CGFS, is a distribute file system that like gfs or ndfs in c++, it aims for using a set of software for storing very large stream-oriented files over a set of commodity computers. Files are replicated across machines for safety, and load is balanced fairly across the machine set.
The CGFS project always welcomes new contributors.
See the Sourceforge site page:
http://www.sourceforge.net/projects/cgfs