G-Hadoop is a MapReduce implementation targeting on a distributed system with multiple clusters, such as a Grid infrastructures, a Cloud, distributed virtual machines, or a multi-data centre infrastructure.
In order to share data across multiple administrative domains, G-Hadoop replaces Hadoop’s native distributed file system with the Gfarm Grid file system.