Apache TEZ
FlumeJava: easy, efficient data-parallel pipelines
Dryad: distributed data-parallel programs from sequential building blocks