Stratosphere is an open-source system for Big Data Analytics that can be deployed in a local cluster using HDFS or in the Amazon cloud. Stratosphere extends the MapReduce programming model, making it easy to write complex analytic queries that include binary operators such as joins. Stratosphere contains a cost-based optimizer, that automatically picks the best parallel schedule for a program, relieving the programmer from the task of hand-optimizing jobs. The platform is jointly developed by TU Berlin, HU Berlin, and HPI, three Universities in the greater Berlin area.
LINK Download | |
Updates June 2014:
We are happy to announce a new major Stratosphere release, version 0.5. This release adds many new features and improves the
interoperability, stability, and performance of the system.
The major theme of the release is the completely new Java API that makes it easy to write powerful distributed programs. This programming
model significantly eases the development of Stratosphere programs,supports flexible use of regular Java classes as data types, and adds
many new built-in operators to simplify the writing of powerful programs. The result are programs that need less code, are more
readable, interoperate better with existing code, and execute faster.
for a complete list of new features.
In total, 26 people have contributed to Stratosphere since the last release. Thank you for making this project possible!
The Stratosphere project has been accepted to the Apache Incubator and will continue its work under the umbrella of the Apache Software Foundation.