Book: Learning Spark
Spark
is an open source project that has been built and is maintained by a thriving and
diverse community of developers. If you or your organization are trying Spark for
the first time, you might be interested in the history of the project. Spark
started in 2009 as a research project in the UC Berkeley RAD Lab, later to
become the AMP Lab. The researchers in the lab had previously been working on
Hadoop Map‐Reduce, and observed that MapReduce was
inefficient for iterative and interactive computing jobs. Thus, from the
beginning, Spark was designed to be fast for interactive queries and iterative
algorithms, bringing in ideas like support for in-memory storage and efficient
fault recovery.
No comments:
Post a Comment