Apache Spark is an open-source cluster-computing framework. Originally developed at the University of California, Berkeley’s AMP Lab, the Spark code base was later donated to the Apache Software Foundation, which has maintained it since. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Advantage
Advantages of Apache Spark Analytics Training Blog. Yes, Hadoop is your ultimate store of all your semi structured data within HDFS and yes, you can query all your data using Map Reduce. The numerous advantages of Apache Spark make it a very attractive big data framework.