Tutorial 4
Question 1 - What is Apache spark? why spark has come into existence? brief architecture of Apache Spark Answer 1 - Apache Spark is a lightning-fast cluster computing technology, designed for fast computation. It is based on Hadoop MapReduce and it extends the MapReduce model to efficiently use it for more types of computations, which includes interactive queries and stream processing. The main feature of Spark is its in-memory cluster computing that increases the processing speed of an application. Major issues with Apache Hadoop . 1. Issue with Small Files 2. It has Slow Processing Speed 3. Only support for Batch Processing only 4. There is no support for Real-time Data Processing To overcome all these issues, Apache Spark comes into the picture . one more reason behind Evolution of Apache Spark is that there were many general purpose computin...
Comments
Post a Comment