What is the difference between Apache Storm and Apache Spark? Hadoop interveiw questions


  • Apache Storm operates on data in motion (continuous stream of data). The real time nature is due to its ability to operate on streaming data (data flowing through a set of queries).


  • Apache Spark operates on data at rest. Its real time nature is due to its ability to perform computations on data (RDD) in real time, these are still batch computations like Hadoop.


  • Spark Streaming however combines both where it treats streaming computations as a series of deterministic batch computations on small time intervals.


Comments

Archive

Contact Form

Send