Apache Spark consists of following components.
- Spark Core
- Spark SQL
- Spark Streaming
- Spark MLlib
- Spark GraphX
Spark Core is the heart of Spark Ecosystem.
It is the fundamental component of Spark. It is an API. It uses a specialized fundamental data structure known as RDD (Resilient Distributed Datasets). It contains the basic functionality of Spark, including components for task scheduling, memory management, fault recovery, interacting with storage systems.
All other components of spark ecosystem are built on top of spark core.