Autoplay
Autocomplete
Previous Lesson
Complete and Continue
Apache Spark Fundamentals
Spark Basics
Introduction & Contents (3:30)
Why Spark - Vertical vs Horizontal Scaling (3:55)
What Spark Is Good For (4:45)
Spark Driver, Context & Executors (4:11)
Cluster Types (1:59)
Client vs Cluster Deployment (6:11)
Where to Run Spark (3:38)
Course Data & Environment
Tools in the Spark Course (2:35)
The Dataset (4:11)
Docker Setup (2:52)
Jupyter Notebook Setup & Run (5:31)
Spark Coding Basics
RDDs (3:57)
DataFrames (1:40)
Transformations & Actions Overview (2:59)
Transformations (2:22)
Actions (3:06)
Hands On Part
Link to GitHub Repository
Download Datasets
Notebook 1: JSON Transformations (9:52)
Notebook 2: Working with Schemas (8:23)
Notebook 3: Working With DataFrames (10:09)
Notebook 4: SparkSQL (5:04)
Notebook 5: Working with RDDs (12:52)
Where to Run Spark
Lesson content locked
If you're already enrolled,
you'll need to login
.
Enroll in Course to Unlock