Chapter: Introduction

Introduction and Course Overview

About the Author

Spark’s concepts and approach

Resilient Distributed Databases (RDD)

Creating a Project in IDEA

How To Access Your Working Files

Chapter: Spark Core API & Best practices

Base RDD

Actions - Part 1

Actions - Part 2

Hadoop Combiners In Spark

Direct Acyclic Graph And Lazy Evaluation

Chapter: Closure serialization

How does the magic of Spark works

Serializers and how to change them

Caching & Persistence

Chapter: Spark SQL

Spark SQL

Inferring A Schema

Applying A Schema

Loading And Writing

SQL Caching And UDF

Chapter: Spark MLLib

Spark MLLib And Supervised Example - SVM

Unsupervised With Iris Dataset - KMeans

Chapter: Spark GraphX

Graph Construction

Graph Algorithms

Chapter: Spark Streaming

Streaming And The Microbatch

Mutable Transformations And Checkpointing

Windows And RDD Transformations

Streaming With Spark SQL, MLLib And Core

Chapter: Deployment and Infrastructure

Cluster Managers And Submission - Standalone, Mesos And Yarn

Chapter: Conclusion

Resources And Where To Go From Here

