Books & Videos

Table of Contents

Chapter: Introduction

Introduction And Course Overview

02m 1s

About The Author

01m 1s

Installing Python

04m 37s

Installing iPython And Using Notebooks

06m 27s

How To Access Your Working Files

01m 15s

Chapter: Installing Spark

Download And Setup

03m 24s

Running The Spark Shell

05m 34s

Running The Spark Shell With iPython

06m 37s

Chapter: Spark Fundamentals

What Is A Resilient Distributed Dataset - RDD?

04m 53s

Reading A Text File

03m 34s

Actions

02m 13s

Transformations

02m 29s

Persisting Data

04m 10s

Chapter: Transformations

Map

03m 4s

Filter

03m 56s

Flatmap

03m 15s

MapPartitions

04m 7s

MapPartitionsWithIndex

01m 50s

Sample

02m 36s

Union

01m 11s

Intersection

01m 28s

Distinct

02m 1s

Cartesian

03m 17s

Pipe

03m 39s

Coalesce

02m 12s

Repartition

02m 29s

RepartitionAndSortWithinPartitions

03m 57s

Chapter: Actions

Reduce

04m 19s

Collect

01m 56s

Count

03m 4s

First

01m 19s

Take

01m 5s

TakeSample

03m 3s

TakeOrdered

02m 10s

SaveAsTextFile

04m 9s

CountByKey

02m 40s

ForEach

03m 11s

Chapter: Key-Value Pair RDDs

GroupByKey

02m 30s

ReduceByKey

03m 30s

AggregateByKey

03m 44s

SortByKey

02m 46s

Join

04m 16s

CoGroup

02m 9s

Chapter: Input And Output

WholeTextFile

03m 14s

Pickle Files

03m 59s

HadoopInputFormat

05m 34s

HadoopOutputFormat

05m 31s

Chapter: Performance

Broadcast Variables

04m 17s

Accumulators

05m 8s

Using A Custom Accumulator

04m 52s

Partitioning

07m 55s

Chapter: Running On A Cluster

Spark Standalone Cluster

04m 26s

Mesos

03m 37s

Yarn

02m 28s

Client Versus Cluster Mode

02m 41s

Chapter: Advanced Spark

Spark Streaming

04m 21s

Dataframes And SQL

03m 27s

MLlib

04m 28s

Chapter: Conclusion

Resources And Where To Go From Here

01m 2s

Wrap Up

01m 27s