Process large amounts of data in real time using Spark Streaming
About This Video
Process streams of real-time data from various sources with Spark Streaming
Query your streaming data in real-time using Spark SQL
A comprehensive tutorial with practical examples to help you develop real-time Spark applications
Businesses these days require constant, real-time analysis of large amounts of data, along with meaningful insights which influence business decisions. Apache Spark has emerged as the most popular tool in the Big Data market for efficient real-time analytics of Big Data. Spanning over 5 hours, this course will teach you the basics of Apache Spark and how to use Spark Streaming - a module of Apache Spark which involves handling and processing of Big Data on a real-time basis. You will learn how to create Spark applications with Scala to process streams of real-time data. Whether you want to analyze continuously incoming website traffic, analyze real-time streams of Twitter feeds or query your streaming data in real time, this course has got you covered. You will also learn how to use the MLlib module of Spark to train machine learning models with streaming data, and use those models to make real-time predictions. The course assumes some programming experience, and uses Scala to develop Spark applications. It includes a crash course in the Scala programming language in case you're new to it.
Taming Big Data with Spark Streaming and Scala – Hands On!
5 hours 54 minutes
Frank Kane spent 9 years at Amazon and IMDb, developing and managing the technology that automatically delivers product and movie recommendations to hundreds of millions of customers, all the time. Frank holds 17 issued patents in the fields of distributed computing, data mining, and machine learning. In 2012, Frank left to start his own successful company, Sundog Software, which focuses on virtual reality environment technology, and teaching others about big data analysis.