Understand the fundamentals of Scala and the Apache Spark ecosystem
Handle large streams of data with Spark Streaming and perform Machine Learning in real time with Spark MLlib
Comprehensive tutorial packed with practical examples to help you develop real-world Big Data applications with Spark with Scala
With the rise in popularity of the term ‘Big Data’, there is an increasing need to process large amounts of data in real-time, with maximum efficiency. This has led to Apache Spark gaining popularity in the Big Data market very quickly. The Spark ecosystem allows you to process large streams of data in real-time. As Spark is built on Scala, knowledge of both has become vital for data scientists and data analysts today. This comprehensive 7 hour course will empower you to build efficient Spark applications to fulfill your Big Data needs.You will start with quickly understanding the basics of Scala and proceed to set up the development environment for Apache Spark and Scala for Big Data processing. You will understand the different modules of Spark like Spark SQL, Spark Streaming and GraphX, along with when and how to use them. While doing so, you will build practical, real-world Spark applications in Scala and see how you can deploy them on the cloud. You will also learn how to perform machine learning in real time using Spark’s MLlib module. Finally, you will learn how to run Spark on Hadoop clusters along with best practices and troubleshooting techniques.With over 20 carefully selected examples and abundant explanation to explain even the most difficult concepts, this course will ensure your success in taming your Big Data challenges using Spark with Scala.
Frank Kane spent 9 years at Amazon and IMDb, developing and managing the technology that automatically delivers product and movie recommendations to hundreds of millions of customers, all the time. Frank holds 17 issued patents in the fields of distributed computing, data mining, and machine learning. In 2012, Frank left to start his own successful company, Sundog Software, which focuses on virtual reality environment technology, and teaching others about big data analysis.