Learning Spark
Lightning-Fast Big Data Analytics
Publisher: O'Reilly Media
Final Release Date: June 2014
Pages: 300

With Early Release ebooks, you get books in their earliest form — the author's raw and unedited content as he or she writes — so you can take advantage of these technologies long before the official release of these titles. You'll also receive updates when significant changes are made, new chapters as they're written, and the final ebook bundle.

The Web is getting faster, and the data it delivers is getting bigger. How can you handle everything efficiently? This book introduces Spark, an open source cluster computing system that makes data analytics fast to run and fast to write. You’ll learn how to run programs faster, using primitives for in-memory cluster computing. With Spark, your job can load data into memory and query it repeatedly much quicker than with disk-based systems like Hadoop MapReduce.

Written by the developers of Spark, this book will have you up and running in no time. You’ll learn how to express MapReduce jobs with just a few simple lines of Spark code, instead of spending extra time and effort working with Hadoop’s raw Java API.

  • Quickly dive into Spark capabilities such as collect, count, reduce, and save
  • Use one programming paradigm instead of mixing and matching tools such as Hive, Hadoop, Mahout, and S4/Storm
  • Learn how to run interactive, iterative, and incremental analyses
  • Integrate with Scala to manipulate distributed datasets like local collections
  • Tackle partitioning issues, data locality, default hash partitioning, user-defined partitioners, and custom serialization
  • Use other languages by means of pipe() to achieve the equivalent of Hadoop streaming
Table of Contents
Product Details
About the Author
Recommended for You
Customer Reviews

REVIEW SNAPSHOT®

by PowerReviews
oreillyLearning Spark
 
4.3

(based on 3 reviews)

Ratings Distribution

  • 5 Stars

     

    (1)

  • 4 Stars

     

    (2)

  • 3 Stars

     

    (0)

  • 2 Stars

     

    (0)

  • 1 Stars

     

    (0)

67%

of respondents would recommend this to a friend.

Reviewed by 3 customers

Sort by

Displaying reviews 1-3

Back to top

(1 of 1 customers found this review helpful)

 
4.0

Excellent so far

By Ramesh M

from Indianapolis, IN

About Me Designer, Developer

Verified Buyer

Pros

  • Concise
  • Easy to understand
  • Helpful examples
  • Well-written

Cons

    Best Uses

    • Intermediate
    • Novice
    • Student

    Comments about oreilly Learning Spark:

    Even though spark programing basic concepts and examples can be found at Internet, reading a book with structured format is easy and less time consuming. Author has done great job by explaining the spark functions with examples. That allowed me to get better understanding on basic concepts easily with examples and start applying on my project.

    I'm looking forward to have next set of early access.

     
    5.0

    Brilliant intro to Spark

    By Helipilot50

    from Dallas, TX, USA

    About Me Designer, Developer, Maker, Seasoned It Professional

    Verified Buyer

    Pros

    • Accurate
    • Concise
    • Easy to understand
    • Well-written

    Cons

    • Needs To Be Finished

    Best Uses

    • Intermediate
    • Novice

    Comments about oreilly Learning Spark:

    Spark is a credible compute engine that scales massively. This book not only assists you in getting started with Spark, but also helps you to adjust your thinking so you can completely exploit the parallel processing. If you have a need for Complex Event Processing, or "as it happens" processing rather that using the batch processing of Hadoop, then Spark is your solution.

    (0 of 3 customers found this review helpful)

     
    4.0

    Really cool

    By Drodri

    from Barcelona

    Verified Buyer

    Comments about oreilly Learning Spark:

    waiting for new update !

    Displaying reviews 1-3

    Back to top

     
    Buy 2 Get 1 Free Free Shipping Guarantee
    Buying Options
    Immediate Access - Go Digital what's this?
    Pre-Order  Print: $39.99
    February 2015 (est.)