Big Data for Chimps
A Guide to Massive-Scale Data Processing in Practice
Publisher: O'Reilly Media
Final Release Date: September 2015
Pages: 220

Finding patterns in massive event streams can be difficult, but learning how to find them doesn’t have to be. This unique hands-on guide shows you how to solve this and many other problems in large-scale data processing with simple, fun, and elegant tools that leverage Apache Hadoop. You’ll gain a practical, actionable view of big data by working with real data and real problems.

Perfect for beginners, this book’s approach will also appeal to experienced practitioners who want to brush up on their skills. Part I explains how Hadoop and MapReduce work, while Part II covers many analytic patterns you can use to process any data. As you work through several exercises, you’ll also learn how to use Apache Pig to process data.

  • Learn the necessary mechanics of working with Hadoop, including how data and computation move around the cluster
  • Dive into map/reduce mechanics and build your first map/reduce job in Python
  • Understand how to run chains of map/reduce jobs in the form of Pig scripts
  • Use a real-world dataset—baseball performance statistics—throughout the book
  • Work with examples of several analytic patterns, and learn when and where you might use them
Table of Contents
Product Details
About the Author
Colophon
Recommended for You
Customer Reviews

REVIEW SNAPSHOT®

by PowerReviews
oreillyBig Data for Chimps
 
3.7

(based on 3 reviews)

Ratings Distribution

  • 5 Stars

     

    (2)

  • 4 Stars

     

    (0)

  • 3 Stars

     

    (0)

  • 2 Stars

     

    (0)

  • 1 Stars

     

    (1)

67%

of respondents would recommend this to a friend.

Pros

No Pros

Cons

No Cons

Best Uses

No Best Uses
    • Reviewer Profile:
    • Developer (3)

Reviewed by 3 customers

Displaying reviews 1-3

Back to top

 
5.0

A Seriously Fun Guide to Data Science in Practice

By mrflip

from Austin, TX

About Me Developer

Verified Reviewer

Pros

  • Easy to understand
  • Helpful examples
  • Well-written

Cons

    Best Uses

    • Intermediate
    • Novice
    • Student

    Comments about oreilly Big Data for Chimps:

    This book leaves you with two things: physical intuition for how data moves through the system in a Hadoop job, and a practical cookbook for the full range of database constructs needed by the practicing data scientist. It's a great resource for both beginning and intermediate practitioners.

    It covers the big data toolkit from the outside, maximizing programmer efficiency and maintainability over raw performance and sophistication. It emphasizes high-level tools that get the job done rather than grinding through the minutiae of the primitive Java APIs.

    (1 of 1 customers found this review helpful)

     
    5.0

    Best book I ever wrote!

    By Russ the Writer

    from Pacifica, CA

    About Me Developer, Sys Admin

    Pros

    • Accurate
    • Concise
    • Easy to understand
    • Helpful examples
    • Well-written

    Cons

      Best Uses

      • Expert
      • Intermediate
      • Student

      Comments about oreilly Big Data for Chimps:

      This book teaches the how of Big Data in a way no other book does.

      (0 of 4 customers found this review helpful)

       
      1.0

      waste of money

      By Vik

      from Ohio

      About Me Developer

      Verified Reviewer

      Pros

      • None

      Cons

      • Difficult to understand
      • Not comprehensive enough
      • Too many errors

      Best Uses

        Comments about oreilly Big Data for Chimps:

        this book I am sad to write was honestly a good attempt but the outcome is absysmal. if you want to waste your money please buy it..

        Displaying reviews 1-3

        Back to top

         
        Buy 2 Get 1 Free Free Shipping Guarantee
        Buying Options
        Immediate Access - Go Digital what's this?
        Ebook:  $33.99
        Formats:  DAISY, ePub, Mobi, PDF
        Print & Ebook:  $43.99
        Print:  $39.99