MapReduce Design Patterns
Building Effective Algorithms and Analytics for Hadoop and Other Systems
Publisher: O'Reilly Media
Released: November 2012
Pages: 252

Until now, design patterns for the MapReduce framework have been scattered among various research papers, blogs, and books. This handy guide brings together a unique collection of valuable MapReduce patterns that will save you time and effort regardless of the domain, language, or development framework you’re using.

Each pattern is explained in context, with pitfalls and caveats clearly identified to help you avoid common design mistakes when modeling your big data architecture. This book also provides a complete overview of MapReduce that explains its origins and implementations, and why design patterns are so important. All code examples are written for Hadoop.

  • Summarization patterns: get a top-level view by summarizing and grouping data
  • Filtering patterns: view data subsets such as records generated from one user
  • Data organization patterns: reorganize data to work with other systems, or to make MapReduce analysis easier
  • Join patterns: analyze different datasets together to discover interesting relationships
  • Metapatterns: piece together several patterns to solve multi-stage problems, or to perform several analytics in the same job
  • Input and output patterns: customize the way you use Hadoop to load or store data

"A clear exposition of MapReduce programs for common data processing patterns—this book is indespensible for anyone using Hadoop."

--Tom White, author of Hadoop: The Definitive Guide

Table of Contents
Product Details
About the Author
Colophon
Recommended for You
Customer Reviews

REVIEW SNAPSHOT®

by PowerReviews
oreillyMapReduce Design Patterns
 
4.3

(based on 8 reviews)

Ratings Distribution

  • 5 Stars

     

    (3)

  • 4 Stars

     

    (4)

  • 3 Stars

     

    (1)

  • 2 Stars

     

    (0)

  • 1 Stars

     

    (0)

100%

of respondents would recommend this to a friend.

Pros

  • Helpful examples (6)
  • Well-written (6)
  • Accurate (5)
  • Concise (5)
  • Easy to understand (3)

Cons

    Best Uses

    • Intermediate (7)
    • Expert (5)
      • Reviewer Profile:
      • Developer (6), Educator (4)

    Reviewed by 8 customers

    Sort by

    Displaying reviews 1-8

    Back to top

    (1 of 1 customers found this review helpful)

     
    4.0

    Harbinger of Big Data Maturation!

    By CoE4BD

    from St. Paul, MN

    About Me Educator

    Verified Reviewer

    Pros

    • Comprehendable
    • Easy to understand
    • Familiar Template
    • Helpful examples
    • Timely
    • Well-written

    Cons

    • A Good Start-we Need More

    Best Uses

    • Intermediate

    Comments about oreilly MapReduce Design Patterns:

    Miner and Shook use the format from the popular Gamma, Helms, Johnson & Vlissides' book Design Patterns: Elements of Reusable Object-Oriented Software (Addison-Wesley, 1994) with a few additions. Their intents are noble: to establish a vocabulary for MapReduce software as the field matures, and to establish foundations for the conversation. This book is not for beginners. It is a compendium of frequently used techniques along with suitable data structures for implementing them. An open question is whether the maturity of Pig and Hive will eliminate the need for patterns such as these. For each pattern they use the template: Intent, Motivation, Applicability, Structure, Consequences, Resemblances, Performance Analytics, and Examples.

     
    5.0

    Great read

    By mhausenblas

    from Galway, Ireland

    About Me Advocate, Developer, Educator

    Verified Reviewer

    Pros

    • Accurate
    • Concise
    • Helpful examples
    • Well-written

    Cons

      Best Uses

      • Expert
      • Intermediate
      • Novice

      Comments about oreilly MapReduce Design Patterns:

      A very useful book for devops and app developers. What I found most useful is that the recipes are always well motivated and the respective SQL equivalency are pointed out which lowers the barrier for people with an RDBMS background.

       
      5.0

      well illustrated with java code

      By produke

      from madrid

      About Me Developer

      Verified Reviewer

      Pros

      • Accurate
      • Concise
      • Easy to understand
      • Helpful examples
      • Well-written

      Cons

        Best Uses

        • Expert
        • Intermediate
        • Programmer

        Comments about oreilly MapReduce Design Patterns:

        Already certified by Cloudera, was looking a cookbook as reference and examples. This book is having enough concise information to cover the different patterns and well illustrated with java code. Would recommend it for beginners and already users of hadoop api. The book can be find here http://shop.oreilly.com/product/0636920025122.do Disclaimer: the book was given as part from blogger review scheme from O'Reilly.

         
        5.0

        The book is good

        By Jay Chen

        from San Jose

        About Me Designer, Developer, Educator

        Pros

        • Accurate
        • Concise
        • Well-written

        Cons

        • Difficult to understand

        Best Uses

        • Expert
        • Intermediate

        Comments about oreilly MapReduce Design Patterns:

        1) Reference the rules, 2) It help us to understand how to apply design patterns on MapReduce.
        The only drawback is that the book is for advanced MapReduce programmer.

        (2 of 2 customers found this review helpful)

         
        4.0

        The book of MapReduce developers.

        By Icaro

        from Athens, Greece

        About Me Designer, Developer

        Verified Reviewer

        Pros

        • Accurate
        • Helpful examples
        • Well-written

        Cons

          Best Uses

          • Intermediate

          Comments about oreilly MapReduce Design Patterns:

          A book that all MapReduce developers must have. I admit that it is not a book to start with, since you must already be familiar with MapReduce and also to have a good knowledge of Hadoop. Still this book is a great reference of various patterns and algorithms used in MapReduce programming.

          (3 of 5 customers found this review helpful)

           
          4.0

          Decent hadoop cookbook reference

          By Produke

          from Spain

          About Me Certified Developer, Developer

          Verified Reviewer

          Pros

          • Accurate
          • Concise
          • Easy to understand
          • Helpful examples

          Cons

            Best Uses

            • Expert
            • Intermediate
            • Novice
            • Student

            Comments about oreilly MapReduce Design Patterns:

            Already certified by Cloudera, was looking a cookbook as reference and examples. This book is having enough concise information to cover the different patterns and well illustrated with java code. Would recommend it for beginners and already users of hadoop api.
            Disclaimer: the book was given as part of the blogger review scheme from O'Reilly.

            (2 of 4 customers found this review helpful)

             
            4.0

            Excellent MapReduce book

            By surachart

            from Bangkok, Thailand

            About Me DBA, Developer, Educator, Sys Admin

            Pros

            • Concise
            • Helpful examples
            • Well-written

            Cons

            • Difficult to understand
            • Not comprehensive enough

            Best Uses

            • Expert
            • Intermediate

            Comments about oreilly MapReduce Design Patterns:

            I read a book with ebub format. It's easy to read with great template format. All code examples in book are written for Hadoop. I could learn from many examples. This book looks like "cook book" (Each example, you will see question, how to do, idea, example code and comparing with sql & pig), but we should even know about Hadoop and java programming. However, I believe it's a good idea to use this book as reference, because readers can reproduce code in book with their work.

            (4 of 7 customers found this review helpful)

             
            3.0

            Not an easy go

            By mko

            from Poland

            Verified Reviewer

            Comments about oreilly MapReduce Design Patterns:

            I admit, this was not a good idea to start MapReduce adventure by jumping directly into Design Patterns. However, I had my goal in it. Long, long time ago in a C++ galaxy far away there was a design patterns related book: "Design Patterns: Elements of Reusable Object-Oriented Software". And it was good.

            "MapReduce Design Patters" tries to be something similar within MapReduce area. However, this book focuses on the implementation too much. If you are not familiar with MapReduced yet, it will not help you becoming fascinated about the whole idea. After reading dozen of pages I had to start from the scratch by looking for some additional MapReduce introductions before I have gotten back to this one.

            It's really hard for me to judge the technical side of the book. This is entirely my fault as I am completely new to MapReduce. Yes, I know, it was like
            jumping in at the deep end. However, I was hoping that design patterns will be presented in more general way, more the way design patterns were presented in already mentioned "Design Patterns: Elements of Reusable Object-Oriented Software". My point is. I don't claim that MapReduce Design Patterns is written badly. No, not at all. On the other hand, if you'd like to become more familiar with the general idea of MapReduce, this is not a good place to start.

            Big plus goes to the authors for collecting various patterns and indexing them in a one place.

            And one more explanation. This book is not designed for the complete beginners like me myself. Authors themselves say: "The motivation for us to write this book was to fill a missing gap we saw in a lot of new MapReduce developers. They had learned how to use the system, got comfortable with writing MapReduce, but were lacking the experience to understand how to do things right or well".

            Displaying reviews 1-8

            Back to top

             
            Buy 2 Get 1 Free Free Shipping Guarantee
            Buying Options
            Immediate Access - Go Digital what's this?
            Ebook: $39.99
            Formats:  DAISY, ePub, Mobi, PDF
            Print & Ebook: $49.49
            Print: $44.99