Hadoop Cluster Deployment
By Danil Zburivsky
Publisher: Packt Publishing
Released: November 2013
Pages: 126

In Detail

Big Data is the hottest trend in the IT industry at the moment. Companies are realizing the value of collecting, retaining, and analyzing as much data as possible. They are therefore rushing to implement the next generation of data platform, and Hadoop is the centerpiece of these platforms.

This practical guide is filled with examples which will show you how to successfully build a data platform using Hadoop. Step-by-step instructions will explain how to install, configure, and tie all major Hadoop components together. This book will allow you to avoid common pitfalls, follow best practices, and go beyond the basics when building a Hadoop cluster.

This book will walk you through the process of building a Hadoop cluster from the ground up. By using practical examples and command samples, you will be able to get a cluster up and running in no time, and you will also gain a deep understanding of how various Hadoop components work and interact with each other.

You will learn how to pick the right hardware for different types of Hadoop clusters and about the differences between various Hadoop distributions. By the end of this book, you will be able to install and configure several of the most popular Hadoop ecosystem projects including Hive, Impala, and Sqoop, and you will also be given a sneak peek into the pros and cons of using Hadoop in the cloud.

Approach

This book is a step-by-step tutorial filled with practical examples which will show you how to build and manage a Hadoop cluster along with its intricacies.

Who this book is for

This book is ideal for database administrators, data engineers, and system administrators, and it will act as an invaluable reference if you are planning to use the Hadoop platform in your organization. It is expected that you have basic Linux skills since all the examples in this book use this operating system. It is also useful if you have access to test hardware or virtual machines to be able to follow the examples in the book.

Product Details
Recommended for You
Customer Reviews

REVIEW SNAPSHOT®

by PowerReviews
oreillyHadoop Cluster Deployment
 
4.0

(based on 1 review)

Ratings Distribution

  • 5 Stars

     

    (0)

  • 4 Stars

     

    (1)

  • 3 Stars

     

    (0)

  • 2 Stars

     

    (0)

  • 1 Stars

     

    (0)

Reviewed by 1 customer

Displaying review 1

Back to top

 
4.0

Informative

By JamR

from London

About Me Educator

Verified Reviewer

Pros

  • Accurate
  • Helpful examples
  • Well-written

Cons

    Best Uses

    • Intermediate

    Comments about oreilly Hadoop Cluster Deployment:

    This is a lot of information to take in and if you are new to Hadoop as I was I would recommend viewing an overview video or two online to become familiar with all the vocabulary before diving into this book. I got a LOT out of this book by tackling it in two passes; First, I read it all the way through but just glanced over the examples – understanding the use cases, but not getting all wrapped around the Java. It has easily been 10 years since I last developed anything in Java, so I postponed my personal syntax journey. In my second pass through the book I focused on working and understanding the examples. By reading first and then working the examples I found that I had more focus on understanding the details and concepts by reading with an uninterrupted flow. Following with a "working the examples" pass allowed me to review and reinforce concepts with example activities. I tackled this material using a "lecture then lab" approach that works well for me.

    Displaying review 1

    Back to top

     
    Buy 2 Get 1 Free Free Shipping Guarantee
    Buying Options
    Immediate Access - Go Digital what's this?
    Ebook: $20.99
    Formats:  ePub, Mobi, PDF