Publisher: O'Reilly Media / Yahoo Press Released: May 2012 Pages: 688
Ready to unlock the power of your data? With this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. You’ll find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. This third edition covers recent changes to Hadoop, including material on the new MapReduce API, as well as MapReduce 2 and its more flexible execution model (YARN). - Store large datasets with the Hadoop Distributed File System (HDFS)
- Run distributed computations with MapReduce
- Use Hadoop’s data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence
- Discover common pitfalls and advanced features for writing real-world MapReduce programs
- Design, build, and administer a dedicated Hadoop cluster—or run Hadoop in the cloud
- Load data from relational databases into HDFS, using Sqoop
- Perform large-scale data processing with the Pig query language
- Analyze datasets with Hive, Hadoop’s data warehousing system
- Take advantage of HBase for structured and semi-structured data, and ZooKeeper for building distributed systems
|
- Title:
- Hadoop: The Definitive Guide, 3rd Edition
- By:
- Tom White
- Publisher:
- O'Reilly Media / Yahoo Press
- Formats:
-
- Print
- Ebook
- Safari Books Online
- Print:
- May 2012
- Ebook:
- May 2012
- Pages:
- 688
- Print ISBN:
- 978-1-4493-1152-0
- | ISBN 10:
- 1-4493-1152-0
- Ebook ISBN:
- 978-1-4493-1151-3
- | ISBN 10:
- 1-4493-1151-2
|
-
Tom White Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK. View Tom White's full profile page. |
Colophon The animal on the cover of Hadoop: The Definitive Guide is an African elephant. Thesemembers of the genus Loxodonta are the largest land animals on earth (slightly largerthan their cousin, the Asian elephant) and can be identified by their ears, which havebeen said to look somewhat like the continent of Asia. Males stand 12 feet tall at theshoulder and weigh 12,000 pounds, but they can get as big as 15,000 pounds, whereasfemales stand 10 feet tall and weigh 8,000–11,000 pounds. Even young elephants arevery large: at birth, they already weigh approximately 200 pounds and stand about 3feet tall. African elephants live throughout sub-Saharan Africa. Most of the continent’s elephantslive on savannas and in dry woodlands. In some regions, they can be found indesert areas; in others, they are found in mountains. The species plays an important role in the forest and savanna ecosystems in which theylive. Many plant species are dependent on passing through an elephant’s digestive tractbefore they can germinate; it is estimated that at least a third of tree species in westAfrican forests rely on elephants in this way. Elephants grazing on vegetation also affectthe structure of habitats and influence bush fire patterns. For example, under naturalconditions, elephants make gaps through the rainforest, enabling the sunlight to enter,which allows the growth of various plant species. This, in turn, facilitates more abundanceand more diversity of smaller animals. As a result of the influence elephants haveover many plants and animals, they are often referred to as a keystone species becausethey are vital to the long-term survival of the ecosystems in which they live. The cover image is from the Dover Pictorial Archive. The cover font is Adobe ITCGaramond. The text font is Linotype Birka; the heading font is Adobe Myriad Condensed;and the code font is LucasFont’s TheSansMonoCondensed. |
|
Description
|
Table of Contents
|
Product Details
|
About the Author
|
Colophon
|
 |
|
 |
|
|
|
Recommended for You
|
Recently Viewed
|
 |
|
By Preston Gralla, Brian Sawyer
December 2011
By Thibault Imbert
January 2012
By Jonathan Stark, Brian Jepson
January 2012
Ebook: $23.99
Print & Ebook: $32.99
Print: $29.99
|
Customer Reviews
3/23/2012 (2 of 3 customers found this review helpful) 3.0Thorough, but not divided up well
|
|
|