Moving Hadoop to the Cloud
Harnessing Cloud Features and Flexibility for Hadoop Clusters
Publisher: O'Reilly Media
Final Release Date: November 2016
Pages: 300

With Early Release ebooks, you get books in their earliest form—the author's raw and unedited content as he or she writes—so you can take advantage of these technologies long before the official release of these titles. You'll also receive updates when significant changes are made, new chapters are available, and the final ebook bundle is released.

Up until recently, Hadoop deployments have existed on hardware owned and run by organizations. Now cloud service providers let customers effectively rent hardware and the associated network connectivity. But there’s a lot more to installing a Hadoop cluster in the public cloud than simply renting machines.

This practical book shows developers and systems administrators familiar with Hadoop how to install, use, and manage cloud-born clusters efficiently. You’ll learn how to architect clusters in a way that works with cloud-provider features—not just to avoid potential pitfalls, but also to take full advantage of what these services can do. You’ll also compare the Amazon, Google, and Microsoft clouds, and learn how to set up clusters in each of them.

  • Learn the advantages and disadvantages of running Hadoop in the cloud
  • Get a cloud primer on instances, networking and security, and storage
  • Build a simple Hadoop cloud cluster, and run a MapReduce job
  • Explore use cases for high availability, relational data with Hive, and complex analytics with Spark
  • Learn best practices for designing and managing cloud clusters, including network topologies, day-to-day tasks, and troubleshooting
Table of Contents
Product Details
About the Author
Recommended for You
Customer Reviews
 
Buy 2 Get 1 Free Free Shipping Guarantee
Buying Options
Immediate Access - Go Digital what's this?
Pre-Order  Print:  $39.99
July 2017 (est.)