With Early Release ebooks, you get books in their earliest form—the author's raw and unedited content as he or she writes—so you can take advantage of these technologies long before the official release of these titles. You’ll also receive updates when significant changes are made, new chapters are available, and the final ebook bundle is released.
This practical book provides a comprehensive guide on how to make enterprise Hadoop integration successful. You’ll learn how to build a Hadoop infrastructure, architect an enterprise Hadoop platform, and even take Hadoop to the cloud.
Ideal for engineers, technical and enterprise architects, and technical leads, this guide includes lots of practical examples that can be easily comprehended by technical architects, and put into practice by an engineer, developer, and/or cluster operator.
Chapter 2Compute and Storage
Chapter 4Organizational Considerations
Chapter 5Data Center Considerations
Chapter 6Cluster Nodes Operating System
Chapter 7Service Databases
Chapter 8Hadoop Deployment
Chapter 9Platform Validation Testing
Chapter 11Cluster Access
Chapter 13Backup & Disaster Recovery
Chapter 14General Considerations
Chapter 15How to build my security architecture in a Hadoop environment?
Chapter 16How to build HA for my Hadoop Cloud environment
Chapter 17How can I build an on-premise private cloud for Hadoop?
Chapter 18How do I run Hadoop on public cloud infrastructure?
Chapter 19Leveraging Cloud Storage
Chapter 20How can I work with managed service providers?
Jan Kunigk has worked on enterprise Hadoop solutions since 2010. Before joining Cloudera in 2014, his tasks included building optimized systems architectures for Hadoop at IBM and implementing a Hadoop-as-a-service offering at a global IT hosting corporation. In his current role as a Solutions Architect he makes Hadoop projects at Cloudera’s enterprise customers successful, covering a wide spectrum of architectural decisions to the implementation of big data applications across all industry sectors on a day-to-day basis.
Lars George has been involved with Hadoop and HBase since 2007, and became a full HBase committer in 2009. He has spoken at many Hadoop User Group meetings, and conferences such as Hadoop World and Hadoop Summit, ApacheCon, FOSDEM, QCon etc. He also started the Munich OpenHUG meetings. Lars worked for Cloudera for over five years, as the EMEA Chief Architect, acting as a liaison between the Cloudera professional services team and customers as well as partners in and around Europe, building the next data driven solutions. In 2016 he started with his own Hadoop advisory firm, extending on what he has learned and seen in the field for more than 8 years. He is also the author or O'Reilly's "HBase The Definitive Guide".
Paul Wilkinson has been wrestling with big data in the public sector since before Hadoop existed and was very glad when it arrived in his life in 2009. He became a Cloudera consultant in 2012, advising customers on all things hadoop: application design, information architecture, cluster management and infrastructure planning the FullStack. After a torrent of professional services work across financial services, cybersecurity, adtech, gaming and government, he’s seen it all warts and all. Or at least, he hopes he has.
Ian Buss began his journey into distributed computing with parallel computational electromagnetics whilst studying for a PhD in photonics at the University of Bristol. After simulating LEDs on supercomputers, he made the move from big compute in academia to big data in the public sector, first encountering Hadoop in 2012. After having fun building, deploying, managing and using Hadoop clusters, Ian joined Cloudera as a Solutions Architect in 2014. His day job now involves integrating Hadoop into enterprises and making stuff work in the real world.