Books & Videos

Table of Contents

  1. Chapter 1 Introduction

  2. Chapter 2 HDFS

    1. Goals and Motivation

    2. Design

    3. Daemons

    4. Reading and Writing Data

    5. Managing Filesystem Metadata

    6. Namenode High Availability

    7. Namenode Federation

    8. Access and Integration

  3. Chapter 3 MapReduce

    1. The Stages of MapReduce

    2. Introducing Hadoop MapReduce

    3. YARN

  4. Chapter 4 Planning a Hadoop Cluster

    1. Picking a Distribution and Version of Hadoop

    2. Hardware Selection

    3. Operating System Selection and Preparation

    4. Kernel Tuning

    5. Disk Configuration

    6. Network Design

  5. Chapter 5 Installation and Configuration

    1. Installing Hadoop

    2. Configuration: An Overview

    3. Environment Variables and Shell Scripts

    4. Logging Configuration

    5. HDFS

    6. Namenode High Availability

    7. Namenode Federation

    8. MapReduce

    9. Rack Topology

    10. Security

  6. Chapter 6 Identity, Authentication, and Authorization

    1. Identity

    2. Kerberos and Hadoop

    3. Authorization

    4. Tying It Together

  7. Chapter 7 Resource Management

    1. What Is Resource Management?

    2. HDFS Quotas

    3. MapReduce Schedulers

  8. Chapter 8 Cluster Maintenance

    1. Managing Hadoop Processes

    2. HDFS Maintenance Tasks

    3. MapReduce Maintenance Tasks

  9. Chapter 9 Troubleshooting

    1. Differential Diagnosis Applied to Systems

    2. Common Failures and Problems

    3. “Is the Computer Plugged In?”

    4. Treatment and Care

    5. War Stories

  10. Chapter 10 Monitoring

    1. An Overview

    2. Hadoop Metrics

    3. Health Monitoring

  11. Chapter 11 Backup and Recovery

    1. Data Backup

    2. Namenode Metadata

  1. Appendix Deprecated Configuration Properties

  2. Colophon