Books & Videos

Table of Contents

  1. Chapter 1 Core Technologies

    1. Hadoop Distributed File System (HDFS)

    2. MapReduce

    3. YARN

    4. Spark

  2. Chapter 2 Database and Data Management

    1. Cassandra

    2. HBase

    3. Accumulo

    4. Memcached

    5. Blur

    6. Solr

    7. MongoDB

    8. Hive

    9. Spark SQL (formerly Shark)

    10. Giraph

  3. Chapter 3 Serialization

    1. Avro

    2. JSON

    3. Protocol Buffers (protobuf)

    4. Parquet

  4. Chapter 4 Management and Monitoring

    1. Ambari

    2. HCatalog

    3. Nagios

    4. Puppet

    5. Chef

    6. ZooKeeper

    7. Oozie

    8. Ganglia

  5. Chapter 5 Analytic Helpers

    1. MapReduce Interfaces

    2. Analytic Libraries

    3. Pig

    4. Hadoop Streaming

    5. Mahout

    6. MLLib

    7. Hadoop Image Processing Interface (HIPI)

    8. SpatialHadoop

  6. Chapter 6 Data Transfer

    1. Sqoop

    2. Flume

    3. DistCp

    4. Storm

  7. Chapter 7 Security, Access Control, and Auditing

    1. Sentry

    2. Kerberos

    3. Knox

  8. Chapter 8 Cloud Computing and Virtualization

    1. Serengeti

    2. Docker

    3. Whirr