Books & Videos

Table of Contents

  1. Chapter 1 Introduction to Amazon Elastic MapReduce

    1. Amazon Web Services Used in This Book

    2. Amazon Elastic MapReduce

    3. Amazon EMR and the Hadoop Ecosystem

    4. Amazon Elastic MapReduce Versus Traditional Hadoop Installs

    5. Application Building Blocks

  2. Chapter 2 Data Collection and Data Analysis with AWS

    1. Log Analysis Application

    2. Log Messages as a Data Set for Analytics

    3. Understanding MapReduce

    4. Collection Stage

    5. Simulating Syslog Data

    6. Developing a MapReduce Application

    7. Custom JAR MapReduce Job

    8. Running an Amazon EMR Cluster

    9. Viewing Our Results

    10. Debugging a Job Flow

    11. Our Application and Real-World Uses

  3. Chapter 3 Data Filtering Design Patterns and Scheduling Work

    1. Extending the Application Example

    2. Understanding Web Server Logs

    3. Finding Errors in the Web Logs Using Data Filtering

    4. Building Summary Counts in Data Sets

    5. Job Flow Scheduling

    6. Scheduling with AWS Data Pipeline

    7. Real-World Uses

  4. Chapter 4 Data Analysis with Hive and Pig in Amazon EMR

    1. Amazon Job Flow Technologies

    2. What Is Pig?

    3. Utilizing Pig in Amazon EMR

    4. What Is Hive?

    5. Utilizing Hive in Amazon EMR

    6. Our Application with Hive and Pig

  5. Chapter 5 Machine Learning Using EMR

    1. A Quick Tour of Machine Learning

    2. Python and EMR

    3. What’s Next?

  6. Chapter 6 Planning AWS Projects and Managing Costs

    1. Developing a Project Cost Model

    2. Optimizing AWS Resources to Reduce Project Costs

    3. Amazon Tools for Estimating Your Project Costs

  1. Appendix Amazon Web Services Resources and Tools

    1. Amazon AWS Online Resources

    2. Amazon AWS Cost Estimation Tools

    3. AWS Best Practices and Architecture

    4. Amazon EMR Distributions

  2. Appendix Cloud Computing, Amazon Web Services, and Their Impacts

    1. AWS Service Delivery Models

    2. Performance

    3. Elasticity and Growth

    4. Security

    5. Uptime and Availability

  3. Appendix Installation and Setup

    1. Prerequisites

    2. Installing Hadoop

    3. Building MapReduce Applications

    4. Running MapReduce Applications Locally

    5. Installing Pig

    6. Installing Hive

  4. Index

  5. Colophon