Books & Videos

Table of Contents

  1. Chapter 1 Introduction

    1. What Is Pig?

    2. Pig’s History

  2. Chapter 2 Installing and Running Pig

    1. Downloading and Installing Pig

    2. Running Pig

  3. Chapter 3 Grunt

    1. Entering Pig Latin Scripts in Grunt

    2. HDFS Commands in Grunt

    3. Controlling Pig from Grunt

  4. Chapter 4 Pig’s Data Model

    1. Types

    2. Schemas

  5. Chapter 5 Introduction to Pig Latin

    1. Preliminary Matters

    2. Input and Output

    3. Relational Operations

    4. User Defined Functions

  6. Chapter 6 Advanced Pig Latin

    1. Advanced Relational Operations

    2. Integrating Pig with Legacy Code and MapReduce

    3. Nonlinear Data Flows

    4. Controlling Execution

    5. Pig Latin Preprocessor

  7. Chapter 7 Developing and Testing Pig Latin Scripts

    1. Development Tools

    2. Testing Your Scripts with PigUnit

  8. Chapter 8 Making Pig Fly

    1. Writing Your Scripts to Perform Well

    2. Writing Your UDF to Perform

    3. Tune Pig and Hadoop for Your Job

    4. Using Compression in Intermediate Results

    5. Data Layout Optimization

    6. Bad Record Handling

  9. Chapter 9 Embedding Pig Latin in Python

    1. Compile

    2. Bind

    3. Run

    4. Utility Methods

  10. Chapter 10 Writing Evaluation and Filter Functions

    1. Writing an Evaluation Function in Java

    2. Algebraic Interface

    3. Accumulator Interface

    4. Python UDFs

    5. Writing Filter Functions

  11. Chapter 11 Writing Load and Store Functions

    1. Load Functions

    2. Store Functions

  12. Chapter 12 Pig and Other Members of the Hadoop Community

    1. Pig and Hive

    2. Cascading

    3. NoSQL Databases

    4. Metadata in Hadoop

  1. Appendix Built-in User Defined Functions and Piggybank

    1. Built-in UDFs

    2. Piggybank

  2. Appendix Overview of Hadoop

    1. MapReduce

    2. Hadoop Distributed File System

  3. Colophon