Books & Videos

Table of Contents

  1. Setup

    1. Chapter 1 Theory

      1. Agile Big Data
      2. Big Words Defined
      3. Agile Big Data Teams
      4. Agile Big Data Process
      5. Code Review and Pair Programming
      6. Agile Environments: Engineering Productivity
      7. Realizing Ideas with Large-Format Printing
    2. Chapter 2 Data

      1. Email
      2. Working with Raw Data
      3. SQL
      4. NoSQL
      5. Data Perspectives
    3. Chapter 3 Agile Tools

      1. Scalability = Simplicity
      2. Agile Big Data Processing
      3. Setting Up a Virtual Environment for Python
      4. Serializing Events with Avro
      5. Collecting Data
      6. Data Processing with Pig
      7. Publishing Data with MongoDB
      8. Searching Data with ElasticSearch
      9. Reflecting on our Workflow
      10. Lightweight Web Applications
      11. Presenting Our Data
      12. Conclusion
    4. Chapter 4 To the Cloud!

      1. Introduction
      2. GitHub
      3. dotCloud
      4. Amazon Web Services
      5. Instrumentation
  2. Climbing the Pyramid

    1. Chapter 5 Collecting and Displaying Records

      1. Putting It All Together
      2. Collect and Serialize Our Inbox
      3. Process and Publish Our Emails
      4. Presenting Emails in a Browser
      5. Agile Checkpoint
      6. Listing Emails
      7. Searching Our Email
      8. Conclusion
    2. Chapter 6 Visualizing Data with Charts

      1. Good Charts
      2. Extracting Entities: Email Addresses
      3. Visualizing Time
      4. Conclusion
    3. Chapter 7 Exploring Data with Reports

      1. Building Reports with Multiple Charts
      2. Linking Records
      3. Extracting Keywords from Emails with TF-IDF
      4. Conclusion
    4. Chapter 8 Making Predictions

      1. Predicting Response Rates to Emails
      2. Personalization
      3. Conclusion
    5. Chapter 9 Driving Actions

      1. Properties of Successful Emails
      2. Better Predictions with Naive Bayes
      3. P(Reply | From & To)
      4. P(Reply | Token)
      5. Making Predictions in Real Time
      6. Logging Events
      7. Conclusion
  1. Colophon