Books & Videos

Table of Contents

  1. Chapter 1 Terms

    1. Document-Oriented

    2. Key/Value Stores

    3. Horizontal or Vertical Scaling

    4. MapReduce

    5. Sharding

  2. Chapter 2 NoSQL Databases

    1. MongoDB

    2. CouchDB

    3. Cassandra

    4. Redis

    5. BigTable

    6. HBase

    7. Hypertable

    8. Voldemort

    9. Riak

    10. ZooKeeper

  3. Chapter 3 MapReduce

    1. Hadoop

    2. Hive

    3. Pig

    4. Cascading

    5. Cascalog

    6. mrjob

    7. Caffeine

    8. S4

    9. MapR

    10. Acunu

    11. Flume

    12. Kafka

    13. Azkaban

    14. Oozie

    15. Greenplum

  4. Chapter 4 Storage

    1. S3

    2. Hadoop Distributed File System

  5. Chapter 5 Servers

    1. EC2

    2. Google App Engine

    3. Elastic Beanstalk

    4. Heroku

  6. Chapter 6 Processing

    1. R

    2. Yahoo! Pipes

    3. Mechanical Turk

    4. Solr/Lucene

    5. ElasticSearch

    6. Datameer

    7. BigSheets

    8. Tinkerpop

  7. Chapter 7 NLP

    1. Natural Language Toolkit

    2. OpenNLP

    3. Boilerpipe

    4. OpenCalais

  8. Chapter 8 Machine Learning

    1. WEKA

    2. Mahout

    3. scikits.learn

  9. Chapter 9 Visualization

    1. Gephi

    2. GraphViz

    3. Processing

    4. Protovis

    5. Fusion Tables

    6. Tableau

  10. Chapter 10 Acquisition

    1. Google Refine

    2. Needlebase

    3. ScraperWiki

  11. Chapter 11 Serialization

    1. JSON

    2. BSON

    3. Thrift

    4. Avro

    5. Protocol Buffers