Data Science Master Kit

Data Science Master Kit

Everything you need to become a data science expert

If you're ready to take your data science skills to the next level, the Data Science Master Kit walks you through some of the most challenging aspects of data science you might face: from designing data-intensive applications and scalable Hadoop architectures to full-text search, real-time streaming, and advanced analytics. The skills you'll learn in this comprehensive kit are what separate ordinary data wranglers from the experts.

Buy any two titles and get the 3rd Free with discount code: OPC10

Or, get them all for $170.20 (60% savings)

Add to Cart
Designing Data-Intensive Applications

Designing Data-Intensive Applications: This book examines the key principles, algorithms, and trade-offs of data systems, using the internals of various popular software packages and frameworks as examples.

Advanced Analytics with Spark

Advanced Analytics with Spark: In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example.

Hadoop: The Definitive Guide

Hadoop: The Definitive Guide: With the fourth edition of this comprehensive guide, you'll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.

HBase: The Definitive Guide

HBase: The Definitive Guide: If you're looking for a scalable storage solution to accommodate a virtually endless amount of data, this ebook shows you how Apache HBase can meet your needs. Modeled after Google's BigTable architecture, HBase scales to billions of rows and millions of columns, while ensuring that write and read performance remain constant.

Hadoop Application Architectures

Hadoop Application Architectures: While many sources explain how to use various components in the Hadoop ecosystem, this practical book takes you through architectural considerations necessary to tie those components together into a complete tailored application, based on your particular use case.

Introduction to Apache Kafka

Introduction to Apache Kafka: In this video course, host Gwen Shapira from Cloudera shows developers and administrators how to integrate Kafka into a data processing pipeline.

Elasticsearch: The Definitive Guide

Elasticsearch: The Definitive Guide: This practical guide not only shows you how to search, analyze, and explore data with Elasticsearch, but also helps you deal with the complexities of human language, geolocation, and relationships.

Large-scale Real-time Stream Processing and Analytics

Large-scale Real-time Stream Processing and Analytics: In this unique O’Reilly video collection—taken from live sessions at Strata + Hadoop World 2015 in San Jose, California—you’ll learn about several analytics tools and event mining techniques from experts in the field.

Strata + Hadoop World

San Jose · London · Beijing · New York · Singapore

Tap into the collective intelligence of the leading minds in data—decision makers using the power of big data to drive business strategy, and practioners who collect, analyze, and manipulate data. Strata gives you the skills, tools, and technologies you need to make data work today—and the insights and visionary thinking O'Reilly is known for.

Learn more