Strata Conference New York + Hadoop World 2014: Video Compilation

Video description

Use the power of big data to drive business strategy

What happens when cutting-edge data science and new business fundamentals intersect? Find out with this complete video compilation of Strata + Hadoop World 2014 in New York, where you’ll get a front-row seat to every keynote, workshop, and session.

Ten conference tracks were required to capture the most challenging problems and compelling opportunities in data today, with presentations from Mike Olson (Cloudera), Kim Rees (Periscopic), Roger Magoulas (O'Reilly), Douglas Merrill (ZestFinance), Amanda Cox (The New York Times), and scores of other experienced data practitioners from finance, media, government, and education.

Download these videos or stream them through our HD player, and gain a clear perspective on the future of big data, including all the analytics, architectures, techniques, tools, and technologies you need to use data successfully.

Tracks include:

  • Business & Industry: How organizations of all sizes use data to make better decisions
  • Connected World: Navigating in an always-connected, always-on world
  • Data Science: Everything from the latest algorithms and advances in machine learning to cultural change and team-building
  • Design & Interfaces: Capturing user experience, design, new interfaces, and visualization
  • Law, Ethics & Open Data: Issues on governance, ethics, and compliance in the era of open data
  • Machine Data: Extracting meaningful insights from data collected and generated by things
  • Security: Fighting fraud, detecting threats, increasing trust—and securing data
  • Beyond Hadoop: How tools like Cassandra, Storm, Accumulo, Kafka and Spark fit in the data science toolkit
  • Hadoop in Action: Real-world case studies of the Hadoop ecosystem in action
  • The Hadoop Platform: A deep dive into the dominant big data stack, with practical lessons and integration tricks

Publisher resources

View/Submit Errata

Table of contents

  1. Keynotes
    1. Open Standards and the Modern Data Center - Mike Olson
    2. What Would Google Do? Understanding the Future of Big Data - M. C. Srivas
    3. Keynote with Miriah Meyer
    4. Accelerating Parkinson’s Research with Big Data Technologies - Ron Kasabian
    5. Data The New Era of Interactive Storytelling - Sharmila Shahani-Mulligan
    6. Spark Needs a Business Analyst Workflow - Ben Werther
    7. Statistics Without the Agonizing Pain - John Rauser
    8. Pax Data - Eli Collins
    9. The Power of Emotions: When Big Data meets Emotion Data - Rana El Kaliouby
    10. A New Data Science Economy - Joseph Sirosh
    11. Style Stalking: The Stochastic Patterns that Drive Fashion Trends - Karen Moon
    12. Pasta Mathematica - George Legendre
    13. Big Data - 2020 vision - John Schitka
    14. Turning Data into Decisions in a Big Data World - Rachel Hawley
    15. The Hidden Brain - Shankar Vedantam
    16. A Word Too Much Repeated Falls Out of Being - So Why is Big Data Being Talked About so Much? - Paul Zikopoulos
    17. Is Privacy Becoming a Luxury Good? - Julia Angwin
  2. Business Industry
    1. Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 1
    2. Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 2
    3. Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 3
    4. Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 4
    5. Just Enough Math - Paco Nathan and Allen Day - Part 1
    6. Just Enough Math - Paco Nathan and Allen Day - Part 2
    7. Just Enough Math - Paco Nathan and Allen Day - Part 3
    8. Just Enough Math - Paco Nathan and Allen Day - Part 4
    9. Solving the Right Problem - Max Shron and Sasha Laundy
    10. Transforming to a Data Driven Operations Model - Denise Asplund
    11. From Experiments to Insights at Pinterest - Andrea Burbank
    12. Case Study: -A Forensic Look at Success and Failure of Predictive Analytics in Healthcare - Eugene Kolker
    13. The Open Data 500: Building Businesses on Free Government Data - Joel Gurin and Laura Manley
    14. Decided by Data: Case Studies from a Data Driven Product Culture - Nellwyn Thomas
    15. Preemptive Shipping: How Gilt Predicts Which Customers Will Buy Products It Has Never Sold Before - Igor Elbert
    16. What are VCs Really Looking For? - Michael Dauber, Renee DiResta, Matt Turck, James Cham, and Jake Flomenberg
    17. PDF Prison Break: Freeing Data, Empowering Experts at Edmunds.com - John Akred and Karim Qazi
    18. Fashioning Fit: Determining Fit Through Data - Liza Kindred, David Whittemore, Gina Mancuso, and Rasmus Thofte
    19. From Runway to Database, the Season's Hottest Fashion: Data - Rachel Kalmar
    20. How Public Data Creates Revenue for a Scandinavian Retailer - Majken Sander
  3. Connected World
    1. Generating Possible A/B Tests for Uber Via a City Simulation Framework - Bradley Voytek
    2. The State GeoSpatial BigData - Mansour Raad
    3. Architecting World's Largest Biometric Identity System - Aadhaar Experience - Pramod Varma
    4. Pairing EMR Data with an Open Commons to Engage Communities, Provide Work Force Development and Predict Community Health Futures - Brigitte Piniewski
    5. Nanocubes: Interactive Visual Exploration of Large, Geospatial, Temporal Datasets - Lauro Lins
  4. Data Science
    1. Data Science at the Command Line - Jeroen Janssens - Part 1
    2. Data Science at the Command Line - Jeroen Janssens - Part 2
    3. Data Science at the Command Line - Jeroen Janssens - Part 3
    4. Data Science at the Command Line - Jeroen Janssens - Part 4
    5. Becoming a Scalable Data Scientist - Alice Zheng
    6. All the Data and Still Not Enough! - Claudia Perlich
    7. The Great Debate: If You Can't Code, You Can't Be a Data Scientist - Joseph Adler, Hilary Mason, Scott Nicholson, Lucian Lita, and Roger Magoulas
    8. Data Science Bootcamp - Laurie Skelly
    9. The Day Zach Galifianakis Saved Healthcare - Chris Harland
    10. Computing Professional Identity for the Economic Graph - Vitaly Gordon
    11. Multi-language Data Science with IPython, IJulia, IR, and Friends - Brian Granger and Fernando Pérez
    12. Using Data Science on Internet Search Behavior as a Proxy for Human Behavior - Juan Miguel Lavista
    13. AI in 2014: Progress and Problems - Beau Cronin
    14. Big Data Anti-Patterns - Douglas Moore
    15. Machine Learning system architecture – Microsoft Translator, a Case Study - Vishal Chowdhary
    16. Secure Machine Learning - Bahman Bahmani
    17. Fashioning Data: The Balance Between Creativity and Data-Driven Decisions - Karen Moon, Vijay Subramanian, and Liza Kindred
    18. Distributed Gradient Boosting Machine - Cliff Click
    19. Deploying and Evaluating Data Products - Josh Levy
  5. Design Interfaces
    1. D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 1
    2. D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 2
    3. D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 3
    4. D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 4
    5. Visual Change: The Power of Scaled Data Visualization in Action - Nathan Shetterley, Joshua Patterson, Allan Enemark, and Kathleen Moynahan
    6. The Future of Storytelling in Data Communication - Andrew Hill
    7. Graphistry: Scaling Visual Exploration with GPUs and Design - Leo Meyerovich
    8. Design and Data, A Human Centered Approach to Analysis, Experiment Design, and Visualization - Arianna McClain and Alisa Lemberg
    9. Visualization Typography: Designing Legends, Labels, Titles, and Text - Trina Chiasson
  6. Hadoop Beyond
    1. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 1
    2. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 2
    3. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 3
    4. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 4
    5. Tackling Data Curation in Three Generations - Michael Stonebraker
    6. Advantages of a Domain-Specific Language Approach to Data Transformation - Joe Hellerstein and Sean Kandel
    7. Stories from the Trenches: The Challenges of Building an Analytics Stack - Fangjin Yang and Xavier Léauté
    8. Tachyon: A Memory Centric Storage System for Big Data Computing - Haoyuan Li
    9. Anomaly Detection with Apache Spark - Sean Owen
    10. Mixing Structured Data and Analytics with Spark SQL - Michael Armbrust
    11. Interactive Visual Data Exploration with Spark - Hossein Falaki
    12. Open Source Real Time BI using Storm, Hadoop, Titan, Druid D3 - Anil Madan
    13. Highly Scalable Tile-Based Visualization for Exploratory Data Analysis - David Jonker and Rob Harper
  7. Hadoop Platform
    1. Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 1
    2. Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 2
    3. Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 3
    4. Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 4
    5. From Raw Data to Analytics with No ETL - Marcel Kornacker and Lenni Kuff
    6. SQL on Everything, in Memory - Julian Hyde
    7. From Oracle to Hadoop - Guy Harrison, David Robson, and Kathleen Ting
    8. Hive on Apache Tez: Benchmarked at Yahoo! Scale - Mithun Radhakrishnan
    9. Scaling Storm: Cluster Sizing and Performance Optimization - P. Taylor Goetz
    10. Building Real-time Data Products at LinkedIn with Apache Samza - Martin Kleppmann
    11. HBase: Where Online Meets Low Latency - Nick Dimiduk and Nicolas Liochon
    12. Apache HBase Application Archetypes - Jonathan Hsieh and Lars George
    13. Hadoop Operations - Best Practices from the Field - Chris Nauroth and Suresh Srinivas
    14. Resource Management with YARN - Anubhav Dhoot
    15. Bulk Loading Your Big Data into Apache HBase, a Full Walkthrough - Jean-Daniel Cryans
    16. An Independent Comparison of Open Source SQL-on-Hadoop - Greg Rahn
    17. Bringing PyData to Impala - Uri Laserson
  8. Hadoop in Action
    1. Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 1
    2. Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 2
    3. Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 3
    4. Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 4
    5. Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 1
    6. Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 2
    7. Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 3
    8. Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 4
    9. How Goldman Sachs is Using Knowledge to Create an Information Edge - Peter Ferns
    10. Customer Intelligence: Harnessing Elephants at Transamerica - Stephen Lloyd, Vishal Bamba, and David Beaudoin
    11. Transitioning from Original Big Data to the New Big Data: L.L.Bean’s Journey - Chris Wilson and Doug Bryan
    12. Unlocking Big Data at CERN - Matthias Braeger and Manish Devgan
    13. Big Data Modeling: How FICO is Turning DBAs and into Data Engineers - Lelanie Moll, Deb Brooks, and Silaphet Mounkhaty
    14. How LinkedIn Democratizes Big Data Visualization - Praveen Neppalli Naga, Chi-Yi Kuan, and Jonathan Wu
    15. Better Care with Big Data: A Panel Discussion - Ryan Goldman, Ryan Brush, Sabrina Dahlgren, Aashima Gupta, and Michael Thompson
    16. Renaissance in Medicine: Next-Generation Big Data Workloads - Allen Day
    17. Image Processing on Hadoop - Ailey Crow
    18. The Next Generation of Big Data in the Cloud - Daniel Weeks
    19. Building an Enterprise Data Hub to Bridge the Gap Between Business and IT - Sabrina Dahlgren and Rajiv Synghal
  9. Law, Ethics Open Data
    1. Better Accountability Through Open Data - Merici Vinton and Micheál Keane
    2. Wonk, Meet Geek - Jim Adler
    3. You Have Zero Privacy, You Own Your Data, and Other Myths - Gilad Rosner
    4. Homelessness Prevention by the Numbers - Stefan Heeke and Adeen Flinker
    5. Why Big Data Needs Thick Data - Tricia Wang and Matt LeMay
  10. Machine Data
    1. Connectivity, Real-Time Data, and Edge Analytics to Enable Intelligent Machines for the Industrial Internet - Alisher Maksumov and Jean Lau
    2. Data is a Local Problem - Alasdair Allan
    3. Super Simple Internet of Things Backend: Persistence Post Hadoop with Crate Data - Jodok Batlogg
    4. SmartCity StreamApp: An Internet of Things Service for Real-time Traffic Management - Damian Black
  11. Security
    1. Resolving Data Inaccuracy - Mike Armstrong
    2. Big Data vs Zombies: Using Algorithms, Big Data, and Large Scale Distributed Processing to Combat Identity Fraud - Jesse Shaw
    3. Why Should Anyone Care at All about Privacy, Privacy Engineering, or Data? - Michelle Dennedy
    4. Real-Time Cyber Threat Detection with Sqrrl and Spark - Adam Fuchs
    5. Big Data Framework for Anomaly Detection Root Cause Analysis on Streaming Time Series Data - Roy Singh
  12. Enterprise Adoption
    1. In the Data Lake - Barry Devlin
    2. Unseating the Giants - Monte Zweben
    3. What’s Holding Up Your Hadoop? - Eddie Garcia
  13. Spark Camp
    1. Spark Camp - Paco Nathan and Patrick Wendell - Part 1
    2. Spark Camp - Michael Armbrust - Part 2
    3. Spark Camp - Joseph Bradley - Part 3
    4. Spark Camp - Tathagata Das - Part 4
    5. Spark Camp - Sameer Farooqui and Holden Karau - Part 5
    6. Spark Camp - Sameer Farooqui and Holden Karau - Part 6
    7. Spark Camp - Sameer Farooqui and Holden Karau - Part 7
    8. Spark Camp - Sameer Farooqui and Holden Karau - Part 8
  14. Hardcore Data Science
    1. Doing the Impossible (Almost) - Ted Dunning
    2. Tupleware: Redefining Modern Analytics - Tim Kraska
    3. Data Science for Humans, Not Robots - Alice Zheng
    4. Big Data: Efficient Collection and Processing - Anna Gilbert
    5. Computational Problems in Managing Social Information - Jon Kleinberg
    6. Small Data Problems - Kira Radinsky
    7. Building and Deploying Large-scale Machine Learning Pipelines Using the Berkeley Data Analytics Stack - Ben Recht
    8. Learning About Music and Listeners - Brian Whitman
    9. Statistical Topic Modeling - Hanna Wallach
    10. The Aha! Moment: From Data to Insight - Dafna Shahaf
  15. Data-Driven Business Day
    1. Designing for Interruption - Alistair Croll
    2. Check Your Bias, Feed Your Empathy - Farrah Bostic
    3. The Data Lake Dream - Edd Dumbill
    4. Why Marketing’s Approach to Big Data is All Wrong - Jennifer Zeszut
    5. Bigger is Better, but at What Cost? Towards Understanding the Economic Value of Data - Brian d'Alessandro
    6. The Sounds of (Data) Silence - Jana Eggers
    7. Panel: Deciding Better - Joe Caserta, Farrah Bostic, and Halle Tecco
    8. Making Strategic Decisions: Business Requirements for Analytics Projects - Joy Beatty
    9. The Future of Data - Kim Rees
    10. How Goldman Sachs is Using Knowledge to Create an Information Edge - Peter Ferns
    11. The Big (Data) Picture - Rohit Jain
    12. Improving Healthcare Business Strategies through Lean Data Partnerships - Brigitte Piniewski
    13. Building with Data: Lessons from Etsy - Nellwyn Thomas
    14. Reducing Employee Turnover by 75%: Applying Data and Predictive Analytics to Hiring and Team Assembly - Michael Rosenbaum
    15. Better Accountability Through Open Data - Merici Vinton
    16. The Unit: Building Data Science Teams the Special Operations Way - Amy Gaskins
    17. MapReduce ETL Processing for Healthcare Process Improvement Dashboards - Mary Ann Wayer
  16. Industrial Internet
    1. Industrial Internet Day Opening Remarks - Jon Bruner
    2. Taking the Industrial Internet to the Ends of the Earth - Daniel Koffler
    3. Oceans 2.0: The Last Remaining Wild West - Ami Daniel
    4. Big Data Analytics: Enabling Innovation while Reducing Risk - David Simchi-Levi
    5. Video Analytics in the Big Fast Streaming Data Era - Victor Fang and Yu Cao
    6. The Industrial Internet and the Data Revolution - Nathan Oostendorp
    7. Bring Your Own Internet (of Things) - Alasdair Allan
    8. IIOT Applied: 10 Things I Learned While Deploying an IIoT Machine Learning System - Cameron Turner
    9. Industrial Internet Day Closing Panel - Jon Bruner, Leo Spiegel, Edy Liongosari, and Mark Grabb
  17. PyData at Strata
    1. IPython - Brian Granger and Fernando Pérez
    2. Collaborative Data Science with coLaboratory - Kayur Patel and Kester Tong
    3. Intro to NumPy and matplotlib - Jake Vanderplas - Part 1
    4. Intro to NumPy and matplotlib - Jake Vanderplas - Part 2
    5. Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 1
    6. Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 2
    7. Visualizing Data with Blaze and Bokeh - Andy Terrel
    8. Interactive Visualization with Bokeh - Peter Wang
    9. SciPy – An Exploration of the Most Useful Bits - Travis Oliphant - Part 1
    10. SciPy – An Exploration of the Most Useful Bits - Travis Oliphant - Part 2
    11. New and Upcoming Features in Pandas - Wes McKinney
    12. High Performance Python - Trent Nelson
  18. Sponsored
    1. Got the T-shirt: Real Experiences from a Hadoop Veteran - Jim Scott
    2. See the Fastest Spark-Powered Disparate Data Blending Analysis Solution - Vaibhav Nivargi
    3. Disrupting the Traditional Analyst Workflow with Platfora and Spark - Peter Schlampp and Ed Smith
    4. Big Data Architectural Patterns - Todd Papaioannou
    5. An End-to-End Approach to Offloading the Data Warehouse with Hadoop - Jorge A Lopez
    6. Global Hadoop: Storage and Compute Challenges in Multi-Data Center Deployments - Jagane Sundar and Brett Rudenstein
    7. Using Graph to Discover Unseen Relationships in Big Data - Mike Hoskins
    8. Hadoop Effortlessly: A Data Inventory is Key to Data Self-service - Moderated by: Alex Gorelik - Panelists: Suresh Srinivas, Mike Sutten, John Mount, Clark Farrey, and Sunil Soares
    9. Building Real-Time Platforms with MemSQL and Apache Spark - Eric Frenkiel
    10. Unlocking Hadoop’s Potential with YARN - Sanjay Radia
    11. Real-time streaming and analytics with Amazon Elastic MapReduce and Amazon Kinesis - Steve McPherson
    12. NoSQL Solutions for Big Data Problems - Don Pinto
    13. Big Data SQL and Query Franchising: An Architecture for SQL Beyond Hadoop - Dan McClary
    14. Drive Data Quality at Your Company: Create a Data Lake - George Corugedo
    15. Important Advances in Hadoop: A Panel Discussion - Joey Jablonski, Armando Costa, Jim Burmingham, and Rob Johnson
    16. Cloud Machine Learning - Joseph Sirosh
    17. Embracing Diversity - Sid Sipes
    18. The Art of Prediction: Seamless Visualization and Modeling With Hadoop - Adam Pilz
    19. Extending "Variety" of Data to "Variety" of Users - Tina Groves
    20. How to Architect Big Data Apps with the Lambda Architecture - with Real Work Examples on Merging Batch and Real-Time Processing - Altan Khendup and Ron Bodkin
    21. What do Al Capone Hadoop Have in Common? Visualizing Data at Scale – Making Sense Out of Big Data - James Dixon
    22. Distributed R - A Scalable and High-performance Platform for R - Sunil Venkayala and Indrajit Roy
    23. Getting Big Data to Work: Agile Data Transformation in Hadoop - Stephanie McReynolds, Xavier Quintuna, Shirshanka Das, Charlie Crocker, and Anna Dorofiyenko
    24. Now Playing at Netflix: Advanced Decision-Making with Hadoop, Starring MicroStrategy - Michael Hiskey
    25. Analytics the Way Nature Intended - Donald Farmer
    26. Western Union: Implementing a Hadoop-based Enterprise Data Hub with Informatica - Pravin Darbare and Sumeet Agrawal
    27. For Red Hat, it's 1994 All Over Again - Sarangan Rangachari
    28. Hadoop Responsibly with Big Data Governance - Moderated by: Barry Devlin - Panelists: Sunil Soares, Joseph Dossantos, and Jay Zaidi
    29. Big Content: Finding the Why Behind the What - Sid Probstein
  19. Solutions Showcase Theater
    1. Innovative Healthcare, Tech Retail Companies Mix CRM Info with Big Data to Make Reps 10x More Productive, 40x More Useful and 30% More Profitable - Michael Hiskey
    2. Real-time Classification and Sentiment Analysis of Multi-lingual Content Using Advanced Analytics on Apache Storm - Anand Venugopal
    3. Hadoop at Bloomberg - Sudarshan Kadambi
    4. EVP Data Lake: Store Everything, Analyze Anything, Build What You Need - Ryan Peterson
    5. 10 Amazing Things to do With A Hadoop-based Data Lake - Greg Chase
    6. Solve Data Ingest Limitation with High Performance Networks Offloads - Asaf Wachtel
    7. Real-Time Big Data Architecture @ LivePerson - Shane K. Johnson
    8. From Infrastructure to Data Applications - Jonathan Gray
    9. From Big Iron to Big Data: Offloading Data Workloads to Hadoop at a Major US Bank - Jorge A. Lopez
    10. Managing Data in Regulated Industries - Jim Clark
    11. The Pain Curve - Lack of Automation Leads to Failure - Greg Bruno
    12. Building the Enterprise Data Hub - Joe Caserta
    13. QlikView and Big Data Analytics at King - Donald Farmer
    14. Driving Growth in Transportation Using Big Data and Data Science - Marie Goodell
    15. Competitiveness in the Age of Big Data - Satyendra Rana
    16. Unraveling Hadoop's Meltdown Mysteries - Sean Suchter
    17. Let's Stop Pretending that One Size Fits All When it Comes to the Challenges of Working with Enterprise Data - Nenshad Bardoliwalla
    18. Waking Analysts from their Nightmare - George Corugedo
    19. "Mining" the IoT for Business Value: How WWT Helped One of the Largest Mining Companies Predict Engine Failures - Yoni Malchi
    20. All Hands on Deck: How to Get Non-technical Business Users to Tackle Big Data so you Can Focus on Complex Queries - Amit Bendov
    21. The Spark-Inspired Workflow - Kevin Beyer
    22. Do you Prefer to Hike up Machu Pichu or Take the Train? - Todd Goldman
    23. Using Big Data to Improve Patient Outcomes - John Armstrong
    24. Get Real with Hadoop - Jim Scott
    25. Big Data Analytics Heavyweight Sounds Off on Financial Services Use Cases - Matt Schumpert
    26. Real World Showcase of How a Retail Customer Uses and Can Use Microsoft Big Data and Business Analytics Technologies - Sanjay Soni
    27. Using Hadoop to Run Real-Time, Operational Applications - Rich Reimer
    28. Automated Data Inventory for Hadoop - Oliver Claude
    29. Keys to Optimizing Product Inventory and Pricing at One of the Largest Global Retailers - Julien Sauvage
    30. Consumer Behavior Analytics with Cubes on Hadoop - Ajay Anand
    31. Omneo’s Enterprise Data Hub: Helping Manufacturers Save Millions - Kathleen deValk
    32. Building an Enterprise Grade Big Data Risk Management Solution for Financial Services - Vamsi Chemitiganti
    33. Orange Silicon Valley spins up private Big Data as a Service with BlueData to create on-demand Spark and Hadoop Clusters - Tom Phelan
    34. Everything You Don't Know About HBase in 10 Minutes or Less - Alex Newman
    35. Big Data News Cases… What in the World are People Doing with Hadoop? - Gord Sissons
    36. Build Intelligent Applications with H20's Open Source - Joel Horwitz
    37. NoSQL Key Value Stores - The Key to Velocity - Brian Bulkowski
    38. Java Big Data in Real Time - Matt Schuetze
    39. Using Operational Intelligence to Track 10M Cable TV Viewers in Real Time - Dr. William Bain
    40. Unlock the Value of Big Data with Hunk for Hadoop - Adrish Sannyasi
    41. Big Cybersecurity Data for Insider Threat Analysis - Joe Travaglini
    42. Customer Spotlight: Big Data, The Elephant and the Bear - Lawrence Schwartz
    43. Case Study: Improving Customer Experience by Employing Big Data Technologies in the Banking Industry - Martin Triska
    44. Better Manufacturing with Data: Using 3D Visual Analytics on the Shop Floor - Carl Byers
    45. MemSQL Shutterstock: Insights in Real Time - Eric Frenkiel and Chris Fischer
    46. Running In-Memory Jobs and Traditional Jobs on the Same Hadoop Cluster - David Chaiken
    47. Data Transformation on Hadoop: Balancing Technology and Human Needs to Boost Performance and Increase ROI - Ravi Hubbly
    48. Big Data Analytics / IoT: New Customer Insights Using Network Data - Ankur Gupta
    49. Extending Enterprise Data Security to Hadoop - Raul Ortega
    50. Industrialized Hadoop Analytics and SQL: Unleashing the Business User - John Santaferraro
    51. Connection Analytics: Extracting Value from Social Networks Data - Sri Raghavan
    52. The Emergence of the Streamlined Data Refinery - Chuck Yarbrough
    53. Hardware Still Matters: Manageable Infrastructure Platforms for Dynamic Big Data Environments - Robert Novak

Product information

  • Title: Strata Conference New York + Hadoop World 2014: Video Compilation
  • Author(s):
  • Release date: November 2014
  • Publisher(s): O'Reilly Media, Inc.
  • ISBN: 9781491900345