Publisher: O'Reilly Media
Final Release Date: July 2013
Pages: 414
Written by renowned data science experts Foster Provost and Tom Fawcett, Data Science for Business introduces the fundamental principles of data science, and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect. This guide also helps you understand the many data-mining techniques in use today.
Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You’ll not only learn how to improve communication between business stakeholders and data scientists, but also how participate intelligently in your company’s data science projects. You’ll also discover how to think data-analytically, and fully appreciate how data science methods can support business decision-making.
- Understand how data science fits in your organization—and how you can use it for competitive advantage
- Treat data as a business asset that requires careful investment if you’re to gain real value
- Approach business problems data-analytically, using the data-mining process to gather good data in the most appropriate way
- Learn general concepts for actually extracting knowledge from data
- Apply data science principles when interviewing data science job candidates
|
-
Chapter 1 Introduction: Data-Analytic Thinking -
The Ubiquity of Data Opportunities -
Example: Hurricane Frances -
Example: Predicting Customer Churn -
Data Science, Engineering, and Data-Driven Decision Making -
Data Processing and “Big Data” -
From Big Data 1.0 to Big Data 2.0 -
Data and Data Science Capability as a Strategic Asset -
Data-Analytic Thinking -
This Book -
Data Mining and Data Science, Revisited -
Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist -
Summary -
Chapter 2 Business Problems and Data Science Solutions -
From Business Problems to Data Mining Tasks -
Supervised Versus Unsupervised Methods -
Data Mining and Its Results -
The Data Mining Process -
Implications for Managing the Data Science Team -
Other Analytics Techniques and Technologies -
Summary -
Chapter 3 Introduction to Predictive Modeling: From Correlation to Supervised Segmentation -
Models, Induction, and Prediction -
Supervised Segmentation -
Visualizing Segmentations -
Trees as Sets of Rules -
Probability Estimation -
Example: Addressing the Churn Problem with Tree Induction -
Summary -
Chapter 4 Fitting a Model to Data -
Classification via Mathematical Functions -
Regression via Mathematical Functions -
Class Probability Estimation and Logistic “Regression” -
Example: Logistic Regression versus Tree Induction -
Nonlinear Functions, Support Vector Machines, and Neural Networks -
Summary -
Chapter 5 Overfitting and Its Avoidance -
Generalization -
Overfitting -
Overfitting Examined -
Example: Overfitting Linear Functions -
* Example: Why Is Overfitting Bad? -
From Holdout Evaluation to Cross-Validation -
The Churn Dataset Revisited -
Learning Curves -
Overfitting Avoidance and Complexity Control -
Summary -
Chapter 6 Similarity, Neighbors, and Clusters -
Similarity and Distance -
Nearest-Neighbor Reasoning -
Some Important Technical Details Relating to Similarities and Neighbors -
Clustering -
Stepping Back: Solving a Business Problem Versus Data Exploration -
Summary -
Chapter 7 Decision Analytic Thinking I: What Is a Good Model? -
Evaluating Classifiers -
Generalizing Beyond Classification -
A Key Analytical Framework: Expected Value -
Evaluation, Baseline Performance, and Implications for Investments in Data -
Summary -
Chapter 8 Visualizing Model Performance -
Ranking Instead of Classifying -
Profit Curves -
ROC Graphs and Curves -
The Area Under the ROC Curve (AUC) -
Cumulative Response and Lift Curves -
Example: churnperformance analytics for modeling performance analytics, for modeling churn Performance Analytics for Churn Modeling -
Summary -
Chapter 9 Evidence and Probabilities -
Example: Targeting Online Consumers With Advertisements -
Combining Evidence Probabilistically -
Applying Bayes’ Rule to Data Science -
A Model of Evidence “Lift” -
Example: Evidence Lifts from Facebook "Likes" -
Summary -
Chapter 10 Representing and Mining Text -
Why Text Is Important -
Why Text Is Difficult -
Representation -
Example: Jazz Musicians -
* The Relationship of IDF to Entropy -
Beyond Bag of Words -
Example: Mining News Stories to Predict Stock Price Movement -
Summary -
Chapter 11 Decision Analytic Thinking II: Toward Analytical Engineering -
Targeting the Best Prospects for a Charity Mailing -
Our Churn Example Revisited with Even More Sophistication -
Chapter 12 Other Data Science Tasks and Techniques -
Co-occurrences and Associations: Finding Items That Go Together -
Profiling: Finding Typical Behavior -
Link Prediction and Social Recommendation -
Data Reduction, Latent Information, and Movie Recommendation -
Bias, Variance, and Ensemble Methods -
Data-Driven Causal Explanation and a Viral Marketing Example -
Summary -
Chapter 13 Data Science and Business Strategy -
Thinking Data-Analytically, Redux -
Achieving Competitive Advantage with Data Science -
Sustaining Competitive Advantage with Data Science -
Attracting and Nurturing Data Scientists and Their Teams -
Examine Data Science Case Studies -
Be Ready to Accept Creative Ideas from Any Source -
Be Ready to Evaluate Proposals for Data Science Projects -
A Firm’s Data Science Maturity -
Chapter 14 Conclusion -
The Fundamental Concepts of Data Science -
What Data Can’t Do: Humans in the Loop, Revisited -
Privacy, Ethics, and Mining Data About Individuals -
Is There More to Data Science? -
Final Example: From Crowd-Sourcing to Cloud-Sourcing -
Final Words -
Appendix Proposal Review Guide -
Business and Data Understanding -
Data Preparation -
Modeling -
Evaluation and Deployment -
Appendix Another Sample Proposal -
Scenario and Proposal -
Glossary -
Appendix Bibliography -
Index -
Colophon |
- Title:
- Data Science for Business
- By:
- Foster Provost, Tom Fawcett
- Publisher:
- O'Reilly Media
- Formats:
-
- Print
- Ebook
- Safari Books Online
- Print:
- August 2013
- Ebook:
- July 2013
- Pages:
- 414
- Print ISBN:
- 978-1-4493-6132-7
- | ISBN 10:
- 1-4493-6132-3
- Ebook ISBN:
- 978-1-4493-6131-0
- | ISBN 10:
- 1-4493-6131-5
|
-
Foster Provost Foster Provost is Professor and NEC Faculty Fellow at the NYU Stern School of Business, where he teaches in the MBA, Business Analytics, and Data Science programs. Former Editor-in-Chief for the journal Machine Learning, Professor Provost has co-founded several successful companies focusing on data science for marketing. View Foster Provost's full profile page. -
Tom Fawcett Tom Fawcett holds a Ph.D. in machine learning and has worked in industry R&D for more than two decades for companies such as GTE Laboratories, NYNEX/Verizon Labs, and HP Labs. His published work has become standard reading in data science both on methodology (evaluating data mining results) and on applications (fraud detection and spam filtering). View Tom Fawcett's full profile page. |
|
Table of Contents
|
Product Details
|
About the Author
|
 |
|
 |
|
|
|
Recommended for You
|
Recently Viewed
|
 |
|
By Stephen Wendel
November 2013
Ebook
:
$33.99
Print & Ebook
:
$43.99
Print
:
$39.99
|
Customer Reviews

3/22/2017 (1 of 1 customers found this review helpful) 5.0Hands-down, best data science book for the non-statistician By Ryan from Minneapolis, MN 1/23/2016 4.0liked the introduction to text mining 11/12/2015 (1 of 1 customers found this review helpful) - Accurate
- Easy to understand
- Well-written
- Intermediate
- Novice
- Student
4/16/2014 (3 of 3 customers found this review helpful) 5.0Solid Informative Entry to Data Analysis By Jason from Washington, DC - Easy to understand
- Helpful examples
- Well-written
3/22/2014 (3 of 3 customers found this review helpful) 5.0Perfect survey for non-specialists. By Edmon from Knoxville, TN About Me Designer, Developer, Maker - Accurate
- Easy to understand
- Helpful examples
- Well-written
2/23/2014 (2 of 2 customers found this review helpful) 5.0Insight into any world By ~ icp from New York, NY About Me Sales, Sales Executive - Accurate
- Concise
- Easy to understand
- Helpful examples
- Well-written
- Expert
- Intermediate
- Novice
- Student
- Easy to understand
- Well-written
- Intermediate
- Novice
- Student
12/15/2013 (8 of 8 customers found this review helpful) 3.0Not a How-To Guide, More for Managers By Will J from Chicago, IL About Me Analyst, Professional - Easy to understand
- Well-written
- Not comprehensive enough
- Too basic
10/24/2013 (1 of 1 customers found this review helpful) 5.0Introduction to data science By Victor Sheng from Conway, Arkansas About Me Designer, Educator - Accurate
- Concise
- Easy to understand
- Helpful examples
- Well-written
- Expert
- Intermediate
- Novice
- Student
9/26/2013 (1 of 1 customers found this review helpful) 5.0Excellent for understanding background By ahmetRasit from Ankara, Turkey About Me Developer, Maker - Easy to understand
- Helpful examples
- Well-written
- Intermediate
- Novice
- Student
|
|
|