Managing with Data Science - Harvard Business School MBA Program

Managing with Data Science

Course Number 1365

Professor Srikant M. Datar
Fall; Q1; 1.5 credits
X schedule starting September 5th, 1:15pm to 2:45pm

Overview

The last few years has seen an explosion of data. Data is being collected at a staggering rate from a wide range of sources as the scale of digital activities continues to increase. Looking outward, companies have enormous data on their customers, such as what they buy, how they buy, and where they buy. Looking inward, they also have data on many of their own activities, from operations to employee engagement. However, value is not created by data alone; it is created by the application of data to achieve a business need.

Data science seeks to make sense of and gain insights from data. A recent McKinsey study estimates that over the next few years, the demand for managers with data skills will total 1.5 million. This course focuses on helping students develop the basic data skills needed to guide an organization towards becoming data-centric and to think critically about the use of data science within an organization.

Course Objectives

Data science is a “team sport” whose practitioners draw on concepts from statistics, computer science, and machine learning to build predictive models that can inform decision making. The course objectives are twofold:

  • Familiarize students with the fundamentals of data science such that they can work effectively with a data science team in an organization, both to shape the “ask” and interpret outputs.
  • Develop their understanding of data science’s implications for management and decision making in a data-rich environment.

Neither a mathematical nor programming background is required for this course. However, if you have a strong background in mathematics or computer programming this course may be too basic for you.

Topics Covered

Through a series of new cases, caselets, and assignments students will learn to:

  • Shape actionable business “asks”
  • Find, evaluate, and augment data
  • Apply basic algorithms to build models (decision trees, random forests, regression, neural networks, clustering)
  • Identify limitations of models and their outputs
  • Think critically about all parts of the data science ecosystem and working processes to make effective decisions

A new modeling platform, Data Robot, will be a vehicle for some of the learning. You can explore the platform at www.DataRobot.com. The company headquarters is located in Boston and they will support us during the course.

Requirements

The most skillful managers in a data-rich world are not data scientists themselves, but those who have a deep-enough understanding of the data science ecosystem and individual modeling techniques to know their value and their limitations. Therefore, the curriculum balances a high level overview of data science and its role in business with the basic mechanics of modeling techniques and model building. We will work through the mechanics of data science in Excel but also train on using the DataRobot platform so that students will not have to program in Python or R to work on their projects. Throughout the course, the focus will be on thinking critically about data, models, and conclusions in a managerial context.

Part 1: The Data Science Ecosystem & Introduction to Data

1. The Oakland Athletics: Strategy & Metrics for a Budget -Introduction to Data Science

2. Busbud: Building a Data Company -Building a data product and crafting an SEO and SEM strategy

Part 2: Data Considerations & Modeling Techniques

3. Chateau Winery: Unsupervised Learning - Exploring data through K-means clustering

4. Chateau Winery: Supervised Learning - Recommendation techniques

5. Lending Club: Predicting Default (A) - Basic prediction using decision trees

6. Lending Club: Predicting Default (B) - Bootstrapping, Bagging, and Random Forest (an ensemble model)

7. Predicting Purchasing Behavior at PriceMart (A) - Prediction using OLS & logistic regression

8. Predicting Purchasing Behavior at PriceMart (B) - Advanced regularization methods

9. Tamarin App: Natural Language Processing - Naïve Bayes, sentiment analysis

10. Topic: Neural Networks & Finance

Part 3: Communication, Ethics, and Governance

11. Topic: Data Visualization & Communication

12. Data Science at Target - Building a data science organization

13. Targeted Fundraising at St. Camillus - Governance and ethics

14. Final Exam

Who is eligible?

This course is open to HBS MBA students.