Publications
Publications
- March 2022 (Revised July 2022)
- HBS Case Collection
Prediction & Machine Learning
By: Iavor I. Bojinov, Michael Parzen and Paul Hamilton
Abstract
This note provides an introduction to machine learning for an introductory data science course. The note begins with a description of supervised, unsupervised, and reinforcement learning. Then, the note provides a brief explanation of the difference between traditional statistical modeling and machine learning. Next, the note covers two models used for classification, logistic regression and decision trees. After introducing these two models, the note explains how train, validation, and holdout sets (and k-fold cross validation) are used to tune and evaluate different models. Finally, the note concludes with a discussion of different performance metrics (ROC curves, confusion matrices, log loss) that are used to evaluate classification models.
Keywords
Citation
Bojinov, Iavor I., Michael Parzen, and Paul Hamilton. "Prediction & Machine Learning." Harvard Business School Technical Note 622-101, March 2022. (Revised July 2022.)