Publications
Publications
- July 2023 (Revised July 2023)
- HBS Case Collection
Generative AI Value Chain
By: Andy Wu and Matt Higgins
Abstract
Generative AI refers to a type of artificial intelligence (AI) that can create new content (e.g., text, image, or audio) in response to a prompt from a user. ChatGPT, Bard, and Claude are examples of text generating AIs, and DALL-E, Midjourney, and Stable Diffusion are examples of the image-generating variety. During training, a generative AI learns the underlying structure of the desired output by absorbing a mass of relevant data — all of the books in the public domain, for example, or petabytes of text scraped from across the internet. Once trained, generative AIs work by creating outputs that recreate, with calculated variation, the underlying patterns learned in training.
In 2023, all these types of generative AI were created in a similar process. At the core of any generative AI system is the model, a mathematical representation of patterns that forms the basis of ‘knowledge’ for the system. The structure of the model is determined by its architecture, the theoretical organization of parameters in an artificial neural networks that the system uses to generate its outputs. To learn, the model relies on a mountain of training data, a collection of examples relevant to the task the model is being trained to perform. During an initial pre-training process, the model learns to adjust its parameter-weights (assumed by the architecture), improving its prediction quality with many iterations over time; that model is further refined through a fine-tuning process. Training an AI system requires specialized hardware, like GPUs in data centers, that consume enormous amounts of electricity to handle heavy and massively-parallel computational loads. Outside of the model, commercial AI companies could then implement further user-facing guardrails to keep the model from generating undesired content. From there, the model can then be used for inference by developers (through an API) or users (through an application). (Exhibit 1 shows an overview of the generative AI value chain.)
In 2023, all these types of generative AI were created in a similar process. At the core of any generative AI system is the model, a mathematical representation of patterns that forms the basis of ‘knowledge’ for the system. The structure of the model is determined by its architecture, the theoretical organization of parameters in an artificial neural networks that the system uses to generate its outputs. To learn, the model relies on a mountain of training data, a collection of examples relevant to the task the model is being trained to perform. During an initial pre-training process, the model learns to adjust its parameter-weights (assumed by the architecture), improving its prediction quality with many iterations over time; that model is further refined through a fine-tuning process. Training an AI system requires specialized hardware, like GPUs in data centers, that consume enormous amounts of electricity to handle heavy and massively-parallel computational loads. Outside of the model, commercial AI companies could then implement further user-facing guardrails to keep the model from generating undesired content. From there, the model can then be used for inference by developers (through an API) or users (through an application). (Exhibit 1 shows an overview of the generative AI value chain.)
Keywords
AI; Artificial Intelligence; Model; Hardware; Data Centers; AI and Machine Learning; Applications and Software; Analytics and Data Science; Value
Citation
Wu, Andy, and Matt Higgins. "Generative AI Value Chain." Harvard Business School Background Note 724-355, July 2023. (Revised July 2023.)