Skip to Main Content
HBS Home
  • About
  • Academic Programs
  • Alumni
  • Faculty & Research
  • Baker Library
  • Giving
  • Harvard Business Review
  • Initiatives
  • News
  • Recruit
  • Map / Directions
Faculty & Research
  • Faculty
  • Research
  • Featured Topics
  • Academic Units
  • …→
  • Harvard Business School→
  • Faculty & Research→
Publications
Publications
  • 2025
  • Working Paper
  • HBS Working Paper Series

Dynamic Personalization with Multiple Customer Signals: Multi-Response State Representation in Reinforcement Learning

By: Liangzong Ma, Ta-Wei Huang, Eva Ascarza and Ayelet Israeli
  • Format:Print
  • | Language:English
  • | Pages:56
ShareBar

Abstract

Reinforcement learning (RL) offers potential for optimizing sequences of customer interactions by modeling the relationships between customer states, company actions, and long-term value. However, its practical implementation often faces significant challenges. First, while companies collect detailed customer characteristics to represent customer states, these data often contain noise or irrelevant information, obscuring the true customer states. Second, existing state construction techniques focus primarily on summarizing characteristics related to short-term values, rather than capturing the broader behaviors that drive long-term customer value. These limitations hinder RL’s ability to effectively learn customer dynamics and maximize long-term value. To address these challenges, we introduce a novel Multi-Response State Representation (MRSR) Learning method to enhance existing RL methods. Unlike state construction methods, MRSR utilizes rich customer signals—such as recency, engagement, and spending—to construct low-dimensional state representations that effectively predict behaviors driving long-term customer value. Using data from a free-to-play mobile game with dynamic difficulty adjustments, MRSR demonstrates significant improvements, increasing 30-day in-game currency spending by 37% compared to standard offline RL methods and 24% over advanced state representation techniques. Policy interpretation further highlights MRSR’s ability to identify distinct and relevant customer states, enabling precise and targeted interventions to boost long-term engagement and spending.

Keywords

Dynamic Policy; Deep Reinforcement Learning; Representation Learning; Dynamic Difficulty Adjustment; Latent Variable Models; Customer Relationship Management; Customer Value and Value Chain; Foreign Direct Investment; Analytics and Data Science

Citation

Ma, Liangzong, Ta-Wei Huang, Eva Ascarza, and Ayelet Israeli. "Dynamic Personalization with Multiple Customer Signals: Multi-Response State Representation in Reinforcement Learning." Harvard Business School Working Paper, No. 25-037, February 2025.
  • SSRN
  • Read Now

About The Authors

Eva Ascarza

Marketing
→More Publications

Ayelet Israeli

Marketing
→More Publications

More from the Authors

    • 2025
    • Faculty Research

    In Privacy We Trust: The Effect of Privacy Regulations on Data Sharing Behavior

    By: Ozge Demirci, Ayelet Israeli and Eva Ascarza
    • June 2025
    • Faculty Research

    Full-Funnel Advertising on TikTok: An Experiment (Solution)

    By: Jeremy Yang and Ayelet Israeli
    • June 2025
    • Faculty Research

    Full-Funnel Advertising on TikTok: An Experiment

    By: Jeremy Yang and Ayelet Israeli
More from the Authors
  • In Privacy We Trust: The Effect of Privacy Regulations on Data Sharing Behavior By: Ozge Demirci, Ayelet Israeli and Eva Ascarza
  • Full-Funnel Advertising on TikTok: An Experiment (Solution) By: Jeremy Yang and Ayelet Israeli
  • Full-Funnel Advertising on TikTok: An Experiment By: Jeremy Yang and Ayelet Israeli
ǁ
Campus Map
Harvard Business School
Soldiers Field
Boston, MA 02163
→Map & Directions
→More Contact Information
  • Make a Gift
  • Site Map
  • Jobs
  • Harvard University
  • Trademarks
  • Policies
  • Accessibility
  • Digital Accessibility
Copyright © President & Fellows of Harvard College.