Skip to main content

Menu

The Experience
About Stanford GSB
About Us

The Leadership

Dean’s Updates

School News & History

Commencement

Voices
Business, Government & Society

Centers & Institutes

Center for Entrepreneurial Studies

Center for Social Innovation

Stanford Seed
About the Experience
Learning at Stanford GSB

Experiential Learning

Guest Speakers

Entrepreneurship

Leadership

Social Innovation

Communication
Life at Stanford GSB

Collaborative Environment

Activities & Organizations

Student Services

Housing Options

International Students
The Programs
Full-Time Degree Programs
MBA

Why Stanford MBA

Academic Experience

Admission

Financial Aid
MSx

Why Stanford MSx

Curriculum

Admission

Financial Aid
PhD

Academic Experience

Admission

Financial Aid

Research Fellows Program
See All Programs
Non-Degree & Certificate Programs
Executive Education

Stanford Executive Program

Programs for Organizations

The Difference

Admission
Online Programs

Stanford LEAD
Stanford Seed

Seed Transformation Program

Aspire Program

Seed Spark Program
Faculty & Research
Faculty
Faculty Profiles

Academic Areas

Awards & Honors

Seminars

Conferences

Voices
Faculty Research
Publications

Working Papers

Case Studies

Books
Research Hub
Research Labs & Initiatives

Business Library

Data, Analytics & Research Computing

Behavioral Lab
Research Labs
Cities, Housing & Society Lab

Golub Capital Social Impact Lab
Research Initiatives
Corporate Governance Research Initiative

Corporations and Society Initiative

Policy and Innovation Initiative

Rapid Decarbonization Initiative
Stanford Latino Entrepreneurship Initiative

Value Chain Innovation Initiative

Venture Capital Initiative
Insights
Topics
Accounting

Career & Success

Climate & Sustainability

Corporate Governance

Culture & Society

Economics

Education

Entrepreneurship
Finance & Investing

Government & Politics

Healthcare

Innovation

Leadership & Management

Marketing

Markets & Trade

Nonprofit
Operations & Logistics

Opportunity & Access

Organizational Behavior

Political Economy

Social Impact

Technology & AI
Opinion & Analysis

Magazine

Podcasts

Email Newsletter
Alumni
Welcome, Alumni
Communities

Digital Communities & Tools

Regional Chapters

Women’s Programs

Identity Chapters

Find Your Reunion

Events
Career Resources

Job Search Resources

Career & Life Transitions

Programs & Services

Career Video Library

Alumni Education

Research Resources

Volunteering
Alumni News

Class Notes

Alumni Voices

Books

Giving

Contact Alumni Relations
Directory

Upcoming Events

Groups

Email

My Account
Events
Admission Events & Information Sessions
MBA Program

MSx Program

PhD Program
Alumni Events

All Other Events

Enter the terms you wish to search for.

Faculty & Research

In This Section

Faculty
Publications
Books
Working Papers
Case Studies
Research Labs & Initiatives
Behavioral Lab
Data, Analytics & Research Computing

Faculty
Publications
Books
Working Papers
Case Studies
Research Labs & Initiatives
Behavioral Lab
DARC

Faculty & Research Publications Reinforcement with Fading Memories

Reinforcement with Fading Memories

By Kuang XuSe-Young Yun

Mathematics of Operations Research

November2020 Vol. 45 Issue 4 Pages 1258–1288.

Operations, Information & Technology

View Publication

We study the effect of imperfect memory on decision making in the context of a stochastic sequential action-reward problem. An agent chooses a sequence of actions, which generate discrete rewards at different rates. She is allowed to make new choices at rate β, whereas past rewards disappear from her memory at rate μ. We focus on a family of decision rules where the agent makes a new choice by randomly selecting an action with a probability approximately proportional to the amount of past rewards associated with each action in her memory. We provide closed form formulas for the agent’s steady-state choice distribution in the regime where the memory span is large (μ → 0) and show that the agent’s success critically depends on how quickly she updates her choices relative to the speed of memory decay. If β ≫ μ , the agent almost always chooses the best action (that is, the one with the highest reward rate). Conversely, if β ≪ μ , the agent chooses an action with a probability roughly proportional to its reward rate.

Footer contact links

Contact Us
Visit Us
Stay In Touch

Footer 1

Companies, Organizations & Recruiters
Stanford Community
Newsroom

Footer 2

Library
Jobs
MyGSB

Footer legal links

Accessibility
Non-Discrimination Policy
Privacy Policy
Terms of Use
Stanford University

The Experience
Business, Government & Society Initiative
The Programs
Executive Education
Faculty & Research
Stanford Seed
Library
Alumni
Insights
Stanford Business Podcasts
Stanford Business Magazine
- All Issues
  - Spring 2022
  - Spring 2021
  - Fall 2021
  - Autumn 2020
  - Summer 2020
  - Winter 2020
  - Shift
  - Catalyst
  - Value
Newsroom
Events
Stanford Community Resources
Jobs
Visit Us
- Dining
- Accommodations
Contact Us
- Stay in Touch
- Follow Us
Companies, Organizations, & Recruiters

About Stanford GSB

About the Experience

Full-Time Degree Programs

Non-Degree & Certificate Programs

Faculty

Faculty Research

Research Hub

Research Labs

Research Initiatives

Topics

Welcome, Alumni

Admission Events & Information Sessions

Reinforcement with Fading Memories