讲座:Stochastic Gradient Descent with Adaptive Data 发布时间:2024-03-07

  • 活动时间:
  • 活动地址:
  • 主讲人:

题 目:Stochastic Gradient Descent with Adaptive Data

嘉 宾:董婧,副教授,哥伦比亚大学商学院

主持人:江浦平 助理教授 上海交通大学安泰经济与管理学院

时 间:2024年3月13日(周三)10:00-11:30am

地 点:腾讯会议(校内师生如需会议号和密码,请发邮件至yuxin.su@sjtu.edu.cn获取)

内容简介:

Stochastic gradient descent (SGD) is a powerful optimization technique, particularly useful in online learning scenarios. Its convergence analysis/effectiveness is relatively well understood under the assumption that the data samples are independent and identically distributed (iid). However, applying online learning to policy optimization problems in operations research involves a distinct challenge: the policy changes the environment and thereby affects the data used to update the policy. The adaptively generated data stream involves samples that are non-stationary, no longer independent from each other, and affected by previous decisions. The influence of previous decisions on the data-generating environment introduces estimation bias in the gradients, which presents a potential source of instability for online learning not present in the iid case. In this paper, we introduce simple criteria for the adaptively generated data stream to guarantee the convergence of SGD. We show that the convergence speed of SGD with adaptive data is largely similar to the classical iid setting, as long as the mixing time of the policy-induced dynamics is factored in. Our Lyapunov-function analysis allows one to translate existing stability analysis of systems studied in operations research into convergence rates for SGD, and we demonstrate this for queuing and inventory management problems. We also showcase how our result can be applied to study an actor-critic policy gradient algorithm.

演讲人简介:

Jing Dong is the DeRosa Family Associate Professor of Business at the Decision, Risk, and Operations Division at Columbia Business School. Her research is at the interface of applied probability and service operations management, with a special focus on patient flow management in healthcare delivery systems. She received an NSF CAREER Award in 2020. She currently serves on the editorial boards of Operations Research, Mathematics of Operations Research, Management Science, Manufacturing and Service Operations Management, and Operations Research Letters. She received her Ph.D. in Operations Research from Columbia University. Before joining Columbia Business School, she was on the faculty of Northwestern University.

 

欢迎广大师生参加!