讲座:Offline and Online Classification Learning under Selective Labels 发布时间:2025-06-04

  • 活动时间:
  • 活动地址:
  • 主讲人:

题 目:Offline and Online Classification Learning under Selective Labels

嘉 宾:毛小介 副教授 清华大学

主持人:许欢 教授 上海交通大学安泰经济与管理学院

时 间:2025年6月11日(周三)10:00-11:30

地 点:安泰楼A503室

内容简介:

The problem of selective label observations is common in many decision-making applications involving human subjects. In these applications, whether an individual’s outcome label can be observed depends on a certain decision. For example, in lending, the default status of a loan applicant cannot be observed if the loan is not approved, creating a selection bias in labeled data. The selective label problem presents significant challenges for developing effective machine learning algorithms. In this talk, I will present our research on learning classifiers from selectively labeled data in both offline and online settings. In the offline setting, we follow the existing literature and consider selective labels arising from the decisions of multiple heterogeneous decision-makers. We formalize this setup through an instrumental variable framework, provide principled identification analysis, and propose a cost-sensitive learning algorithm to tackle the selective label problem. In the online setting, we additionally consider individuals’ strategic behaviors of manipulating their features to achieve favorable classification (e.g., get loan approval). We propose an online optimization algorithm to learn linear classifiers under both selective labels and strategic manipulation. We further prove that the expected regret of this algorithm is sublinear.

演讲人简介:

Xiaojie Mao is an associate professor in Management Science and Engineering at Tsinghua University. He did his undergraduate in Mathematical Economics at Wuhan University and Ph.D. in Statistics and Data Science at Cornell University. His research interest lies in causal inference and data-driven decision-making. His research has appeared in top journals and conferences across multiple fields, such as Operations Research, Management Science, Journal of Machine Learning Research, Journal of the Royal Statistical Society Series B, NeurIPS, ICML, COLT, etc.

 

欢迎广大师生参加!