Yuantong Li (李沅桐)

Research Scientist
Meta

E-mail: liyuantong93 [@] gmail [DOT] com

About me

2024 - Now, Research Scientist at Meta.

2019 - 2024, Ph.D. in Statistics at UCLA.

Research interests

I am currently focused on developing data-efficient reinforcement learning with human feedback (RLHF) to fine-tune large language models (LLMs) and pretraining the LLM for applications in search, recommendation, and advertising systems. I'm also interested in foundational research in bandit theory and RL, and designing and optimizing multi-agent systems within the social system.

Publications/Manuscripts

2024

Epinet for Content Cold Start
Hong Jun Jeon, Songbin Liu, Yuantong Li, Hunter Song, Ji Liu, Peng Wu, and Zheqing Zhu.
submitted.

Dynamic Dynamic Bayesian Incentive Compatible for Recommendation In Online Two-Sided Market
Yuantong Li, Guang Cheng, and Xiaowu Dai.
submitted.

Dynamic Matching Bandit For Two-Sided Online Markets
Yuantong Li, Chi-hua Wang, Guang Cheng, and Will Wei Sun.
submitted.

Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences Constraints
Yuantong Li, Guang Cheng, and Xiaowu Dai.
ICML 2024.

2023

Discussion of ‘‘Estimating Means of Bounded Random Variables by Betting’’ by Waudby-Smith and Ramdas
Jiayi Li, Yuantong Li, and Xiaowu Dai.
JRSSB 2023.

2022

Graph Federated Learning with Hidden Representation Sharing
Shuang Wu, Mingxuan Zhang, Yuantong Li, Carl Yang, and Pan Li.
CIKM-FedGraph 2022.

Residual Bootstrap Exploration for Stochastic Linear Bandit
Shuang Wu, Chi-hua Wang, Yuantong Li, and Guang Cheng.
UAI 2022.

Debiasing Neural Retrieval via In-batch Balancing Regularization
Yuantong Li, Xiaokai Wei, Zijian Wang, Shen Wang, Parminder Bhatia, Xiaofei Ma, and Andrew Arnold.
NACCL Workshop 2022.

Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad, Yuantong Li, Zhuoran Yang, Zhaoran Wang, Will Wei Sun, and Guang Cheng.
JASA 2022.

2021

Peel learning for pathway-related outcome prediction
Yuantong Li, Fei Wang, Mengying Yang, Fan Yang, Edward Cantu, Hengyi Rao, and Rui Feng.
Bioinformatics 2021.

Online Forgetting Process for Linear Regression Models
Yuantong Li, Chi-hua Wang, and Guang Cheng.
AISTATS 2021.

2020

A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components
Yuantong Li, Qi Ma, and Sujit K Ghosh.
KDD 2020.

Interactive Attention Networks for Semantic Text Matching
Sendong Zhao, Yong Huang, Chang Su, Yuantong Li, and Fei Wang.
ICDM 2020.