PhD. Student |
I am currently a PhD. Student at Department of Computer Science and Engineering, Shanghai Jiao Tong University as well as a member of PAIR-Lab, co-advised by Prof. Yaodong Yang and Prof. Ying Wen. My research interest lies in AI Alignment and Preference-based RL. If you would like to discuss about potential collaboration or common research interests, please do not hesitate to contact me.
(* indicates equal contribution)
Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Zhaowei Zhang, Fengshuo Bai*, Jun Gao*, Yaodong Yang
arXiv preprint arXiv:2310.00378
Zero-shot Preference Learning for Offline RL via Optimal Transport
Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li
arXiv preprint arXiv:2306.03615
PiCor: multi-task deep reinforcement learning with policy correction
Fengshuo Bai, Hongming Zhang, Tianyang Tao, Zhiheng Wu, Yanna Wang, Bo Xu
Proceedings of the AAAI Conference on Artificial Intelligence, 2023
[PDF]
Meta-reward-net: Implicitly differentiable reward learning for preference-based reinforcement learning
Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang
Conference on Neural Information Processing Systems (NeurIPS), 2022
[PDF] [Code] [Website]
Institute for AI, Peking University (Jan 2022 - Oct 2022)
Research on algorithms of Reinforcement Learning from the Human Feedback.
Supervisor: Dr. Yali Du, Dr. Yaodong Yang
Pacemaker to Merit Student, 2018.09
President Scholarship of Southeast University, 2018.09
China National Scholarship, 2017.09
Sliver Prize (25/1178), Kaggle Lux AI Challenge, 2021.12
Third Prize (Intro), 7th (Research) NeurIPS 2021 MineRL Diamond Competition, 2021.10
14th, IJCAI 2020 Mahjong Artificial Intelligence Competition, 2021.01
I was a reviewer / PC member of conferences:
DAI 2023
AAMAS 2024