News


1 May 2025
Two full papers are accepted by ICML'25 on LLM alignment and auction learning.

1 May 2024
One full paper is accepted by ICML'24 on auction learning.

Kexin Huang 

PhD student

Lab of Data Science
University of Science and Technology of China

443 Huangshan Road, Hefei, China 230027

Advisor: Prof. Xiangnan He and Prof. Xiang Wang
Email: huangkx AT mail.ustc.edu.cn
GitHubGoogle Scholar

I am currently in the 3th year of my PhD program at USTC Lab for Data Science, supervised by Prof. Xiangnan He and Prof. Xiang Wang. My research focuses on advancing large language models (LLMs) through post-training techniques, e.g., reinforcement learning.

Education

University of Science and Technology of China (USTC)
PhD Candidate in School of AI and Data Science      2023.09 - 2028.06 (Expected)
Advisor: Prof. Xiangnan He and Prof. Xiang Wang
University of Science and Technology of China (USTC)
Bachelor in School of Computer Sciences      2019.09 - 2023.06
Advisor: Prof. Xiangnan He

Publications


In the Year of 2025:


pdf
RePO: ReLU-based Preference Optimization
Junkang Wu, Kexin Huang, Xue Wang, Jinyang Gao, Bolin Ding, Jiancan Wu, Xiangnan He, Xiang Wang
NeurIPS 2025 [PDF] [Codes]

pdf
LaMP-Val: Large Language Models Empower Personalized Valuation in Auction
Jie Sun, Tianyu Zhang, Houcheng Jiang, Kexin Huang, Xiang Shu, Zhibo Zhu, Lintao Ma, Xingyu Lu, Jun Zhou, Junkang Wu, Chi Luo, An Zhang, Jiancan Wu, Xiang Wang
EMNLP 2025 (Findings) [PDF] [Codes]

pdf
Larger or Smaller Reward Margins to Select Preferences for Alignment?
Kexin Huang, Junkang Wu, Ziqian Chen, Xue Wang, Jinyang Gao, Bolin Ding, Jiancan Wu, Xiangnan He, Xiang Wang
ICML 2025 [PDF] [Codes]

pdf
Learning Bayesian Nash Equilibrium in Auction Games via Approximate Best Response
Kexin Huang, Ziqian Chen, Xue Wang, Chongming Gao, Jinyang Gao, Bolin Ding, Xiang Wang
ICML 2025 [PDF] [Codes]

pdf
SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations
Chongming Gao, Ruijun Chen, Shuai Yuan, Kexin Huang, Yuanqing Yu, Xiangnan He
WWW 2025 Oral [PDF] [Codes]

In the Year of 2024:


pdf
Auctionformer: A Unified Deep Learning Algorithm for Solving Equilibrium Strategies in Auction Games
Kexin Huang, Ziqian Chen, Xue Wang, Chongming Gao, Jinyang Gao, Bolin Ding, Xiang Wang
ICML 2024 [PDF] [Codes]

In the Year of 2023:


pdf
Alleviating Matthew Effect of Offline Reinforcement Learning in Recommendation
Chongming Gao, Kexin Huang, Jiawei Chen, Yuan Zhang, Biao Li, Peng Jiang, Shiqi Wang, Zhong Zhang, Xiangnan He
SIGIR 2023 [PDF] [Codes]   (Best Paper Honorable Mention)

pdf
Learn to Explore: on Bootstrapping Interactive Data Exploration with Meta-learning
Yukun Cao, Xike Xie, Kexin Huang
ICDE 2023 [PDF]

Preprints



pdf
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Chongming Gao, Kexin Huang, Ziang Fei, Jiaju Chen, Jiawei Chen, Jianshan Sun, Shuchang Liu, Qingpeng Cai, Peng Jiang
arxiv 2025 [PDF]

pdf
Quantile Advantage Estimation for Entropy-Safe Reasoning
Junkang Wu, Kexin Huang, Jiancan Wu, An Zhang, Xiang Wang, Xiangnan He
arxiv 2025 [PDF] [Codes]

Experiences & Services

Research Intern, Alibaba Tongyi Lab, Hangzhou, April 2023 - Present
- Qwen Pilot, Mentor: Guoyin Wang
- DAIL Lab, Mentor: Ziqian Chen, Xue Wang, Jinyang Gao
TA of Graph Theory, University of Science and Technology of China, Fall 2022
Teaching Professor: Yinlong Xu
Invited Reviewer of Conferences/Journals
2025: ICLR, TKDE

Honors

SIGIR'23 Best Paper Honorable Mention, 2023

Webpage template borrows from Junkang Wu.