News
25 Jan 2026
One full paper (first-author) is accepted by ICLR'26 on LLM reasoning.
1 May 2025
Two full papers (first-author) are accepted by ICML'25 on LLM alignment and auction learning.
1 May 2024
One full paper (first-author) is accepted by ICML'24 on auction learning.
|
Kexin Huang
PhD student
Lab of Data Science 443 Huangshan Road, Hefei, China 230027
Advisor: Prof. Xiangnan He and Prof. Xiang Wang |
I am currently in the 3th year of my PhD program at USTC Lab for Data Science, supervised by Prof. Xiangnan He and Prof. Xiang Wang.
My research focuses on advancing large language models (LLMs) through post-training techniques, e.g., reinforcement learning.
Education
|
University of Science and Technology of China (USTC) PhD Candidate in School of AI and Data Science 2023.09 - 2028.06 (Expected) Advisor: Prof. Xiangnan He and Prof. Xiang Wang |
|
University of Science and Technology of China (USTC) Bachelor in School of Computer Sciences 2019.09 - 2023.06 Advisor: Prof. Xiangnan He |
Publications
In the Year of 2026:![]() |
Beyond Magnitude: Leveraging Direction of RLVR Updates for LLM Reasoning Kexin Huang, Haoming Meng, Junkang Wu, Jinda Lu, Chiyu Ma, Ziqian Chen, Xue Wang, Bolin Ding, Jiancan Wu, Xiang Wang, Xiangnan He, Guoyin Wang, Jingren Zhou ICLR 2026 [PDF] [Codes] |
![]() |
Quantile Advantage Estimation for Entropy-Safe Reasoning Junkang Wu, Kexin Huang, Jiancan Wu, An Zhang, Xiang Wang, Xiangnan He ICLR 2026 [PDF] [Codes] |
![]() |
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs Haoming Meng, Kexin Huang, Shaohang Wei, Chiyu Ma, Shuo Yang, Xue Wang, Guoyin Wang, Bolin Ding, Jingren Zhou ICLR 2026 [PDF] |
In the Year of 2025:
![]() |
RePO: ReLU-based Preference Optimization Junkang Wu, Kexin Huang, Xue Wang, Jinyang Gao, Bolin Ding, Jiancan Wu, Xiangnan He, Xiang Wang NeurIPS 2025 [PDF] [Codes] |
![]() |
LaMP-Val: Large Language Models Empower Personalized Valuation in Auction Jie Sun, Tianyu Zhang, Houcheng Jiang, Kexin Huang, Xiang Shu, Zhibo Zhu, Lintao Ma, Xingyu Lu, Jun Zhou, Junkang Wu, Chi Luo, An Zhang, Jiancan Wu, Xiang Wang EMNLP 2025 (Findings) [PDF] [Codes] |
![]() |
Larger or Smaller Reward Margins to Select Preferences for Alignment? Kexin Huang, Junkang Wu, Ziqian Chen, Xue Wang, Jinyang Gao, Bolin Ding, Jiancan Wu, Xiangnan He, Xiang Wang ICML 2025 [PDF] [Codes] |
![]() |
Learning Bayesian Nash Equilibrium in Auction Games via Approximate Best Response Kexin Huang, Ziqian Chen, Xue Wang, Chongming Gao, Jinyang Gao, Bolin Ding, Xiang Wang ICML 2025 [PDF] [Codes] |
![]() |
SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations Chongming Gao, Ruijun Chen, Shuai Yuan, Kexin Huang, Yuanqing Yu, Xiangnan He WWW 2025 Oral [PDF] [Codes] |
![]() |
Auctionformer: A Unified Deep Learning Algorithm for Solving Equilibrium Strategies in Auction Games Kexin Huang, Ziqian Chen, Xue Wang, Chongming Gao, Jinyang Gao, Bolin Ding, Xiang Wang ICML 2024 [PDF] [Codes] |
![]() |
Alleviating Matthew Effect of Offline Reinforcement Learning in Recommendation Chongming Gao, Kexin Huang, Jiawei Chen, Yuan Zhang, Biao Li, Peng Jiang, Shiqi Wang, Zhong Zhang, Xiangnan He SIGIR 2023 [PDF] [Codes] (Best Paper Honorable Mention) |
![]() |
Learn to Explore: on Bootstrapping Interactive Data Exploration with Meta-learning Yukun Cao, Xike Xie, Kexin Huang ICDE 2023 [PDF] |
Preprints
![]() |
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer Chongming Gao, Kexin Huang, Ziang Fei, Jiaju Chen, Jiawei Chen, Jianshan Sun, Shuchang Liu, Qingpeng Cai, Peng Jiang arxiv 2025 [PDF] |
Experiences & Services
| Research Intern, Alibaba Tongyi Lab, Hangzhou, April 2023 - January 2026 - Qwen Pilot, Mentor: Guoyin Wang - DAIL Lab, Mentor: Ziqian Chen, Xue Wang, Jinyang Gao, Bolin Ding |
| TA of Graph Theory, University of Science and Technology of China, Fall 2022 Teaching Professor: Yinlong Xu |
|
Invited Reviewer of Conferences/Journals 2025: ICLR, TKDE |
Grants & Honors
| NSFC Young Student Basic Research Program (PhD Student), 2026-2027 |
| National Scholarship, University of Science and Technology of China, 2025 |
| SIGIR'23 Best Paper Honorable Mention, 2023 |

