About Me
I am a 4th year CS Ph.D. candidate at the LAUNCH Lab, University of Michigan – Ann Arbor, advised by Prof. Lu Wang. I obtained my Bachelor’s degree from Peking University, advised by Prof. Xiaojun Wan.
I work on LLM reasoning and agents.
News
- 2026-05: Excited to start a Research Scientist internship at Meta!
- 2026-05: Thrilled to receive ICML 2026 Silver Reviewer Award!
- 2026-04: AdaMEM, which focuses on test-time adaptive agent memory, has been accepted to ICML 2026.
- 2026-04: ThinkLogit, which focuses on inference-time compute for long reasoning, has been accepted to ACL Findings 2026. See you in San Diego!
- 2026-01: MLE-Ideator, which focuses on reinforcement learning for MLE agent, has been accepted to EACL 2026.
Selected Publications
- AdaMEM: Test-Time Adaptive Memory for Language Agents
Yunxiang Zhang, Yiheng Li, Ali Payani, Lu Wang
ICML 2026 [paper] [code] [project page] - Logit Arithmetic Elicits Long Reasoning Capabilities Without Training
Yunxiang Zhang, Muhammad Khalifa, Lechen Zhang, Xin Liu, Ayoung Lee, Xinliang Frederick Zhang, Farima Fatahi Bayat, Lu Wang
Findings of ACL 2026 [paper] [code] - Learning to Ideate for Machine Learning Engineering Agents
Yunxiang Zhang, Kang Zhou, Zhichao Xu, Kiran Ramnath, Yun Zhou, Sangmin Woo, Haibo Ding, Lin Lee Cheong
EACL 2026 [paper] - MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?
Yunxiang Zhang, Muhammad Khalifa, Shitanshu Bhushan, Grant D Murphy, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, Lu Wang
NeurIPS 2025 Datasets and Benchmarks Track [paper] [project page] [code] - Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, Lu Wang
Findings of ACL 2024 [paper] [code] [project page] - Merging Generated and Retrieved Knowledge for Open-Domain QA
Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang
EMNLP 2023 [paper] [code] - SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning
Yunxiang Zhang, Xiaojun Wan
NeurIPS 2023 Datasets and Benchmarks Track [paper] [data] - MOVER: Mask, Over-generate and Rank for Hyperbole Generation
Yunxiang Zhang, Xiaojun Wan
NAACL 2022 [paper] [code] - Interpreting the Robustness of Neural NLP Models to Textual Perturbations
Yunxiang Zhang, Liangming Pan, Samson Tan, Min-Yen Kan
Findings of ACL 2022 [paper] - BiRdQA: A Bilingual Dataset for Question Answering on Tricky Riddles
Yunxiang Zhang, Xiaojun Wan
AAAI 2022 [paper] [data]
Services
Co-Organizer, 1st Workshop on Test-Time Scaling and Reasoning Models, COLM 2025
Student Volunteer, ACL 2024
Reviewer: TMLR, ICML (2025-26), AISTATS (2025), ICLR (2025-26), NeurIPS (2024-25), ACL Rolling Review (2024-26), COLM (2024, 2026), EMNLP (2022-24), ACL (2023), CoNLL (2023-24), ACM Computing Surveys
