Bio
I’m a second-year Ph.D. student at The Ohio State University, proudly advised by Prof. Huan Sun. My recent research interests focus on leveraging LLMs for scientific coding tasks, as well as enhancing the reasoning ability and faithfulness of large language models. Previously, I received my Master’s degree from Peking University in 2023 and Bachelor’s degree from Northeastern University (China) in 2020.
I’m currently looking for 2026 summer internship opportunities. Feel free to reach out to me if you’re interested in my research!
News
[06/2025] Releasing AutoSDT, an automated pipeline for scaling high-quality data-driven scientific coding tasks.
[06/2025] Check out our new preprint Mind2Web2, a benchmark for evaluating agentic search with agent-as-a-judge.
[01/2025] Our paper ScienceAgentBench is accepted to ICLR 2025.
[10/2024] Check out our new preprint ScienceAgentBench, a new benchmark to rigorously assess language agents for data-driven scientific discovery.
[08/2024] Will be in-person attending ACL 2024 at Bangkok, Thailand from Aug 10th to Aug 16th. Welcome to coffee chats!
[05/2024] Our paper AttributionBench got accepted by ACL 2024 Findings !
[05/2024] Our paper Math-Shepherd got accepted by ACL 2024!
[04/2024] Our paper TableLlama got accepted by NAACL 2024 as an Oral Paper 🔥 (145/2434=6.0%)!!
[03/2024] Our paper TableLlama got accepted by NAACL 2024 (565/2434=23.2%)!
Preprints
- [arXiv 06/2025] AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
Yifei Li$^{*}$, Hanane Nour Moussa$^{*}$, Ziru Chen, Shijie Chen, Botao Yu, Mingyi Xue, Benjamin Burns, Tzu-Yao Chiu, Vishal Dey, Zitong Lu, Chen Wei, Qianheng Zhang, Tianyu Zhang, Song Gao, Xuhui Huang, Xia Ning, Nesreen K. Ahmed, Ali Payani, Huan Sun
[pdf][code][data][website] - [arXiv 06/2025] Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Boyu Gou$^{*}$, Zanming Huang$^{*}$, Yuting Ning$^{*}$, Yu Gu, Michael Lin, Weijian Qi, Andrei Kopanev, Botao Yu, Bernal Jiménez Gutiérrez, Yiheng Shu, Chan Hee Song, Jiaman Wu, Shijie Chen, Hanane Nour Moussa, Tianshu Zhang, Jian Xie, Yifei Li, Tianci Xue, Zeyi Liao, Kai Zhang, Boyuan Zheng, Zhaowei Cai, Viktor Rozgic, Morteza Ziyadi, Huan Sun, Yu Su
[pdf][code][data][website]
Publications
- [ICLR 2024] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Ziru Chen$^{*}$, Shijie Chen$^{*}$, Yuting Ning, Qianheng Zhang, Boshi Wang, Botao Yu, Yifei Li, Zeyi Liao, Chen Wei, Zitong Lu, Vishal Dey, Mingyi Xue, Frazier N Baker, Benjamin Burns, Daniel Adu-Ampratwum, Xuhui Huang, Xia Ning, Song Gao, Yu Su, Huan Sun
[pdf][code][data][website] - [ACL Findings 2024 (long)] AttributionBench: How Hard is Automatic Attribution Evaluation?
Yifei Li$^*$, Xiang Yue, Zeyi Liao, Huan Sun
[pdf][code][data][website][Poster] - [ACL 2024 (long)] Math-Shepherd: A Label-Free Step-by-Step Verifier for LLMs in Mathematical Reasoning
Peiyi Wang*, Lei Li, Zhihong Shao, RX Xu, Damai Dai, Yifei Li, Deli Chen, Yu Wu, Zhifang Sui
[pdf][data][model][website] - [NAACL 2024 (long)] TableLlama: Towards Open Large Generalist Models for Tables
Tianshu Zhang$^*$, Xiang Yue, Yifei Li, Huan Sun
[pdf][code][data][model][website] [ACL 2023 (long)] Making language models better reasoners with step-aware verifier
Yifei Li$^*$, Zeqi Lin, Shizhuo Zhang, Qiang Fu, Bei Chen, Jian-Guang Lou, Weizhu Chen
[pdf][code][data][poster][video]- [KBS 2022] Exploiting high-order local and global user–item interactions for effective recommendation
Sheng Tian$^*$, Guibing Guo, Yifei Li, Yuan Liu, Xingwei Wang
[pdf]
Earlier Preprints
- [arxiv 2022] Input-tuning: Adapting unfamiliar inputs to frozen pretrained models
Shengnan An$^*$, Yifei Li$^*$, Zeqi Lin, Qian Liu, Bei Chen, Qiang Fu, Weizhu Chen, Nanning Zheng, Jian-Guang Lou
[pdf]
Last Update: 08/02/2025