Wenhao Yu 于文豪

Research Scientist at OpenAI

Wenhao Yu avatar

About Me

I am a Research Scientist at OpenAI, where I work on large-scale language model (LLM) training. Before that, my research focuses on building self-improving LLMs that can continuously learn from interaction, feedback, and experience. To support this goal, I have extensively studied and applied reinforcement learning (RL) for post-training, reasoning, and agentic behaviors in large-scale models.

I earned my Ph.D. in Computer Science and Engineering from University of Notre Dame in 2023, advised by Prof. Meng Jiang . My research during Ph.D. was generously supported by the Bloomberg Ph.D Fellowship . I also enjoyed amazing internship experiences at Microsoft Research, AI2, and Bloomberg.

What's New

Selected Publications

For a full list of publications, please refer to my Google Scholar page

R-Zero: Self-Evolving Reasoning LLM from Zero Data
Chengsong Huang, Wenhao Yu, Xiaoyang Wang, Hongming Zhang, Zongxia Li, Ruosen Li, Jiaxin Huang, Haitao Mi, Dong Yu
[ICLR 2026] International Conference on Learning Representations
Parallel-r1: Towards parallel thinking via reinforcement learning
Tong Zheng, Hongming Zhang, Wenhao Yu, Xiaoyang Wang, Runpeng Dai, Rui Liu, Huiwen Bao, Chengsong Huang, Heng Huang, Dong Yu
[ICLR 2026] International Conference on Learning Representations
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly
TZhaowei Wang, Wenhao Yu, Xiyu Ren, Jipeng Zhang, Yu Zhao, Rohit Saxena, Liang Cheng, Ginny Wong, Simon See, Pasquale Minervini, Yangqiu Song, Mark Steedman
[NeurIPS 2025] Conference on Neural Information Processing Systems
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory
Di Wu, Hongwei Wang, Wenhao Yu, Yuwei Zhang, Kai-Wei Chang, Dong Yu
[ICLR 2025] International Conference on Learning Representations
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph
Siru Ouyang, Wenhao Yu, Kaixin Ma, Zilin Xiao, Zhihan Zhang, Mengzhao Jia, Jiawei Han, Hongming Zhang, Dong Yu
[ICLR 2025] International Conference on Learning Representations
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
Hongliang He, Wenlin Yao, Kaixin Ma, Wenhao Yu, Yong Dai, Hongming Zhang, Zhenzhong Lan, Dong Yu
[ACL 2024] 2024 Annual Conference of Association for Computational Linguistics

Industry Experience

OpenAI, San Francisco, CA
Apr. 2026 - Present
Member of Technical Staff (Researcher) – RL team; Worked on synthetic RL
Tencent AI, Seattle, WA
Sep. 2023 - Mar. 2026
Senior Researcher – Hunyuan frointer lab; Worked on LLM Post-training
Allen Institute for AI (AI2), Seattle, WA
Sep. 2022 - Apr. 2023
Research intern – Aristo team; Worked on LLM research
Microsoft Research, Redmond, WA
May - Aug. 2021 & 2022
Research intern – MSR; Worked on RAG and generative IR

Mentoring

I’ve been fortunate to mentor and work alongside many talented students: