Yizhong Wang

yizhongw01@gmail.com

Research Scientist, ByteDance Seed
Incoming Assistant Professor, CS Department, University of Texas at Austin

CV / Bio / Google Scholar / X / Research Statement

I am a Research Scientist at ByteDance Seed and an incoming Assistant Professor at the University of Texas at Austin. I received my PhD from the Paul G. Allen School of Computer Science & Engineering at the University of Washington, where I was co-advised by Hannaneh Hajishirzi and Noah Smith.

I study how (natural/human) language can help AI understand, reason, learn, communicate, and interact with the world. This has led to my past work on instruction tuning, synthetic data generation, RLVR, and open language models. Recently, I am thinking about learning algorithms that can enable more autonomy in AI, and how to use this in high-value but challenging domains (e.g., scientific discovery).

Prospective students and collaborators, please see the Prospective Students section below.

News

July 27, 2025 — We had a tutorial on Synthetic Data in the Era of LLMs at ACL 2025.

June 16, 2025 — I joined ByteDance Seed as a Research Scientist.

May 27, 2025 — I have defended my PhD dissertation!

Nov. 21, 2024 — Tülu has evolved to v3, with SoTA performance & fully open post-training recipes & a playground!

Oct. 29, 2024 — We released Hybrid Preferences, a framework for combining human and AI feedback for better RLHF.

Sep. 25, 2024 — Tulu 2.5 got accepted to NeurIPS 2024!

July 10, 2024 — OLMo won the Best Theme Paper Award at ACL 2024!

May 16, 2024 — OLMo was accepted to ACL 2024 main conference, and temporal alignment was accepted to the findings.

Feb. 12, 2024 — I have passed my PhD general exam! 🏃

Feb. 1, 2024 — I am excited to be part of the OLMo first release. Check out the blog post and tech report.

Jan. 16, 2023 — Self-RAG and BTR got accepted to ICLR 2024!

Nov. 18, 2023 — We released Tülu 2, which tops open models on several benchmarks (e.g. AlpacaEval and Chatbot Arena)!

Sep. 22, 2023 — Tülu got accepted into NeurIPS 2023 Datasets and Benchmarks Track. See people in New Orleans!

June 9, 2023 — We arxived a paper that systematically studies instruction tuning resources and released Tülu, a suite of full-parameter instruction-tuned models from 7B to 65B! [Tweets]

May 2, 2023 — We have three papers accepted by ACL 2023. Looking forward to meeting people at Toronto!

Apr. 18, 2023 — I gave a guest lecture about instruction tuning of large language models at JHU. [Slides][Video]

Jan. 23, 2023 — I started doing part-time research internship at AI2.

Dec. 20, 2022 — We arxived Self-Instruct, a new way to align language models with little human annotation. [Tweets]

Apr. 16, 2022 — We released Natural Instructions V2 that covers 1600+ NLP tasks together with their instructions!

Aug. 12, 2021 — I will be interning at FAIR London starting from September.

Jan. 13, 2021 — Our MultiModalQA paper collaborated with AI2 Israel was accepted to ICLR 2021!

Dec. 7, 2020 — I will be interning part-time at AI2 in the next few months, mostly with the Aristo team.

Dec. 1, 2020 — Our paper on plain language summarization of medical reviews was accepted to AAAI 2021!

Nov. 1, 2020 — Our LiveQA paper was elected as CCL 2020 best paper. Congrats to Qianying and Sicong!

Sep. 15, 2020 — Our paper on dataset analysis was accepted to EMNLP 2020!

Mar. 1, 2020 — I will co-organize the ACL 2020 Student Research Workshop [Website].

Sep. 5, 2019 — I arrived in Seattle and started my PhD journey at UW.

Selected Publications

* indicates equal contribution. For a full list, see my Google Scholar page.

Tülu 3: Pushing Frontiers in Open Language Model Post-Training

Nathan Lambert, Jacob Morrison, Valentina Pyatkin, Shengyi Huang, Hamish Ivison, Faeze Brahman, Lj Miranda, ..., Luca Soldaini, Noah A. Smith, Yizhong Wang, Pradeep Dasigi, Hannaneh Hajishirzi

COLM 2025

Paper / Blog / Playground

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Lj Miranda*, Yizhong Wang*, Yanai Elazar, Sachin Kumar, Valentina Pyatkin, Faeze Brahman, Noah A. Smith, Hannaneh Hajishirzi, Pradeep Dasigi

ACL 2025

Paper / Data / Code

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Hamish Ivison, Yizhong Wang, Jiacheng Liu, Zeqiu Wu, Valentina Pyatkin, Nathan Lambert, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi

NeurIPS 2024

Paper / Code / Models

OLMo: Accelerating the Science of Language Models

Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, et al.

ACL 2024 (Best Theme Paper)

Paper / Blog / Site

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources

Yizhong Wang*, Hamish Ivison*, Pradeep Dasigi, Jack Hessel, Tushar Khot, Khyathi Raghavi Chandu, David Wadden, Kelsey MacMillan, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

NeurIPS 2023

Paper / Code / Models

Self-Instruct: Aligning Language Models with Self-Generated Instructions

Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A Smith, Daniel Khashabi, Hannaneh Hajishirzi

ACL 2023

Paper / Data / Code

Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Yizhong Wang*, Swaroop Mishra*, Pegah Alipoormolabashi, Yeganeh Kordi et al.

EMNLP 2022

Paper / Data / Code / Site

DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs

Dheeru Dua, Yizhong Wang, Pradeep Dasigi, Gabriel Stanovsky, Sameer Singh and Matt Gardner

NAACL 2019

Paper / Code

A Two-Stage Parsing Method for Text-level Discourse Analysis

Yizhong Wang, Sujian Li and Houfeng Wang

ACL 2017 (Outstanding Paper Award)

Paper / Code / Slides

Prospective Students

I plan to recruit multiple CS PhD students to start from Fall 2026 at the the University of Texas at Austin. If your research interests align with mine (or something new about AI really excites you), I strongly encourage you to directly apply to the UT Austin CS PhD program. Please mention my name as a potential advisor in your application.

You are also welcome to email me with your CV and a brief description of your research interests if you are interested in joining my group as a Postdoc, PhD student, or research assistant. I will try my best to respond.