Yizhong Wang

PhD student
Paul G. Allen School of Computer Science & Engineering
University of Washington, Seattle, WA

Email: yizhongw [at] cs.washington.edu

Short Bio

I am a fifth-year PhD student at the Paul G. Allen School of Computer Science & Engineering, University of Washington. I am very fortunate to be co-advised by Hannaneh Hajishirzi and Noah Smith. I am also a part-time research intern at Allen Institute for Artificial Intelligence. I have previously interned at Meta AI, Microsoft Research, and Baidu NLP. Prior to UW, I obtained my Master's degree from Peking University and Bachelor's degree from Shanghai Jiao Tong University.

My primary research interests lie in natural language processing and machine learning. I am excited about the generality of large language models (LLMs). In particular, I have been thinking over the following topics these days:

Adaptation of LLMs. How can we build and evaluate instruction-following models better? What perpectives do we need to consider during adaptation and their effects on the general-purpose models? What types of supervision formats could be effective and scalable?
Continual learning of LLMs. What is the boundary between pretraining and finetuning? What architectures and learning methods can enable LLMs to keep evolving after pretraining? How does models' internal knowledge interplay with new knowledge being learned?
Large-scale synthetic data. Generative models are producing data at an unprecedented speed. What are the roles of model-generated data to our model development, or even our Internet and society? How can we ensure diverse and high-quality data generation at a large scale? Can we distinguish them from human data?

I believe answering these questions will be critical for the incoming era of generative AI. Feel free to drop me messages if you want to chat about these topics or would like to collaborate!

My name is written as 王义中 in Chinese characters.

News

Feb. 1, 2024

I am excited to be part of the OLMo first release. Check out the blog post and tech report.

Jan. 16, 2023

Self-RAG and BTR got accepted to ICLR 2024!

Sep. 22, 2023

📢 We are organizing a Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023. Please consider submitting your paper or joining us in the conference!

Nov. 18, 2023

We released Tülu 2, which tops open models on several benchmarks (e.g. AlpacaEval and Chatbot Arena)!

Sep. 22, 2023

Tülu got accepted into NeurIPS 2023 Datasets and Benchmarks Track. See people in New Orleans!

June 9, 2023

We arxived a paper that systematically studies instruction tuning resources and released Tülu, a suite of full-parameter instruction-tuned models from 7B to 65B! [Tweets]

May 2, 2023

We have three papers accepted by ACL 2023. Looking forward to meeting people at Toronto!

Apr. 18, 2023

I gave a guest lecture about instruction tuning of large languag models at JHU. [Slides][Video]

Jan. 23, 2023

I started doing part-time research internship at AI2.

Dec. 20, 2022

We arxived Self-Instruct, a new way to align language models with little human annotation. [Tweets]

Apr. 16, 2022

We released Natural Instructions V2 that covers 1600+ NLP tasks together with their instructions!

Aug. 12, 2021

I will be interning at FAIR London starting from September.

Jan. 13, 2021

Our MultiModalQA paper collaborated with AI2 Israel was accepted to ICLR 2021!

Dec. 7, 2020

I will be interning part-time at AI2 in the next few months, mostly with the Aristo team.

Dec. 1, 2020

Our paper on plain language summarization of medical reviews was accepted to AAAI 2021!

Nov. 1, 2020

Our LiveQA paper was elected as CCL 2020 best paper. Congrats to Qianying and Sicong!

Sep. 15, 2020

Our paper on dataset analysis was accepted to EMNLP 2020!

Mar. 1, 2020

I will co-organize the ACL 2020 Student Research Workshop [Website].

Sep. 5, 2019

I arrived in Seattle and started my PhD journey at UW.

Aug. 13, 2019

Our paper on numeracy probing was accepted to EMNLP 2019!

July. 4, 2019

I graduated from Peking University! Thanks for my advisor Sujian Li and all my labmates. Farewell!

Apr. 15, 2019

Super excited to announce that I'll be joining the UW NLP group as a CSE PhD student in Sep. 2019!

Feb. 22, 2019

Our DROP paper was accepted to NAACL 2019! [Details], [Matt's tweets].

Oct. 5, 2018

I started new internship at Allen Institute for Artificial Intelligence, working with Matt and Sameer!

Sep. 13, 2018

Our MRC system (nlnet) firstly achieves human parity w.r.t F1 on SQuAD 1.1 and also tops 2.0! [Leaderboad]

May 24, 2018

One paper on discourse segmentation was accepted to EMNLP 2018! [Preprint] [Code]

April. 26, 2018

I started my internship at Microsoft Reserach Asia, working with Furu Wei.

Apr. 21, 2018

Two papers were accepted to ACL 2018!

Feb. 25, 2018

Our reading comprehension system (V-Net) won the first place on MS-MARCO leaderboard! [Report]

Nov. 16, 2017

We released a large-scale dataset for Chinese Machine Reading Comprehension.

July 08, 2017

One paper on discourse relation classification was accepted to IJCNLP 2017.

June 14, 2017

I started my internship at Baidu NLP Team.

June 03, 2017

Our paper on discourse parsing was selected as ACL 2017 outstanding paper! [Full List]

March 31, 2017

One short paper on RST discourse parsing was accepted to ACL 2017.

Sep. 30, 2016

One demo paper on dependency parsing was accepted to COLING 2016.

July 4, 2016

I graduated (with honours) from Shanghai Jiao Tong University . Thanks and Bye!

Selected Publications [See all on my Google Scholar]

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Hamish Ivison*, Yizhong Wang*, Valentina Pyatkin, Nathan Lambert, Matthew Peters, Pradeep Dasigi, Joel Jang, David Wadden, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

Arxiv Preprint

Paper Code Models

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources (Spotlight)

Yizhong Wang*, Hamish Ivison*, Pradeep Dasigi, Jack Hessel, Tushar Khot, Khyathi Raghavi Chandu, David Wadden, Kelsey MacMillan, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

NeurIPS 2023 (Datasets and Benchmarks Track)

Paper Code Models

Self-Instruct: Aligning Language Models with Self-Generated Instructions

Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A Smith, Daniel Khashabi, Hannaneh Hajishirzi

ACL 2023

Paper Code

Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Yizhong Wang*, Swaroop Mishra*, Pegah Alipoormolabashi, Yeganeh Kordi et al.

EMNLP 2022

Paper Data Code Site

Probing Across Time: What Does RoBERTa Know and When?

Leo Z. Liu*, Yizhong Wang*, Jungo Kasai, Hannaneh Hajishirzi, Noah A. Smith

EMNLP 2021 Findings

Paper Slides

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Swabha Swayamdipta, Roy Schwartz, Nicholas Lourie, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith and Yejin Choi

2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

Paper Code

Do Neural NLP Models Know Numbers? Probing Numeracy in Embeddings

Eric Wallace*, Yizhong Wang*, Sujian Li, Sameer Singh and Matt Gardner

2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019)

Paper Code

DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs

Dheeru Dua, Yizhong Wang, Pradeep Dasigi, Gabriel Stanovsky, Sameer Singh and Matt Gardner

2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)

Paper Code Demo Site

Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification

Yizhong Wang, Kai Liu, Jing Liu, Wei He, Yajuan Lyu, Hua Wu, Sujian Li and Haifeng Wang.

The 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)

Paper Slides

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications

Wei He, Kai Liu, Yajuan Lyu, Shiqi Zhao, Xinyan Xiao, Yuan Liu, Yizhong Wang, Hua Wu, Qiaoqiao She, Xuan Liu, Tian Wu, Haifeng Wang

ACL Workshop on Machine Reading for Question Answering, 2018

Paper Data Code Site

A Two-Stage Parsing Method for Text-level Discourse Analysis (Outstanding Paper Award)

Yizhong Wang, Sujian Li and Houfeng Wang

The 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017)

Paper Code Slides

* indicates equal contribution.