Zihao He

I'm a Research Scientist on the Monetization GenAI team at Meta. I obtained my PhD in CS from USC, advised by Prof. Kristina Lerman. I was part of the SEA Lab. My research interests include NLP, LLM alignment & evaluation, and reinforcement learning from human feedback (RLHF).

Previously, I received my undergraduate degree in Communication Engineering from Beijing University of Posts and Telecommunications. I've spent time at Tsinghua University working with Shutao Xia. During my PhD, I interned at TikTok, Amazon, and DiDi Global.

Industrial Experience

Research Scientist @ Meta, Menlo Park, CA, Feb 2025 – Present
Research Scientist Intern @ TikTok, San Jose, CA, May–Aug 2024
Applied Scientist Intern @ Amazon, Remote, May–Aug 2022
Algorithm Development Intern @ DiDi Global, Beijing, Apr–Jul 2020

Education

Aug 2019 – Feb 2025 Ph.D. in Computer Science, University of Southern California, Los Angeles, CA, United States

Sep 2018 – Jun 2019 (Transferred) M.S. in Computer Engineering, Tsinghua University, Beijing, China

Sep 2014 – Jun 2018 B.E. in Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China

Selected Publications & Preprints

Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge

Junjie Wu, Xuan Kan, Zihao He, Shunwen Tan, Bo Pan, Kaitai Zhang

ACL Industry Track, 2026

BigTokDetect: A Clinically-Informed Vision-Language Modeling Framework for Detecting Pro-Bigorexia Videos on TikTok

Minh Duc Chu, Kshitij Pawar, Zihao He, Roxanna Sharifi, Ross Sonnenblick, Magdalayna Curry, Laura D'Adamo, Lindsay Young, Stuart B Murray, Kristina Lerman

EACL, 2026

STEER-BENCH: A Benchmark for Evaluating the Steerability of Large Language Models

Kai Chen, Zihao He, Taiwei Shi, Kristina Lerman

EMNLP, 2025

Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation

Hieu Nguyen, Zihao He, Shoumik Atul Gandre, Ujjwal Pasupulety, Sharanya Kumari Shivakumar, Kristina Lerman

Preprint, 2025

Improving and Assessing the Fidelity of Large Language Models Alignment to Online Communities

Minh Duc Chu, Zihao He, Rebecca Dorn, Kristina Lerman

NAACL, 2025

COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities

Zihao He, Rebecca Dorn, Siyi Guo, Minh Duc Chu, Kristina Lerman

EMNLP, 2024

How Susceptible are Large Language Models to Ideological Manipulation?

Kai Chen, Zihao He, Jun Yan, Taiwei Shi, Kristina Lerman

EMNLP, 2024

Whose Emotions and Moral Sentiments Do Language Models Reflect?

Zihao He, Siyi Guo, Ashwin Rao, Kristina Lerman

ACL-Findings, 2024

Media Media

CPL-NoViD: Context-Aware Prompt-based Learning for Norm Violation Detection in Online Communities

Zihao He, Jonathan May, Kristina Lerman

ICWSM, 2024

Code

IsamasRed: A Public Dataset Tracking Reddit Discussions on Israel-Hamas Conflict

Kai Chen, Zihao He, Keith Burghardt, Jingxin Zhang, Kristina Lerman

ICWSM, 2024

Code

Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations

Abhishek Anand, Negar Mokhberian, Prathyusha Kumar, Anweasha Saha, Zihao He, Ashwin Rao, Fred Morstatter, Kristina Lerman

Proceedings of the 1st Workshop on Uncertainty-Aware NLP (UncertaiNLP), 2024

Reading Between the Tweets: Deciphering Ideological Stances of Interconnected Mixed-ideology Communities

Zihao He, Ashwin Rao, Siyi Guo, Negar Mokhberian, Kristina Lerman

EACL-Findings, 2024

Code

Measuring Online Emotional Reactions to Events

Siyi Guo, Zihao He, Ashwin Rao, Eugene Jang, Yuanfeixue Nan, Fred Morstatter, Jeffrey Brantingham, Kristina Lerman

ASONAM, 2023

Anger Breeds Controversy: Analyzing Controversy and Emotions on Reddit

Kai Chen, Zihao He, Rong-Ching Chang, Jonathan May, Kristina Lerman

SBP-BRiMS, 2023

ALCAP: Alignment-Augmented Music Captioner

Zihao He, Weituo Hao, Wei-Tsung Lu, Changyou Chen, Kristina Lerman, Xuchen Song

EMNLP, 2023

Code

Infusing Knowledge from Wikipedia to Enhance Stance Detection

Zihao He, Negar Mokhberian, Kristina Lerman

Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis, 2022

Code

Detecting Polarized Topics Using Partisanship-aware Contextualized Topic Embeddings

Zihao He, Negar Mokhberian, António Câmara, Andres Abeliuk, Kristina Lerman

EMNLP-Findings, 2021

Code Media

Speaker Turn Modeling for Dialogue Act Classification

Zihao He, Leili Tavabi, Kristina Lerman, Mohammad Soleymani

EMNLP-Findings, 2021

Code

Professional Service

Program Committee Member / Reviewer:

ACL Rolling Review (ARR) — 11 review cycles, Aug 2023 to Jan 2026, feeding ACL, EMNLP, NAACL, and EACL
International Conference on Computational Linguistics (COLING / LREC-COLING) — 2024, 2025
AAAI International Conference on Web and Social Media (ICWSM) — 2024
Conference on Complex Systems (CCS) — 2021 (Program Committee)
Workshop on Trustworthy Natural Language Processing (TrustNLP) at NAACL 2024, NAACL 2025, ACL 2026
Workshop on Representation Learning for Responsible Human-Centric AI (R2HCAI) at AAAI 2023

Teaching Experience

DSCI-531: Fairness in Artificial Intelligence. Instructor: Kristina Lerman. Spring 2022
CSCI-566: Deep Learning and its Applications. Instructor: Yue Zhao. Spring 2024

Miscellany

In my spare time I enjoy cooking and hiking.
I was born in Yichang, Hubei, a key transportation hub on the Yangtze River. The city is known for its proximity to the Three Gorges Dam, the world's largest hydroelectric power station.
Many thanks to Nelson Liu for sharing the source code of this website!

Zihao He

Recent News

Industrial Experience

Education

Selected Publications & Preprints

Professional Service

Teaching Experience

Miscellany