I am a final-year Ph.D. candidate at Paul G. Allen School of
Computer Science & Engineering, University of Washington, advised by Prof. Yejin Choi. I am also a graduate student researcher at NVIDIA and was previously a student researcher at Allen Institute for Artificial Intelligence (Ai2).
My research centers on
Humanistic,
Pluralistic, and
Coevolutionary AI Safety and Alignment
aiming to foster the long-term secure, sustainable, and synergistic coevolution of AI and
humanity:
From Human to AI: Developing human-centered, ever-evolving, and future-oriented AI systems,
anchored in interdisciplinary insights into human intelligence, values, and global needs.
From AI to Human: Advancing the frontiers of human knowledge, augmenting human capabilities, and
addressing consequential sociotechnical challenges through robust, efficient, and scalable innovations
in data, learning algorithms, and AI system design.
My current research focuses on developing data, algorithmic, and system-level solutions to address sociotechnical challenges in AI safety, security, and LLM alignment, often through multi-agent, RL, and data synthesis angles. My works have spearheaded research on moral and (pluralistic) value reasoning of LLMs. I also work on scalable, steerable methods for pluralistic alignment, proactive interventions to mitigate long-term risks such as the erosion of human creativity and democracy, and the creation of socially beneficial AI systems that advance collective well-being.
Artificial Intelligence
Natural Language Processing
AI Safety
Machine Learning
Pluralistic Alignment
Human-AI Interaction
โฐ๏ธ I'm on the job market for 2026!
Please feel free to reach out if you think my
background could be a good fit for your organization.
Education & Experience
Education
University of Washington
Ph.D. in Computer Science and Engineering
Sept 2019 - Dec 2025 (expected), Seattle, Washington
Colby College
B.A. in Computer Science and Mathematics (Summa Cum Laude, Top 0.5%)
Sept 2015 - Dec 2018, Waterville, Maine
Professional Experience
NVIDIA
Graduate Student Researcher at Nemo-Guardrail Team
March 2025 - Present, Santa Clara, California
Allen Institute for Artificial Intelligence (Ai2)
Graduate Student Researcher at Mosaic and AllenNLP Teams
June 2020 - December 2024, Seattle, Washington
Stanford University
Undergraduate Research Intern at the Computer Science Department
June 2017 - September 2019, Stanford, California
The Future Laboratory, Tsinghua University
Visiting Student Researcher
July 2019 - September 2019, Beijing, China
Awards
Outstanding Paper Award
AI Agents: Capabilities and Safety (AIA) Workshop @ COLM 2025 (Oct 2025)
Best Paper Award
CHI 2024 (May 2024)
Outstanding Paper Award
EMNLP 2023 (Dec 2023)
Best Paper Award
NAACL 2022 (Jul 2022)
Anne Dinning - Michael Wolf Endowed Regental Fellowship
University of Washington (2019)
Paul G. Allen School First-Year Ph.D. Fellowship
Member of the Phi Beta Kappa Society
Colby College (2018)
Elected as a member of Phi Beta Kappa with junior standing
Honorable Mention of Interdisciplinary Contest in Modeling (ICM)
COMAP (2018)
20th annual Interdisciplinary Contest in Modeling (ICM)
Phi Beta Kappa Undergraduate Scholastic Achievement Award
Colby College (2017)
Top two students in the sophomore and junior classes
Julius Seelye Bixler Scholar
Colby College (2016, 2017, 2018)
Top-ranking students as determined by the cumulative academic record, three-time
recipient
Phi Beta Kappa Summer Research Scholar
Colby College (2016)
Summer research stipend
Teaching & Services
Courses
Head TA for the NLP class with 230+ undergraduate and graduate students
Co-design the class module, including teaching materials and assignments
Lead TA for a graduate-level seminar with over 30 students
Conference Tutorials
July 2025, Co-Instructor, ACL 2025
Guest Lectures
COM SCI 162: Natural Language Processing, UCLA
Red-Teaming and Safeguarding Language Models: Current Practices, Challenges, and Future Directions
Slides
May 2025 (Instructor: Saadia Gabriel)
11-830: Ethics, Social Biases, and Positive Impact in Language Technologies, CMU
Red-Teaming and Safeguarding Language Models: Current Practices, Challenges, and Future Directions
Slides
Feb 2025 (Instructor: Maarten Sap)
IS504: Sociotechnical Information Systems, UIUC
How to Build AI with Deep Concerns for Human Traits, Values, and Needs?
Slides
Nov 2024 (Instructor: Yue Guo)
CS475: ML for NLP, KAIST, South Korea
LLM Reasoning (In-Context Learning, Prompting, and Reasoning)
Slides
Nov 2024 (Instructor: Alice Oh)
CSE 447: Natural Language Processing, University of Washington
In-Context Learning, Prompting, and Basics of Reasoning
Slides
Nov 2024 (Instructor: Yulia Tsvetkov)
CS1684/2084: Bias and Ethical Implications in Artificial Intelligence, University of Pittsburgh
How to Build AI with Deep Concerns for Human Traits, Values, and Needs?
Slides
Oct 2024 (Instructor: Xiang Lorraine Li)
CSE 163: Intermediate Data Programming, University of Washington
How to Build AI with Deep Concerns for Human Traits, Values, and Needs?
Aug 2024 (Instructor: Yuxuan Mei)
Ethics and Citizenship, The Downtown School, Seattle
Can We Teach Machines Human Ethics and Values?
Sept 2023, w/ Valentina Pyatkin and Taylor Sorensen
CS496: AI Perspectives: Symbolic Reasoning to Deep Learning, Northwestern University
Toward Interpretable and Interactive Socially & Ethically Informed AI
March 2023 (Instructor: Mohammed Anwarul Alam)
LAW E 553: Technology Law And Public Policy Seminar, University of Washington
Toward Interpretable and Interactive Socially & Ethically Informed AI
March 2023 (Instructor: Inyoung Cheong)
Ethics and Citizenship, The Downtown School, Seattle
Toward Socially Aware & Ethically Informed AI
Slides
Sept 2022, w/ Saadia Gabriel
HONORS 222 B: Artificial Intelligence Meets Society, University of Washington
Toward Ethically Informed & Socially Aware AI
May 2022 (Instructor: Richard Freeman)
Workshop Organizations
Oct 2025, Co-Organizer, COLM 2025
Dec 2024, Co-Organizer, NeurIPS 2024
Dec 2023, Co-Organizer, NeurIPS 2023
Talks
Netskope
WildTeaming and WildGuard: Building Robust Model-Level and System-Level Safeguards of Language Models
May 2025, Speaker
Darpa ITM PI Meeting
Can Language Models Reason about Individualistic Human Values and Preferences?
March 2025, Speaker
University of Washington, Foster School of Business, Computational Minds and Machines lab
How to Build Machines with Deep Concerns of Human Traits, Values, and Needs?โTowards Humanistic AI Alignment
Feb 2025, Speaker (Hosted by Max Kleiman-Weiner)
Annual Research Showcase and Open House Event, UW CSE
AI Safety Panel
Oct 2024, Panelist
All-Ai2 Meeting, Allen Institute for Artificial Intelligence (Ai2)
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer LMs
July 2024, Speaker
The Big Picture Workshop, EMNLP, Singapore
On the Outcomes of Scientific Disagreements on Machine Morality
Dec 2023, Speaker
Darpa ITM Kickoff PI Meeting
Toward Interpretable and Interactive Socially & Ethically Informed AI
May 2023, Speaker
Mosaic Morality & AI Series, Allen Institute for Artificial Intelligence (Ai2)
Toward Interpretable, Interactive, Informative Machine Moral Reasoning
Feb 2023, Discussant
UW NLP Retreat
Toward Socially Aware & Ethically Informed AI
Sept 2022, Speaker
All-Ai2 Meeting, Allen Institute for Artificial Intelligence (Ai2)
Delphi: Toward Machine Ethics and Norms
Oct 2021, Speaker
Personal
I deeply value mentorship and am profoundly grateful to the mentors who have shaped and supported my research journey (in alphabetical order): Chandra Bhagavatula, Antoine Bosselut, Yejin Choi, Oren Etzioni, Erick Galinkin, Jena D. Hwang, Natasha Jaques, James Landay, Ronan Le Bras, Sydney Levine, Christopher Parisien, Sherry Ruan, Maarten Sap, and Yulia Tsvetkov.
I firmly believe that everyone has the potential to achieve anything they set their mind to. Keep going
and try again.
Your path is uniquely yours. Follow what ignites you. Every twist, every turn, every unexpected direction
is exactly where you need to be.
Two cats, an orange tabby named Loopy and an orange british shorthair named Loafy, adopted me as their owner.
My current life motto: be happy and be healthy.