About
I am a 5th-year Ph.D. student at the Computer Science Department of University of Maryland, College Park, where I am advised by Prof. Hal Daumé III.
I am broadly interested in studying problems related to trustworthiness in multimodal setting, aiming to enhance human-centered AI. My research focuses on equipping visual-language models with self-reasoning and self-improvement capabilities to better serve human needs. This includes:
(i) Correcting hallucinations and communicating uncertainties [EMNLP 24, EMNLP 23]
(ii) Fostering pragmatic understanding by simulating human behavior [ACL 23]
(iii) Generating faithful explanations [Paper coming soon]
(iv) Visual-language alignment for long context [ACL 25]
News
- [Jun 2025] Excited to start research internship at Microsoft Semantic Machines!
- [Feb 2025] New paper on Can Hallucination Correction Improve Video-Language Alignment? Accepted by ACL 2025.
- [Feb 2024] New paper on Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Alternatives, accepted by EMNLP 2024 (Oral). Project Website
- [Oct 2023] New paper on Hallucination Detection for Grounded Instruction Generation, accepted by EMNLP 2023. Project Website
- [Dec 2022] New paper on Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models. Accepted by ACL 2023, and ICML Theory-of-Mind Workshop 2023 (Outstanding Paper Award). Project Website