About
I am a 5th-year Ph.D. student at the Computer Science Department of University of Maryland, College Park, where I am advised by Prof. Hal Daumé III. I graduated from Columbia University with M.S. in Computer Science, and Sun Yat-Sen University with B.S. in Computational Mathematics.
I am broadly interested in studying problems related to trustworthiness in multimodal setting, aiming to enhance human-centered AI. My research focuses on equipping visual-language models with self-reasoning and self-improvement capabilities to better serve human needs. This includes:
(i) Correcting hallucinations and communicating uncertainties
(ii) Fostering pragmatic understanding by simulating human behavior
(iii) Generating faithful explanations
(iv) Visual-language alignment for long context
News
- [Jun 2025] Excited to start research internship at Microsoft Semantic Machines!
- [Feb 2025] New paper on Can Hallucination Correction Improve Video-Language Alignment? Accepted by ACL 2025.
- [Nov 2024] I’m going to EMNLP 2024 in person, to give a presentation for this paper. Let’s chat!
- [Feb 2024] New paper on Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Alternatives, accepted by EMNLP 2024 main conference (Oral). Project Website
- [Oct 2023] New paper on Hallucination Detection for Grounded Instruction Generation, accepted by EMNLP 2023. Project Website
- [Dec 2022] New paper on Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models. Accepted by ACL 2023, and ICML ToM Workshop 2023 (received Outstanding Paper Award). Code and dataset released on Project Website