I am a third year PhD student at UNC, Chapel Hill. I currently work in the MURGe-Lab, and am advised by Mohit Bansal. My research interests are in the areas of Deep Learning, Machine Learning, and Computer Vision. Recently, I am particularly interested in multi-modal learning, paramter-efficient transfer learning, and continual learning, where my goal is to enable the agent to learn and do inference as humans. Before joining MURGe-Lab, I also worked with Colin Raffel and Marc Niethammer.
I also spent time working as a research scientist intern in tech company in summers. In 2023 summer, I interned at Meta with Abhimanyu Dubey, Filip Radenovic and Abhishek Kadian on text-to-image generation. In 2022 summer, I worked at Microsoft with Linjie Li, Kevin Lin and Zhe Gan on VL model merging.
July 2023: "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" is accepted to ICCV 2023.
May 2023: Start the research internship at Meta
April 2023: A preprint of "An Empirical Study of Multimodal Model Merging" is online.
Feb 2023: "Vision Transformers are Parameter-Efficient Audio-Visual Learners" is accepted to CVPR 2023.