Project Description
I am broadly interested in developing safe and generalizable multimodal intelligence through algorithms that learn effectively with minimal human supervision. Currently, working on multimodal AI across video, image, audio, and language, with the goal of advancing multimodal understanding, generation, and reasoning.
Research Classification
- Engineering and technology
Research Interests
- Multimodal AI
- Computer Vision
Faculty
Faculty of Science