Pritam Sarkar

Project Description

I am broadly interested in developing safe and generalizable multimodal intelligence through algorithms that learn effectively with minimal human supervision. Currently, working on multimodal AI across video, image, audio, and language, with the goal of advancing multimodal understanding, generation, and reasoning.
 

Research Classification

  • Engineering and technology

Research Interests

  • Multimodal AI
  • Computer Vision

Faculty

Faculty of Science