Pritam Sarkar

Project Description

I am broadly interested in developing safe and generalizable multimodal intelligence through algorithms that learn effectively with minimal human supervision. Currently, working on multimodal AI across video, image, audio, and language, with the goal of advancing multimodal understanding, generation, and reasoning.

Research Classification

Engineering and technology

Research Interests

Multimodal AI
Computer Vision

Faculty

Faculty of Science

Personal Website

https://pritamsarkar.com/

Breadcrumb

Project Description

Research Classification

Research Interests

Faculty

Personal Website

Main navigation