Sayan is a Research Scientist at Adobe Research. His primary research interests broadly include Multimodal Learning (Vision-Language, Audio-Visual), Self-Supervised Learning, and Time-Series Analysis. Prior to joining Adobe, Sayan earned his Ph.D. degree from the University of Toronto, Canada in 2024.

More information, including a list of publications, can be found here.

Publications

SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation

Nag, Sayan., Goswami, Koustava., Karanam, Srikrishna. (Sep. 29, 2024)

European Conference on Computer Vision (ECCV)

MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models

Chowdhury, Sanjoy., Nag, Sayan., J, Joseph., Srinivasan, Balaji., Manocha, Dinesh. (Jun. 17, 2024)

CVPR Highlight

CVPR 2024