Sayan is a Research Scientist at Adobe Research. His primary research interests broadly include Multimodal Learning (Vision-Language, Audio-Visual), Self-Supervised Learning, and Time-Series Analysis. Prior to joining Adobe, Sayan earned his Ph.D. degree from the University of Toronto, Canada in 2024.

More information, including a list of publications, can be found here.

Publications

Agentic Design Review System

Nag, Sayan., J, Joseph., Goswami, Koustava., Morariu, Vlad., Srinivasan, Balaji. (Jan. 20, 2026)

AAAI Conference on Artificial Intelligence 2026

Localizing Knowledge in Diffusion Transformers

Zarei, Arman., Basu, Samyadeep., Rezaei, Keivan., Lin, Zihao., Nag, Sayan., Feizi, Soheil. (Dec. 1, 2025)

Neural Information Processing Systems (NeurIPS)

SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation

Nag, Sayan., Goswami, Koustava., Karanam, Srikrishna. (Sep. 29, 2024)

European Conference on Computer Vision (ECCV)

MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models

Chowdhury, Sanjoy., Nag, Sayan., J, Joseph., Srinivasan, Balaji., Manocha, Dinesh. (Jun. 17, 2024)

CVPR Highlight

CVPR 2024