I’m Saransh Sharma, a 2025 Dual Degree (B.Tech + M.Tech) graduate in Computer Science from IIT Kharagpur. My research focuses on designing intelligent document experiences, including developing new document interactions and functionalities, extracting multimodal insights, and generating multimodal artifacts that help users better understand, explore, and interact with document content. My thesis focused on leveraging large language models for intent detection in both text and multimodal settings. I have also worked on agentic frameworks for extractive summarization and enhancing multilingual understanding of language models.