Audio

We invent audio AI technology for human-centered creativity. Our goal is to empower people to bring their creative ideas to life through high quality audio content. This ranges from music and podcasts to video and immersive experiences, as well as emerging media types. We are dramatically simplifying the creation process with AI so that people can quickly go from idea to produced content regardless of skill level, iterating on the creative aspects rather than the technical ones. We do this through our research in the analysis, processing, and generation of speech, music, everyday sounds, and more. We also work on problems at the intersection of audio with video, augmented reality, and natural language processing. To advance all of these research areas, we develop new machine learning models, novel signal processing algorithms, and new human computer interaction paradigms. 

Meet some of our researchersView More

Jiaqi Su

Audio Research Scientist

Jonah Casebeer

Research Scientist

Jui-Hsien Wang

Research Engineer

View our latest publicationsView More

SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation

Brade, S., Anderson, S., Kumar, R., Jin, Z., Truong, A. (Apr. 26, 2025)

CHI 2025

Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs

Ghosh, S., Evuru, C., Kumar, S., Tyagi, U., Nieto, O., Jin, Z., Manocha, D. (Apr. 21, 2025)

International Conference on Learning Representations (ICLR)

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Ghosh, S., Kumar, S., Evuru, C., Nieto, O., Duraiswami, R., Manocha, D. (Apr. 7, 2025)

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Project Super Sonic

With Project Super Sonic, users can generate sound effects for videos simply by entering a prompt. They can also click on objects within the video to create sounds—no prompt needed. Timing can also be controlled using voice input, making the experience highly intuitive. It’s a cool way to enhance your videos with custom audio that fits perfectly.

View our latest newsView All News

Join us!

We are looking for researchers, engineers, and interns to take our technologies to the next level. We're recruiting, and we would love to hear from you!