Justin Salamon

Principal Scientist

San Francisco

Justin is a Principal Scientist and head of the Sound Design AI (SODA) group at Adobe Research in San Francisco. SODA works on sound generation and understanding for creative video and audio applications. The team ships foundation audio and audiovisual models that solve real user needs, working with product teams to ship groundbreaking creative experiences. Before joining Adobe, Justin was a Senior Research Scientist at New York University. For further information and a full list of his publications please see: www.justinsalamon.com

Publications

SILA: Signal-to-Language Augmentation for Enhanced Control in Text-to-Audio Generation

Kumar, Sonal., Seetharaman, Prem., Salamon, Justin., Nieto, Oriol. (Oct. 12, 2025)

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

FLAM: Frame-Wise Language-Audio Modeling

Wu, Yusong., Tsirigotis, Christos., Chen, Ke., Huang, Cheng-Zhi., Courville, Aaron., Nieto, Oriol., Seetharaman, Prem., Salamon, Justin. (May. 8, 2025)

International Conference on Machine Learning (ICML)

Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations

García, Hugo., Nieto, Oriol., Salamon, Justin., Pardo, Bryan., Seetharaman, Prem. (Apr. 7, 2025)

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning

Manco, Ilaria., Salamon, Justin., Nieto, Oriol. (Nov. 10, 2024)

International Society for Music Information Retrieval Conference (ISMIR)

Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries

Wilkins, Julia., Salamon, Justin., Fuentes, Magdalena., Bello, Juan., Nieto, Oriol. (Oct. 22, 2023)

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Efficient Spoken Language Recognition Via Multilabel Classification

Nieto, Oriol., Jin, Zeyu., Dernoncourt, Franck., Salamon, Justin. (Aug. 24, 2023)

Interspeech 2023

Language-Guided Audio-Visual Source Separation via Trimodal Consistency

Tan, Reuben., Ray, Arijit., Plummer, Bryan., Salamon, Justin., Nieto, Oriol., Russell, Bryan., Saenko, Kate. (Jun. 18, 2023)

Highlight Paper (Top 10%)

Conference on Computer Vision and Pattern Recognition (CVPR)

Audio-Text Models Do Not Yet Leverage Natural Language

Wu, Ho-Hsiang., Nieto, Oriol., Bello, Juan., Salamon, Justin. (Jun. 4, 2023)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Automated Acoustic Monitoring Captures Timing and Intensity of Bird Migration

Doren, Benjamin., Lostanlen, Vincent., Cramer, Aurora., Salamon, Justin., Dokter, Adriaan., Kelling, Steve., Bello, Juan., Farnsworth, Andrew. (Dec. 14, 2022)

Journal of Applied Ecology

Filler Word Detection and Classification: A Dataset and Benchmark

Zhu, Ge., Caceres, Juan-Pablo., Salamon, Justin. (Sep. 18, 2022)

23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)

HEAR: Holistic Evaluation of Audio Representations

Turian, Joseph., Shier, Jordie., Khan, Humair., Raj, Bhiksha., Schuller, Björn., Steinmetz, Christian., Malloy, Colin., Tzanetakis, George., Velarde, Gissel., McNally, Kirk., Henry, Max., Pinto, Nicolas., Noufi, Camille., Clough, Christian., Herremans, Dorien., Fonseca, Eduardo., Engel, Jesse., Salamon, Justin., Esling, Philippe., Manocha, Pranay., Watanabe, Shinji., Jin, Zeyu., Bisk, Yonatan. (Jul. 20, 2022)

NeurIPS 2021

It’s Time for Artistic Correspondence in Music and Video

Surís, Dídac., Vondrick, Carl., Russell, Bryan., Salamon, Justin. (Jun. 19, 2022)

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Deep Embeddings and Section Fusion Improve Music Segmentation

Salamon, Justin., Nieto, Oriol., Bryan, Nicholas. (Nov. 8, 2021)

International Society for Music Information Retrieval Conference (ISMIR)

Emotion Embedding Spaces for Matching Music to Stories

Won, Minz., Salamon, Justin., Bryan, Nicholas., Mysore, Gautham., Serra, Xavier. (Nov. 8, 2021)

Best Student Paper Award

International Society for Music Information Retrieval Conference (ISMIR)

Who Calls the Shots? Rethinking Few-Shot Learning for Audio

Wang, Yu., Bryan, Nicholas., Salamon, Justin., Cartwright, Mark., Bello, Juan. (Oct. 18, 2021)

Best Audio Few-Shot Learning Paper Award

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Few-Shot Continual Learning for Audio Classification

Wang, Yu., Bryan, Nicholas., Cartwright, Mark., Bello, Juan., Salamon, Justin. (Jun. 8, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

What’s all the Fuss about Free Universal Sound Separation Data?

Wisdom, Scott., Erdogan, Hakan., Ellis, Daniel., Serizel, Romain., Turpault, Nicolas., Fonseca, Eduardo., Salamon, Justin., Seetharaman, Prem., Hershey, John. (Jun. 8, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Sound Event Detection and Separation: A Benchmark on DESED Synthetic Soundscapes

Turpault, Nicolas., Serizel, Romain., Wisdom, Scott., Erdogan, Hakan., Hershey, John., Fonseca, Eduardo., Seetharaman, Prem., Salamon, Justin. (Jun. 8, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Improving Sound Event Detection in Domestic Environments using Sound Separation

Turpault, Nicolas., Wisdom, Scott., Erdogan, Hakan., Hershey, John., Serizel, Romain., Fonseca, Eduardo., Seetharaman, Prem., Salamon, Justin. (Nov. 2, 2020)

Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)

SONYC-UST-V2: An Urban Sound Tagging Dataset with Spatiotemporal Context

Cartwright, Mark., Cramer, Aurora., Méndez, Ana., Wang, Yu., Wu, Ho-Hsiang., Lostanlen, Vincent., Fuentes, Magdalena., Dove, Graham., Mydlarz, Charlie., Salamon, Justin., Nov, Oded., Bello, Juan. (Nov. 2, 2020)

Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)

Controllable Neural Prosody Synthesis

Morrison, Maxwell., Jin, Zeyu., Salamon, Justin., Bryan, Nicholas., Mysore, Gautham. (Oct. 26, 2020)

Interspeech 2020

Metric Learning vs Classification for Disentangled Music Representation Learning

Lee, Jongpil., Bryan, Nicholas., Salamon, Justin., Jin, Zeyu., Nam, Juhan. (Oct. 11, 2020)

International Society for Music Information Retrieval Conference (ISMIR)

Few-Shot Drum Transcription in Polyphonic Music

Wang, Yu., Salamon, Justin., Cartwright, Mark., Bryan, Nicholas., Bello, Juan. (Oct. 11, 2020)

International Society for Music Information Retrieval Conference (ISMIR)

Telling Left From Right: Learning Spatial Correspondence of Sight and Sound

Yang, Karren., Russell, Bryan., Salamon, Justin. (Jun. 14, 2020)

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Chirping up the Right Tree: Incorporating Biological Taxonomies into Deep Bioacoustic Classifiers

Cramer, Jason., Lostanlen, Vincent., Farnsworth, Andrew., Salamon, Justin., Bello, Juan. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Sound Event Detection in Synthetic Domestic Environments

Serizel, Romain., Turpault, Nicolas., Shah, Ankit., Salamon, Justin. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Disentangled Multidimensional Metric Learning For Music Similarity

Lee, Jongpil., Bryan, Nicholas., Salamon, Justin., Jin, Zeyu., Nam, Juhan. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Few-Shot Sound Event Detection

Wang, Yu., Salamon, Justin., Bryan, Nicholas., Bello, Juan. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Robust Sound Event Detection in Bioacoustic Sensor Networks

Lostanlen, Vincent., Salamon, Justin., Farnsworth, Andrew., Kelling, Steve., Bello, Juan. (Oct. 26, 2019)

PLoS ONE 14(10): e0214168, 2019. DOI: https://doi.org/10.1371/journal.pone.0214168

Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis

Turpault, Nicolas., Serizel, Romain., Shah, Ankit., Salamon, Justin. (Oct. 25, 2019)

Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

SONYC Urban Sound Tagging (SONYC-UST): A Multilabel Dataset from an Urban Acoustic Sensor Network

Cartwright, Mark., Mendez, Ana., Cramer, Jason., Lostanlen, Vincent., Dove, Graham., Wu, Ho-Hsiang., Salamon, Justin., Nov, Oded., Bello, Juan. (Oct. 25, 2019)

Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

TriCycle: Audio Representation Learning from Sensor Network Data Using Self-Supervision

Cartwright, Mark., Cramer, Jason., Salamon, Justin., Bello, Juan. (Oct. 20, 2019)

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

What’s Broken in Music Informatics Research? Three Uncomfortable Statements

Salamon, Justin. (Jun. 15, 2019)

Machine Learning for Music Discovery workshop, International Conference on Machine Learning (ICML)

News