Justin Salamon

Senior Research Scientist

San Francisco

Justin is a senior research scientist and member of the Audio Research Group at Adobe Research in San Francisco. Previously he was a senior research scientist at the Music and Audio Research Laboratory and Center for Urban Science and Progress of New York University.

His research focuses on the application of machine learning and signal processing to audio & video, with applications in machine listening, audiovisual and multi-modal understanding, representation learning & self-supervision, audio for video, music information retrieval, bioacoustics, environmental sound analysis, and open source software & data.

Please visit his personal website for a complete list of publications, research topics, code/data releases, and research news.

Publications

Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations

García, Hugo., Nieto, Oriol., Salamon, Justin., Pardo, Bryan., Seetharaman, Prem. (Apr. 7, 2025)

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning

Manco, Ilaria., Salamon, Justin., Nieto, Oriol. (Nov. 10, 2024)

International Society for Music Information Retrieval Conference (ISMIR)

Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries

Wilkins, Julia., Salamon, Justin., Fuentes, Magdalena., Bello, Juan., Nieto, Oriol. (Oct. 22, 2023)

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Efficient Spoken Language Recognition Via Multilabel Classification

Nieto, Oriol., Jin, Zeyu., Dernoncourt, Franck., Salamon, Justin. (Aug. 24, 2023)

Interspeech 2023

Language-Guided Audio-Visual Source Separation via Trimodal Consistency

Tan, Reuben., Ray, Arijit., Plummer, Bryan., Salamon, Justin., Nieto, Oriol., Russell, Bryan., Saenko, Kate. (Jun. 18, 2023)

Highlight Paper (Top 10%)

Conference on Computer Vision and Pattern Recognition (CVPR)

Audio-Text Models Do Not Yet Leverage Natural Language

Wu, Ho-Hsiang., Nieto, Oriol., Bello, Juan., Salamon, Justin. (Jun. 4, 2023)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Automated Acoustic Monitoring Captures Timing and Intensity of Bird Migration

Doren, Benjamin., Lostanlen, Vincent., Cramer, Aurora., Salamon, Justin., Dokter, Adriaan., Kelling, Steve., Bello, Juan., Farnsworth, Andrew. (Dec. 14, 2022)

Journal of Applied Ecology

Filler Word Detection and Classification: A Dataset and Benchmark

Zhu, Ge., Caceres, Juan-Pablo., Salamon, Justin. (Sep. 18, 2022)

23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)

HEAR: Holistic Evaluation of Audio Representations

Turian, Joseph., Shier, Jordie., Khan, Humair., Raj, Bhiksha., Schuller, Björn., Steinmetz, Christian., Malloy, Colin., Tzanetakis, George., Velarde, Gissel., McNally, Kirk., Henry, Max., Pinto, Nicolas., Noufi, Camille., Clough, Christian., Herremans, Dorien., Fonseca, Eduardo., Engel, Jesse., Salamon, Justin., Esling, Philippe., Manocha, Pranay., Watanabe, Shinji., Jin, Zeyu., Bisk, Yonatan. (Jul. 20, 2022)

NeurIPS 2021

It’s Time for Artistic Correspondence in Music and Video

Surís, Dídac., Vondrick, Carl., Russell, Bryan., Salamon, Justin. (Jun. 19, 2022)

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Emotion Embedding Spaces for Matching Music to Stories

Won, Minz., Salamon, Justin., Bryan, Nicholas., Mysore, Gautham., Serra, Xavier. (Nov. 8, 2021)

Best Student Paper Award

International Society for Music Information Retrieval Conference (ISMIR)

Deep Embeddings and Section Fusion Improve Music Segmentation

Salamon, Justin., Nieto, Oriol., Bryan, Nicholas. (Nov. 8, 2021)

International Society for Music Information Retrieval Conference (ISMIR)

Who Calls the Shots? Rethinking Few-Shot Learning for Audio

Wang, Yu., Bryan, Nicholas., Salamon, Justin., Cartwright, Mark., Bello, Juan. (Oct. 18, 2021)

Best Audio Few-Shot Learning Paper Award

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Few-Shot Continual Learning for Audio Classification

Wang, Yu., Bryan, Nicholas., Cartwright, Mark., Bello, Juan., Salamon, Justin. (Jun. 8, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

What’s all the Fuss about Free Universal Sound Separation Data?

Wisdom, Scott., Erdogan, Hakan., Ellis, Daniel., Serizel, Romain., Turpault, Nicolas., Fonseca, Eduardo., Salamon, Justin., Seetharaman, Prem., Hershey, John. (Jun. 8, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Sound Event Detection and Separation: A Benchmark on DESED Synthetic Soundscapes

Turpault, Nicolas., Serizel, Romain., Wisdom, Scott., Erdogan, Hakan., Hershey, John., Fonseca, Eduardo., Seetharaman, Prem., Salamon, Justin. (Jun. 8, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

SONYC-UST-V2: An Urban Sound Tagging Dataset with Spatiotemporal Context

Cartwright, Mark., Cramer, Aurora., Méndez, Ana., Wang, Yu., Wu, Ho-Hsiang., Lostanlen, Vincent., Fuentes, Magdalena., Dove, Graham., Mydlarz, Charlie., Salamon, Justin., Nov, Oded., Bello, Juan. (Nov. 2, 2020)

Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)

Improving Sound Event Detection in Domestic Environments using Sound Separation

Turpault, Nicolas., Wisdom, Scott., Erdogan, Hakan., Hershey, John., Serizel, Romain., Fonseca, Eduardo., Seetharaman, Prem., Salamon, Justin. (Nov. 2, 2020)

Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)

Controllable Neural Prosody Synthesis

Morrison, Maxwell., Jin, Zeyu., Salamon, Justin., Bryan, Nicholas., Mysore, Gautham. (Oct. 26, 2020)

Interspeech 2020

Metric Learning vs Classification for Disentangled Music Representation Learning

Lee, Jongpil., Bryan, Nicholas., Salamon, Justin., Jin, Zeyu., Nam, Juhan. (Oct. 11, 2020)

International Society for Music Information Retrieval Conference (ISMIR)

Few-Shot Drum Transcription in Polyphonic Music

Wang, Yu., Salamon, Justin., Cartwright, Mark., Bryan, Nicholas., Bello, Juan. (Oct. 11, 2020)

International Society for Music Information Retrieval Conference (ISMIR)

Telling Left From Right: Learning Spatial Correspondence of Sight and Sound

Yang, Karren., Russell, Bryan., Salamon, Justin. (Jun. 14, 2020)

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Chirping up the Right Tree: Incorporating Biological Taxonomies into Deep Bioacoustic Classifiers

Cramer, Jason., Lostanlen, Vincent., Farnsworth, Andrew., Salamon, Justin., Bello, Juan. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Sound Event Detection in Synthetic Domestic Environments

Serizel, Romain., Turpault, Nicolas., Shah, Ankit., Salamon, Justin. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Disentangled Multidimensional Metric Learning For Music Similarity

Lee, Jongpil., Bryan, Nicholas., Salamon, Justin., Jin, Zeyu., Nam, Juhan. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Few-Shot Sound Event Detection

Wang, Yu., Salamon, Justin., Bryan, Nicholas., Bello, Juan. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Robust Sound Event Detection in Bioacoustic Sensor Networks

Lostanlen, Vincent., Salamon, Justin., Farnsworth, Andrew., Kelling, Steve., Bello, Juan. (Oct. 26, 2019)

PLoS ONE 14(10): e0214168, 2019. DOI: https://doi.org/10.1371/journal.pone.0214168

SONYC Urban Sound Tagging (SONYC-UST): A Multilabel Dataset from an Urban Acoustic Sensor Network

Cartwright, Mark., Mendez, Ana., Cramer, Jason., Lostanlen, Vincent., Dove, Graham., Wu, Ho-Hsiang., Salamon, Justin., Nov, Oded., Bello, Juan. (Oct. 25, 2019)

Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis

Turpault, Nicolas., Serizel, Romain., Shah, Ankit., Salamon, Justin. (Oct. 25, 2019)

Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

TriCycle: Audio Representation Learning from Sensor Network Data Using Self-Supervision

Cartwright, Mark., Cramer, Jason., Salamon, Justin., Bello, Juan. (Oct. 20, 2019)

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

What’s Broken in Music Informatics Research? Three Uncomfortable Statements

Salamon, Justin. (Jun. 15, 2019)

Machine Learning for Music Discovery workshop, International Conference on Machine Learning (ICML)

News

New AI features make audio editing easier – and more accessible for everyone

Adobe Research interns hone their communication and coding skills in a summer of learning

Readability Research: This New Field Can Help Us All Read Better