Publications

Published November 8, 2021

Deep Embeddings and Section Fusion Improve Music Segmentation

International Society for Music Information Retrieval Conference (ISMIR)

Justin Salamon, Oriol Nieto, Nicholas J. Bryan
  • AI & Machine Learning
  • Audio

Published November 8, 2021

Controllable deep melody generation via hierarchical music representation

International Society for Music Information Retrieval Conference

Shuqi Dai, Zeyu Jin, Celso Gomes, Roger B. Dannenberg
  • Audio

Published November 8, 2021

Emotion Embedding Spaces for Matching Music to Stories

International Society for Music Information Retrieval Conference (ISMIR)

Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham Mysore, Xavier Serra
Best Student Paper Award
  • AI & Machine Learning
  • Audio

Published October 18, 2021

Who Calls the Shots? Rethinking Few-Shot Learning for Audio

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Yu Wang, Nicholas J. Bryan, Justin Salamon, Mark Cartwright, Juan Pablo Bello
Best Audio Few-Shot Learning Paper Award
  • AI & Machine Learning
  • Audio

Published October 18, 2021

Auto-DSP: Learning to Optimize Acoustic Echo Cancellers

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Jonah Casebeer, Nicholas J. Bryan, Paris Smaragdis
  • AI & Machine Learning
  • Audio

Published October 17, 2021

HiFi-GAN-2: Studio-quality speech enhancement via generative adversarial networks conditioned on acoustic features

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Jiaqi Su, Zeyu Jin, Adam Finkelstein
  • AI & Machine Learning
  • Audio

Published June 11, 2021

Personalized HRTF Modeling Using DNN-augmented BEM

ICASSP

Mengfan Zhang, Jui-Hsien Wang, Doug L. James, Wang
  • AI & Machine Learning
  • Audio
  • Graphics (2D & 3D)

Published June 9, 2021

CDPAM: Contrastive learning for perceptual audio similarity

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Pranay Manocha, Zeyu Jin, Richard Zhang, Adam Finkelstein
  • AI & Machine Learning
  • Audio

Published June 9, 2021

Bandwidth Extension is All You Need

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Jiaqi Su, Yunyun Wang, Adam Finkelstein, Zeyu Jin
  • AI & Machine Learning
  • Audio

Published June 8, 2021

Sound Event Detection and Separation: A Benchmark on DESED Synthetic Soundscapes

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon
  • AI & Machine Learning
  • Audio

Published June 8, 2021

What’s all the Fuss about Free Universal Sound Separation Data?

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey
  • AI & Machine Learning
  • Audio

Published June 8, 2021

Few-Shot Continual Learning for Audio Classification

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Yu Wang, Nicholas J. Bryan, Mark Cartwright, Juan Pablo Bello, Justin Salamon
  • AI & Machine Learning
  • Audio

Published June 6, 2021

Context-Aware Prosody Correction for Text-Based Speech Editing

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Max Morrison, Lucas Rencker, Zeyu Jin, Nicholas J. Bryan, Juan-Pablo Caceres, Bryan Pardo
  • AI & Machine Learning
  • Audio

Published June 6, 2021

Differentiable Signal Processing with Black-Box Audio Effects

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Marco A. Martínez Ramírez, Oliver Wang, Paris Smaragdis, Nicholas J. Bryan
  • AI & Machine Learning
  • Audio

Published December 11, 2020

Audio-Based Music Structure Analysis: Current Trends, Open Challenges, and Applications

Transactions of the International Society for Music Information Retrieval (TISMIR)

Oriol Nieto, Gautham Mysore, Cheng-i Wang, Jordan B. L. Smith, Jan Schlüter, Thomas Grill, Brian McFee
  • AI & Machine Learning
  • Audio

Published November 30, 2020

MakeItTalk: Speaker-Aware Talking Head Animation

SIGGRAPH Asia 2020

Yang Zhou, Dingzeyu Li, Eli Shechtman, Jose Echevarria, Xintong Han, Evangelos Kalogerakis
  • AI & Machine Learning
  • Audio
  • Graphics (2D & 3D)

Published October 26, 2020

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Interspeech 2020

Pranay Manocha, Adam Finkelstein, Richard Zhang, Nicholas J. Bryan, Gautham Mysore, Zeyu Jin
  • AI & Machine Learning
  • Audio

Published October 26, 2020

Controllable Neural Prosody Synthesis

Interspeech 2020

Maxwell Morrison, Zeyu Jin, Justin Salamon, Nicholas J. Bryan, Gautham Mysore
  • Audio

Published October 26, 2020

HiFi-GAN: High-fidelity denoising and dereverberation based on speech deep features in adversarial networks

Interspeech 2020

Jiaqi Su, Zeyu Jin, Adam Finkelstein
  • AI & Machine Learning
  • Audio

Published October 11, 2020

Few-Shot Drum Transcription in Polyphonic Music

International Society for Music Information Retrieval Conference (ISMIR)

Yu Wang, Justin Salamon, Mark Cartwright, Nicholas J. Bryan, Juan Pablo Bello
  • AI & Machine Learning
  • Audio
1 2 3 4 7