Publications

Published November 30, 2020

MakeItTalk: Speaker-Aware Talking Head Animation

SIGGRAPH Asia 2020

Yang Zhou, Dingzeyu Li, Eli Shechtman, Jose Echevarria, Xintong Han, Evangelos Kalogerakis
  • AI & Machine Learning
  • Audio
  • Graphics (2D & 3D)

Published October 26, 2020

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Interspeech 2020

Pranay Manocha, Adam Finkelstein, Richard Zhang, Nicholas J. Bryan, Gautham Mysore, Zeyu Jin
  • AI & Machine Learning
  • Audio

Published October 26, 2020

Controllable Neural Prosody Synthesis

Interspeech 2020

Maxwell Morrison, Zeyu Jin, Justin Salamon, Nicholas J. Bryan, Gautham Mysore
  • Audio

Published October 11, 2020

Few-Shot Drum Transcription in Polyphonic Music

International Society for Music Information Retrieval Conference (ISMIR)

Yu Wang, Justin Salamon, Mark Cartwright, Nicholas J. Bryan, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published October 11, 2020

Metric Learning vs Classification for Disentangled Music Representation Learning

International Society for Music Information Retrieval Conference (ISMIR)

Jongpil Lee, Nicholas J. Bryan, Justin Salamon, Zeyu Jin, Juhan Nam
  • AI & Machine Learning
  • Audio

Published August 24, 2020

Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing

ECCV 2020

Dingzeyu Li, Yapeng Tian, Chenliang Xu
  • AI & Machine Learning
  • Audio
  • Computer Vision, Imaging & Video

Published June 24, 2020

Deep Audio Prior: Learning Sound Source Separation from a Single Audio Mixture

CVPR 2020 Sight and Sound Workshop

Dingzeyu Li, Yapeng Tian, Chenliang Xu
  • AI & Machine Learning
  • Audio

Published June 14, 2020

Telling Left From Right: Learning Spatial Correspondence of Sight and Sound

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Karren Yang, Bryan Russell, Justin Salamon
  • AI & Machine Learning
  • Audio
  • Computer Vision, Imaging & Video

Published May 4, 2020

Impulse Response Data Augmentation and Deep Neural Networks For Blind Room Acoustic Parameter Estimation

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Nicholas J. Bryan
  • AI & Machine Learning
  • Audio

Published May 4, 2020

Few-Shot Sound Event Detection

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Yu Wang, Justin Salamon, Nicholas J. Bryan, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published May 4, 2020

Disentangled Multidimensional Metric Learning For Music Similarity

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Jongpil Lee, Nicholas J. Bryan, Justin Salamon, Zeyu Jin, Juhan Nam
  • AI & Machine Learning
  • Audio

Published May 4, 2020

One-Shot Parametric Audio Production Style Transfer With Application to Frequency Equalization

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Stylianos I. Mimilakis, Nicholas J. Bryan, Paris Smaragdis
  • AI & Machine Learning
  • Audio

Published May 4, 2020

Attentive Modality Hopping Mechanism for Speech Emotion Recognition

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

David Seunghyun Yoon, Subhadeep Dey, Hwanhee Lee, Kyomin Jung
  • AI & Machine Learning
  • Audio
  • Natural Language Processing

Published May 4, 2020

Sound Event Detection in Synthetic Domestic Environments

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Romain Serizel, Nicolas Turpault, Ankit Shah, Justin Salamon
  • AI & Machine Learning
  • Audio

Published May 4, 2020

Chirping up the Right Tree: Incorporating Biological Taxonomies into Deep Bioacoustic Classifiers

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Jason Cramer, Vincent Lostanlen, Andrew Farnsworth, Justin Salamon, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published March 23, 2020

Scene-Aware Audio Rendering via Deep Acoustic Analysis

IEEE VR Journal Track (TVCG)

Zhenyu Tang, Nicholas J. Bryan, Dingzeyu Li, Tim Langlois, Dinesh Manocha
  • AI & Machine Learning
  • AR, VR & 360 Photography
  • Audio

Published October 26, 2019

Robust Sound Event Detection in Bioacoustic Sensor Networks

PLoS ONE 14(10): e0214168, 2019. DOI: https://doi.org/10.1371/journal.pone.0214168

Vincent Lostanlen, Justin Salamon, Andrew Farnsworth, Steve Kelling, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published October 25, 2019

SONYC Urban Sound Tagging (SONYC-UST): A Multilabel Dataset from an Urban Acoustic Sensor Network

Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Mark Cartwright, Ana Elisa Mendez Mendez, Jason Cramer, Vincent Lostanlen, Graham Dove, Ho-Hsiang Wu, Justin Salamon, Oded Nov, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published October 25, 2019

Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis

Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Nicolas Turpault, Romain Serizel, Ankit Shah, Justin Salamon
  • AI & Machine Learning
  • Audio

Published October 20, 2019

TriCycle: Audio Representation Learning from Sensor Network Data Using Self-Supervision

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Mark Cartwright, Jason Cramer, Justin Salamon, Juan Pablo Bello
  • AI & Machine Learning
  • Audio
1 2 3 6