Publications

Published November 30, 2020

MakeItTalk: Speaker-Aware Talking Head Animation

SIGGRAPH Asia 2020

Yang Zhou, Dingzeyu Li, Eli Shechtman, Jose Echevarria, Xintong Han, Evangelos Kalogerakis
  • AI & Machine Learning
  • Audio
  • Graphics (2D & 3D)

Published August 24, 2020

Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing

ECCV 2020

Dingzeyu Li, Yapeng Tian, Chenliang Xu
  • AI & Machine Learning
  • Audio
  • Computer Vision, Imaging & Video

Published June 24, 2020

Deep Audio Prior: Learning Sound Source Separation from a Single Audio Mixture

CVPR 2020 Sight and Sound Workshop

Dingzeyu Li, Yapeng Tian, Chenliang Xu
  • AI & Machine Learning
  • Audio

Published May 4, 2020

Impulse Response Data Augmentation and Deep Neural Networks For Blind Room Acoustic Parameter Estimation

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Nicholas J. Bryan
  • AI & Machine Learning
  • Audio

Published May 4, 2020

Few-Shot Sound Event Detection

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Yu Wang, Justin Salamon, Nicholas J. Bryan, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published May 4, 2020

Disentangled Multidimensional Metric Learning For Music Similarity

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Jongpil Lee, Nicholas J. Bryan, Justin Salamon, Zeyu Jin, Juhan Nam
  • AI & Machine Learning
  • Audio

Published May 4, 2020

One-Shot Parametric Audio Production Style Transfer With Application to Frequency Equalization

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Stylianos I. Mimilakis, Nicholas J. Bryan, Paris Smaragdis
  • AI & Machine Learning
  • Audio

Published March 23, 2020

Scene-Aware Audio Rendering via Deep Acoustic Analysis

IEEE VR Journal Track (TVCG)

Zhenyu Tang, Nicholas J. Bryan, Dingzeyu Li, Tim Langlois, Dinesh Manocha
  • AI & Machine Learning
  • AR, VR & 360 Photography
  • Audio

Published October 26, 2019

Robust Sound Event Detection in Bioacoustic Sensor Networks

PLoS ONE 14(10): e0214168, 2019. DOI: https://doi.org/10.1371/journal.pone.0214168

Vincent Lostanlen, Justin Salamon, Andrew Farnsworth, Steve Kelling, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published October 25, 2019

SONYC Urban Sound Tagging (SONYC-UST): A Multilabel Dataset from an Urban Acoustic Sensor Network

Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Mark Cartwright, Ana Elisa Mendez Mendez, Jason Cramer, Vincent Lostanlen, Graham Dove, Ho-Hsiang Wu, Justin Salamon, Oded Nov, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published October 25, 2019

Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis

Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Nicolas Turpault, Romain Serizel, Ankit Shah, Justin Salamon
  • AI & Machine Learning
  • Audio

Published October 20, 2019

TriCycle: Audio Representation Learning from Sensor Network Data Using Self-Supervision

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Mark Cartwright, Jason Cramer, Justin Salamon, Juan Pablo Bello
  • AI & Machine Learning
  • Audio

Published October 19, 2019

Real-Time Lip Sync for Live 2D Animation

arXiv

Deepali Aneja, Wilmot Li
  • AI & Machine Learning
  • Audio
  • Graphics (2D & 3D)

Published September 20, 2019

Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition

Interspeech 2019

Subhadeep Dey, Petr Motlicek, Trung Bui, Franck Dernoncourt
  • AI & Machine Learning
  • Audio
  • Natural Language Processing

Published August 1, 2019

Text-based Editing of Talking-head Video

ACM Transactions on Graphics (Proc. SIGGRAPH'19)

Ohad Fried, Ayush Tewari, Michael Zollhofer, Adam Finkelstein, Eli Shechtman, Dan B Goldman, Kyle Genova, Zeyu Jin, Christian Theobalt, Maneesh Agarwala
  • AI & Machine Learning
  • Audio
  • Computer Vision, Imaging & Video

Published June 15, 2019

What’s Broken in Music Informatics Research? Three Uncomfortable Statements

Machine Learning for Music Discovery workshop, International Conference on Machine Learning (ICML)

Justin Salamon
  • AI & Machine Learning
  • Audio

Published May 4, 2019

VoiceAssist: Guiding Users to High-Quality Voice Recordings

ACM Conference on Human Factors in Computing Systems (CHI)

Prem Seetharaman, Gautham Mysore, Bryan Pardo, Paris Smaragdis, Celso Gomes
  • AI & Machine Learning
  • Audio
  • Human Computer Interaction

Published May 4, 2019

B-Script: Transcript-based B-roll Video Editing with Recommendations

ACM Conference on Human Factors in Computing Systems (CHI)

Bernd Huber, Hijung Valentina Shin, Bryan Russell, Oliver Wang, Gautham Mysore
  • Audio
  • Computer Vision, Imaging & Video
  • Human Computer Interaction
  • Natural Language Processing

Published May 3, 2019

Audible Panorama: Automatic Spatial Audio Generation for Panorama Imagery

ACM Conference on Human Factors and Computing Systems (SIGCHI)

Haikun Huang, Michael S. Solah, Dingzeyu Li, Lap-Fai Yu
  • AI & Machine Learning
  • AR, VR & 360 Photography
  • Audio
  • Computer Vision, Imaging & Video
  • Graphics (2D & 3D)
  • Human Computer Interaction

Published February 1, 2019

Learning Affective Correspondence between Music and Image

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Gaurav Verma, Eeshan Gunesh Dhekane, Tanaya Guha
  • AI & Machine Learning
  • Audio
  • Computer Vision, Imaging & Video
1 2 3 6