Publications

Publication date: May 4, 2019

B-Script: Transcript-based B-roll Video Editing with Recommendations

ACM Conference on Human Factors in Computing Systems (CHI)

Bernd Huber, Hijung Valentina Shin, Bryan Russell, Oliver Wang, Gautham Mysore
  • Adobe Research icon Audio
  • Adobe Research icon Computer Vision, Imaging & Video
  • Adobe Research icon Human Computer Interaction
  • Adobe Research icon Natural Language Processing

Publication date: May 3, 2019

Audible Panorama: Automatic Spatial Audio Generation for Panorama Imagery

ACM Conference on Human Factors and Computing Systems (SIGCHI)

Haikun Huang, Michael S. Solah, Dingzeyu Li, Lap-Fai Yu
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon AR, VR & 360 Photography
  • Adobe Research icon Audio
  • Adobe Research icon Computer Vision, Imaging & Video
  • Adobe Research icon Graphics (2D & 3D)
  • Adobe Research icon Human Computer Interaction

Publication date: February 1, 2019

Learning Affective Correspondence between Music and Image

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Gaurav Verma, Eeshan Gunesh Dhekane, Tanaya Guha
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio
  • Adobe Research icon Computer Vision, Imaging & Video

Publication date: January 10, 2019

Recent Advances in Music Signal Processing

IEEE Signal Processing Magazine

Meinard Mueller, Bryan Pardo, Gautham Mysore, Vesa Valimaki
  • Adobe Research icon Audio

Publication date: December 3, 2018

Self-Supervised Generation of Spatial Audio for 360° Video

Neural Information Processing Systems (NIPS)

Pedro Morgado, Nuno Vasconcelos, Tim Langlois, Oliver Wang
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio
  • Adobe Research icon Computer Vision, Imaging & Video

Publication date: September 9, 2018

Visually Indicated Sound Generation by Perceptually Optimized Classification

Proc. of the 1st Multimodal Learning and Applications Workshop (MULA 2018)

Kan Chen, Chuanxi Zhang, Chen Fang, Zhaowen Wang, Trung Bui, Ram Nevatia
Best Paper Award
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio
  • Adobe Research icon Computer Vision, Imaging & Video

Publication date: September 2, 2018

A Framework for Speech Recognition Benchmarking

Proc. of Interspeech 2018

Franck Dernoncourt, Trung Bui, Walter Chang
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio

Publication date: August 13, 2018

Scene-Aware Audio for 360° Videos

ACM Transactions on Graphics (SIGGRAPH)

Dingzeyu Li, Tim Langlois, Changxi Zheng
  • Adobe Research icon Audio
  • Adobe Research icon Computer Vision, Imaging & Video
  • Adobe Research icon Graphics (2D & 3D)
  • Adobe Research icon Human Computer Interaction

Publication date: August 8, 2018

Toward Wave-based Sound Synthesis for Computer Animation

ACM Transactions on Graphics (SIGGRAPH 2018)

Jui-Hsien Wang, Ante Qu, Tim Langlois, Doug James
  • Adobe Research icon Audio
  • Adobe Research icon Graphics (2D & 3D)

Publication date: August 3, 2018

Temporal extensions of Nonnegative Matrix Factorization

Book Chapter in Audio Source Separation and Speech Enhancement, Wiley, 2018

Cedric Fevotte, Paris Smaragdis, Nasser Mohammadiha, Gautham Mysore
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio

Publication date: July 30, 2018

VisemeNet: Audio-Driven Animator-Centric Speech Animation

SIGGRAPH 2018

Yang Zhou, Zhan Xu, Chris Landreth, Evangelos Kalogerakis, Subhransu Maji, Karan Singh
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio
  • Adobe Research icon Graphics (2D & 3D)

Publication date: June 18, 2018

Visual to Sound: Generating Natural Sound for Videos in the Wild

Proc. of CVPR 2018

Yipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara L. Berg
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio
  • Adobe Research icon Computer Vision, Imaging & Video

Publication date: April 21, 2018

LoopMaker: Automatic Creation of Music Loops from Pre-recorded Music

ACM Conference on Human Factors in Computing Systems (CHI)

Kitty Shi, Gautham Mysore
  • Adobe Research icon Audio
  • Adobe Research icon Human Computer Interaction

Publication date: April 15, 2018

FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Zeyu Jin, Adam Finkelstein, Gautham Mysore, Jingwan (Cynthia) Lu
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio

Publication date: April 15, 2018

Blind Estimation of the Speech Transmission Index for Speech Quality Prediction

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Prem Seetharaman, Gautham Mysore, Paris Smaragdis, Bryan Pardo
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio

Publication date: April 15, 2018

Crowdsourced Pairwise-comparison for Source Separation Evaluation

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Mark Cartwright, Bryan Pardo, Gautham Mysore
  • Adobe Research icon Audio

Publication date: March 11, 2018

A System for Personalized Music Medley Creation

ACM IUI Workshop on Intelligent Music Interfaces for Listening and Creation (MILC)

Kitty Shi, Gautham Mysore
  • Adobe Research icon Audio

Publication date: March 3, 2018

Dynamic Non-negative Models for Audio Source Separation

Book Chapter in Audio Source Separation, Springer

Paris Smaragdis, Gautham Mysore, Nasser Mohammadiha
  • Adobe Research icon AI & Machine Learning
  • Adobe Research icon Audio

Publication date: October 23, 2017

Re-visiting the Music Segmentation Problem with Crowdsourcing

International Society of Music Information Retrieval Conference (ISMIR)

Cheng-i Wang, Gautham Mysore, Shlomo Dubnov
  • Adobe Research icon Audio

Publication date: October 22, 2017

AutoDub: Automatic Redubbing for Voiceover Editing

ACM Symposium on User Interface Software and Technology (UIST)

Shrikant Venkataramani, Paris Smaragdis, Gautham Mysore
  • Adobe Research icon Audio
  • Adobe Research icon Human Computer Interaction
1 2 3 4 5 6 7 8