Gautham Mysore

Senior Principal Scientist

San Francisco

Gautham is a senior principal scientist and head of the CAVA (Co-Creation for Audio, Video, & Animation) Research organization in Adobe Research. Please visit his website for more about him.

Publications

Emotion Embedding Spaces for Matching Music to Stories

Won, Minz., Salamon, Justin., Bryan, Nicholas., Mysore, Gautham., Serra, Xavier. (Nov. 8, 2021)

Best Student Paper Award

International Society for Music Information Retrieval Conference (ISMIR)

Audio-Based Music Structure Analysis: Current Trends, Open Challenges, and Applications

Nieto, Oriol., Mysore, Gautham., Wang, Cheng-i., Smith, Jordan., Schlüter, Jan., Grill, Thomas., McFee, Brian. (Dec. 11, 2020)

Transactions of the International Society for Music Information Retrieval (TISMIR)

Controllable Neural Prosody Synthesis

Morrison, Maxwell., Jin, Zeyu., Salamon, Justin., Bryan, Nicholas., Mysore, Gautham. (Oct. 26, 2020)

Interspeech 2020

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Manocha, Pranay., Finkelstein, Adam., Zhang, Richard., Bryan, Nicholas., Mysore, Gautham., Jin, Zeyu. (Oct. 26, 2020)

Interspeech 2020

VoiceAssist: Guiding Users to High-Quality Voice Recordings

Seetharaman, Prem., Mysore, Gautham., Pardo, Bryan., Smaragdis, Paris., Gomes, Celso. (May. 4, 2019)

ACM Conference on Human Factors in Computing Systems (CHI)

B-Script: Transcript-based B-roll Video Editing with Recommendations

Huber, Bernd., Shin, Hijung., Russell, Bryan., Wang, Oliver., Mysore, Gautham. (May. 4, 2019)

ACM Conference on Human Factors in Computing Systems (CHI)

Recent Advances in Music Signal Processing

Mueller, Meinard., Pardo, Bryan., Mysore, Gautham., Valimaki, Vesa. (Jan. 10, 2019)

IEEE Signal Processing Magazine

Temporal extensions of Nonnegative Matrix Factorization

Fevotte, Cedric., Smaragdis, Paris., Mohammadiha, Nasser., Mysore, Gautham. (Aug. 3, 2018)

Book Chapter in Audio Source Separation and Speech Enhancement, Wiley, 2018

LoopMaker: Automatic Creation of Music Loops from Pre-recorded Music

Shi, Kitty., Mysore, Gautham. (Apr. 21, 2018)

ACM Conference on Human Factors in Computing Systems (CHI)

FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Jin, Zeyu., Finkelstein, Adam., Mysore, Gautham., Lu, Jingwan. (Apr. 15, 2018)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Blind Estimation of the Speech Transmission Index for Speech Quality Prediction

Seetharaman, Prem., Mysore, Gautham., Smaragdis, Paris., Pardo, Bryan. (Apr. 15, 2018)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Crowdsourced Pairwise-comparison for Source Separation Evaluation

Cartwright, Mark., Pardo, Bryan., Mysore, Gautham. (Apr. 15, 2018)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

A System for Personalized Music Medley Creation

Shi, Kitty., Mysore, Gautham. (Mar. 11, 2018)

ACM IUI Workshop on Intelligent Music Interfaces for Listening and Creation (MILC)

Dynamic Non-negative Models for Audio Source Separation

Smaragdis, Paris., Mysore, Gautham., Mohammadiha, Nasser. (Mar. 3, 2018)

Book Chapter in Audio Source Separation, Springer

Re-visiting the Music Segmentation Problem with Crowdsourcing

Wang, Cheng-i., Mysore, Gautham., Dubnov, Shlomo. (Oct. 23, 2017)

International Society of Music Information Retrieval Conference (ISMIR)

AutoDub: Automatic Redubbing for Voiceover Editing

Venkataramani, Shrikant., Smaragdis, Paris., Mysore, Gautham. (Oct. 22, 2017)

ACM Symposium on User Interface Software and Technology (UIST)

VoCo: text-based insertion and replacement in audio narration

Jin, Zeyu., Mysore, Gautham., DiVerdi, Stephen., Lu, Jingwan., Finkelstein, Adam. (Jul. 31, 2017)

ACM Transactions on Graphics (SIGGRAPH)

Eulerian Video Magnification and Analysis

Wadhwa, Neal., Wu, Hao-Yu., Davis, Abe., Rubinstein, Michael., Shih, Eugene., Mysore, Gautham., Chen, Justin., Buyukozturk, Oral., Guttag, John., Freeman, William., Durand, Frédo. (Jan. 1, 2017)

Communications of the ACM

Analysis of Prosody Increment Induced by Pitch Accents for Automatic Emphasis Correction

Zhang, Yang., Mysore, Gautham., Berthouzoz, Floraine., Hasegawa-Johnson, Mark. (May. 24, 2016)

Speech Prosody

Structural Segmentation with the Variable Markov Oracle and Boundary Adjustment

Wang, Cheng-i., Mysore, Gautham. (Mar. 25, 2016)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Fast and Easy Crowdsourced Perceptual Audio Evaluation

Cartwright, Mark., Pardo, Bryan., Mysore, Gautham., Hoffman, Matt. (Mar. 20, 2016)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Equalization Matching of Speech Recordings in Real-World Environments

Germain, Francois., Mysore, Gautham., Fujioka, Tatako. (Mar. 20, 2016)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

CUTE: a Concatenative Method for Voice Conversion Using Exemplar-based Unit Selection

Jin, Zeyu., Finkelstein, Adam., DiVerdi, Stephen., Lu, Jingwan., Mysore, Gautham. (Mar. 1, 2016)

The 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Capture-Time Feedback for Recording Scripted Narration

Rubin, Steve., Berthouzoz, Floraine., Mysore, Gautham., Agrawala, Maneesh. (Nov. 8, 2015)

ACM Symposium on User Interface Software and Technology (UIST)

Can We Automatically Transform Speech Recorded on Common Consumer Devices in Real-World Environments into Professional Production Quality Speech? – A Dataset, Insights, and Challenges

Mysore, Gautham. (Aug. 1, 2015)

IEEE Signal Processing Letters

Speaker and Noise Independent Online Single Channel Speech Enhancement

Germain, Francois., Mysore, Gautham. (Apr. 19, 2015)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Speech Dereverberation using a Learned Speech Model

Liang, Dawen., Hoffman, Matt., Mysore, Gautham. (Apr. 19, 2015)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Efficient Manifold Preserving Audio Source Separation using Locality Sensitive Hashing

Kim, Minje., Smaragdis, Paris., Mysore, Gautham. (Apr. 19, 2015)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Lamello: Passive Acoustic Sensing for Tangible Input Components

Savage, Valkyrie., Head, Andrew., Hartmann, Bjorn., Goldman, Dan., Mysore, Gautham., Li, Wilmot. (Apr. 18, 2015)

ACM Conference on Human Factors in Computing Systems (CHI)

Stopping Criteria for Non-negative Matrix Factorization Based Supervised and Semi-Supervised Source Separation

Germain, Francois., Mysore, Gautham. (Oct. 1, 2014)

IEEE Signal Processing Letters

The Visual Microphone: Passive Recovery of Sound from Video

Davis, Abe., Rubinstein, Michael., Wadhwa, Neal., Mysore, Gautham., Durand, Fredo., Freeman, William. (Aug. 12, 2014)

ACM Transactions on Graphics (SIGGRAPH)

Exploiting Long-Term Temporal Dependencies in NMF using Recurrent Neural Networks with Application to Source Separation

Boulanger-Lewandowski, Nicolas., Mysore, Gautham., Hoffman, Matthew. (May. 9, 2014)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Speech Decoloration based on the Product-of-Filters Model

Liang, Dawen., Ellis, Daniel., Hoffman, Matthew., Mysore, Gautham. (May. 9, 2014)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Static and Dynamic Source Separation Using Nonnegative Factorizations: A unified view

Smaragdis, Paris., Fevotte, Cedric., Mysore, Gautham., Mohammadiha, Nasser., Hoffman, Matthew. (May. 1, 2014)

IEEE Signal Processing Magazine Special Issue on Source Separation and Applications

ISSE: An Interactive Source Separation Editor

Bryan, Nicholas., Mysore, Gautham., Wang, Ge. (Apr. 26, 2014)

ACM Human Factors in Computing Systems (CHI)

A Generative Product-of-Filters Model of Audio

Hoffman, M.., Mysore, Gautham. (Apr. 1, 2014)

Proceedings of the 2nd International Conference on Learning Representations (ICLR 2014), April 2014

Combining Modeling of Singing Voice and Background Music for Automatic Separation of Musical Mixtures

Rafii, Zafar., Germain, Francois., Sun, Dennis., Mysore, Gautham. (Nov. 4, 2013)

International Society of Music Information Retrieval Conference (ISMIR)

Source Separation of Polyphonic Music With Interactive User-Feedback on a Piano Roll Display

Bryan, Nicholas., Mysore, Gautham., Wang, Ge. (Nov. 4, 2013)

International Society for Music Information Retrieval Conference (ISMIR)

Content-Based Tools for Editing Audio Stories

Rubin, S.., Berthouzoz, F.., Mysore, Gautham., Li, Wilmot., Agrawala, M.. (Oct. 1, 2013)

Proceedings of the 26th Annual ACM Symposium on User interface Software and Technology

Speaker and Noise Independent Voice Activity Detection

Germain, Francois., Sun, Dennis., Mysore, Gautham. (Aug. 25, 2013)

Best Student Paper Award

Interspeech

An Efficient Posterior Regularized Latent Variable Model for Interactive Sound Source Separation

Bryan, Nicholas., Mysore, Gautham. (Jun. 16, 2013)

International Conference on Machine Learning (ICML)

Interactive Refinement Of Supervised And Semi-Supervised Sound Source Separation Estimates

Bryan, Nicholas., Mysore, Gautham. (May. 1, 2013)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Universal Speech Models for Speaker Independent Single Channel Source Separation

Sun, D.., Mysore, Gautham. (May. 1, 2013)

ICASSP 2013 – Proceedings of the 38th International Conference on Acoustics, Speech, and Signal Processing, May 2013

Interactive User-feedback for Audio Source Separation

Bryan, N.., Mysore, Gautham. (Mar. 19, 2013)

In Proc. IUI 2013 - IEEE International Conference on Automatic Face and Gesture Recognition , March 19 - 22. Presented in the Interactive Machine Learning Workshop, 2013.

UnderScore: Musical Underlays for Audio Stories

Rubin, S.., Berthouzoz, F.., Mysore, Gautham., Li, Wilmot., Agrawala, M.. (Oct. 1, 2012)

Proceedings of the 25th annual ACM Symposium on User interface Software and Technology

Language Informed Bandwidth Expansion

Han, Jinyu., Mysore, Gautham., Smaragdis, Paris. (Sep. 23, 2012)

IEEE International Workshop on Machine Learning for Signal Processing

Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments

Duan, Zhiyao., Mysore, Gautham., Smaragdis, Paris. (Sep. 9, 2012)

Interspeech

Variational Inference in Non-negative Factorial Hidden Markov Models for Efficient Audio Source Separation

Mysore, Gautham., Sahani, Maneesh. (Jun. 26, 2012)

Proceedings of the International Conference on Machine Learning (ICML)

Clustering and Synchronizing Multi-camera Video via Landmark Cross-Correlation

Bryan, Nicholas., Smaragdis, Paris., Mysore, Gautham. (Mar. 25, 2012)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Following Musical Sources by Example

Smaragdis, Paris., Mysore, Gautham. (Mar. 25, 2012)

ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing , March 2012

Noise-Robust Dynamic Time Warping Using PLCA Features

King, B.., Smaragdis, Paris., Mysore, Gautham. (Mar. 25, 2012)

ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing , March 2012

Following Musical Sources by Example

Smaragdis, Paris., Mysore, Gautham. (Mar. 25, 2012)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Audio Imputation Using the Non-negative Hidden Markov Model

Han, J.., Mysore, Gautham., Pardo, B.. (Mar. 12, 2012)

LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, March 2012

Sound Recognition in Mixtures

Nam, J.., Mysore, Gautham., Smaragdis, Paris. (Mar. 12, 2012)

LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, March 2012

Online PLCA for Real-Time Semi-supervised Source Separation

Duan, Z.., Mysore, Gautham., Smaragdis, Paris. (Mar. 12, 2012)

LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, March 2012

A Non-negative Approach to Language Informed Speech Separation

Mysore, Gautham., Smaragdis, Paris. (Mar. 12, 2012)

LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, March 2012

A Convolutive Spectral Decomposition Approach to the Separation of Feedback from Target Speech

Mysore, Gautham., Smaragdis, Paris. (Sep. 18, 2011)

MLSP - IEEE International Workshop on Machine Learning for Signal Processing , September 2011

A Non-Negative Approach to Semi-Supervised Separation of Speech from Noise with the Use of Temporal Dynamics

Mysore, Gautham., Smaragdis, Paris. (May. 22, 2011)

ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing , May 2011

Non-negative Hidden Markov Modeling of Audio with Application to Source Separation

Mysore, Gautham., Smaragdis, Paris., Raj, B.. (Sep. 27, 2010)

Best Student Paper Award

LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, September 2010

Separation by Humming: User Guided Sound Extraction from Monophonic Mixtures

Smaragdis, Paris., Mysore, Gautham. (Oct. 18, 2009)

WASPAA - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , October 2009

Multipitch Estimation using Sparse Impulse Distributions and Instrument Specific Priors

Mysore, Gautham., Smaragdis, Paris. (Jun. 18, 2009)

IMCL - International Conference on Machine Learning (ICML) Workshop on Sparse Methods for Music Audio , June 2009

Probabilistic Factorization of Non-Negative Data with Entropic Co-occurrence Constraints

Smaragdis, Paris., Shashanka, M.., Raj, B.., Mysore, Gautham. (Mar. 15, 2009)

ICA - International Conference on Independent Component Analysis and Signal Separation , March 2009

News

Bridging technology and musical creativity: ISMIR 2024

New Horizons for Audio Research at Adobe

Helping Vloggers Create Better Videos