Gautham Mysore

Principal Scientist

Creative Intelligence Lab, San Francisco

Gautham is a principal scientist and head of the Audio Research Group at Adobe Research in San Francisco. He is also an Adjunct Professor at Stanford University in the Center for Computer Research in Music and Acoustics (CCRMA). His research involves developing new machine learning and signal processing for a wide variety of real-world audio applications.

He received his Ph.D. (CCRMA), M.A. (CCRMA), and M.S. (Electrical Engineering) from Stanford University. He has previously been a visiting researcher at the Gatsby Computational Neuroscience Unit at the University College London. He has also previously spent time at Microsoft Research and the Department of Electrical Communication Engineering at the lndian Institute of Science.

Please visit his personal website for a complete list of publications.

 

My Publications

FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Jin, Z., Finkelstein, A., Mysore, G., Lu, J. (Apr. 15, 2018)
The 43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP oral)

VoCo: Text-based Insertion and Replacement in Audio Narration

Jin, Z., Mysore, G., DiVerdi, S., Lu, J., Finkelstein, A. (Aug. 1, 2017)
ACM Transactions on Graphics (Proc. of SIGGRAPH 2017)

CUTE: a Concatenative Method for Voice Conversion Using Exemplar-based Unit Selection

Jin, Z., Finkelstein, A., DiVerdi, S., Lu, J., Mysore, G. (Mar. 1, 2016)
The 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),

ISSE: An Interactive Source Separation Editor

Bryan, N., Mysore, G., Wang, G. (Apr. 26, 2014)
ACM Human Factors in Computing Systems (CHI)

A Generative Product-of-Filters Model of Audio

Hoffman, M., Mysore, G. (Apr. 1, 2014)
Proceedings of the 2nd International Conference on Learning Representations (ICLR 2014), April 2014

Source Separation of Polyphonic Music With Interactive User-Feedback on a Piano Roll Display

Bryan, N., Mysore, G., Wang, G. (Nov. 4, 2013)
International Society for Music Information Retrieval Conference (ISMIR)

Content-Based Tools for Editing Audio Stories

Rubin, S., Berthouzoz, F., Mysore, G., Li, W., Agrawala, M. (Oct. 1, 2013)
Proceedings of the 26th Annual ACM Symposium on User interface Software and Technology

Interactive Refinement Of Supervised And Semi-Supervised Sound Source Separation Estimates

Bryan, N., Mysore, G. (May. 1, 2013)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Universal Speech Models for Speaker Independent Single Channel Source Separation

Sun, D., Mysore, G. (May. 1, 2013)
ICASSP 2013 – Proceedings of the 38th International Conference on Acoustics, Speech, and Signal Processing, May 2013

Interactive User-feedback for Audio Source Separation

Bryan, N., Mysore, G. (Mar. 19, 2013)
In Proc. IUI 2013 - IEEE International Conference on Automatic Face and Gesture Recognition , March 19 - 22. Presented in the Interactive Machine Learning Workshop, 2013.

UnderScore: Musical Underlays for Audio Stories

Rubin, S., Berthouzoz, F., Mysore, G., Li, W., Agrawala, M. (Oct. 1, 2012)
Proceedings of the 25th annual ACM Symposium on User interface Software and Technology

Clustering and Synchronizing Multi-camera Video via Landmark Cross-Correlation

Bryan, N., Smaragdis, P., Mysore, G. (Mar. 25, 2012)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Following Musical Sources by Example

Smaragdis, P., Mysore, G. (Mar. 25, 2012)
ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing , March 2012

Noise-Robust Dynamic Time Warping Using PLCA Features

King, B., Smaragdis, P., Mysore, G. (Mar. 25, 2012)
ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing , March 2012

A Non-negative Approach to Language Informed Speech Separation

Mysore, G., Smaragdis, P. (Mar. 12, 2012)
LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, March 2012

Online PLCA for Real-Time Semi-supervised Source Separation

Duan, Z., Mysore, G., Smaragdis, P. (Mar. 12, 2012)
LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, March 2012

Sound Recognition in Mixtures

Nam, J., Mysore, G., Smaragdis, P. (Mar. 12, 2012)
LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, March 2012

Audio Imputation Using the Non-negative Hidden Markov Model

Han, J., Mysore, G., Pardo, B. (Mar. 12, 2012)
LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, March 2012

A Convolutive Spectral Decomposition Approach to the Separation of Feedback from Target Speech

Mysore, G., Smaragdis, P. (Sep. 18, 2011)
MLSP - IEEE International Workshop on Machine Learning for Signal Processing , September 2011

A Non-Negative Approach to Semi-Supervised Separation of Speech from Noise with the Use of Temporal Dynamics

Mysore, G., Smaragdis, P. (May. 22, 2011)
ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing , May 2011

Non-negative Hidden Markov Modeling of Audio with Application to Source Separation

Mysore, G., Smaragdis, P., Raj, B. (Sep. 27, 2010)
Best Student Paper Award LVA/ICA - International Conference on Latent Variable Analysis and Signal Separation, September 2010

Separation by Humming: User Guided Sound Extraction from Monophonic Mixtures

Mysore, G., Smaragdis, P. (Oct. 18, 2009)
WASPAA - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , October 2009

Multipitch Estimation using Sparse Impulse Distributions and Instrument Specific Priors

Mysore, G., Smaragdis, P. (Jun. 18, 2009)
IMCL - International Conference on Machine Learning (ICML) Workshop on Sparse Methods for Music Audio , June 2009

Probabilistic Factorization of Non-Negative Data with Entropic Co-occurrence Constraints

Smaragdis, P., Shashanka, M., Raj, B., Mysore, G. (Mar. 15, 2009)
ICA - International Conference on Independent Component Analysis and Signal Separation , March 2009