Zeyu Jin

Research Scientist

San Francisco

Zeyu is a research scientist at Adobe Research in San Francisco. His research interests are at speech and music synthesis, deep learning, and human-computer interaction.

He received a Ph.D. degree in computer science from Princeton University adviced by Adam Finkelstein and M.S in music technology in Carnegie Mellon University. Between 2015 and 2017, he interned at Adobe for three times and presented his primary research project – VoCo – at Adobe MAX Sneaks (link to video) in 2016.

Publications

Controllable deep melody generation via hierarchical music representation

Dai, S., Jin, Z., Gomes, C., Dannenberg, R. (Nov. 8, 2021)

International Society for Music Information Retrieval Conference

HiFi-GAN-2: Studio-quality speech enhancement via generative adversarial networks conditioned on acoustic features

Su, J., Jin, Z., Finkelstein, A. (Oct. 17, 2021)

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

CDPAM: Contrastive learning for perceptual audio similarity

Manocha, P., Jin, Z., Zhang, R., Finkelstein, A. (Jun. 9, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Bandwidth Extension is All You Need

Su, J., Wang, Y., Finkelstein, A., Jin, Z. (Jun. 9, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Context-Aware Prosody Correction for Text-Based Speech Editing

Morrison, M., Rencker, L., Jin, Z., Bryan, N., Caceres, J., Pardo, B. (Jun. 6, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Manocha, P., Finkelstein, A., Zhang, R., Bryan, N., Mysore, G., Jin, Z. (Oct. 26, 2020)

Interspeech 2020

Controllable Neural Prosody Synthesis

Morrison, M., Jin, Z., Salamon, J., Bryan, N., Mysore, G. (Oct. 26, 2020)

Interspeech 2020

Metric Learning vs Classification for Disentangled Music Representation Learning

Lee, J., Bryan, N., Salamon, J., Jin, Z., Nam, J. (Oct. 11, 2020)

International Society for Music Information Retrieval Conference (ISMIR)

Disentangled Multidimensional Metric Learning For Music Similarity

Lee, J., Bryan, N., Salamon, J., Jin, Z., Nam, J. (May. 4, 2020)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Text-based Editing of Talking-head Video

Fried, O., Tewari, A., Zollhofer, M., Finkelstein, A., Shechtman, E., Goldman, D., Genova, K., Jin, Z., Theobalt, C., Agarwala, M. (Aug. 1, 2019)

ACM Transactions on Graphics (Proc. SIGGRAPH'19)

FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Jin, Z., Finkelstein, A., Mysore, G., Lu, J. (Apr. 15, 2018)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

VoCo: text-based insertion and replacement in audio narration

Jin, Z., Mysore, G., DiVerdi, S., Lu, J., Finkelstein, A. (Jul. 31, 2017)

ACM Transactions on Graphics (SIGGRAPH)

CUTE: a Concatenative Method for Voice Conversion Using Exemplar-based Unit Selection

Jin, Z., Finkelstein, A., DiVerdi, S., Lu, J., Mysore, G. (Mar. 1, 2016)

The 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

News