Richard Zhang

Senior Research Scientist

San Francisco

I am a Senior Research Scientist at Adobe Research. My research is in improving Generative AI, with speed, quality, and controllability. I have broader interests in computer vision, machine learning, deep learning, graphics, and image processing. I was previously a PhD student at UC Berkeley, advised by Professor Alexei (Alyosha) Efros. I obtained my BS and MEng degrees from Cornell University in ECE. I received the Adobe Fellowship Award in 2017 and was named an Innovator Under 35 by MIT Tech Review in 2023.

My personal website is here.

Publications

From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Yin, Tianwei., Zhang, Qiang., Zhang, Richard., Freeman, William., Durand, Fredo., Shechtman, Eli., Huang, Xun. (Jun. 15, 2025)

#1 on VBench Leaderboard at submission

Conference on Computer Vision and Pattern Recognition (CVPR 2025)

VideoGigaGAN: Towards Detail-rich Video Super-Resolution

Xu, Yiran., Park, Taesung., Zhang, Richard., Zhou, Yang., Shechtman, Eli., Liu, Feng., Huang, Jia-Bin., Liu, Difan. (Jun. 13, 2025)

Conference on Computer Vision and Pattern Recognition (CVPR 2025)

Long-Context State-Space Video World Models

Po, Ryan., Nitzan, Yotam., Zhang, Richard., Chen, Berlin., Dao, Tri., Shechtman, Eli., Wetzstein, Gordon., Huang, Xun. (May. 26, 2025)

arxiv

Improved Distribution Matching Distillation for Fast Image Synthesis

Yin, Tianwei., Gharbi, Michaël., Park, Taesung., Zhang, Richard., Shechtman, Eli., Durand, Frédo., Freeman, William. (Dec. 9, 2024)

Oral

NeurIPS 2024

TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Wu, Zongze., Kolkin, Nick., Brandt, Jonathan., Zhang, Richard., Shechtman, Eli. (Oct. 4, 2024)

European Conference on Computer Vision (ECCV'24)

Diffusion2GAN: Distilling Diffusion Models into Conditional GANs

Kang, Minguk., Zhang, Richard., Barnes, Connelly., Paris, Sylvain., Kwak, Suha., Park, Jaesik., Shechtman, Eli., Zhu, Jun-Yan., Park, Taesung. (Oct. 3, 2024)

European Conference on Computer Vision (ECCV'24)

Lazy Diffusion Transformer for Interactive Image Editing

Nitzan, Yotam., Wu, Zongze., Zhang, Richard., Shechtman, Eli., Cohen-Or, Daniel., Park, Taesung., Gharbi, Michaël. (Oct. 2, 2024)

European Conference on Computer Vision (ECCV'24)

Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection

Ganjdanesh, Alireza., Kang, Yan., Liu, Yuchen., Zhang, Richard., Lin, Zhe., Huang, Heng. (Oct. 1, 2024)

European Conference on Computer Vision (ECCV)

Editable Image Elements for Controllable Synthesis

Mu, Jiteng., Gharbi, Michaël., Zhang, Richard., Shechtman, Eli., Vasconcelos, Nuno., Wang, Xiaolong., Park, Taesung. (Oct. 1, 2024)

European Conference on Computer Vision (ECCV'24)

One-step Diffusion with Distribution Matching Distillation

Yin, Tianwei., Gharbi, Michaël., Zhang, Richard., Shechtman, Eli., Durand, Fredo., Freeman, William., Park, Taesung. (Jun. 21, 2024)

CVPR 2024

Personalized Residuals for Concept-Driven Text-to-Image Generation

Ham, Cusuh., Fisher, Matt., Hays, James., Kolkin, Nick., Liu, Yuchen., Zhang, Richard., Hinz, Tobias. (Jun. 19, 2024)

CVPR 2024

Image Neural Field Diffusion Models

Chen, Yinbo., Wang, Oliver., Zhang, Richard., Shechtman, Eli., Wang, Xiaolong., Gharbi, Michaël. (Jun. 18, 2024)

Highlight

CVPR 2024

Ablating Concepts in Text-to-Image Diffusion Models

Kumari, Nupur., Zhang, Bingliang., Wang, Sheng-Yu., Shechtman, Eli., Zhang, Richard., Zhu, Jun-Yan. (Oct. 5, 2023)

International Conference on Computer Vision (ICCV'23)

Scaling up GANs for Text-to-Image Synthesis

Kang, Minguk., Zhu, Jun-Yan., Zhang, Richard., Park, Jaesik., Shechtman, Eli., Paris, Sylvain., Park, Taesung. (Jun. 22, 2023)

Highlight

Computer Vision and Pattern Recognition (CVPR'23)

Multi-Concept Customization of Text-to-Image Diffusion

Kumari, Nupur., Zhang, Bingliang., Zhang, Richard., Shechtman, Eli., Zhu, Jun-Yan. (Jun. 22, 2023)

Computer Vision and Pattern Recognition (CVPR'23)

Domain Expansion of Image Generators

Nitzan, Yotam., Gharbi, Michaël., Zhang, Richard., Park, Taesung., Zhu, Jun-Yan., Cohen-Or, Daniel., Shechtman, Eli. (Jun. 21, 2023)

Computer Vision and Pattern Recognition (CVPR'23)

BlobGAN: Spatially Disentangled Scene Representations

Epstein, Dave., Park, Taesung., Zhang, Richard., Shechtman, Eli., Efros, Alexei. (Oct. 27, 2022)

European Conference on Computer Vision (ECCV'22)

Any-resolution Training for High-resolution Image Synthesis

Chai, Lucy., Gharbi, Michaël., Shechtman, Eli., Isola, Philip., Zhang, Richard. (Oct. 27, 2022)

European Conference on Computer Vision (ECCV'22)

3D-FM GAN: Towards 3D-Controllable Face Manipulation

Liu, Yuchen., Shu, Zhixin., Li, Yijun., Lin, Zhe., Zhang, Richard., Kung, Sun-Yuan. (Oct. 23, 2022)

European Conference on Computer Vision (ECCV)

ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions

Liu, Difan., Shetty, Sandesh., Hinz, Tobias., Fisher, Matt., Zhang, Richard., Park, Taesung., Kalogerakis, Evangelos. (Aug. 1, 2022)

ACM Transactions on Graphics (TOG)

GAN-Supervised Dense Visual Alignment (GANgealing)

Peebles, William., Zhu, Jun-Yan., Zhang, Richard., Torralba, Antonio., Efros, Alexei., Shechtman, Eli. (Jun. 24, 2022)

Computer Vision and Pattern Recognition (CVPR'22)

Ensembling Off-the-shelf Models for GAN Training

Kumari, Nupur., Zhang, Richard., Shechtman, Eli., Zhu, Jun-Yan. (Jun. 23, 2022)

Computer Vision and Pattern Recognition (CVPR'22)

Inverting and Editing Real Images with Spatially Varying and Automatic Latent Selection

Parmar, Gaurav., Li, Yijun., Lu, Jingwan., Zhang, Richard., Zhu, Jun-Yan., Singh, Krishna. (Jun. 19, 2022)

CVPR2022

Spatially-Adaptive Pixelwise Networks for Fast Image Translation

Shaham, Tamar., Gharbi, Michaël., Zhang, Richard., Shechtman, Eli., Michaeli, Tomer. (Jun. 23, 2021)

Computer Vision and Pattern Recognition (CVPR'21)

Few-shot Image Generation via Cross-domain Correspondence

Ojha, Utkarsh., Li, Yijun., Lu, Jingwan., Efros, Alexei., Lee, Yong., Shechtman, Eli., Zhang, Richard. (Jun. 23, 2021)

Computer Vision and Pattern Recognition (CVPR'21)

Ensembling with Deep Generative Views

Chai, Lucy., Zhu, Jun-Yan., Shechtman, Eli., Isola, Phillip., Zhang, Richard. (Jun. 22, 2021)

Computer Vision and Pattern Recognition (CVPR'21)

CDPAM: Contrastive learning for perceptual audio similarity

Manocha, Pranay., Jin, Zeyu., Zhang, Richard., Finkelstein, Adam. (Jun. 9, 2021)

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Few-shot Image Generation via Self-adaptation

Li, Yijun., Zhang, Richard., Lu, Jingwan., Shechtman, Eli. (Dec. 6, 2020)

Neurips 2020

Swapping Autoencoder for Deep Image Manipulation

Park, Taesung., Zhu, Jun-Yan., Wang, Oliver., Lu, Jingwan., Shechtman, Eli., Efros, Alexei., Zhang, Richard. (Dec. 6, 2020)

Neurips 2020

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Manocha, Pranay., Finkelstein, Adam., Zhang, Richard., Bryan, Nicholas., Mysore, Gautham., Jin, Zeyu. (Oct. 26, 2020)

Interspeech 2020

Contrastive Learning for Unpaired Image-to-Image Translation

Park, Taesung., Efros, Alexei., Zhang, Richard., Zhu, Jun-Yan. (Aug. 23, 2020)

European Conference on Computer Vision (ECCV)

Transforming and Projecting Images into Class-conditional Generative Networks

Huh, Minyoung., Zhang, Richard., Zhu, Jun-Yan., Paris, Sylvain., Hertzmann, Aaron. (Aug. 23, 2020)

Oral

European Conference on Computer Vision (ECCV)

Deep Parametric Shape Predictions using Distance Fields

Smirnov, Dmitiry., Fisher, Matt., Kim, Vladimir., Zhang, Richard., Solomon, Justin. (Jun. 1, 2020)

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

CNN-generated images are surprisingly easy to spot… for now

Wang, Sheng-Yu., Wang, Oliver., Zhang, Richard., Owens, Andrew., Efros, Alexei. (Jun. 1, 2020)

Oral

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Image Morphing with Perceptual Constraints and STN Alignment

Fish, Noa., Zhang, Richard., Perry, Lilach., Cohen-Or, Daniel., Shechtman, Eli., Barnes, Connelly. (May. 1, 2020)

Computer Graphics Forum (CGF)

Detecting Photoshopped Images by Scripting Photoshop

Wang, Sheng-Yu., Wang, Oliver., Owens, Andrew., Zhang, Richard., Efros, Alexei. (Oct. 31, 2019)

International Conference on Computer Vision (ICCV'19)

Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation

Ghosh, Arnab., Zhang, Richard., Dokania, Puneet., Efros, Alexei., Torr, Philip., Wang, Oliver., Shechtman, Eli. (Oct. 29, 2019)

International Conference on Computer Vision (ICCV'19)

Making Convolutional Networks Shift-Invariant Again

Zhang, Richard. (Jun. 12, 2019)

International Conference on Machine Learning (ICML'19)

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Zhang, Richard., Isola, Phillip., Efros, Alexei., Shechtman, Eli., Wang, Oliver. (Jun. 19, 2018)

IEEE Conference on Computer Vision and Pattern Recognition (CVPR'18)

Toward Multimodal Image-to-Image Translation

Zhu, Jun-Yan., Zhang, Richard., Pathak, Deepak., Darrell, Trevor., Efros, Alexei., Wang, Oliver., Shechtman, Eli. (Dec. 4, 2017)

Neural Information Processing Systems (NIPS'17)

News