Yuheng Li

Hi! I am a research scientist at Adobe Research.
Before joining Adobe, I got my PhD in Computer Science from University of Wisconsin-Madison in 2024, under the supervision Prof. Yong Jae Lee.

Generally, I am interested in controllable & multimodal image generation/ manipulation.
Feel free to contact me for collaboration.

Email  /  CV  /  Google Scholar



Research

Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
Yuheng Li, Haotian Liu, Mu Cai, Yijun Li , Eli Shechtman, Zhe Lin, Yong Jae Lee, and Krishna Kumar Singh
Proceedings of the European Conference on Computer Vision (ECCV), 2024
[ProjectPage, Code, Paper]

Yo'LLaVA: Your Personalized Language and Vision Assistant
Thao Nguyen, Haotian Liu, Yuheng Li, Mu Cai, Utkarsh Ojha, Yong Jae Lee
arXiv, 2024
[ProjectPage, Code, Paper]

Improved Baselines with Visual Instruction Tuning (LLaVA-1.5)
Haotian Liu, Chunyuan Li, Yuheng Li, Yong Jae Lee
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[ProjectPage, Code, Paper]

Edit One for All: Interactive Batch Image Editing
Thao Nguyen, Utkarsh Ojha, Yuheng Li, Haotian Liu, Yong Jae Lee
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[ProjectPage, Code, Paper]

Generate Anything Anywhere in Any Scene
Yuheng Li, Haotian Liu, Yangming Wen, Yong Jae Lee
arxiv, 2023

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Mu Cai*, Zeyi Huang*, Yuheng Li, Haohan Wang, and Yong Jae Lee
(*equal contribution)
arxiv, 2023

Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen, Yuheng Li, Utkarsh Ojha, Yong Jae Lee
Neural Information Processing Systems (NeurIPS), 2023
[ProjectPage, Code, Paper]

What Knowledge Gets Distilled in Knowledge Distillation?
Utkarsh Ojha*, Yuheng Li*, Anirudh Sundara Rajan*, Yingyu Liang, Yong Jae Lee
(*equal contribution)
Neural Information Processing Systems (NeurIPS), 2023

GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li*, Yong Jae Lee*
(*equal advising)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[arXiv] [code] [Project Page] [Demo] [Youtube]

Towards Universal Fake Image Detectors that Generalize Across Generative Models
Utkarsh Ojha*, Yuheng Li*, Yong Jae Lee
(*equal contribution)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023

Delving Deeper into Anti-aliasing in ConvNets
Xueyan Zou, Fanyi Xiao, Zhiding Yu, Yuheng Li, and Yong Jae Lee
International Journal of Computer Vision (IJCV), 2022

Contrastive Learning for Diverse Disentangled Foreground Generation
Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh
Proceedings of the European Conference on Computer Vision (ECCV), 2022

GIRAFFE HD: A High-Resolution 3D-aware Generative Model
Yang Xue, Yuheng Li, Krishna Kumar Singh, Yong Jae Lee
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[arXiv] [code]

Collaging Class-specific GANs for Semantic Image Synthesis
Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh
IEEE International Conference on Computer Vision (ICCV), 2021
[arXiv] [project]

PartGAN: Unsupervised Part Decomposition for Image Generation and Segmentation
Yuheng Li, Krishna Kumar Singh, Yong Jae Lee
British Machine Vision Conference (BMVC), 2021

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation
Yuheng Li, Krishna Kumar Singh, Utkarsh Ojha, Yong Jae Lee
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
[arXiv] [code]