X-Fusion: Introducing New Modality to Frozen Large Language Models
Sicheng Mo, Thao Nguyen, Xun Huang, Siddharth Srinivasan Iyer, Yijun Li, Yuchen Liu, Abhishek Tandon, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou,
Yuheng Li
IEEE International Conference on Computer Vision, (
ICCV), 2025
🏆 Best paper at CVPR 2025 Workshop: "Transformers for Vision (T4V)
[
ProjectPage,
Code,
Paper]