About Me

I am current a third-year Ph.D. student (Sep. 2022 - Jun. 2027, expected) in the School of Information Science and Technology, Fudan University, supervised by Prof. Tao Chen. I am also fortunate to work closely with Dr. Bo Zhang from Shanghai AI Lab. Before this, I obtained my Bachelor’s degree in Electronic Engineering also from Fudan University (Sep. 2018 - Jun. 2022). I work in the fields of deep learning and computer vision, with particular focuses on 3D perception, transfer learning, multi-modal LLM. My research pursues to develop vision-language systems that possess the capacity to comprehend, reason, and envision the physical world and explore using AI for scientific discovery.

🔥 News

2025.7: 🎉🎉 SPOT is accepted by IEEE T-PAMI 2025.
2025.6: 🎉🎉 Two papers (Lumina Image 2.0 and Chimera) are accepted by ICCV 2025. One is about text-to-image generation, the other is about multimodal reasoning.
2025.5: 🎉🎉 Two papers (SurveyForge and Dolphin) are accepted by ACL 2025. Both are introduced for accelerate scientific research.
2025.5: 🎉🎉 We release NovelSeek, a unified closed-loop multi-agent framework for Automatic Scientific Research.
2025.2: 🎉🎉 One paper (CST-Stereo) is accepted by CVPR 2025. CST-Stereo introduce a unified self-training framework for iterative-based stereo matching models.
2024.12: 🎉🎉 One paper (GeoX) is accepted by ICLR 2025. GeoX reveals the large potential of formalized visual-language pre-training in enhancing geometric problem-solving abilities.
2024.12: 🎉🎉 One paper (AIOStereo) is accepted by AAAI 2025. AIOStereo can transfer knowledge from multiple vision foundation models into a single stereo matching model flexibly.
2024.10: 🎉🎉 I recieve the national scholarship.
2024.09: 🎉🎉 Two papers (AdaptiveDiffusion and 3DET-Mamba) are accepted by NeurIPS 2024. One is about training-free acceleration of diffusion model, another is about mamba architecture in 3D detection.
2024.07: 🎉🎉 One paper (Reg-TTA3D) is accepted by ECCV 2024. We explore test-time adaptive 3d object detection for the first time.
2024.01: 🎉🎉 One paper (ReSimAD) is accepted by ICLR 2024. We propose a zero-shot generalization framework by reconstructing mesh and simulating target point clouds.
2023.09: 🎉🎉 One Paper (AD-PT) is accepted by NeurIPS 2023.We explore 3D pre-training pipeline to obtain backbones with strong generalization capability.
2023.02: 🎉🎉 Two Papers (Bi3D and Uni3D) are accepted by CVPR 2023. One is about active domain adaptation for 3D object detection, another is about multi-dataset training for 3d object detection.
2022.07: 🎉🎉 One Paper (HelixFormer) is accepted by ACM'MM 2022. We explore Transformer architecture on few-shot fine-grained classification task.

📝 Publications & Preprints

ICCV 2025

Chimera: Improving Generalist Model with Domain-Specific Experts

Tianshuo Peng^*, Mingsheng Li^*, Jiakang Yuan, Hongbin Zhou, Renqiu Xia, Renrui Zhang, Lei Bai, Song Mao, Bin Wang, Aojun Zhou, Botian Shi, Tao Chen, Bo Zhang, Xiangyu Yue

Jiakang Yuan (袁家康)

About Me

🔥 News

📝 Publications & Preprints

📖 Educations

💬 Invited Talks

💻 Internships

📝 Academic Services

Jiakang Yuan
(袁家康)