Hello, everybody!

Hey, this is Xuehui Wang. I'm a PhD student of computer science at Shanghai Jiao Tong University, China, supervised by Prof.Wei Shen and this is my fourth year. I wanna spend more time exploring keys of artificial intelligience. Hope I can bring some solid works to the community of computer vision! Recently, I mainly focus on the development of vision-language models, especially how to adopt VLMs into the field of image understanding as well as how to enable a GUI agent to operate on personal computers and mobile phones.

I am a dog person. Labrador Retriever, Golden Retriever and Border Collie are my favorites. I wish I can have a cute dog after obtaining the PhD degree. I spend my free time on BiliBili, a video sharing platform like YouTube, and musics. I'm a hardcore fan of DaoYueShe(盗月社食遇记). I also enjor driving (I have a XPENG G6 car that can drive itself) and playing Nintendo Switch, which brings great time for me.

After a struggle with anxiety disorder, I believe that health is the greatest fortune, and knowledge is our permanent pursuit.

Education & Intern

Shanghai AI LaboratoryShanghai AI Laboratory
Shanghai Jiao Tong UniversityShanghai Jiao Tong University
Tencent, Youtu LabTencent, Youtu Lab
Sensetime, MIGSensetime, MIG
Sun Yat-sen UniversitySun Yat-sen University
Shandong UniversityShandong University

Research Intern

@Supervisor: Dr. Xue Yang & Dr. Wenhai Wang
April 2023 - Now
Shanghai, China
  • I'm an intern member of OpenGVLab, which is lead by Dr. Wenhai Wang, the Young Research Scientist of AILab, and Prof. Jifeng Dai.
  • Currently, I actively paticipate in the development of InternVL-2.5, InternVL-3 series models.
  • I also lead the development of an evaluation tool for computer use, which is important to evaluate the capability of VLMs throughly.

Latest News

  • Mar, 2025 One paper has been accepted to ICLR 2025.
  • Mar, 2025 One collaborative have been accepted to AAAI 2025.
  • Apr, 2024 Two papers have been accepted to ECCV 2024.
  • Dec, 2023 Two collaborative papers have been accepted to TIP and AAAI 2024.
  • Apr, 2023 I begin my fourth internship at Shanghai AI Lab.
  • Feb, 2023 Our survey has been accepted to TPAMI
  • Jan, 2023 One collaborative paper about weakly-supervised rotated object detection has been accepted to ICLR 2023!
  • Aug, 2022 One collaborative paper about instance segmentation has been accepted to Neurocomputing (JCR Q2)!
  • Jul, 2022 One collaborative paper about night image restoration has been accepted to ECCV 2022!
  • Mar, 2022 One collaborative paper about generative adversarial networks has been accepted to TMM (JCR Q1)!
  • Mar, 2022 One paper about instance segmentation has been accepted to CVPR 2022!

Recent Papers

Expanding performance boundaries of open-source multimodal models with model, data, and test-time scaling
Arxiv, 2025
ArxivInternVL
Zhe Chen, Weiyun Wang, Yue Cao, Yangzhou Liu, Zhangwei Gao, etc.
FLoRA: Maintaining Structural Integrity in Parameter Spaces for Parameter Efficient Fine-tuning
IEEE International Conference on Learning Representation (ICLR), 2025
ICLR
Chongjie Si*, Xuehui Wang*, Xue Yang, Zhengqin Xu, Qingyun Li, Jifeng Dai, Yu Qiao, Xiaokang Yang, Wei Shen#
Tendency-driven mutual exclusivity for weakly supervised incremental semantic segmentation
European Conference on Computer Vision (ECCV), 2024
ECCV
Chongjie Si*, Xuehui Wang*, Xiaokang Yang, Wei Shen#

Academic Services

Reviewer for Conferences
Service Image
CVPR 2023
Service Image
CVPR 2024
Service Image
CVPR 2025
Service Image
ICCV 2023
Service Image
ICCV 2025
Service Image
ECCV 2022
Service Image
ICLR 2024
Service Image
ICLR 2025
Service Image
NeurIPS 2024
Service Image
NeurIPS 2025
Service Image
CVPR 2023
Service Image
CVPR 2024
Service Image
CVPR 2025
Service Image
ICCV 2023
Service Image
ICCV 2025
Service Image
ECCV 2022
Service Image
ICLR 2024
Service Image
ICLR 2025
Service Image
NeurIPS 2024
Service Image
NeurIPS 2025
Service Image
CVPR 2023
Service Image
CVPR 2024
Service Image
CVPR 2025
Service Image
ICCV 2023
Service Image
ICCV 2025
Service Image
ECCV 2022
Service Image
ICLR 2024
Service Image
ICLR 2025
Service Image
NeurIPS 2024
Service Image
NeurIPS 2025
Service Image
CVPR 2023
Service Image
CVPR 2024
Service Image
CVPR 2025
Service Image
ICCV 2023
Service Image
ICCV 2025
Service Image
ECCV 2022
Service Image
ICLR 2024
Service Image
ICLR 2025
Service Image
NeurIPS 2024
Service Image
NeurIPS 2025