Hello, everybody!

Hey, this is Xuehui Wang. I'm a PhD student of computer science at Shanghai Jiao Tong University, China, supervised by Prof.Wei Shen and this is my last year. I wanna spend more time exploring keys of artificial intelligience. Hope I can bring some solid works to the community of computer vision! Recently, I mainly focus on the development of vision-language models, especially how to adopt VLMs into the field of image understanding as well as how to enable a GUI agent to operate on personal computers and mobile phones.

I am a dog person. Labrador Retriever, Golden Retriever and Border Collie are my favorites. I wish I can have a cute dog after obtaining the PhD degree. I spend my free time on BiliBili, a video sharing platform like YouTube, and musics. I'm a hardcore fan of DaoYueShe(盗月社食遇记). I also enjor driving (I have a XPENG G6 car that can drive itself) and playing Nintendo Switch, which brings great time for me.

After a struggle with anxiety disorder, I believe that health is the greatest fortune, and knowledge is our permanent pursuit.

Education & Intern

Shanghai AI Laboratory

Shanghai Jiao Tong University

Tencent, Youtu Lab

Sensetime, MIG

Sun Yat-sen University

Shandong University

Research Intern

@Supervisor: Dr. Xue Yang & Dr. Wenhai Wang & Prof. Jifeng Dai

Apr 2023 - Sep 2025

Shanghai, China

✓I'm an intern member of OpenGVLab, which is lead by Dr. Wenhai Wang, the Young Research Scientist of AILab, and Prof. Jifeng Dai.
✓Currently, I actively paticipate in the development of InternVL-2.5, InternVL-3 series models.
✓I also lead the development of an evaluation tool, termed as MMBench-GUI, for computer use, which is important to evaluate the capability of VLMs throughly.

Latest News

Sep, 2025 Two paper has been accepted to NeurIPS 2025.
Jul, 2025 We release a new benchmark (MMBench-GUI) for GUI Agent.
May, 2025 One paper has been accepted to ICCV 2025.
Mar, 2025 One paper has been accepted to ICLR 2025.
Mar, 2025 One collaborative have been accepted to AAAI 2025.
Apr, 2024 Two papers have been accepted to ECCV 2024.
Dec, 2023 Two collaborative papers have been accepted to TIP and AAAI 2024.
Apr, 2023 I begin my fourth internship at Shanghai AI Lab.
Feb, 2023 Our survey has been accepted to TPAMI
Jan, 2023 One collaborative paper about weakly-supervised rotated object detection has been accepted to ICLR 2023!
Aug, 2022 One collaborative paper about instance segmentation has been accepted to Neurocomputing (JCR Q2)!
Jul, 2022 One collaborative paper about night image restoration has been accepted to ECCV 2022!
Mar, 2022 One collaborative paper about generative adversarial networks has been accepted to TMM (JCR Q1)!
Mar, 2022 One paper about instance segmentation has been accepted to CVPR 2022!