Hello, everybody!
Hey, this is Xuehui Wang. I'm a PhD student of computer science at Shanghai Jiao Tong University, China, supervised by Prof.Wei Shen and this is my last year. I wanna spend more time exploring keys of artificial intelligience. Hope I can bring some solid works to the community of computer vision! Recently, I mainly focus on the development of vision-language models, especially how to adopt VLMs into the field of image understanding as well as how to enable a GUI agent to operate on personal computers and mobile phones.
I am a dog person. Labrador Retriever, Golden Retriever and Border Collie are my favorites. I wish I can have a cute dog after obtaining the PhD degree. I spend my free time on BiliBili, a video sharing platform like YouTube, and musics. I'm a hardcore fan of DaoYueShe(盗月社食遇记). I also enjor driving (I have a XPENG G6 car that can drive itself) and playing Nintendo Switch, which brings great time for me.
After a struggle with anxiety disorder, I believe that health is the greatest fortune, and knowledge is our permanent pursuit.
Education & Intern
Research Intern
@Supervisor: Dr. Xue Yang & Dr. Wenhai Wang & Prof. Jifeng Dai- ✓I'm an intern member of OpenGVLab, which is lead by Dr. Wenhai Wang, the Young Research Scientist of AILab, and Prof. Jifeng Dai.
- ✓Currently, I actively paticipate in the development of InternVL-2.5, InternVL-3 series models.
- ✓I also lead the development of an evaluation tool, termed as MMBench-GUI, for computer use, which is important to evaluate the capability of VLMs throughly.
Latest News
Sep, 2025Two paper has been accepted to NeurIPS 2025.Jul, 2025We release a new benchmark (MMBench-GUI) for GUI Agent.May, 2025One paper has been accepted to ICCV 2025.Mar, 2025One paper has been accepted to ICLR 2025.Mar, 2025One collaborative have been accepted to AAAI 2025.Apr, 2024Two papers have been accepted to ECCV 2024.Dec, 2023Two collaborative papers have been accepted to TIP and AAAI 2024.Apr, 2023I begin my fourth internship at Shanghai AI Lab.Feb, 2023Our survey has been accepted to TPAMIJan, 2023One collaborative paper about weakly-supervised rotated object detection has been accepted to ICLR 2023!Aug, 2022One collaborative paper about instance segmentation has been accepted to Neurocomputing (JCR Q2)!Jul, 2022One collaborative paper about night image restoration has been accepted to ECCV 2022!Mar, 2022One collaborative paper about generative adversarial networks has been accepted to TMM (JCR Q1)!Mar, 2022One paper about instance segmentation has been accepted to CVPR 2022!
Recent Papers


