Mengcheng Lan

About Me

I am a Senior Researcher at Tencent, Singapore. I received my Ph.D. in May 2026 from S-Lab, Nanyang Technological University, where I was advised by Prof. Yiping Ke, Kelly and co-advised by Dr. Wayne Zhang.

My research centers on building multimodal agents that perceive, reason about, and act in the world. I currently focus on GUI agents — systems that operate graphical user interfaces the way people do — as a step toward general-purpose, action-capable intelligence. More broadly, I work at the intersection of vision–language learning and multimodal large language models, with earlier contributions to open-vocabulary and multimodal semantic segmentation.

I am always glad to connect with fellow researchers and to chat about potential collaborations. Feel free to reach out.

Experiences

Jan 2026 - Present Senior Researcher, Tencent Singapore.
Apr 2025 - Dec 2025 Research Intern, ByteDance/TikTok, Singapore. Working with Dr. Zilong Huang and Dr. Song Bai
Aug 2022 - Jan 2023 Research Associate, CIL, Nanyang Technological University, Singapore. Working with Prof. Yiping Ke, Kelly
Jul 2020 - Jul 2022 Research Assistant, GAP Lab, The Chinese University of Hong Kong, Shenzhen. Working with Prof. Xiaoguang Han

Publications [Google Scholar]

Let ViT Speak: Generative Language-Image Pre-training
Proceedings of the 19th European Conference on Computer Vision, ECCV 2026
Yan Fang*, Mengcheng Lan*, Zilong Huang, Weixian Lei, Yunqing Zhao, Yujie Zhong, Yingchen Yu, Qi She, Yao Zhao, Yunchao Wei.
[Paper] [Code]

Text4Seg++: Advancing Image Segmentation via Generative Language Modeling
IEEE Transactions on Pattern Analysis and Machine Intelligence, TPAMI 2026
Mengcheng Lan, Chaofeng Chen, Jiaxing Xu, Zongrui Li, Yiping Ke, Xudong Jiang, Yingchen Yu, Yunqing Zhao, Song Bai.
[Paper] [Code]

Text4Seg: Reimagining Image Segmentation as Text Generation
Proceedings of the Thirteenth International Conference on Learning Representations, ICLR 2025
Mengcheng Lan, Chaofeng Chen, Yue Zhou, Jiaxing Xu, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang.
[Paper] [Code]

ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
Proceedings of the 18th European Conference on Computer Vision, ECCV 2024
Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang.
[Paper] [Code]

ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
Proceedings of the 18th European Conference on Computer Vision, ECCV 2024
Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang.
[Paper] [Code]

Learning to Discover Knowledge: A Weakly-Supervised Partial Domain Adaptation Approach
IEEE Transactions on Image Processing, TIP 2024
Mengcheng Lan, Min Meng, Jun Yu, Jigang Wu.
[Paper] [Code]

SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
Proceedings of the 37th Conference on Neural Information Processing Systems, NeurIPS 2023
Mengcheng Lan, Xinjiang Wang, Yiping Ke, Jiaxing Xu, Litong Feng, Wayne Zhang.
[Paper] [Code]

MIMO is all you need: a strong multi-in-multi-out baseline for video prediction
Proceedings of the AAAI Conference on Artificial Intelligence, AAAI 2023
Shuliang Ning*, Mengcheng Lan*, Yanran Li, Chaofeng Chen, Qian Chen, Xunlai Chen, Xiaoguang Han, Shuguang Cui.
[Paper] [Code]

Awards

First Grade Scholarship, 2017-2020
National Scholarship, 2019
Outstanding Student of Guangdong Province , 2020

Services

Conference Reviewer:

ICML, ICLR, NeurIPS, CVPR, ICCV, ECCV, AAAI

Journal Reviewer:

Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
International Journal of Computer Vision (IJCV)
Neurocomputing
Engineering Applications of Artificial Intelligence (EAAI)

Teaching

[2023/24 1nd semester]: SC2002/CE2002/CZ2002 Object Oriented Design & Programming ~ Teaching Assistant

[2023/24 2nd semester]: SC2008/CE3005/CZ3006 Computer Networks ~ Teaching Assistant