About Me

I am a Senior Researcher at Tencent, Singapore. I received my Ph.D. in May 2026 from S-Lab, Nanyang Technological University, where I was advised by Prof. Yiping Ke, Kelly and co-advised by Dr. Wayne Zhang.

My research centers on building multimodal agents that perceive, reason about, and act in the world. I currently focus on GUI agents — systems that operate graphical user interfaces the way people do — as a step toward general-purpose, action-capable intelligence. More broadly, I work at the intersection of vision–language learning and multimodal large language models, with earlier contributions to open-vocabulary and multimodal semantic segmentation.

I am always glad to connect with fellow researchers and to chat about potential collaborations. Feel free to reach out.

Experiences

  • Jan 2026 - Present Senior Researcher, Tencent Singapore.
  • Apr 2025 - Dec 2025 Research Intern, ByteDance/TikTok, Singapore. Working with Dr. Zilong Huang and Dr. Song Bai
  • Aug 2022 - Jan 2023 Research Associate, CIL, Nanyang Technological University, Singapore. Working with Prof. Yiping Ke, Kelly
  • Jul 2020 - Jul 2022 Research Assistant, GAP Lab, The Chinese University of Hong Kong, Shenzhen. Working with Prof. Xiaoguang Han

Publications [Google Scholar]

Let ViT Speak: Generative Language-Image Pre-training
Proceedings of the 19th European Conference on Computer Vision, ECCV 2026  
Yan Fang*, Mengcheng Lan*, Zilong Huang, Weixian Lei, Yunqing Zhao, Yujie Zhong, Yingchen Yu, Qi She, Yao Zhao, Yunchao Wei.
[Paper] [Code]

Text4Seg++: Advancing Image Segmentation via Generative Language Modeling
IEEE Transactions on Pattern Analysis and Machine Intelligence, TPAMI 2026  
Mengcheng Lan, Chaofeng Chen, Jiaxing Xu, Zongrui Li, Yiping Ke, Xudong Jiang, Yingchen Yu, Yunqing Zhao, Song Bai.
[Paper] [Code]

Text4Seg: Reimagining Image Segmentation as Text Generation
Proceedings of the Thirteenth International Conference on Learning Representations, ICLR 2025  
Mengcheng Lan, Chaofeng Chen, Yue Zhou, Jiaxing Xu, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang.
[Paper] [Code]

ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
Proceedings of the 18th European Conference on Computer Vision, ECCV 2024  
Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang.
[Paper] [Code]

ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
Proceedings of the 18th European Conference on Computer Vision, ECCV 2024  
Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang.
[Paper] [Code]

Learning to Discover Knowledge: A Weakly-Supervised Partial Domain Adaptation Approach
IEEE Transactions on Image Processing, TIP 2024  
Mengcheng Lan, Min Meng, Jun Yu, Jigang Wu.
[Paper] [Code]

SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation
Proceedings of the 37th Conference on Neural Information Processing Systems, NeurIPS 2023  
Mengcheng Lan, Xinjiang Wang, Yiping Ke, Jiaxing Xu, Litong Feng, Wayne Zhang.
[Paper] [Code]

MIMO is all you need: a strong multi-in-multi-out baseline for video prediction
Proceedings of the AAAI Conference on Artificial Intelligence, AAAI 2023  
Shuliang Ning*, Mengcheng Lan*, Yanran Li, Chaofeng Chen, Qian Chen, Xunlai Chen, Xiaoguang Han, Shuguang Cui.
[Paper] [Code]

Awards

  • First Grade Scholarship, 2017-2020
  • National Scholarship, 2019
  • Outstanding Student of Guangdong Province , 2020

Services

Conference Reviewer:
  • ICML, ICLR, NeurIPS, CVPR, ICCV, ECCV, AAAI
Journal Reviewer:
  • Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • International Journal of Computer Vision (IJCV)
  • Neurocomputing
  • Engineering Applications of Artificial Intelligence (EAAI)

Teaching

  • [2023/24 1nd semester]: SC2002/CE2002/CZ2002 Object Oriented Design & Programming ~ Teaching Assistant
  • [2023/24 2nd semester]: SC2008/CE3005/CZ3006 Computer Networks ~ Teaching Assistant