About Me
I am a Senior Researcher at Tencent, Singapore. I received my Ph.D. in May 2026 from S-Lab, Nanyang Technological University, where I was advised by Prof. Yiping Ke, Kelly and co-advised by Dr. Wayne Zhang.
My research centers on building multimodal agents that perceive, reason about, and act in the world. I currently focus on GUI agents — systems that operate graphical user interfaces the way people do — as a step toward general-purpose, action-capable intelligence. More broadly, I work at the intersection of vision–language learning and multimodal large language models, with earlier contributions to open-vocabulary and multimodal semantic segmentation.
I am always glad to connect with fellow researchers and to chat about potential collaborations. Feel free to reach out.
Experiences
- Jan 2026 - Present Senior Researcher, Tencent Singapore.
- Apr 2025 - Dec 2025 Research Intern, ByteDance/TikTok, Singapore. Working with Dr. Zilong Huang and Dr. Song Bai
- Aug 2022 - Jan 2023 Research Associate, CIL, Nanyang Technological University, Singapore. Working with Prof. Yiping Ke, Kelly
- Jul 2020 - Jul 2022 Research Assistant, GAP Lab, The Chinese University of Hong Kong, Shenzhen. Working with Prof. Xiaoguang Han
Publications [Google Scholar]
Awards
- First Grade Scholarship, 2017-2020
- National Scholarship, 2019
- Outstanding Student of Guangdong Province , 2020
Services
Conference Reviewer:- ICML, ICLR, NeurIPS, CVPR, ICCV, ECCV, AAAI
- Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- International Journal of Computer Vision (IJCV)
- Neurocomputing
- Engineering Applications of Artificial Intelligence (EAAI)
Teaching
- [2023/24 1nd semester]: SC2002/CE2002/CZ2002 Object Oriented Design & Programming ~ Teaching Assistant
- [2023/24 2nd semester]: SC2008/CE3005/CZ3006 Computer Networks ~ Teaching Assistant