About Me
I am currently a Researcher at Kuaishou Technology, focusing on cutting-edge research in Computer Vision and Natural Language Processing. My current research interests include Multimodal Large Language Models (MLLMs), Formal Theorem Proving, and AI Agents.
I received my Ph.D. degree from the Multimedia and Human Understanding Group (MHUG) at the Department of Information Engineering and Computer Science, University of Trento, Italy, in 2022. I was supervised by Prof. Nicu Sebe and Dr. Bruno Lepri, with my thesis defense committee including Vittorio Murino, Zhengyou Zhang, and Elisa Ricci.
Before my doctoral studies, I earned my B.Eng. degree in Photogrammetry and Remote Sensing (2015) and M.Eng. degree in Pattern Recognition and Intelligent System (2018) from Wuhan University, China.
We are actively recruiting daily interns for long-term positions. Please feel free to submit your resume to my email for exciting research opportunities!
Research Experience
Research focus: MLLMs, Formal Theorem Proving and Agents
Research focus: Image Generation and Enhancing (GANs and Diffusion Models)
Mentors: Dr. Linchao Bao and Dr. Wei Bi.
Research focus: GANs, Image Domain Translation
Mentors: Prof. Nicu Sebe and Dr. Bruno Lepri.
Research focus: Deep learning, GANs, Cross-modal Representations, Image Domain Translation
Mentors: Dr. Wei Bi and Dr. Xiaojiang Liu.
Research focus: Deep Learning, Neural Dialogue Generation
Mentor: Prof. Jian Yao.
Research focus: Deep Learning, Remote Sensing
Selected Publications
Conference Papers
Journal Articles
Recent News
We released Leanabell-Prover-V2 for verifier-integrated reasoning via RL.
We released SeqPE for universal positional encoding.
We released LCoT2Tree for uncovering structural patterns in Long CoT.
We released CrEval for evaluating text creativity across diverse domains.
We released the UNITE framework for Multimodal Information Retrieval.
One paper accepted to ACL main conference: MCTS-VCB.
We released Capybara-VL and Capybara-Omni at ICLR 2025 SCI-FM workshop - our efficient MLLMs.
We released Leanabell-Prover achieving the SOTA 59.8% pass@32 on MiniF2F-test.
Academic Services
Conference Reviews
- ICML 2025
- AAAI 2025
- CVPR 2024, 2023, 2022, 2021
- NeurIPS 2025, 2024, 2023, 2022
- ACM MM 2025, 2024, 2023, 2022, 2021, 2020
- ACL/EMNLP 2025, 2024
- ECCV 2024, 2022
- ICCV 2023, 2021
- IJCAI 2022, 2021
Journal Reviews
- IEEE TPAMI
- International Journal of Computer Vision (IJCV)
- IEEE Transactions on Industrial Informatics (TII)
- IEEE GRSL
- IEEE J-STARS
- IEEE TNNLS
- Machine Vision and Applications (MVAP)
- IEEE TMM
- Pattern Recognition Letters (PRL)
- Information Fusion