Han Peng
I am a Staff Research Scientist at Ant Group, where I am growing a team on post-training, reasoning, and agent capabilities for multimodal foundation models.
Our recent work includes Ming-Flash-Omni, a 100B parameter Open-Source model that can read, hear, speak, and see.
Previously, I was a Machine Learning Engineer at Google. My academic foundation is from the University of Oxford, with a PhD in Physics and postdoctoral research at the Visual Geometry Group (VGG). My work at Oxford on AI for brain imaging set a world record.
🚀 We’re Hiring!
Our team is expanding. We are looking for passionate Research Interns and Full-time Researchers to tackle fundamental challenges in AGI. This is an opportunity to work on frontier research and see your work deployed at scale. If you’re excited by our work, please send your CV to: penghan (dot) peng (at) antgroup (dot) com.
For a comprehensive list of my publications and professional experience, please visit: