I'm an AI researcher and engineer with 7+ years across the field, spanning both research and production. My work is backed by a PhD in Computer Science earned in China and 100+ citations on Google Scholar in generative modeling and computer vision. My background also includes startup experience building AI apps, and earlier years as a software engineer with team leadership — experience that still shapes how I build and ship.
Today I fine-tune large language and vision-language models and deploy inference services that stay fast and stable at scale. I'm at my best closing the gap between research and product — turning a promising result into a system people can actually rely on.
While primarily an engineering role — from system design to deployment and production — it also involved research: successfully defended postdoctoral research on computer vision, hyperbolic deep learning, and interpretable AI, and published peer-reviewed papers.
Engineering work included, but was not limited to:
Fine-tuned generative diffusion models using Stable Diffusion and LoRA for conditional face synthesis with improved cultural and social diversity.
Built and deployed an AI onboarding assistant using LangChain, GPT-4, RAG, ChromaDB, and FastAPI, and fine-tuned LLaMA for paper summarization and QA.
AI Research Engineer2017 — 2022
Shanghai ERC of Big Data · Shanghai, China
Built VAE- and GAN-based models for image editing, makeup transfer, and masked-face inpainting.
Shipped real-time emotion- and face-recognition systems with PyTorch, OpenCV, and Docker.
Delivered forecasting models with XGBoost and scikit-learn; advanced facial-attractiveness research and released a labeled dataset.
Earlier — Software Engineering & Team Leadership2006 — 2014