About Me

Hey there👋🏻! I’m Zian(Andy) Zheng, an AI enthusiast on a quest for challenges and unexplored horizons.

  • As a CS MMath student at University of Waterloo, I’m always eager to push boundaries and embrace novel ideas.

  • Beyond the study, I thrive on adrenaline-fueled outdoor pursuits like skydiving, scuba diving, free diving, climbing, and kayaking.

Whether it’s conquering algorithms or braving the wild, I embrace every chance to grow, learn, and adapt.


Research Interests

I’m generally interested in LLMs, VLMs, and AIGC. Here are a few topics I’m exploring — and what I’ve done so far:


1. Making AI models(LLMs / VLMs / Diffusion) more efficient

  • OpenMoE: Explore training dynamics of routing mechanism of Mixture-of-Experts (MoE) models, aiming for better load balancing during inference.
    → Findings: context-independent specialization, early routing convergence, and end-stage drop.

  • AdaVocab: Sparse vocabulary activation for next-token prediction speeds up SLM inference.

  • TheMatrix: (my contribution) Inference optimization for Real-time, interactive video generation with diffusion models. [demo] → This could be the future game engine(without rewards), or an efficient video world model with them :)


2. “Simulate with AI, learn from AI, and deploy in the real world”

A vision I’m passionate about: Simulate any environment with world models, supervise policy learning with AI feedbacks(e.g. VLMs), and finally generalize to the real world.

Roadmap (in My Mind):

  • ✅ Built action-controllable world models (e.g. Matrix, Cosmos)
  • 🔄 Wrapping world model + VLM reward into a Gym-style RL env (in progress)

Challenges I’m working on:

  • Fast world model inference: Pipelined and parallelized DiT, VAE, and post-processors in TheMatrix project.
  • AI-generated rewards: Can VLMs give direct feedback, or label preferences for training reward models? Still exploring.

3. Making Human–AI interaction better

Not just research — I like to build tools too! As a heavy LLM/VLM user, I hope to build better interfaces to collaborate with AI.

Some thoughts:

  • Let users segment, reuse, and compose dialogue context more effectively
  • Turn chat history into a personalized knowledge base — for strengthening or sharing knowledge

Miscellaneous

  • Fun academic trivia: I’m Newton’s 18th-gen academic great-great-…-grandstudent. The academic family tree is here. Sadly, I don’t think I inherited much of his math genius — sorry, academic ancestors 😅
  • I am an extreme sports lover and adrenaline junkie. You can call me ‘Tri-diver Andy’ (skydiver, freediver, scuba diver).
  • I like writing poems (in English/Chinese). For example, here is one of my poems about life and yet-to-be-discovered love.