Max W.F. Ku
I’m a first-year PhD student in Computer Science at the University of Waterloo, Faculty of Mathematics, where I’m fortunate to be advised by Prof. Wenhu Chen.
My research lies at the intersection of generative AI, visual content creation, and model interpretability. I’m particularly interested in making visual generation and editing (images, videos, and beyond) more controllable, interpretable, and usable in creative applications.
At the heart of my work is a simple but ambitious goal:
To make generative visuals fully controllable across science, communication, and creative applications.
While visuals remain my core focus, I’m increasingly curious about how they can integrate with physical reasoning and scientific understanding. I believe controllability in generative models should go beyond aesthetics, extending to physical coherence and alignment with how we perceive the world.
My work spans:
- Controllable Editing and Generation (I prioritize editing over generation)
- Multimodal Agentic Systems (Visuals + X)
- Interpretability and Explainable AI
- Creative Applications in Entertainment, Education, and Science
news
Jun 15, 2025 | Achieved a total of 1000 citations. |
---|---|
Jun 11, 2025 | DisProtEdit got accepted to 2025 ICML GenBio workshop and FM4LS workshop. |
Jun 02, 2025 | Joined NVIDIA Deep Imagination Research as an intern for Summer 2025. |
May 15, 2025 | TheoremExplainAgent got accepted to ACL 2025 Main (Oral)! |
Nov 03, 2024 | AnyV2V got accepted to TMLR 2024! |
latest posts
Feb 25, 2025 | Paper Review - Audio-Visual Related Research (WIP) |
---|---|
Jan 14, 2025 | Implementing RAG for Code Library Documentation |
Jan 08, 2025 | Evaluating Protein Transfer Learning with TAPE |