I am a 1st-year Ph.D. student at the Hong Kong University of Science and Technology (HKUST), supervised by Prof. Jun Zhang. I received my B.E. degree from Beihang University. I also work as a research intern at SenseTime Research, closely with Dr. Ruihao Gong. Previously, I have interned at Microsoft Research Asia and SenseTime Research. My research interest focuses on efficient vision and language generative models.

Selected Publications

* indicates equal contribution, 📧 indicates corresponding author

Preprint
sym

MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping

Yushi Huang, Zining Wang, Zhihang Yuan📧, Yifu Ding, Ruihao Gong, Jinyang Guo, Xianglong Liu, Jun Zhang📧

Paper Abstract
Preprint
sym

QVGen: Pushing the Limit of Quantized Video Generative Models

Yushi Huang, Ruihao Gong📧, Jing Liu, Yifu Ding, Chengtao Lv, Haotong Qin, Jun Zhang📧

Paper Abstract
TPAMI 2025
sym

Temporal Feature Matters: A Framework for Diffusion Model Quantization sym

Yushi Huang, Ruihao Gong, Xianglong Liu📧, Jing Liu, Yuhang Li, Jiwen Lu, Dacheng Tao

Paper Code Abstract
ICML 2025
sym

HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration sym

Yushi Huang*, Zining Wang*, Ruihao Gong📧, Jing Liu, Xinjie Zhang, Jinyang Guo, Xianglong Liu, Jun Zhang📧

Paper Code Abstract
CVPR 2024 Highlight
sym

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models sym

Yushi Huang*, Ruihao Gong*, Jing Liu, Tianlong Chen, Xianglong Liu📧

Paper Code Abstract Project Page

Projects

Toolkit
LightCompress

LightCompress is an off-the-shelf compression suite for AIGC models (LLMs, VLMs, diffusion, etc.) that packages SOTA quantization, sparsification, and deployment best practices to shrink models while preserving accuracy.

Services

  • Conference Reviews: NeurIPS, ICLR, ICML, COLM, AAAI, CVPR.

Educations

  • 2025.02 - Now, Ph.D. in Electronic Computer and Engineering, the Hong Kong University of Science and Technology.
  • 2020.09 - 2024.06, B.Eng. in Computer Science and Engineering, Shenyuan Honors College, Beihang University.

Internships

  • 2025.02 - Now, SenseTime Research.
  • 2024.12 - 2025.02, Microsoft Research Asia.
  • 2023.05 - 2024.12, SenseTime Research.