I am a 1st-year Ph.D. student at the Hong Kong University of Science and Technology (HKUST), supervised by Prof. Jun Zhang. I received my B.E. degree from Beihang University. I also work as a research intern at SenseTime Research, closely with Dr. Ruihao Gong. Previously, I have interned at Microsoft Research Asia and SenseTime Research. My research interest focuses on efficient vision and language generative models.
Selected Publications
* indicates equal contribution, 📧 indicates corresponding author

MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Yushi Huang, Zining Wang, Zhihang Yuan📧, Yifu Ding, Ruihao Gong, Jinyang Guo, Xianglong Liu, Jun Zhang📧

QVGen: Pushing the Limit of Quantized Video Generative Models
Yushi Huang, Ruihao Gong📧, Jing Liu, Yifu Ding, Chengtao Lv, Haotong Qin, Jun Zhang📧

Temporal Feature Matters: A Framework for Diffusion Model Quantization
Yushi Huang, Ruihao Gong, Xianglong Liu📧, Jing Liu, Yuhang Li, Jiwen Lu, Dacheng Tao

Yushi Huang*, Zining Wang*, Ruihao Gong📧, Jing Liu, Xinjie Zhang, Jinyang Guo, Xianglong Liu, Jun Zhang📧

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models
Yushi Huang*, Ruihao Gong*, Jing Liu, Tianlong Chen, Xianglong Liu📧
Projects

LightCompress is an off-the-shelf compression suite for AIGC models (LLMs, VLMs, diffusion, etc.) that packages SOTA quantization, sparsification, and deployment best practices to shrink models while preserving accuracy.
Services
- Conference Reviews: NeurIPS, ICLR, ICML, COLM, AAAI, CVPR.
Educations
- 2025.02 - Now, Ph.D. in Electronic Computer and Engineering, the Hong Kong University of Science and Technology.
- 2020.09 - 2024.06, B.Eng. in Computer Science and Engineering, Shenyuan Honors College, Beihang University.
Internships
- 2025.02 - Now, SenseTime Research.
- 2024.12 - 2025.02, Microsoft Research Asia.
- 2023.05 - 2024.12, SenseTime Research.