研究
-
Zeyu Zhu, Gang Li, Peisong Wang, Zitao Mo, Minnan Pei, Zhuoran Song, Xiaoyao Liang, Jian Cheng. "DALI: A Workload-Aware Offloading Framework for Efficient MoE Inference on Local PCs." [arXiv], 2026.
-
Zeyu Zhu, Gang Li, Minnan Pei, Zitao Mo, Peihuan Ni, Peisong Wang, Tielong Liu and Jian Cheng. "KL-MoE: A Hierarchical MoE Pruning Framework Exploiting KL Divergence". [DAC], 2026.
-
Siting Wang, Minnan Pei, Luoyang Sun, Cheng Deng, Kun Shao, Zheng Tian, Haifeng Zhang, Jun Wang. "SpatialViz-Bench: Automatically Generated Spatial Visualization Reasoning Tasks for MLLMs". [ICLR], 2026.
-
Yuanhui Wang, Kunlong Liu, Minnan Pei, Zhangming Li, Peisong Wang and Qinghao Hu. "MemeBQ: Memory Efficient Binary Quantization of LLMs". [AAAI], 2026.
-
Peihuan Ni, Zitao Mo, Tielong Liu, Hongli Wen, Zeyu Zhu, Minnan Pei, Junwen Si, Weifan Guan, Peisong Wang, Qinghao Hu, Gang Li and Jian Cheng. "APEX: Integer-only Non-linear Function Approximation for Efficient Cross-Modal Inference". [DATE], 2026.
-
Tielong Liu, Gang Li, Zitao Mo, Zeyu Zhu, Minnan Pei and Jian Cheng. "Boosting the Performance of Tree-Based Speculative Decoding of LLMs on FPGAs". [DATE], 2026.
-
Minnan Pei, Gang Li, Junwen Si, Zeyu Zhu, Zitao Mo, Peisong Wang, Zhuoran Song, Xiaoyao Liang, Jian Cheng. "GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing". [MICRO], 2025.
-
Jiahao Cui, Ruoxin Xiao, Shiyuan Fang, Minnan Pei, Yixuan Yu. "Encoding feature supervised UNet++: Redesigning Supervision for liver and tumor segmentation". [arxiv], 2022.