Publications
All Publications
Comprehensive list — accepted papers, under-review submissions, and in-preparation work. For the most current bibliographic information, please refer to my Google Scholar profile.
26 Total
15 Published
11 In Preparation
Conference Paper
- ICSE2020
- VLDB2024
- In Preparation2026Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient Zhongzhu Zhou, Yibo Yang, Ziyan Chen, Fengxiang Bie, Haojun Xia, Xiaoxia Wu, Robert Wu, Ben Athiwaratkun, Bernard Ghanem, Shuaiwen Leon Song
- NeurIPS2024
- USENIX ATC2024Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPU Haojun Xia, Zhen Zheng, Xiaoxia Wu, Shiyang Chen, Zhewei Yao, Stephen Youn, Arash Bakhtiari, Michael Wyatt, Donglin Zhuang, Zhongzhu Zhou, Olatunji Ruwase, Yuxiong He, Shuaiwen Leon Song
- ICML2025
- In Preparation2026Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time Zhenyu Zhang, Xiaoxia Wu, Zhongzhu Zhou, Qingyang Wu, Yineng Zhang, Pragaash Ponnusamy, Harikaran Subbaraj, Jue Wang, Shuaiwen Leon Song, Ben Athiwaratkun
- ICLR2026CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention Zhongzhu Zhou, Fengxiang Bie, Ziyan Chen, Zhenyu Zhang, Yibo Yang, Junxiong Wang, Ben Athiwaratkun, Xiaoxia Wu, Shuaiwen Leon Song
Efficient inference through covariance-aware low-rank decomposition.
- MLSys2026KITTY: Accurate and Efficient 2-bit KV Cache Quantization with Channel-wise Precision Boost Haojun Xia, Xiaoxia Wu, Jisen Li, Robert Wu, Junxiong Wang, Jue Wang, Chenxi Li, Aman Singhal, Alay Shah, Alpay Ariyak, Donglin Zhuang, Zhongzhu Zhou, Ben Athiwaratkun, Zhen Zheng, Shuaiwen Leon Song
- ICML2026Aurora: When RL Meets Adaptive Speculative Training — A Unified Training-Serving System Junxiong Wang*, Fengxiang Bie*, Jisen Li, Zhongzhu Zhou, Zelei Shao, Yubo Wang, Yinghui Liu, Qingyang Wu, Avner May, Sri Yanamandra, Ce Zhang, Tri Dao, Percy Liang, Ben Athiwaratkun, Shuaiwen Leon Song, Chenfeng Xu, Xiaoxia Wu
- Together AI Blog2026CoderForge-Preview: SOTA Open Dataset for Training Efficient Coding Agents Alpay Ariyak*, Junda Zhang, Junxiong Wang, Shang Zhu, Federico Bianchi, Sanjana Srivastava, Ashwinee Panda, Siddhant Bharti, Chenfeng Xu, John Heo, Xiaoxia Shirley Wu, James Zou, Percy Liang, Leon Song, Ce Zhang, Ben Athiwaratkun, Zhongzhu Zhou*, Qingyang Wu*
- Ph.D. Thesis · USYD2026Efficient Compression Algorithm-System Co-Design for Large-Scale Model Training and Inference Zhongzhu Zhou
Doctor of Philosophy (Engineering) thesis at The University of Sydney.
- In Preparation2026Hierarchical Performance Isolation for Distributed LLM Zhongzhu Zhou et al.
- In Preparation2026SQUEEZE THINK: Multi-Model Orchestration for Efficient Recursive Self-Aggregation Zhongzhu Zhou et al.
- In Preparation2026Bio-Inspired LLM-Based Multiagent Systems Zhongzhu Zhou et al.
- In Preparation2026AgentGo: Agent Self-Guided Optimized Program Scheduling for Tool-Using Large Language Models Zhongzhu Zhou et al.
- In Preparation2026Introspective Diffusion Language Models Zhongzhu Zhou et al.
- In Preparation2026SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving Zhongzhu Zhou et al.
- In Preparation2026Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation Zhongzhu Zhou et al.
- In Preparation2026OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization Zhongzhu Zhou et al.
Journal Paper
- Sensors (MDPI)2021Binary Neural Network for Automated Visual Surface Defect Detection Wenzhe Liu, Jiehua Zhang, Zhou Su, Zhongzhu Zhou, Li Liu
Special Issue on Intelligent Sensing and Monitoring for Industrial Process.
- IEEE TPAMI2025RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Models Fengxiang Bie, Yibo Yang, Zhongzhu Zhou, Adam Ghanem, Minjia Zhang, Zhewei Yao, Xiaoxia Wu, Connor Holmes, Pareesa Golnari, David A. Clifton, Yuxiong He, Dacheng Tao, Shuaiwen Leon Song
Survey Papers track.
Preprint Paper
- arXiv2023DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales Zhewei Yao, Reza Yazdani Aminabadi, Olatunji Ruwase, Samyam Rajbhandari, Xiaoxia Wu, Ammar Ahmad Awan, Jeff Rasley, Minjia Zhang, Conglong Li, Connor Holmes, Zhongzhu Zhou, Michael Wyatt, Molly Smith, Lev Kurilenko, Heyang Qin, Masahiro Tanaka, Shuai Che, Shuaiwen Leon Song, Yuxiong He
- arXiv2023DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, et al.
Book
- Tianjin Univ. PressC Language Programming (in Chinese) Xuemao Zhou, Wei Yi, Zhongzhu Zhou
ISBN: 9787561847251.
Patent
- Chinese Patent2020Kubernetes 用户态应用中基于虚拟文件系统的小文件存储优化系统 Liang Du, Guixin Guo, Kangyou Zhong, Yunfei Du, Yutong Lu, Zhongzhu Zhou
Application CN202010195318.5, publication CN111475469A/B.