Zhongzhu Zhou / Charlie Zhou

/ZHONG-JOO JOH/

  • Senior Research Scientist
    Turbo Team
    Together AI
  • Ph.D.
    School of Computer Science, Faculty of Engineering
    The University of Sydney
  • B.Eng. (Hons)
    School of Computer Science and Engineering
    Sun Yat-sen University

Office: Room 408, J12/1 Cleveland St, Darlington NSW 2008

Email: zhongzhu.zhou [at] sydney.edu.au, zhouzhzh8 [at] mail2.sysu.edu.cn

"Let everything happen to you. Beauty and terror. Just keep going. No feeling is final." — Rainer Maria Rilke

Who am I?

I am a Senior Research Scientist at the Turbo Team, Together AI, supervised by Ben Athiwaratkun.

I am a Ph.D. at the School of Computer Science, Faculty of Engineering, The University of Sydney, supervised by Prof. Shuaiwen Leon Song. I have been fortunate to intern at Dolby, DeepSpeed Microsoft, Weixin Group Tencent, and Microsoft (China), contributing to projects in building machine learning systems. I was also a research associate at the School of Computer Science and Engineering, Sun Yat-sen University from 2019 to 2022, under the supervision of Prof. Dan Huang and Yutong Lu. I received my B.E. degree from the School of Computer Science and Engineering, Sun Yat-sen University in 2019.

Research Highlights

My research spans efficient machine learning and systems, from model pretraining quality to efficient algorithms and system co-design that bridges emerging ML/LLM methods and real-world applications, improving both productivity (usable, robust stacks) and performance (throughput, memory, cost-efficiency). I work across academia and industry (Together AI), with an emphasis on LLM efficiency and scalable training infrastructure. My research is supported by Together AI.

Feel free to drop me an email if you have aligned interests.

(🔥 indicates the projects I am leading)

Efficient ML Algorithm

  • Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient (Efficient training) 🔥
  • Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time (Efficient inference) 🔥
  • I-DLM: Introspective Diffusion Language Models (Efficient inference)
  • Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution (Efficient inference)
  • MixOfSpeculator: Mix-Architecture Speculator Design (Efficient inference) 🔥
  • Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation (Efficient training) 🔥
  • Bio-Inspired LLM-Based Multiagent Systems (Efficient inference)
  • Tail Likelihood Reinforcement Learning (Efficient training)
  • Scaling Law of Speculative Decoding (Efficient inference)

Efficient ML System

  • XoRL (RL training system) 🔥
  • Hierarchical Performance Isolation for Distributed LLM (Agent system)
  • AgentGO (Agent system) 🔥
  • Smart KV (Agent system)
  • Universal KV System (Agent system)

Quantization

  • SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving
  • OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization (2026) 🔥

Modeling

  • CoderForge-Preview (TogetherAI Blog)
  • Loop Diffusion

For the full categorized list across all five research themes, see the Research page.

Featured Projects

Recent News

See the full archive for every announcement.

Work

Together AI

Hybrid & San Francisco, United States

Research Consultant / Senior Research Scientist May 2024 - Present

Dolby

Sydney, Australia

Research Intern Mar 2024 - Sep 2024

DeepSpeed Team, Microsoft

Sydney, Australia & Remote

Research Intern Mar 2023 - Feb 2024

Weixin Group, Tencent Holdings Ltd.

Champaign, IL, US & Guangzhou, China

Research Intern Jul 2018 - Jul 2020

Microsoft (China) Co., Ltd.

Guangzhou, China

Project Assistant to Senior Cloud Architect Sep 2018 - Feb 2019

Education

The University of Sydney (USYD)

Sydney, Australia

Doctor of Philosophy (Ph.D.) Oct 2022 - Feb 2026

  • Accumulated GPA: 4.0/4.0 (High Distinction)
  • Progress Evaluation: satisfactory or excellent, USYD, 2023, 2024, 2025
  • APR Intern Program Scholarship (SC3600), USYD, 2024
  • The Jingdong Technology (JD) Co Ltd Research Scholarship in Artificial Intelligence, USYD, 2022

The University of Sydney (USYD)

Sydney, Australia

Visiting Scholar Mar 2022 - Oct 2022

  • Future System Architecture (FSA) Lab, under Prof. Shuaiwen Song.

Sun Yat-sen University (SYSU)

Guangzhou, China

Research Associate Sep 2019 - Mar 2022

  • Accumulated GPA: 3.41/4.0
  • SYSU Overseas Visiting and Collaborative Research Program Funding Plan, SYSU, 2021
  • The Third Class Scholarship, SYSU x 3, 2020, 2021, 2022
  • The Second Class Scholarship (Top 15% of the major), SYSU, 2019

University of Illinois Urbana-Champaign (UIUC)

Remote & Champaign, IL, US

Summer Session Student Jun 2018 - Sep 2018

  • Illinois Computer Science Summer Research Program, UIUC, 2018

Sun Yat-sen University (SYSU)

Guangzhou, China

Bachelor of Engineering in Computer Science and Technology Sep 2015 - Jun 2019

  • Overall GPA: 3.9/4.0
  • National Scholarship (Top 1 of the major), China, 2016
  • Research Honor Degree, SYSU, 2019
  • The First Class Scholarship (Top 5% of the major) x 2, SYSU, 2015-2016, 2017-2018
  • The Second Class Scholarship (Top 15% of the major), SYSU, 2016-2017
  • Meritorious Winner, COMAP's Mathematical Contest in Modeling, United States, 2017
  • The Second Prize, The Chinese Mathematics Competitions, 2016
  • The Third Prize, The Chinese Mathematics Competitions, 2017
  • The Third Prize, ACM-ICPC, SYSU, 2017
  • The Second Prize, Student Innovation Software Development Competition, SYSU, 2017
  • The Third Prize, Microsoft Hackathon, South China, 2018

Selected Publications

Selected accepted conference and venue-published papers. For the complete list — including journals, preprints, under-review submissions, in-preparation work, books, and patents — see the Publications page or my Google Scholar profile.

Selected Talks

Selected Awards

Professional Service

Conference Participation

Teaching

COMP3520: Operating Systems Internals — Tutor, The University of Sydney, Fall 2023.

Certification

Skills

Programming Languages: Pascal (11 yrs), C (11 yrs), C++ (11 yrs), Python (6 yrs), HTML, CSS, JavaScript (6 yrs), Java (6 yrs), SQL (6 yrs), Bash (6 yrs), LaTeX (5 yrs), Matlab (5 yrs), CUDA (5 yrs), R (4 yrs), Go (4 yrs), Triton (2 yrs)

Systems and Infrastructure: MPI/OpenMPI (6 yrs), Linux Kernel (5 yrs), Distributed/Parallel File Systems — Lustre, HDFS (5 yrs), Kubernetes, Kubernetes Scheduler, Kubernetes SR-IOV (5 yrs), Docker (5 yrs), Hadoop (4 yrs), Spark (4 yrs), YARN (4 yrs), Mesos (4 yrs), NVLink (2 yrs), NVSHMEM (2 yrs), TensorCore, CudaCore Programming (2 yrs)

Machine Learning and AI: TensorFlow (5 yrs), PyTorch (4 yrs), TorchServe (4 yrs), TensorBoard (4 yrs), Ray (4 yrs), JAX (2 yrs), Triton (2 yrs), DeepSpeed (1 yr), HuggingFace (1 yr), Reinforcement Learning (4 yrs), CNN, RNN, ResNet, Attention Block, UNet, Transformer, ViT (5 yrs), Neural Architecture Search (3 yrs), Diffusion Models (1 yr), GPT-2/3/4 (1 yr), RLHF (1 yr), VeOmni (1 yr), TorchTitan (1 yr), FSDP (1 yr), ZeRO 1/2/3 (1 yr)

Databases and Storage: MySQL (6 yrs), Oracle SQL (6 yrs), MongoDB (6 yrs), PostgreSQL (6 yrs), Redis (4 yrs), Hive SQL (3 yrs)

Front-end Development: PHP (6 yrs), Vue.js (6 yrs), ReactJS (6 yrs), ASP.NET (6 yrs), jQuery (6 yrs), AngularJS (6 yrs), Apache (6 yrs), MeteorJS (6 yrs)

Back-end Development: Spring Boot (6 yrs), Django (6 yrs), Flask (6 yrs), Node.js (6 yrs), Express (6 yrs), REST API Design (6 yrs), CI/CD

Mobile Development: Android Studio (6 yrs, Java/Kotlin), XCode (6 yrs, Swift/Objective-C), React Native (6 yrs, cross-platform), Flutter (6 yrs, cross-platform)

Web Crawling & Testing: Urllib (6 yrs), BeautifulSoup (6 yrs), Scrapy (6 yrs), Requests (6 yrs), JSON (6 yrs), Selenium (6 yrs), Pytest (6 yrs), JUnit (6 yrs)

Version Control & Build Systems: Git (8 yrs), Gradle (6 yrs, Android/Java), Maven (6 yrs, Java), npm (6 yrs, JavaScript), pip (6 yrs, Python)

Development Tools & Libraries: Airflow (2 yrs), Kafka (2 yrs), Elasticsearch (2 yrs), OpenCV (5 yrs), Pandas (5 yrs), NumPy (5 yrs), SciPy (5 yrs), NLTK (5 yrs), Matplotlib (5 yrs), Seaborn (5 yrs), Azure Data Factory (2 yrs), AWS (2 yrs), Google Cloud Platform (2 yrs)

Extracurricular

Fitness: Fencing (6 yrs), Jogging (7 yrs), Bodybuilding (6 yrs, Hongxing Fitness Club Outstanding Students), Table Tennis (11 yrs), Badminton (11 yrs).

Leisure: Web & mobile application development (5 yrs), Saxophone (9 yrs), Magic (1 yr), Video Games (500+ PS5 game collections).

Volunteer

HPCA 2026 Conference Volunteer (Registration & On-site Operations) Feb 2026

  • Supported on-site conference operations: registration check-in, badge distribution, attendee guidance; coordinated with organizers for smooth session flow and timely room transitions.
  • Assisted speakers and session chairs with logistics (A/V setup, timekeeping, schedule updates) and handled ad-hoc issues to maintain a professional and welcoming conference experience.

SYSU School of Computer Science and Engineering, Student Union — Vice President Jul 2016 - Jul 2017

  • Mentored incoming freshmen and helped them acclimate to the university environment, promoting a sense of belonging through inclusive campus activities and events.

Changjun High School Volunteer Jul 2013 - Jul 2014

  • Enhanced the nursing-home experience by engaging with elderly residents, preparing and serving fresh fruit, and maintaining a clean and sanitary environment for their wellbeing.