AsyncFlow: An Asynchronous Streaming RL Framework for Efficient LLM Post-Training

Zhenyu Han, Ansheng You, Haibo Wang, Kui Luo, Guang Yang, Wenqi Shi, Menglong Chen, Sicheng Zhang, Zeshun Lan, Chunshi Deng, Huazhong Ji, Wenjie Liu, Yu Huang, Yixiang Zhang, Chenyi Pan, Jing Wang, Xin Huang, Chunsheng Li, Jianping Wu

2025-07-04

AsyncFlow: An Asynchronous Streaming RL Framework for Efficient LLM
Post-Training

Summary

This paper talks about AsyncFlow, a new system that makes training large language models faster and more efficient after their initial training by using asynchronous streaming and smart data management techniques.

What's the problem?

The problem is that post-training large language models, especially with reinforcement learning, can be slow and inefficient due to challenges like waiting for data between tasks, poor workload distribution, and tight connections to specific training software, which limits flexibility and scalability.

What's the solution?

The researchers built AsyncFlow with a distributed data storage system that allows continuous streaming of training data, along with an asynchronous workflow that overlaps different training tasks to avoid idle times. They also designed the system to be modular and independent from specific training engines, making it easier to customize and scale.

Why it matters?

This matters because it helps improve the speed and scalability of fine-tuning large language models, enabling faster updates and better performance in real-world applications like chatbots, translators, and other AI tools that rely on large models.

Abstract

AsyncFlow is an asynchronous streaming RL framework that improves efficiency in the post-training phase of LLMs through distributed data management, dynamic load balancing, and decoupled architecture.

View Paper