Reinforcement Learning in Vision: A Survey

Weijia Wu, Chen Gao, Joya Chen, Kevin Qinghong Lin, Qingwei Meng, Yiming Zhang, Yuke Qiu, Hong Zhou, Mike Zheng Shou

2025-08-12

Reinforcement Learning in Vision: A Survey

Summary

This paper talks about recent progress in using reinforcement learning (RL) for visual tasks, where AI agents learn to make decisions by trial and error based on what they see. It reviews different strategies for improving these agents' learning processes, the main themes in this field, and how researchers measure the agents' performance.

What's the problem?

The problem is that teaching AI agents to understand and act based on visual information is very challenging. The environment they interact with is complex and often changes, and it's hard for the AI to learn effective behaviors just from images or videos. Many existing methods struggle with learning efficiently and adapting to these visual tasks.

What's the solution?

The paper surveys various policy optimization strategies, meaning ways to help the AI agents learn better actions from visual data. It discusses key aspects like how to represent the visual environment, how to reward the agents, and how to test their abilities. It also highlights the main challenges still unsolved and suggests directions for future research.

Why it matters?

This matters because visual reinforcement learning could enable smarter robots and AI systems that can see and interact with the world more like humans do. Improving these methods will help build AI that can perform complex tasks in real-time, from robots in factories to autonomous vehicles and interactive software, making technology more useful and responsive.

Abstract

This survey synthesizes recent advancements in visual reinforcement learning, covering policy optimization strategies, thematic pillars, and evaluation protocols, while highlighting open challenges.

View Paper