Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents
Yuhan Guo, Cong Guo, Aiwen Sun, Hongliang He, Xinyu Yang, Yue Lu, Yingji Zhang, Xuntao Guo, Dong Zhang, Jianzhuang Liu, Jiang Duan, Yijia Xiao, Liangjian Wen, Hai-Ming Xu, Yong Dai
2025-08-07
Summary
This paper talks about Web-CogReasoner, a new system designed to make web agents smarter by helping them learn and think like humans do when using the web. It breaks down what they need to know and how they process information into clear stages, using a special kind of reasoning called knowledge-driven Chain-of-Thought.
What's the problem?
The problem is that existing web agents often struggle because they don't have a strong foundation of knowledge or a clear way to think through complex web tasks step-by-step. They might guess or give unclear answers instead of making decisions based on solid information and logical thinking.
What's the solution?
The solution was to create a framework that separates learning facts, concepts, and procedures, then uses this knowledge to guide the agent's reasoning in a structured way. The Web-CogReasoner uses screenshots and structural data from web pages to understand its environment, then plans and takes actions based on transparent reasoning steps. The system is trained on a specially made dataset that teaches the agent in stages, ensuring it learns both the facts and how to use them.
Why it matters?
This matters because it helps build web agents that can handle complicated tasks more reliably and explain their decisions better. With improved reasoning and knowledge, these agents can assist users more effectively, making interactions with websites smoother, smarter, and easier.
Abstract
A framework for web agents decomposes their capabilities into knowledge content learning and cognitive processes, using a structured dataset and a novel reasoning framework to enhance generalization and performance.