VideoGameQA-Bench: Evaluating Vision-Language Models for Video Game Quality Assurance

Mohammad Reza Taesiri, Abhijay Ghildyal, Saman Zadtootaghaj, Nabajeet Barman, Cor-Paul Bezemer

2025-05-23

VideoGameQA-Bench: Evaluating Vision-Language Models for Video Game
Quality Assurance

Summary

This paper talks about VideoGameQA-Bench, a new way to test how well AI models that understand both images and text can help check and improve the quality of video games.

What's the problem?

Making sure video games work correctly and look good is a huge job, and current AI models aren't always tested on the kinds of tasks needed for video game quality assurance, so it's hard to know if they can really help game developers.

What's the solution?

The researchers created VideoGameQA-Bench, a special set of tests designed to see how well these vision-language models can handle the specific challenges found in video game quality assurance, like spotting bugs or checking graphics.

Why it matters?

This matters because it helps game developers know which AI tools are actually useful for making better, more reliable games, and it could make the process of testing games faster and more accurate.

Abstract

A benchmark called VideoGameQA-Bench is introduced to assess Vision-Language Models in video game quality assurance tasks.

View Paper