ViMRHP: A Vietnamese Benchmark Dataset for Multimodal Review Helpfulness Prediction via Human-AI Collaborative Annotation
Truc Mai-Thanh Nguyen, Dat Minh Nguyen, Son T. Luu, Kiet Van Nguyen
2025-05-14
Summary
This paper talks about ViMRHP, a new dataset made to help computers figure out which online reviews are actually helpful, using both text and images, and focusing on the Vietnamese language.
What's the problem?
The problem is that it's hard for AI to accurately judge which reviews are useful, especially for Vietnamese content, because there haven't been enough good datasets that include both words and pictures and are checked by humans.
What's the solution?
The researchers created a special dataset by having both humans and AI work together to label reviews for how helpful they are. They then tested how well AI models could use this data to predict helpfulness, comparing the results to what humans decided.
Why it matters?
This matters because it can make online shopping and information searching better for Vietnamese speakers by helping people quickly find the most useful reviews, and it also helps improve AI technology for languages that don't have as many resources.
Abstract
A Vietnamese multimodal review helpfulness prediction dataset (ViMRHP) is introduced, optimized using AI for annotation, and evaluated for performance in comparison to human-verified data.