AdamMeme is an adaptive, agent-based evaluation framework that assesses multimodal Large Language Models' understanding of harmful memes by iteratively updating with challenging samples.

This paper talks about AdamMeme, a new way to test how well AI models can understand if memes are harmful or not by using a system of smart agents that keep updating the tests with harder memes to challenge the AI.

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Summary

What's the problem?

What's the solution?

Why it matters?

Abstract