Posted on 2025/12/13
AI Agent Evaluation Analyst
Mindrift
Bayan Lepas, Penang, Malaysia
Full Description
Job Overview
Mindrift is seeking a curious and intellectually proactive AI Agent Evaluation Analyst
The ideal candidate will have excellent analytical thinking, strong attention to detail, and good communication skills.
About the Project
We're looking for quality assurance specialists to validate and improve complex task structures, policy logic, and agent evaluation frameworks.
• Reviewing evaluation tasks and scenarios for logic, completeness, and realism
• Identifying inconsistencies, missing assumptions, or unclear decision points
• Helping define clear expected behaviors (gold standards) for AI agents
Requirements
• Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications
• Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements
• Familiarity with structured data formats: Can read, not necessarily write JSON/YAML
• Ability to assess scenarios holistically: What's missing, what's unrealistic, what might break?
• Good communication and clear writing (in English) to document your findings

Zero to AI Engineer
Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.
Find AI, ML, Data Science Jobs By Location
Find Jobs By Position