Freelance Agent Evaluation Engineer

Mindrift · London, UK

part_time mid tech £79,921
Apply through theHRkey →

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within reali…

Posted 31 May 2026 · ref 56830