AI Red-Teamer — Adversarial AI Testing - Remote job Software Development at Mercor

Sep 30, 2025 | 129 views

Full time Software Development USA, UK, Canada Cybersecurity

Job Title : Software Development

Job Type : Full time

Company : Mercor

Candidate Required Location : USA, UK, Canada

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

This role involves building a pod of AI Red-Teamers who probe AI models with adversarial inputs, surface vulnerabilities, and generate the red-team data that makes AI safer for customers.

Red-team AI models and agents: jailbreaks, prompt injections, misuse cases, exploits
Generate high-quality human data: annotate failures, classify vulnerabilities, and flag systemic risks
Apply structure: follow taxonomies, benchmarks, and playbooks to keep testing consistent
Document reproducibly: produce reports, datasets, and attack cases customers can act on
Flex across projects: support different customers, from LLM jailbreaks to socio-technical abuse testing

Qualifications

Prior red-teaming experience (AI adversarial work, cybersecurity, socio-technical probing), highly recommended; OR
Extensive AI background/education that equips you to learn red-teaming fast
Curious and adversarial: instinctively push systems to breaking points
Structured: use frameworks or benchmarks, not just random hacks
Communicative: explain risks clearly to technical and non-technical stakeholders
Adaptable: thrive on moving across projects and customers

Nice-to-Have Specialties

Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction
Cybersecurity: penetration testing, exploit development, reverse engineering
Socio-technical risk: harassment/disinfo probing, abuse analysis
Creative probing: psychology, acting, writing for unconventional adversarial thinking

What Success Looks Like

Uncover vulnerabilities automated tests miss
Deliver reproducible artifacts that strengthen customer AI systems
Evaluation coverage expands: more scenarios tested, fewer surprises in production
Mercor customers trust the safety of their AI because you’ve already probed it like an adversary

Why Join Mercor

Build experience in human data-driven AI red-teaming at the frontier of safety
Play a direct role in making AI systems more robust, safe, and trustworthy

💰 Salary: $54 - $111 usd hourly

Apply Now

Recommended for You

AI Red-Teamer — Adversarial AI Testing - Remote job Software Development at Mercor

Senior Software Engineer in Test - Remote job Software Development at Natera

Middle Android Developer - Remote job Software Development at Genesis

Senior Angular Developer - Remote job Software Development at Smart Working Solutions

Senior Backend Software Engineer - Remote job Software Development at Wellhub

Software Engineer - Remote job Software Development at BEELINE