AI Red Team Specialist (Contract)

Mercor Greater São Paulo Area
Remote
Apply
AI Summary

Mercor seeks an AI Red Team Specialist to test conversational AI models through adversarial techniques. Responsibilities include generating human data, documenting findings, and working independently. Requires prior red teaming experience and native English/Brazilian Portuguese fluency.

Key Highlights
Red team conversational AI models using jailbreaks and prompt injections.
Generate high-quality human data by annotating failures and classifying vulnerabilities.
Work independently and asynchronously to improve AI model performance.
Key Responsibilities
Red team conversational AI models and agents by executing jailbreaks, prompt injections, and misuse cases.
Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing.
Document reproducibly by producing reports, datasets, and attack cases that customers can act on.
Work independently and asynchronously to meet deadlines while improving AI model performance.
Technical Skills Required
AI adversarial work Cybersecurity Socio-technical probing Adversarial ML Creative probing
Benefits & Perks
$29/hour compensation
Remote work
Nice to Have
Experience with Adversarial ML, Cybersecurity, and socio-technical risk analysis.
Skills in creative probing, including psychology, acting, and writing for unconventional adversarial thinking.

Job Description


About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: AI Red Team Specialist

Type: Full-time or Part-time Contract Work

Compensation: $29/hour

Location: Remote; Geography restricted to USA, Brazil

Role Responsibilities

  • Red team conversational AI models and agents by executing jailbreaks, prompt injections, and misuse cases.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing.
  • Document reproducibly by producing reports, datasets, and attack cases that customers can act on.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.

Qualifications

Must-Have

  • Prior red teaming experience in AI adversarial work, cybersecurity, or socio-technical probing.
  • Native-level fluency in English and Brazilian Portuguese.
  • Strong communication skills to explain risks clearly to technical and non-technical stakeholders.

Preferred

  • Experience with Adversarial ML, Cybersecurity, and socio-technical risk analysis.
  • Skills in creative probing, including psychology, acting, and writing for unconventional adversarial thinking.

Compensation & Legal

  • Hourly contractor
  • Competitive rates commensurate with experience

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

,


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

tu trabajocr

Australia

QA Automation Engineer

Testing
8h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Radformation

United State
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

hatch pros

United State

Subscribe our newsletter

New Things Will Always Update Regularly