Interested in this position?
Upload your resume and we'll match you with this and other relevant opportunities.
Upload Your ResumeAbout This Role
Join Mercor as an AI Red Team Specialist to identify vulnerabilities in conversational AI models, generate high-quality human data, and document findings to improve model performance for leading AI research labs.
Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, misuse cases, and bias exploitation
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks
- Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing
- Document reproducibly by producing reports, datasets, and attack cases that customers can act on
- Work independently and asynchronously to meet deadlines while improving AI model performance
Requirements
- Prior red teaming experience in AI adversarial work, cybersecurity, or socio-technical probing
- Native-level fluency in English and Chinese (Mandarin)
- Strong communication skills to explain risks to technical and non-technical stakeholders
- Ability to thrive on moving across projects and customers
Nice to Have
- Experience in Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction
- Background in Cybersecurity: penetration testing, exploit development, reverse engineering
- Expertise in socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing
- Creative probing skills: psychology, acting, writing for unconventional adversarial thinking
Skills
AI
*
Cybersecurity
*
Penetration Testing
*
Conversational AI
*
prompt injection
*
jailbreaks
*
Red Teaming
*
Adversarial ML
*
Exploit development
*
Reverse engineering
*
* Required skills
About Mercor
Mercor connects elite creative and technical talent with leading AI research labs.
Technology
View all jobs at Mercor →
Related Searches
Similar Jobs
Data Science Scientist
Active Remote
Mercor
·
$56 - $77
Python
SQL
Snowflake
BigQuery
+6 more
1 week ago
Software Engineer – Codepath Expert
Active Remote
Mercor
·
$90 - $105
Python
C++
MySQL
1 week ago
Frontend Engineer
Active Remote
Mercor
·
$70 - $80
JavaScript
REACT
TypeScript
Vanilla JS
1 week ago
Engineering Program Coordinator
Active Remote
Mercor
·
$65 - $80
1 week ago
Content Analyst
Active Remote
Mercor
·
$30 - $35
1 week ago