Role Title: Red Team Lead (Offensive Cybersecurity) Role Type: Contractor Location: Remote micro1 is engaging Red Team Leads (Offensive Cybersecurity) to contribute expertise to a customer's critical cybersecurity project.
AI Jailbreak & Prompt-Injection Security Expert
Job description
Role Title: AI Jailbreak & Prompt-Injection Security Expert
Role Type: Contractor
Location: Remote
micro1 is engaging AI Jailbreak & Prompt-Injection Security Experts to contribute to a cutting-edge customer initiative focused on AI safety and robustness. In this role, you'll apply your expertise to help train next-generation AI systems. Your work will shape how models learn, reason, and perform through high-quality, real-world input. No prior experience in AI is required — your domain knowledge is what matters.
Scope of Work
- Design and implement advanced methodologies for evaluating AI system safety, focusing on ethical jailbreaks, LLM red teaming, prompt injection, and tool-use abuse scenarios.
- Create comprehensive cross-domain elicitation strategies to uncover multi-turn and complex adversarial bypass patterns in AI models.
- Develop, maintain, and update regression test suites that systematically test for jailbreak susceptibility and prompt-injection vulnerabilities.
- Construct robust evaluation frameworks that stress-test AI models against real-world adversarial threats, aiming to enhance overall system robustness.
- Collaborate with technical stakeholders to translate security findings into actionable improvements for model safety and risk mitigation.
- Document methodologies, findings, and best practices in clear, well-structured written reports and presentations for both technical and non-technical audiences.
Preferred Qualifications
- 5+ years of expertise in adversarial machine learning, LLM red teaming, AI safety evaluation, or a closely related security domain; 8–20 years preferred for senior contributors.
- Proven experience researching, testing, or uncovering vulnerabilities related to ethical jailbreaks, prompt injection, tool-use abuse, or adversarial AI attacks.
- Advanced degree (PhD, MS) in computer science, cybersecurity, machine learning, or a relevant discipline, or equivalent operational/professional background.
- High credibility and recognition within the AI security or adversarial ML community—such as published research, open-source tools, or conference presentations.
- Exceptional written and verbal communication skills, with a strong focus on clear documentation and collaborative problem-solving.
- Prior participation in multi-disciplinary projects or cross-functional AI safety initiatives is a plus.
- Familiarity with current LLM architectures, prompt engineering techniques, and security assessment tools is highly desirable.
You will be redirected to the company's website to complete your application.