Remote opportunity for experienced iOS engineers to evaluate AI-generated mobile applications, review architecture and implementation decisions, and contribute to frontier AI coding model benchmarking projects.
Systems Engineer (Coding Agent Experience)
Job description
About the Role
Mercor is partnering with a leading AI research lab to support a Frontier Code Agents project.
This role focuses on evaluating and improving frontier AI coding models through realistic systems engineering workflows and structured technical assessments.
Contributors apply professional systems engineering expertise to review, compare, and improve AI-generated system designs, implementations, and engineering decisions.
Key Responsibilities
Evaluate AI Coding Agents
- Review AI-generated code, systems designs, architecture decisions, and infrastructure implementations
Technical Review & Analysis
- Identify bugs, edge cases, performance bottlenecks, and failure modes
- Evaluate system reliability, scalability, and maintainability
Systems Engineering Assessment
- Apply real-world engineering judgment to scenarios involving distributed systems, networking, operating systems, storage systems, infrastructure software, and database internals
Required Experience
- Minimum 2 years of professional systems engineering experience
- Experience with distributed systems, networking, operating systems, storage, or database internals
AI Coding Tools
Cursor, Claude Code, Codex, Windsurf, Gemini CLI, or similar tools.
Important Note
Mercor currently cannot support H1-B or STEM OPT candidates.
Compensation
- $400 per accepted task (~$85/hr effective rate)
About Mercor
Mercor partners with leading AI labs and enterprises to train and evaluate frontier AI systems using expert human knowledge.
You will be redirected to the company's website to complete your application.