Evaluate and improve frontier AI coding models by reviewing realistic software engineering tasks, debugging outputs, and designing advanced coding evaluations.
Senior Software Code Reviewer
Job description
Senior Software Code Reviewer
Compensation: $40 - $300/deliverable
About Vetto
Vetto is a tech company focused on building and scaling high-quality datasets for artificial intelligence systems. The company works at the intersection of human expertise and AI, ensuring that models are trained on technically accurate, well-defined, and realistic data.
Projects support the training and evaluation of Large Language Models (LLMs), where technical rigor and correctness are critical.
About the Project
This project focuses on the technical review and validation of coding tasks used to train AI models.
Coding challenges are generated automatically by AI, and the role involves evaluating the quality of those challenges.
Core evaluation questions include:
- Are the requirements well defined?
- Are the associated tests correctly validating the requirements?
Tests are treated as the mechanism of truth in this context. Mistakes can propagate into AI systems at scale, making accuracy highly important.
Supported Languages
Tasks may involve reviewing code written in:
- Python
- JavaScript / TypeScript
- Go
- Rust
- Java
Responsibilities
- Evaluate whether coding tasks are clearly and correctly defined
- Validate whether tests accurately reflect whether the problem has been solved
- Determine whether solutions that pass tests genuinely solve the intended problem
- Apply strict technical criteria and quality standards consistently across tasks
Required Profile
This role is designed for mid-level and senior software engineers with real professional experience.
Technical Requirements
- Professional experience in software development within production environments
- Experience reviewing and evaluating code written by other engineers
- Strong understanding of automated testing and validation behavior
- Experience contributing to or working with open-source projects
- Strong attention to detail and technical judgment
- Comfortable working fully in English (reading and writing)
Nice to Have
- Experience with Test-Driven Development (TDD)
- Familiarity with large or complex codebases
Project Details
This is expert-level, task-based technical work focused on:
- Analysis
- Validation
- Technical judgment
This role is not focused on producing production code.
Selection Process
Phase 1 — Application
Candidates submit their profile through the application form.
Evaluation is based on:
- Professional background
- Open-source contributions
- Screening responses
There are no interviews.
Phase 2 — Technical Exercise
Shortlisted candidates receive a technical exercise consisting of real coding tasks in the languages they selected.
The exercise simulates the actual work by requiring candidates to:
- Review generated code
- Assess whether solutions correctly solve the problem
- Validate whether associated tests are accurate and meaningful
Performance on the exercise is the primary selection criterion.
Candidates who pass both phases are onboarded directly into the project.
Final Notes
This role focuses on technical judgment, rigor, and responsibility.
Ideal candidates are comfortable:
- Challenging unclear problem definitions
- Questioning tests that pass but are logically incorrect
- Acting as a technical quality gate for AI training data
You will be redirected to the company's website to complete your application.