Evaluate documents for AI training quality and accuracy. High-paying remote role ($80–$160/hr).
Generalist Expert (AI Response Evaluator)
Job description
About Mercor
Mercor partners with leading AI labs and enterprises to train frontier models using human expertise. You will collaborate with top researchers and contribute to improving next-generation AI systems.
Role Overview
We are seeking Generalist Experts to evaluate AI-generated responses and provide structured, high-quality written feedback.
In this role, you will assess the quality, reasoning, and accuracy of AI outputs, ensuring they meet high standards across a wide range of topics. Your work will directly improve the reliability and performance of advanced AI systems.
Key Responsibilities
- Evaluate AI-generated responses for:
- Accuracy
- Reasoning quality
- Clarity and completeness
- Identify gaps, inconsistencies, and weak logic
- Provide structured, well-reasoned written feedback
- Apply evaluation rubrics consistently and accurately
- Maintain high attention to detail in all assessments
Required Skills & Qualifications
- Strong analytical and critical thinking skills
- Excellent written communication abilities
- Ability to identify nuance, implicit meaning, and reasoning gaps
- High attention to detail and consistency
- Ability to work independently and follow structured guidelines
- Must not rely on AI writing tools during evaluation
Eligibility
- Native or near-native English proficiency required
- Candidates based in:
- United States
- United Kingdom
- Canada
- Australia
- New Zealand
are preferred
Work Details
- Fully remote
- Flexible schedule
- Contractor engagement
- Ongoing AI evaluation projects
Additional Notes
- Weekly payments via Stripe or Wise
- Projects may vary in duration based on performance and needs
- No visa sponsorship available
You will be redirected to the company's website to complete your application.