Back to remote jobs

Generalist — Real World Understanding

Mercor

LLM Evaluator (English) Contractor
Remote (Global) $34 – $40/hr May 14, 2026

Job description

Mercor is seeking analytically minded generalists to help train AI systems on real-world reasoning and visual understanding tasks.

This role focuses on evaluating AI performance across:

  • Multi-modal reasoning
  • Real-world interpretation
  • Spatial reasoning
  • Common-sense problem solving
  • Ambiguous scenario analysis

Contributors will work on challenging AI evaluation tasks that require flexible thinking and the ability to reason through situations that do not fit neatly into a single academic or professional domain.

Key Responsibilities

  • Evaluate AI-generated responses and reasoning across:

    • Visual understanding tasks
    • Real-world scenarios
    • Ambiguous reasoning challenges
    • Multi-modal problem-solving workflows
  • Assess AI performance for:

    • Common-sense reasoning
    • Logical consistency
    • Spatial understanding
    • Contextual interpretation
  • Analyze situations that require:

    • Flexible thinking
    • Judgment under ambiguity
    • General reasoning abilities
  • Provide structured feedback to improve AI reasoning and evaluation systems

  • Apply critical thinking to identify:

    • Weak reasoning
    • Missing context
    • Incorrect assumptions
    • Incomplete interpretations
  • Collaborate remotely with AI research and evaluation teams

  • Contribute to improving frontier AI understanding of real-world environments and human reasoning patterns

Required Skills & Qualifications

  • Strong analytical and reasoning abilities

  • Comfort working with:

    • Ambiguous tasks
    • Open-ended problems
    • Real-world interpretation challenges
  • Strong:

    • Critical thinking
    • Intellectual curiosity
    • Written communication
    • Attention to detail
  • Ability to evaluate visual and contextual information thoughtfully

  • Ability to work independently in remote environments

  • Strong general problem-solving abilities across multiple domains

Preferred Qualifications

  • Recent graduate from a highly rigorous or selective academic institution

  • Background involving:

    • Research
    • Analytical coursework
    • Interdisciplinary problem solving
  • Familiarity with:

    • AI systems
    • Visual reasoning tasks
    • Human evaluation workflows

Contract & Payment Information

  • Independent contractor engagement

  • Fully remote and flexible schedule

  • Weekly payments through:

    • Stripe
    • Wise
  • Projects may be:

    • Extended
    • Shortened
    • Concluded early

based on project needs and performance.

Please note:

  • H1-B and STEM OPT candidates are not currently supported

About Mercor

Mercor partners with leading AI labs and enterprises to train frontier AI systems using human expertise.

Contributors collaborate with researchers and AI teams to improve next-generation reasoning, perception, and real-world understanding capabilities in AI systems.

Apply now

You will be redirected to the company's website to complete your application.

Apply now

Stay in the loop.

One email per week, 5 hand-picked roles.