Back to remote jobs

Enterprise AI Agent Users

Mercor

Agent System Evaluator Contractor Short-term
Remote (Global) $30 – $30/hr June 27, 2026

Job description

We are seeking individuals with direct, hands-on experience using enterprise or company-specific AI agent interfaces.

In this role, you will contribute insights based on real-world usage of internal AI tools, helping improve how enterprise AI systems are evaluated, understood, and refined.

Key Responsibilities

Real-World AI System Usage

  • Provide insights based on actual usage of enterprise AI agent systems
  • Describe workflows involving company-specific AI interfaces
  • Highlight strengths, limitations, and usability patterns

Workflow Analysis

  • Explain how AI agents are used in day-to-day professional tasks
  • Identify friction points, inefficiencies, and failure modes
  • Compare different AI tool behaviors across enterprise environments

Feedback Contribution

  • Provide structured feedback on AI agent performance
  • Contribute observations that help improve model evaluation and training
  • Share contextual understanding of real enterprise deployment environments

Ideal Qualifications

  • Direct experience using enterprise AI tools, copilots, or internal AI agent systems
  • Familiarity with company-specific or proprietary AI interfaces
  • Ability to clearly describe workflows and system interactions
  • Strong written communication skills in English

Work Expectations

  • Fully remote, flexible contractor role
  • Work based on asynchronous participation
  • Tasks may involve describing real-world AI usage patterns and experiences

Compensation

  • $30 per hour

Contract & Payment Terms

  • Independent contractor engagement
  • Flexible schedule
  • Weekly payments via Stripe or Wise
  • Projects may be extended or adjusted based on demand and performance

Important Note

Mercor currently cannot support:

  • H1-B candidates
  • STEM OPT candidates

About Mercor

Mercor partners with leading AI labs and enterprises to improve and evaluate frontier AI systems using real-world user expertise.

This role helps bridge the gap between production AI agent usage and model evaluation by incorporating insights from actual enterprise environments.

Apply now

You will be redirected to the company's website to complete your application.

Apply now

Stay in the loop.

One email per week, 5 hand-picked roles.