About Mercor
Mercor partners with leading AI labs and enterprises to train frontier models using human expertise. You will collaborate with top researchers and contribute to improving next-generation AI systems in your domain.
Role Overview
Mercor is seeking detail-oriented Search Generalist Experts to support a high-impact project with a leading AI research lab.
In this role, you will evaluate and improve how advanced AI systems perform on real-world search and browsing tasks. You will assess model outputs across a wide range of queries and contribute to structured evaluation workflows that help train, benchmark, and refine frontier AI systems.
This role is ideal for strong generalists who are skilled researchers, clear writers, and comfortable making nuanced quality judgments at scale.
Key Responsibilities
- Evaluate AI-generated search responses for factual accuracy, helpfulness, clarity, completeness, and overall quality
- Assess whether models use search appropriately and whether search queries are well-formed and effective
- Compare model responses side by side and provide concise, defensible rationales
- Write and refine prompts, golden answers, rubric criteria, and edge cases
- Apply project guidelines consistently across ambiguous, multi-step, and real-world search tasks
- Identify recurring failure modes and escalate unclear cases or rubric gaps
- Participate in calibration, QA, and feedback loops to maintain quality standards
Ideal Qualifications
- Excellent written English and strong online research skills
- Strong judgment when synthesizing information from multiple sources
- Ability to distinguish factual accuracy from fluency or style
- High attention to detail and ability to follow structured guidelines
- Self-directed and reliable in a remote work environment
Preferred Qualifications
- Experience in search quality, fact-checking, content evaluation, QA, or annotation
- Familiarity with search evaluation concepts (factuality, helpfulness, severity, comparisons, tool use)
- Experience with LLM evaluation workflows or human data projects
- Multilingual skills are a plus
- Bachelor’s degree preferred (advanced degree or strong experience is a plus)
Work Details
- Fully remote and flexible
- Work involves real-world search evaluation and AI model improvement tasks
Contract & Payment Terms
- Independent contractor engagement
- Flexible schedule — work on your own time
- Weekly payments via Stripe or Wise
- Projects may be extended, shortened, or concluded early based on performance and business needs
- No access to confidential or proprietary external data required
Additional Notes
- We consider all qualified applicants without regard to legally protected characteristics
- Reasonable accommodations are available upon request
- Unable to support H1-B or STEM OPT candidates at this time