Review simulated research environments and AI-generated scientific reasoning for a leading AI lab. Seeking Mechanical Engineering and Materials Science experts with research experience.
Arches — Simulated Research Reviewer (MechE & Mat-Sci)
Job description
Mercor is seeking domain experts in Mechanical Engineering and Materials Science to evaluate simulated research environments and AI agent trajectories.
Contributors will review AI-generated research workflows and outputs, assessing performance across multiple dimensions including technical accuracy, methodology, reasoning quality, and scientific rigor.
This project supports frontier AI research by helping evaluate how effectively AI systems conduct and communicate technical research.
Responsibilities
Review simulated research environments and AI agent workflows
Evaluate agent-generated outputs for:
- Technical accuracy
- Scientific rigor
- Methodological soundness
- Research quality
- Logical consistency
Assess research processes and decision-making trajectories
Identify strengths, weaknesses, and potential errors in AI-generated work
Provide structured evaluations and feedback
Contribute domain expertise to improve AI research benchmarks and evaluation frameworks
Ideal Background
Candidates should have strong expertise in one or more of the following areas:
Mechanical Engineering
- Engineering analysis
- Design and manufacturing
- Mechanics
- Thermodynamics
- Fluid mechanics
- Materials engineering
- Mechanical systems
Materials Science
- Material characterization
- Material properties
- Metallurgy
- Polymers
- Ceramics
- Composites
- Materials testing
- Applied materials research
Evaluation Focus Areas
Reviews may include assessment of:
- Scientific reasoning
- Experimental design
- Research methodology
- Data interpretation
- Technical correctness
- Evidence-based conclusions
- Overall research quality
Compensation
- $50–90/hour
- Compensation may vary based on expertise and project requirements
Contract & Payment Terms
- Independent contractor engagement
- Fully remote work
- Flexible schedule
- Weekly payments through Stripe or Wise
- Projects may be extended, shortened, or concluded depending on business needs and performance
Important Note
Mercor currently cannot support:
- H1-B candidates
- STEM OPT candidates
About Mercor
Mercor partners with leading AI labs and enterprises to train and improve frontier AI systems using human expertise.
Contributors work directly on projects that help shape the next generation of AI technologies while collaborating with researchers and domain specialists.
You will be redirected to the company's website to complete your application.