Remote opportunity for fluent Spanish (Mexico) and English speakers to evaluate, transcribe, annotate, and benchmark audio content used to train advanced AI language models. This short-term contract role involves transcription, evaluation, and quality assurance tasks focused on Mexican Spanish audio data.
Spanish (Spain) Audio Generalist Evaluator Expert
Job description
Mercor is seeking a Spanish (Spain) Audio Generalist Evaluator Expert to contribute to a high-impact AI audio research project with a leading research laboratory.
In this role, you will work on transcription, annotation, evaluation, and quality assurance tasks that help train and benchmark advanced language models.
This is a structured short-term engagement designed for candidates with strong analytical, academic, linguistic, or research backgrounds who are fluent in both Spanish (Spain) and English.
The work focuses on transforming complex audio and visual information into accurate, well-structured written outputs while helping improve AI performance in Spanish-language audio tasks.
Job Responsibilities
Transcribe and Optimize Audio & Video
Listen to, analyze, and transcribe audio and video content in Spanish (Spain)
Produce high-quality written outputs in Spanish
Complete supporting tasks in English when required
Ensure:
- Accuracy
- Clarity
- Consistency
- Compliance with formatting requirements
Capture linguistic nuances including:
- Tone
- Intent
- Formal and informal register
- Regional expressions
- Contemporary Peninsular Spanish usage
Define and Document Evaluation Standards
- Establish standards for high-quality AI responses
- Develop:
- Evaluation rubrics
- Grading guidelines
- Review standards
in both Spanish and English.
- Document linguistic edge cases and language-specific considerations
- Identify:
- Grammatical complexities
- Idiomatic expressions
- Regional language variations
- Common evaluation challenges
Conduct Model Testing and Grading
Test AI-generated outputs
Evaluate responses based on:
- Accuracy
- Completeness
- Fluency
- Instruction following
- Clarity
Provide structured feedback to improve model performance
Support Benchmarking and Quality Assurance
Participate in:
- QA reviews
- Benchmark creation
- Evaluation cycles
Ensure consistency before datasets are integrated into official benchmarks
Collaborate with project teams to improve task design and resolve ambiguities
Minimum Qualifications
Strong writing, editing, and critical thinking skills
Ability to work independently and meet deadlines
Native or near-native fluency in:
- Spanish (Spain)
- English
Strong familiarity with:
- Peninsular Spanish
- Regional vocabulary
- Spanish accents
- Contemporary language usage across Spain
Ability to accurately transcribe and analyze Spanish-language audio content
Availability of 10–20 hours per week
Preferred Qualifications
College student or recent graduate
Background in:
- Linguistics
- Humanities
- Social Sciences
- Journalism
- Translation
- Localization
- Technical disciplines
Experience with:
- Transcription
- Annotation
- Localization
- AI evaluation
- Research workflows
Familiarity with differences between:
- Peninsular Spanish
- Latin American Spanish
Interest in:
- Artificial Intelligence
- Language Models
- Applied Research
Application Process
- Complete a short AI-led interview (approximately 15 minutes)
- Successful candidates will be onboarded and invited to begin project work
About Mercor
Mercor partners with leading AI laboratories and enterprises to train frontier AI models using human expertise.
Contributors work directly on projects that improve advanced AI systems while collaborating with researchers and domain experts across multiple fields.
You will be redirected to the company's website to complete your application.