About xAI
xAI’s mission is to create AI systems that can accurately understand the universe and support humanity’s pursuit of knowledge. The team is highly technical, fast-moving, and focused on engineering excellence, with a flat structure that values initiative and strong execution.
Role Overview
As an AI Tutor specializing in Hebrew and multilingual audio, you will help train and refine AI systems (including Grok) to perform better in voice interactions, speech recognition, and multilingual audio understanding.
Your work will involve annotating, evaluating, and improving audio data to enhance AI performance across languages, accents, and real-world conditions.
Key Responsibilities
- Label, annotate, and evaluate multilingual audio data (voice recordings, speech samples, etc.)
- Provide high-quality recordings and inputs for AI training
- Ensure audio data reflects natural speech, accurate pronunciation, and cultural context
- Analyze speech elements such as tone, accent, rhythm, and intonation
- Collaborate with technical teams to improve audio processing and annotation workflows
- Help refine tools and processes for efficient audio data handling
Requirements
- Native-level proficiency in Hebrew
- Strong English proficiency (minimum B2 level)
- Ability to analyze speech nuances such as accents, pronunciation, and intonation
- Experience transcribing audio with high accuracy
- Comfort recording voice samples and working with audio data
- Strong attention to detail and analytical thinking
- Ability to work independently and make decisions on ambiguous audio inputs
- Strong communication and organizational skills
Preferred Qualifications
- Background in linguistics, phonetics, speech science, or related fields
- Experience with audio datasets, annotation workflows, or AI training data
- Familiarity with transcription standards and handling speech variations
- Experience in voice-related work (voice acting, podcasting, recording, etc.)
- Portfolio of voice samples or annotated audio work
Work Details
- Work Type: Fully remote
- Engagement: Flexible (full-time, part-time, or contract)
- Schedule: Flexible hours depending on project scope
- Typical workload may average around 10+ hours per week (not fixed)
Compensation & Benefits
- U.S. Pay Range: $35 – $45 per hour
- Benefits vary depending on role type and location
- May include health insurance, 401(k), and paid leave for eligible U.S. roles
Additional Notes
- Open to international candidates (subject to eligibility)
- No visa sponsorship available
- Requires compatible device (Chromebook, macOS 11+, or Windows 10+)
- Strong focus on audio quality, linguistic accuracy, and real-world speech understanding