Technical Data Delivery Lead

Pareto

Computer Science & Engineering Full-time
Remote
$120,000 – $160,000
March 19, 2026

Job Description

About the Role

Pareto AI is seeking a Technical Data Delivery Lead to design, operate, and optimize advanced AI data pipelines for leading AI research labs.

This is a highly technical role focused on building and improving data workflows used in training large language models (LLMs). You will work directly with AI researchers and engineering teams to develop scalable systems for data collection, evaluation, and model improvement.

Responsibilities

Pipeline Architecture

  • Design end-to-end pipelines for:
  • RLHF (Reinforcement Learning from Human Feedback)
  • SFT (Supervised Fine-Tuning)
  • Red-teaming and model evaluation
  • Define annotation schemas, rubrics, and quality frameworks
  • Identify risks and optimize workflows before deployment

Advertisement

Agentic System Deployment

  • Build and deploy AI agents to automate pipeline tasks
  • Develop systems for:
  • Quality control
  • Expert routing
  • Output validation
  • Throughput monitoring
  • Work with engineering teams to integrate agentic workflows

Quality Systems

  • Define and maintain data quality standards
  • Implement:
  • Inter-rater reliability metrics
  • Calibration systems
  • Statistical sampling methods
  • Design automated quality checks and validation layers

Client Collaboration

  • Work directly with AI researchers and technical stakeholders
  • Translate research requirements into operational workflows
  • Communicate performance metrics and risks clearly

Advertisement

Research Integration

  • Stay updated on advancements in:
  • LLM training methodologies
  • Evaluation frameworks
  • AI data tooling
  • Integrate new approaches into active pipelines

Requirements

  • Strong proficiency in Python and SQL
  • Deep understanding of:
  • LLM training (RLHF, SFT)
  • Prompt engineering and output behavior
  • Experience with agentic frameworks such as:
  • LangChain
  • DSPy
  • AutoGen
  • Experience designing or managing data/ML pipelines
  • Strong written communication and documentation skills
  • Ability to operate in fast-paced, evolving environments

Preferred Qualifications

Advertisement

  • Experience with:
  • RL environments and red-teaming workflows
  • Data engineering or ML research support
  • Production-level AI agent systems
  • Knowledge of:
  • Inter-rater reliability
  • Annotation quality frameworks
  • Experience in client-facing or technical program roles

Compensation

  • $120,000 – $160,000 annually
  • Equity included
  • Remote US-based role

About Pareto AI

Pareto AI builds human-in-the-loop data pipelines that power the next generation of AI systems. The company collaborates with leading AI organizations, including research teams from Anthropic, Character.AI, and top universities, to create scalable and ethical AI training infrastructure.

Pareto focuses on combining human expertise with AI systems to improve model performance, reliability, and real-world applicability.

Apply Now

You will be redirected to the company's website to complete your application.

Job Summary

Company Pareto
Location Remote
Type Full-time
Category Computer Science & Engineering
Salary $120,000 – $160,000

Share This Job