You are viewing a preview of this job. Log in or register to view more details about this job.

Freelance Business Operations Expert AI Evaluator

Uber AI Solutions is Uber’s marketplace connecting freelancers with Generative AI researchers. We’re inviting experienced business operations professionals to collaborate on tasks at the frontier of GenAI. This is a freelance, paid opportunity—remote and designed for independent contractors who want to apply their business operations expertise to help shape how AI handles complex organizational, logistical, and strategic language.

What you’ll work on

  • Apply your business operations expertise to define ground truth and refine LLM performance.
  • Draft, review, and refine complex, multi-constraint prompts to ensure they are logically sound, precisely structured, and optimized for high-level AI reasoning.
  • Compose "golden responses" that serve as the authoritative target for AI performance, requiring exceptional reasoning, factual precision, and structural clarity.
  • Analyze model-generated responses against established quality guidelines to measure alignment with target expectations.
  • Identify logical fallacies, subtle nuances, and edge cases to ensure every response meets a supreme professional standard.

Engagement details

  • Location: Remote (United States)
  • Type: Freelance / Independent Contractor (1099)

Who we’re looking for

  • MBA, MS in Management, PMP certification, or Six Sigma Black Belt from an accredited institution.
  • 3+ years of work experience in operational strategy, supply chain management, or process improvement.
  • Individuals with prior data annotation or prompt writing experience (strongly preferred).

Ideal backgrounds

  • Operations managers and directors.
  • Management consultants and business analysts.
  • Project and program managers (PMP).
  • Supply chain strategists and logistics coordinators.

Why this matters 

Your expertise will guide how AI systems handle the weight of business language. By evaluating and refining business prompts and responses, you’ll help ensure that AI is not only accurate but also clear, safe, and operationally sound for real-world application.