Talent.com
This job offer is not available in your country.
RLHF Contractor (AI Model Training)

RLHF Contractor (AI Model Training)

HIREXEWorkFromHome, Limpopo, South Africa
15 hours ago
Job description

Direct message the job poster from HIREXE

About the Role

We are seeking contractors to support Reinforcement Learning with Human Feedback (RLHF) projects, a core process for training and improving large language models (LLMs). You will provide high-quality feedback on AI-generated responses, helping refine model outputs for accuracy, usefulness, safety, and alignment with human expectations. This is a remote, flexible role suited for detail-oriented individuals who are comfortable working independently.

Responsibilities

  • Review and evaluate AI-generated responses for correctness, clarity, and helpfulness.
  • Rank multiple AI outputs by quality and relevance according to provided guidelines.
  • Provide structured written feedback to guide model improvements.
  • Follow detailed annotation instructions with consistency and accuracy.
  • Participate in calibration sessions to align with evolving project standards.
  • Meet productivity and quality benchmarks (e.g., number of tasks per hour, accuracy scores).

Qualifications

  • Strong command of English (reading and writing).
  • Ability to analyze text critically and identify errors in logic, tone, or factuality.
  • Comfortable working with evolving guidelines and ambiguous edge cases.
  • Reliable internet connection and ability to work remotely.
  • Prior experience with content moderation, annotation, writing, editing, or teaching is a plus.
  • Familiarity with AI, machine learning, or NLP concepts is beneficial but not required.
  • What We Offer

  • Flexible schedule : Work from anywhere.
  • Pay : Competitive hourly / project-based compensation.
  • Impact : Contribute to training cutting-edge AI systems used by millions.
  • Growth : Gain experience in the rapidly growing AI / ML ecosystem.
  • Seniority level : Associate

    Employment type : Full-time

    Job function : Engineering and Information Technology

    Industries : Software Development

    Referrals increase your chances of interviewing at HIREXE by 2x

    Get notified about new Artificial Intelligence Researcher jobs in South Africa .

    Cape Town, Western Cape, South Africa 1 month ago

    Cape Town, Western Cape, South Africa 1 month ago

    #J-18808-Ljbffr

    Create a job alert for this search

    Training • WorkFromHome, Limpopo, South Africa