Senior Python Engineer for LLM Data Generation and Evaluation

Turing India
Remote
Apply
AI Summary

Join a leading AI company to enhance Large Language Models (LLMs) by generating and evaluating high-quality data. Collaborate with top research teams and work on real-world AI challenges.

Key Highlights
Generate and evaluate data for fine-tuning and benchmarking LLMs
Design prompts, analyze model outputs, and provide detailed feedback
Collaborate with cross-functional teams to ensure high-quality data and evaluations
Technical Skills Required
Python Testing Debugging Async Programming
Benefits & Perks
Remote & Flexible work schedule
Collaboration with leading AI research teams
Work on real-world AI challenges

Job Description


Role Overview:

We’re looking for experienced Python engineers to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models think, reason, and code.


In this role, you’ll generate and evaluate high-quality data used to fine-tune and benchmark LLMs. You’ll design prompts, analyze model outputs, write Python solutions, and provide detailed feedback that guides model improvements. This is a unique opportunity to contribute to the next generation of AI systems—without needing to train or build the models yourself.


What You’ll Do:

  • Write and maintain clean, efficient Python code for AI training and evaluation.
  • Evaluate and compare model responses as part of RLHF (Reinforcement Learning with Human Feedback).
  • Create and refine datasets for SFT (Supervised Fine-Tuning).
  • Develop reasoning-based feedback to enhance model accuracy and alignment.
  • Collaborate with cross-functional teams to ensure high-quality data and evaluations.


Requirements:

  • 3+ years of strong Python development experience.
  • Solid understanding of testing, debugging, async programming, and software best practices.
  • Excellent written and verbal communication in English.


Offer Details:

  • Commitment: Minimum 20 hrs/week (options for 20, 30, or 40 hrs).
  • Time Zone: 4-hour overlap with PST.
  • Contract: 1-month contractor role (no paid leave).


Perks:

  • Remote & Flexible → Work from anywhere, on your schedule.
  • Collaborate with leading LLM and AI research teams.
  • Work on real-world AI challenges shaping the future of intelligent systems.


About Turing:

Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.


After applying, you will receive an email with a login link. Please use that link to access the portal and complete your profile.


Know amazing talent? Refer them at turing.com/referrals, and earn money from your network.


Subscribe our newsletter

New Things Will Always Update Regularly