Technical Lead for LLM Evaluation and Training Datasets

Turing • India

Remote

Apply

AI Summary

We are looking for an experienced software engineer at the tech lead level to contribute to LLM evaluation and training datasets. The role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality. The candidate will set up and configure code repositories, analyze GitHub issues, and modify codebases to assess LLM performance.

Key Highlights

Technical lead for LLM evaluation and training datasets

Hands-on software engineering work with development environment automation, issue triaging, and test coverage evaluation

Setting up and configuring code repositories and analyzing GitHub issues

Key Responsibilities

Analyze and triage GitHub issues across trending open-source libraries

Set up and configure code repositories, including Dockerization and environment setup

Evaluating unit test coverage and quality

Modify and run codebases locally to assess LLM performance in bug-fixing scenarios

Technical Skills Required

Git Docker Python

Benefits & Perks

Work in a fully remote environment

Opportunity to work on cutting-edge AI projects with leading LLM companies

Nice to Have

Previous participation in LLM research or evaluation projects

Experience building or testing developer tools or automation agents

Job Description

About the projects: we are building LLM evaluation and training datasets to train LLM to work on realistic software engineering problems. One of our approaches, in this project, is to build verifiable SWE tasks based on public repository histories in a synthetic approach with human-in-the-loop; while expanding the dataset coverage to different types of tasks in terms of programming language, difficulty level, and etc.

About the Role:

We are looking for experienced software engineers (tech lead level) who are familiar with high-quality public GitHub repositories and can contribute to this project. This role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality

Why Join Us?

Turing is one of the world’s fastest-growing AI companies accelerating the advancement and deployment of powerful AI systems. You’ll be at the forefront of evaluating how LLMs interact with real code, influencing the future of AI-assisted software development. This is a unique opportunity to blend practical software engineering with AI research.

What does day-to-day look like:

Analyze and triage GitHub issues across trending open-source libraries.
Set up and configure code repositories, including Dockerization and environment setup.
Evaluating unit test coverage and quality.
Modify and run codebases locally to assess LLM performance in bug-fixing scenarios.
Collaborate with researchers to design and identify repositories and issues that are challenging for LLMs.

Interested in remote work opportunities in Development & Programming? Discover Development & Programming Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.

Opportunities to lead a team of junior engineers to collaborate on projects.

Required Skills:

Minimum 3+ years of overall experience
Strong experience with at least one of the following languages: Python
Proficiency with Git, Docker, and basic software pipeline setup.
Ability to understand and navigate complex codebases.
Comfortable running, modifying, and testing real-world projects locally.
Experience contributing to or evaluating open-source projects is a plus.

Nice to Have:

Previous participation in LLM research or evaluation projects.
Experience building or testing developer tools or automation agents.

Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.

Perks of Freelancing With Turing:
Work in a fully remote environment.
Opportunity to work on cutting-edge AI projects with leading LLM companies.

Offer Details:

Commitments Required: At least 4 hours per day and minimum 20 hours per week with overlap of 4 hours with PST. (We have 3 options of time commitment: 20 hrs/week, 30 hrs/week or 40 hrs/week)
Employment type: Contractor assignment (no medical/paid leave)
Duration of contract: 3 month; [expected start date is next week]

After applying, you will receive an email with a login link. Please use that link to access the portal and complete your profile.

Know amazing talent? Refer them at turing.com/referrals, and earn money from your network.

Job Overview

Posted Date Jun 24, 2026

Employment Type Contract

Experience Level Associate

Location India

Category Programming

Company Turing

Mentioned Skills

Industries

Similar Jobs

Explore other opportunities that match your interests

Senior BMC Helix Developer - ITSM/ITOM

Programming

•

12h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Not Applicable

jobhedge consultancy

India

Senior Paid Ads Specialist - Dental & Healthcare Marketing

Programming

•

12h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Mid-Senior level

remedo

India

Senior Java Game Developer (Remote)

Programming

•

12h ago

Premium Job

•••••• •••••• ••••••

Job Type ••••••

Experience Level ••••••

hire feed

India

Technical Lead for LLM Evaluation and Training Datasets

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Nice to Have

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

Senior BMC Helix Developer - ITSM/ITOM

jobhedge consultancy

Senior Paid Ads Specialist - Dental & Healthcare Marketing

remedo

Senior Java Game Developer (Remote)

Premium Job

hire feed

Subscribe our newsletter