Software Engineer (Data Engineer / Data Science) - SWE Bench Evaluation
Design and implement data pipelines for benchmark-driven evaluation of AI systems. Work with structured and unstructured datasets to ensure data quality and integrity. Collaborate with researchers to develop challenging data engineering tasks for AI benchmarking.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Job Description
About The Company
Based in San Francisco, California, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports its clients by accelerating frontier research through high-quality data, advanced training pipelines, and top-tier AI researchers specializing in coding, reasoning, STEM, multilinguality, multimodality, and agents. Additionally, Turing helps enterprises transform AI from proof-of-concept to proprietary intelligence by developing reliable systems that deliver measurable impact and drive lasting results on the P&L. The company's innovative approach and commitment to excellence position it as a pioneer in the AI industry, fostering collaboration between cutting-edge research and practical enterprise solutions.
About The Role
We are seeking experienced Software Engineers (SWE Bench - Data Engineer / Data Science) to join our team and contribute to benchmark-driven evaluation projects focused on real-world data engineering and data science workflows. In this role, you will work hands-on with production-like datasets, designing and implementing data pipelines, performing data processing and analysis, and supporting experiments that evaluate the performance of advanced AI systems. The ideal candidate will possess a strong foundation in data engineering and data science, with the ability to work across various stages of data preparation, analysis, and modeling within complex codebases. This position offers an exciting opportunity to collaborate with top researchers and engineers to develop meaningful benchmarks that push the boundaries of AI technology.
Qualifications
The ideal candidate will have a minimum of three years of experience as a Data Engineer, Data Scientist, or Software Engineer with a focus on data workflows. Proficiency in Python is essential, particularly for data processing, analysis, and model-related tasks. Demonstrable experience working with structured and unstructured data, coupled with a solid understanding of machine learning and data science fundamentals, is required. Candidates should have the ability to navigate and modify complex, real-world codebases and produce clean, reusable, and well-documented code. Strong problem-solving skills, especially in algorithmic or data-intensive problems, are vital. Excellent communication skills in English, both spoken and written, are also necessary for effective collaboration within cross-functional teams.
Responsibilities
Interested in remote work opportunities in Data Science? Discover Data Science Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
- Work with structured and unstructured datasets to support SWE Bench-style evaluation tasks, ensuring data quality and integrity.
- Design, build, and validate data pipelines used in benchmarking and evaluation workflows to facilitate accurate and reproducible results.
- Perform data processing, analysis, feature engineering, and validation to support various data science use cases.
- Write, run, and modify Python scripts to process data and support experimental workflows locally, ensuring efficiency and reliability.
- Evaluate data quality, transformations, and outputs for correctness, reproducibility, and adherence to project standards.
- Create clean, well-documented, and reusable data workflows that can be integrated into benchmarking frameworks.
- Participate in code reviews to maintain high standards of code quality, readability, and maintainability.
- Collaborate with researchers and engineers to design challenging, real-world data engineering and data science tasks for AI evaluation systems.
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
Working as a freelancer with Turing offers the flexibility of a fully remote environment, allowing you to work from anywhere. You will have the opportunity to engage with cutting-edge AI projects alongside leading language model companies, expanding your expertise and professional network. Additionally, Turing provides a platform for continuous learning and growth through exposure to innovative technologies and methodologies. The role offers competitive engagement terms, flexible working hours, and the chance to contribute to impactful AI solutions that shape the future of technology.
Equal Opportunity
Turing is committed to fostering an inclusive environment where all qualified individuals have equal opportunity for employment. We value diversity and are dedicated to creating a workplace that respects and celebrates differences. We do not discriminate based on race, color, religion, gender, gender identity or expression, sexual orientation, national origin, age, disability, or any other protected status. Our goal is to ensure that every team member feels valued, supported, and empowered to contribute their best.
Similar Jobs
Explore other opportunities that match your interests
CareerXperts Consulting
hired