Machine Learning Scientist

Strativ Group • United State
Remote
Apply
AI Summary

Lead the design, development, and training of large, generative deep learning systems for small molecule drug discovery. Develop and maintain training pipelines, evaluate model quality, and collaborate with colleagues. Strong Python and PyTorch experience required.

Key Highlights
Design and implement large co-folding model architectures
Develop and maintain training pipelines
Collaborate with ML and software colleagues
Key Responsibilities
Design and implement improvements to large co-folding model architectures
Build and maintain training pipelines that support reliable, rapid, and large-scale experimentation
Develop and maintain benchmarking and evaluation workflows
Technical Skills Required
Python PyTorch Deep Learning
Benefits & Perks
Competitive compensation package
Fully remote role
Preference for East Coast US candidates
Nice to Have
Geometric deep learning
Equivariant neural network architectures
Protein-ligand modelling

Job Description


About the Company

A well-funded, AI-native biotech on a mission to deliver better medicines through innovation in AI-based drug discovery. The company has developed proprietary deep learning technology for protein-ligand co-folding and structure prediction, and is now scaling that technology to power the next generation of small molecule drug discovery programmes. This is a high-impact scientific role sitting at the intersection of cutting-edge ML research and real-world therapeutic application.



The Role

We are looking for a Machine Learning Scientist to lead the design, development, training, and benchmarking of the next version of our co-folding model - a large, generative deep learning system that predicts how small molecules bind to protein targets. You will own the full model development lifecycle, from architecture design through to training at scale and deployment-ready evaluation.

The role is well suited to those who thrives building systems that are reproducible, scalable, and impactful in real discovery workflows. Candidates at all experience levels are welcome - title and scope will be calibrated to experience and expected impact.



Key Responsibilities

Model Development

  • Design and implement improvements to large co-folding model architectures and training objectives for small molecule drug discovery
  • Build and maintain training pipelines that support reliable, rapid, and large-scale experimentation
  • Train models on many-GPU clusters, including debugging instability, throughput bottlenecks, and failed runs
  • Support periodic retraining and iterative model improvement

Evaluation & Quality

  • Develop and maintain benchmarking and evaluation workflows that track model quality, runtime, and cost
  • Validate on external benchmarks while prioritising internal discovery performance

Engineering & Collaboration

  • Write high-quality research and engineering code: refactor, package, test, and document ML components to support team velocity
  • Collaborate with ML and software colleagues to enable model deployment
  • Mentor interns through technical guidance, code reviews, and best practices in ML experimentation



Required Qualifications

  • PhD in ML, Computer Science, or a computational STEM discipline - or equivalent industry experience demonstrating comparable depth
  • Strong Python and PyTorch experience, including end-to-end implementation and training of deep learning models
  • Demonstrated experience applying deep learning to 3D or physical-science problems (e.g. structural biology, protein modelling, computational chemistry, scientific
  • ML)Strong engineering habits: reproducible experimentation, clean code, testing, and performance-minded debugging
  • Comfort working with modern ML infrastructure (e.g. Docker, CUDA, Kubernetes, experiment tracking tools)
  • Generative modelling: diffusion models, flow matching, stochastic interpolants, or related frameworks
  • Protein-ligand modelling, structure prediction, or structure-based drug discovery (SBDD) workflows
  • Geometric deep learning and/or equivariant neural network architectures
  • Multi-GPU and distributed training at scale
  • HPC or large-scale training operations
  • Docking, molecular dynamics, or other biomolecular simulation methods



What’s on Offer

  • Fully remote role with a preference for East Coast US candidates
  • Competitive compensation package aligned to experience and impact

Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Jobgether

United State

Senior Software Engineer, AI/ML

Machine Learning
•
1d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

ImagineX

United State

Enterprise AI Field Engineer

Machine Learning
•
1d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

zaddy solutions

United State

Subscribe our newsletter

New Things Will Always Update Regularly