Senior Applied Research Engineer

sgi Spain
Relocation
Apply
AI Summary

Join a high-growth AI company building cutting-edge foundation models. Work on large-scale model training, efficiency, and productionization. Improve the performance and efficiency of large-scale training workloads.

Key Highlights
Profile end-to-end distributed training runs
Improve the efficiency and reliability of large-scale training jobs
Design and implement model scaling, parallelisation, and memory optimisation techniques
Key Responsibilities
Profile end-to-end distributed training runs to identify bottlenecks
Improve the efficiency and reliability of large-scale training jobs
Design and implement model scaling, parallelisation, and memory optimisation techniques
Technical Skills Required
Python PyTorch Triton CUDA NCCL MPI C++
Benefits & Perks
Highly competitive salary
Benefits
Equity
Relocation support for Barcelona
Hybrid working
Nice to Have
Knowledge of PyTorch and Triton internals
Experience with C++ and CUDA

Job Description


Senior Applied Research Engineer | Barcelona, Spain


We’re partnering with a high-growth AI company building cutting-edge foundation models designed to solve complex enterprise decision-making problems.

This is an opportunity to join a deeply technical team working on large-scale model training, efficiency, and productionisation. The environment is highly research-driven, but with a strong focus on real-world performance, scalability, and impact.


The role

You’ll work on the performance and efficiency of large-scale training workloads, helping to improve how advanced models are trained, scaled, optimised, and served in production.

This role sits at the intersection of research and systems engineering, with a strong focus on distributed training, profiling, memory optimisation, and model efficiency.

This is the chance to work on genuinely hard problems in foundation model development with meaningful real-world application.


What you’ll be doing

  • Profile end-to-end distributed training runs to identify bottlenecks across compute, GPU memory, and inter-GPU communication
  • Improve the efficiency and reliability of large-scale training jobs, including contributing to architectural decisions and developing Triton/CUDA kernels where needed
  • Design and implement model scaling, parallelisation, and memory optimisation techniques for very large context training workloads
  • Partner closely with ML Researchers to diagnose inefficiencies, ensure new ideas scale effectively, and share best practice around model performance
  • Support the productionisation and serving of models from the research side, including improving inference efficiency through techniques such as quantisation


📍 Barcelona, Spain - hybrid working [Firm will offer relocation support for Barcelona]

💶 Highly competitive salary + benefits + equity

⏱️ Permanent Role


What you bring:

  • Strong understanding of modern ML architectures and large-scale training pipelines
  • Experience running distributed training jobs across multi-GPU systems
  • Advanced profiling and debugging skills across CPU, GPU, memory, latency, and inter-GPU communication
  • Strong Python skills
  • Experience with model scaling and parallelisation approaches, including tensor and pipeline parallelism
  • Familiarity with NCCL, MPI, and distributed communication primitives - highly desirable
  • Knowledge of PyTorch and Triton internals - highly desirable
  • Experience with C++ and CUDA - highly desirable


If you are interested in this role, please respond directly to this advert with your updated CV or email it to chantelle.smith@sourcegroupinternational.com


Similar Jobs

Explore other opportunities that match your interests

Deep Learning Engineer for Language Technologies

Machine Learning
1w ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Barcelona Supercomputing Cente...

Spain

Senior Large Language Models Engineer

Machine Learning
1w ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

European Tech Recruit

Spain

AI Platforms Engineer

Machine Learning
3h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

cern

Switzerland

Subscribe our newsletter

New Things Will Always Update Regularly