Join a high-growth AI company building cutting-edge foundation models. Work on large-scale model training, efficiency, and productionization. Improve the performance and efficiency of large-scale training workloads.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
Senior Applied Research Engineer | Barcelona, Spain
We’re partnering with a high-growth AI company building cutting-edge foundation models designed to solve complex enterprise decision-making problems.
This is an opportunity to join a deeply technical team working on large-scale model training, efficiency, and productionisation. The environment is highly research-driven, but with a strong focus on real-world performance, scalability, and impact.
The role
You’ll work on the performance and efficiency of large-scale training workloads, helping to improve how advanced models are trained, scaled, optimised, and served in production.
This role sits at the intersection of research and systems engineering, with a strong focus on distributed training, profiling, memory optimisation, and model efficiency.
This is the chance to work on genuinely hard problems in foundation model development with meaningful real-world application.
What you’ll be doing
Looking to advance your Machine Learning & AI career with relocation support? Explore Machine Learning & AI Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.
- Profile end-to-end distributed training runs to identify bottlenecks across compute, GPU memory, and inter-GPU communication
- Improve the efficiency and reliability of large-scale training jobs, including contributing to architectural decisions and developing Triton/CUDA kernels where needed
- Design and implement model scaling, parallelisation, and memory optimisation techniques for very large context training workloads
- Partner closely with ML Researchers to diagnose inefficiencies, ensure new ideas scale effectively, and share best practice around model performance
- Support the productionisation and serving of models from the research side, including improving inference efficiency through techniques such as quantisation
📍 Barcelona, Spain - hybrid working [Firm will offer relocation support for Barcelona]
💶 Highly competitive salary + benefits + equity
⏱️ Permanent Role
What you bring:
Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.
- Strong understanding of modern ML architectures and large-scale training pipelines
- Experience running distributed training jobs across multi-GPU systems
- Advanced profiling and debugging skills across CPU, GPU, memory, latency, and inter-GPU communication
- Strong Python skills
- Experience with model scaling and parallelisation approaches, including tensor and pipeline parallelism
- Familiarity with NCCL, MPI, and distributed communication primitives - highly desirable
- Knowledge of PyTorch and Triton internals - highly desirable
- Experience with C++ and CUDA - highly desirable
If you are interested in this role, please respond directly to this advert with your updated CV or email it to chantelle.smith@sourcegroupinternational.com
Similar Jobs
Explore other opportunities that match your interests
Deep Learning Engineer for Language Technologies
Barcelona Supercomputing Cente...
European Tech Recruit
AI Platforms Engineer