Senior Machine Learning Engineer - Inference Optimization

linuxrecruit European Union
Remote
Apply
AI Summary

Join a fast-growing tech unicorn building a next-generation AI platform for developers. As a Senior ML Engineer, you'll work on inference optimization, pushing performance in the real world. Ideal candidate has hands-on experience with Python, ML inference optimization, and tools like vLLM, Triton, SGLang, and TensorRT.

Key Highlights
Fully remote across Europe
Brand new team
Genuine 0→1 product build
Key Responsibilities
Deep in inference optimization work
Reducing latency
Improving model initialisation times
Building distributed systems
Technical Skills Required
Python ML inference optimization vLLM Triton SGLang TensorRT
Benefits & Perks
Fully remote work
Fast-growing tech unicorn

Job Description


A serious 0→1 build. Brand new team. Fully remote across Europe. That alone already says a lot.


Whether you’re working from a beach in Barcelona or a cottage in the Cotswolds, your location is secondary to the code you ship. This role isn't about tracking desk hours; it's about the impact of what you’re building. I’m looking for a Senior ML Engineer specialising in inference optimisation....


This is a chance to join a fast growing tech unicorn building a next generation AI platform for developers. The company originally made its name by helping engineering teams dramatically optimise cloud spend. That same mindset is now being applied to AI infrastructure, helping developers build, deploy and scale AI powered features faster, more efficiently, and at a significantly lower cost.


You’ll be joining early in a brand new team, working on a genuine 0→1 product build where engineering quality and performance really matter. This isn’t about academic benchmarks or theoretical work, it’s about solving real production problems at scale.


Day to day, you’ll be deep in inference optimisation work using tools like vLLM, Triton, SGLang, and TensorRT. The focus is on pushing performance in the real world: reducing latency, improving model initialisation times and building distributed systems that make high performance AI both accessible and cost efficient.


If you're someone whose is an expert in Python with proven experience in ML inference optimisation. You’ll ideally have hands on experience tuning inference engines and working with production scale systems using tools like vLLM, Triton, SGLang, and TensorRT, then this is for you!


If you’d like to find out more, you can apply below or email ethan.farrell@linuxrecruit.co.uk


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Director

mvp match

European Union

MLOps Engineer

Machine Learning
4w ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

puritas group

European Union

AI Expert / Advisor (NLP and ML)

Machine Learning
1h ago
Visa Sponsorship Relocation Remote
Job Type Part-time
Experience Level Mid-Senior level

talantum

Nigeria

Subscribe our newsletter

New Things Will Always Update Regularly