Senior ML Engineer - Generative AI & LLM Systems

ninetwothree ai studio • Spain

Remote

Apply

AI Summary

Design, optimize, and deploy production-grade LLM applications and predictive analytics for global clients. Architect robust AI features, evaluation frameworks, and agentic workflows using modern cloud infrastructure. Requires 3+ years of ML engineering experience with Python, AWS, and strong product delivery mindset.

Key Highlights

Production LLM and generative AI systems for enterprise clients

Full-stack integration with serverless architecture (AWS Lambda)

Ownership from prototype to production with ambiguous problem solving

Evaluation frameworks for AI quality, safety, and business impact

Key Responsibilities

Architect and build AI features: design and implement robust classical ML and generative AI solutions, balancing agentic architectures with deterministic pipelines

Evaluate AI quality: design and maintain evaluation frameworks to measure reliability, safety, and business impact before and after deployment

Integrate and deploy AI capabilities: partner with full-stack developers and DevOps to integrate into client web and mobile applications using serverless architecture or API endpoints

Optimize for production: refine prompts, system instructions, and chunking strategies to balance accuracy, latency, token consumption, and data privacy

Develop traditional predictive analytics: clean and process unstructured or historical client data to train/fine-tune custom algorithms for forecasting, classification, or anomaly detection

Collaborate and communicate: participate in client discovery sessions, translate business requirements into technical scopes, and demo prototypes to stakeholder teams

Maintain engineering excellence: conduct code reviews, implement validation patterns for AI outputs, and contribute templates or runbooks to internal knowledge base

Technical Skills Required

Python AWS Machine Learning

Benefits & Perks

Annual paid vacation: 20 days per year (increasing to 25 days)

Paid sick leave

10 national holidays

2 company days off

Well-being budget

Maternity/paternity leave

Professional development course reimbursement

Hardware provision

Positive engineering culture

Job Description

🔹 100% remote | 🌎 Global team | ⏳ Full-time

NineTwoThree AI Studio is a premier product design, engineering, and marketing firm specializing in custom AI, web, and mobile applications for established brands and funded startups. We are based in Massachusetts but with an American and European staff and a strong, collaborative remote culture.

We're a team that loves doing good work with great people. Our relatively small size keeps us fast and nimble. The wealth of knowledge, experience and talent paired with proven recipes and best practices allows us to find opportunities to help new products succeed.

With a portfolio of over 150 launched products over 13 years, NineTwoThree has garnered recognition as a top AI agency in the U.S., earning accolades such as inclusion in the Inc. 5000 list for four consecutive years and being named among the top 50 AI firms alongside industry leaders like Microsoft, NVIDIA, and IBM. We've built AI and ML tech for big brands like Consumer Reports, FanDuel, and Nara, as well as startups in legal tech, logistics, education, and more.

Role Overview

As an ML Engineer at NineTwoThree AI Studio, you will sit at the intersection of production-grade software engineering, advanced natural language processing, and client delivery. We build custom, high-impact AI systems for brands and startups across diverse industries (such as healthcare, logistics, and fintech).

Instead of siloed academic research, this role demands a product-minded builder. You will design, optimize, and deploy robust LLM applications, custom predictive analytics, and agentic workflows directly into our clients' software ecosystems, taking absolute ownership of features from prototype to production.

Technology Stack

Core Frameworks & Arch: Transformer models, modern LLM APIs (Anthropic Claude, OpenAI, AWS Bedrock, etc.), Open-Source LLMs
Orchestration & Agentic Design: Experience designing LLM workflows, agentic systems, or retrieval pipelines using frameworks such as Langchain, LangGraph, LlamaIndex, or equivalent approaches
Data & Search: Vector databases (Pinecone, pgvector, Milvus, Qdrant, etc.), SQL, and data engineering pipelines
Traditional ML: Supervised and Unsupervised learning (Classification, Regression, Anomaly Detection)
Cloud & Infrastructure: AWS (Lambda, SageMaker, Bedrock, EC2) and modern DevOps/retraining pipelines
Languages: Production-grade Python

Responsibilities

Architect & Build AI Features: Design and implement robust classical ML and generative AI solutions, striking the right balance between autonomous agentic architectures and deterministic pipelines
Evaluate: Design and maintain evaluation frameworks to measure AI quality, reliability, safety, and business impact before and after deployment
Integrate & Deploy: Partner closely with full-stack developers and DevOps to seamlessly integrate AI capabilities into client web and mobile applications using serverless architecture (e.g., AWS Lambda) or API endpoints

Interested in remote work opportunities in Development & Programming? Discover Development & Programming Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.

Optimize for Production: Refine prompts, system instructions, and chunking strategies to balance accuracy, latency, token consumption, and data privacy
Traditional Predictive Analytics: Clean and process unstructured or historical client data to train/fine-tune custom algorithms for specific business problems (such as forecasting, classification, or anomaly detection)
Collaborate & Communicate: Actively participate in client discovery sessions, translate ambiguous business requirements into viable technical scopes, and demo prototypes directly to stakeholder teams
Maintain Engineering Excellence: Engage in constructive code reviews, implement rigorous validation patterns to test AI outputs, and contribute templates or runbooks to our internal AI knowledge base

Requirements

Requirements

Technical Experience

Proven Track Record: 3+ years of experience engineering software with a strong focus on machine learning and natural language processing
LLM & Generative AI Mastery: In-depth understanding of modern LLM architectures, context window mechanics, semantic search techniques, and the limitations of generative systems. Ability to identify when a deterministic solution is preferable to an LLM or agent-based solution
Production experience: Experience building and operating production AI systems, including monitoring, evaluation, debugging, and iterative improvement
Evaluation experience: Understanding of evaluation methodologies for LLM-based systems, including retrieval quality, hallucination detection, and task-specific performance measurement. Ability to reason about tradeoffs between quality, latency, cost, reliability, and engineering complexity
Python & SQL Proficiency: Exceptional Python coding skills and the ability to query, clean, and structure data efficiently
Cloud Infrastructure: Hands-on experience deploying ML or API services within cloud ecosystems, preferably AWS
Ownership: Comfortable taking ownership of ambiguous problems from initial discovery through production deployment and ongoing support

Product & Team Capabilities

Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.

Ambiguity to Execution: Ability to drop into a completely new industry vertical, understand its data constraints, and spin up a working proof-of-concept within a few weeks
The "Product Engineer" Mindset: Passion for seeing things ship and understanding why something is being built from a business value standpoint, not just what is being built
Communication: Fluent written and spoken English. Comfortable interacting with client stakeholders and breaking down technical workflows into clear concepts
Adaptability: Eagerness to experiment with and evaluate fast-emerging AI development tools, models, and frameworks
Education: Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or a related field (or equivalent practical experience)

Benefits

What We Offer

Annual paid vacation: 20 days off per year during the first 3 years, increasing to 25 days in later years
Paid sick leave, 10 national holidays, and 2 company days off
Well-being budget
Maternity/paternity leave
Reimbursement of expenses for professional development courses and certifications (up to 100% in agreement with Manager)
Hardware upon business needs
Strong positive engineering culture, a tightly-knit team of professionals with a good sense of humor

What's The Process

We value your time and ours and make the process fast and easy. Our interview process takes the following steps: a short interview with the HR, 2nd technical interview with ML Engineer and CTO (optional), 3rd live-coding interview, Offer.

Job Overview

Posted Date Jun 26, 2026

Employment Type Full-time

Experience Level Not Applicable

Location Spain

Category Programming

Company ninetwothree ai studio

Mentioned Skills

Similar Jobs

Explore other opportunities that match your interests

Senior Angular Developer

Programming

•

4h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Not Applicable

Infortec Consultores

Spain

Senior Power Platform Developer - Spain

Programming

•

19h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Associate

Jobgether

Spain

Senior Java Developer - Enterprise Data Platforms

Programming

•

1d ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Not Applicable

Jobgether

Spain

Senior ML Engineer - Generative AI & LLM Systems

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

Senior Angular Developer

Infortec Consultores

Senior Power Platform Developer - Spain

Jobgether

Senior Java Developer - Enterprise Data Platforms

Jobgether

Subscribe our newsletter