Senior ML Engineer - Generative AI & LLM Systems

Remote
Apply
AI Summary

Design, optimize, and deploy production-grade LLM applications and predictive analytics for global clients. Architect robust AI features, evaluation frameworks, and agentic workflows using modern cloud infrastructure. Requires 3+ years of ML engineering experience with Python, AWS, and strong product delivery mindset.

Key Highlights
Production LLM and generative AI systems for enterprise clients
Full-stack integration with serverless architecture (AWS Lambda)
Ownership from prototype to production with ambiguous problem solving
Evaluation frameworks for AI quality, safety, and business impact
Key Responsibilities
Architect and build AI features: design and implement robust classical ML and generative AI solutions, balancing agentic architectures with deterministic pipelines
Evaluate AI quality: design and maintain evaluation frameworks to measure reliability, safety, and business impact before and after deployment
Integrate and deploy AI capabilities: partner with full-stack developers and DevOps to integrate into client web and mobile applications using serverless architecture or API endpoints
Optimize for production: refine prompts, system instructions, and chunking strategies to balance accuracy, latency, token consumption, and data privacy
Develop traditional predictive analytics: clean and process unstructured or historical client data to train/fine-tune custom algorithms for forecasting, classification, or anomaly detection
Collaborate and communicate: participate in client discovery sessions, translate business requirements into technical scopes, and demo prototypes to stakeholder teams
Maintain engineering excellence: conduct code reviews, implement validation patterns for AI outputs, and contribute templates or runbooks to internal knowledge base
Technical Skills Required
Python AWS Machine Learning
Benefits & Perks
Annual paid vacation: 20 days per year (increasing to 25 days)
Paid sick leave
10 national holidays
2 company days off
Well-being budget
Maternity/paternity leave
Professional development course reimbursement
Hardware provision
Positive engineering culture

Job Description


πŸ”Ή 100% remote | 🌎 Global team | ⏳ Full-time

NineTwoThree AI Studio is a premier product design, engineering, and marketing firm specializing in custom AI, web, and mobile applications for established brands and funded startups. We are based in Massachusetts but with an American and European staff and a strong, collaborative remote culture.

We're a team that loves doing good work with great people. Our relatively small size keeps us fast and nimble. The wealth of knowledge, experience and talent paired with proven recipes and best practices allows us to find opportunities to help new products succeed.

With a portfolio of over 150 launched products over 13 years, NineTwoThree has garnered recognition as a top AI agency in the U.S., earning accolades such as inclusion in the Inc. 5000 list for four consecutive years and being named among the top 50 AI firms alongside industry leaders like Microsoft, NVIDIA, and IBM. We've built AI and ML tech for big brands like Consumer Reports, FanDuel, and Nara, as well as startups in legal tech, logistics, education, and more.

Role Overview

As an ML Engineer at NineTwoThree AI Studio, you will sit at the intersection of production-grade software engineering, advanced natural language processing, and client delivery. We build custom, high-impact AI systems for brands and startups across diverse industries (such as healthcare, logistics, and fintech).

Instead of siloed academic research, this role demands a product-minded builder. You will design, optimize, and deploy robust LLM applications, custom predictive analytics, and agentic workflows directly into our clients' software ecosystems, taking absolute ownership of features from prototype to production.

Technology Stack

  • Core Frameworks & Arch: Transformer models, modern LLM APIs (Anthropic Claude, OpenAI, AWS Bedrock, etc.), Open-Source LLMs
  • Orchestration & Agentic Design: Experience designing LLM workflows, agentic systems, or retrieval pipelines using frameworks such as Langchain, LangGraph, LlamaIndex, or equivalent approaches
  • Data & Search: Vector databases (Pinecone, pgvector, Milvus, Qdrant, etc.), SQL, and data engineering pipelines
  • Traditional ML: Supervised and Unsupervised learning (Classification, Regression, Anomaly Detection)
  • Cloud & Infrastructure: AWS (Lambda, SageMaker, Bedrock, EC2) and modern DevOps/retraining pipelines
  • Languages: Production-grade Python

Responsibilities

  • Architect & Build AI Features: Design and implement robust classical ML and generative AI solutions, striking the right balance between autonomous agentic architectures and deterministic pipelines
  • Evaluate: Design and maintain evaluation frameworks to measure AI quality, reliability, safety, and business impact before and after deployment
  • Integrate & Deploy: Partner closely with full-stack developers and DevOps to seamlessly integrate AI capabilities into client web and mobile applications using serverless architecture (e.g., AWS Lambda) or API endpoints
  • Optimize for Production: Refine prompts, system instructions, and chunking strategies to balance accuracy, latency, token consumption, and data privacy
  • Traditional Predictive Analytics: Clean and process unstructured or historical client data to train/fine-tune custom algorithms for specific business problems (such as forecasting, classification, or anomaly detection)
  • Collaborate & Communicate: Actively participate in client discovery sessions, translate ambiguous business requirements into viable technical scopes, and demo prototypes directly to stakeholder teams
  • Maintain Engineering Excellence: Engage in constructive code reviews, implement rigorous validation patterns to test AI outputs, and contribute templates or runbooks to our internal AI knowledge base

Requirements

Requirements

Technical Experience

  • Proven Track Record: 3+ years of experience engineering software with a strong focus on machine learning and natural language processing
  • LLM & Generative AI Mastery: In-depth understanding of modern LLM architectures, context window mechanics, semantic search techniques, and the limitations of generative systems. Ability to identify when a deterministic solution is preferable to an LLM or agent-based solution
  • Production experience: Experience building and operating production AI systems, including monitoring, evaluation, debugging, and iterative improvement
  • Evaluation experience: Understanding of evaluation methodologies for LLM-based systems, including retrieval quality, hallucination detection, and task-specific performance measurement. Ability to reason about tradeoffs between quality, latency, cost, reliability, and engineering complexity
  • Python & SQL Proficiency: Exceptional Python coding skills and the ability to query, clean, and structure data efficiently
  • Cloud Infrastructure: Hands-on experience deploying ML or API services within cloud ecosystems, preferably AWS
  • Ownership: Comfortable taking ownership of ambiguous problems from initial discovery through production deployment and ongoing support

Product & Team Capabilities

  • Ambiguity to Execution: Ability to drop into a completely new industry vertical, understand its data constraints, and spin up a working proof-of-concept within a few weeks
  • The "Product Engineer" Mindset: Passion for seeing things ship and understanding why something is being built from a business value standpoint, not just what is being built
  • Communication: Fluent written and spoken English. Comfortable interacting with client stakeholders and breaking down technical workflows into clear concepts
  • Adaptability: Eagerness to experiment with and evaluate fast-emerging AI development tools, models, and frameworks
  • Education: Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or a related field (or equivalent practical experience)

Benefits

What We Offer

  • Annual paid vacation: 20 days off per year during the first 3 years, increasing to 25 days in later years
  • Paid sick leave, 10 national holidays, and 2 company days off
  • Well-being budget
  • Maternity/paternity leave
  • Reimbursement of expenses for professional development courses and certifications (up to 100% in agreement with Manager)
  • Hardware upon business needs
  • Strong positive engineering culture, a tightly-knit team of professionals with a good sense of humor

What's The Process

We value your time and ours and make the process fast and easy. Our interview process takes the following steps: a short interview with the HR, 2nd technical interview with ML Engineer and CTO (optional), 3rd live-coding interview, Offer.

Similar Jobs

Explore other opportunities that match your interests

Senior Angular Developer

Programming
β€’
4h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Infortec Consultores

Spain

Senior Power Platform Developer - Spain

Programming
β€’
19h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Associate

Jobgether

Spain
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Jobgether

Spain

Subscribe our newsletter

New Things Will Always Update Regularly