Senior AI Engineer - Voice AI and Browser Automation

karumi (yc f25) • United State

Visa Sponsorship

Apply

AI Summary

Join our US-based AI engineering team to build cutting-edge voice AI and browser automation systems. Work with large language models and design voice experiences. Contribute to real-time AI pipelines with a focus on reliability and innovation.

Key Highlights

Build and optimize voice AI systems using speech-to-text and text-to-speech models

Design browser agents that navigate, understand, and interact with web applications

Implement browser automation with computer vision and DOM understanding

Engineer prompt systems and LLM workflows for consistent, intelligent behavior

Integrate multimodal AI - combining voice, vision, and language understanding

Technical Skills Required

Python LLMs (OpenAI, Anthropic, or open-source models) Speech AI (STT/TTS systems like Deepgram, ElevenLabs, Whisper) Browser automation (Playwright, Puppeteer, Selenium) Computer vision Async programming and real-time systems

Benefits & Perks

Meaningful equity stake in a backed, fast-growing company

Visa sponsorship available

Gym

Job Description

The Opportunity

Join our AI engineering team in the US to build the core intelligence behind our platform. You'll work at the intersection of voice AI, browser automation, and large language models - creating agents that can listen, speak, navigate interfaces, and interact naturally with users in real-time.

This role combines cutting-edge AI with practical systems work. You'll design voice experiences, build browser agents that understand and control web applications, and optimize LLM behavior for production reliability. We ship working AI features that solve real problems, balancing innovation with pragmatic constraints.

We sponsor visas for qualified candidates.

Core Responsibilities

Build and optimize voice AI systems using speech-to-text and text-to-speech models
Design browser agents that navigate, understand, and interact with web applications
Implement browser automation with computer vision and DOM understanding
Engineer prompt systems and LLM workflows for consistent, intelligent behavior
Create evaluation frameworks to measure voice quality, agent accuracy, and user experience
Integrate multimodal AI - combining voice, vision, and language understanding
Build real-time AI pipelines where latency and reliability are critical
Monitor and improve AI system performance in production environments

Technical Requirements

Production experience with LLMs (OpenAI, Anthropic, or open-source models)
Hands-on work with speech AI (STT/TTS systems like Deepgram, ElevenLabs, Whisper)
Experience with browser automation (Playwright, Puppeteer, Selenium) or computer vision
Strong Python skills with async programming and real-time systems
Understanding of prompt engineering, retrieval systems, and agent frameworks
Ability to debug complex AI behaviors and build observability tools
Software engineering fundamentals for production AI systems

Nice to Have

Experience building autonomous agents or multi-step AI workflows
Knowledge of computer vision for UI understanding and visual grounding
Fine-tuning or training language models for specialized tasks
Real-time audio processing and streaming architectures
Background in NLP, machine learning research, or AI systems

Why Karumi

Meaningful equity stake in a backed, fast-growing company\
Work on cutting-edge voice AI and browser agents in production\
Shape how AI systems interact with users and software interfaces\
Small team with direct impact on core product capabilities\
Gym\
Visa sponsorship available

Job Overview

Posted Date Dec 04, 2025

Employment Type Full-time

Experience Level Entry level

Location United State

Category Machine Learning

Company karumi (yc f25)

Senior AI Engineer - Voice AI and Browser Automation

Key Highlights

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Senior AI Engineer - Voice AI and Browser Automation

Key Highlights

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Subscribe our newsletter