Applied Research Engineer - Video Understanding
We are seeking an Applied Research Engineer to build high-performance pipelines and infrastructure to understand video with precision at internet scale. The role requires 5+ years of experience in computer vision or audio processing, strong Python skills, and hands-on experience with PyTorch. The ideal candidate will have a strong ownership mindset, clear communication skills, and experience building large-scale multimodal systems.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
Dear applicants, please keep in mind that applications without provided salary expectations and active LN profile will not be considered.
Hope for your understanding.
Location: San Francisco, CA
Employment Type: Full-Time ONSITE
Visa Sponsorship: H-1B, O-1, OPT supported
We are an AI research lab focused exclusively on video data. Video represents the dominant digital medium globally — powering creativity, communication, gaming, AR/VR, robotics, and beyond. The biggest bottleneck in advancing these systems is high-quality training data at scale.
Our team combines:
- Exabyte-scale video infrastructure
- Novel video understanding techniques
- Large-scale multimodal datasets
As an Applied Research Engineer, you will build high-performance pipelines and infrastructure to understand video with precision at internet scale.
This role sits between research and production:
Searching for Development & Programming roles that provide visa sponsorship? Connect with international employers through Development & Programming Jobs with Visa Sponsorship opportunities actively seeking talented professionals.
- Not purely academic research
- Not pure infrastructure engineering
- You will work on ambiguous, open-ended problems in:
- Computer Vision
- Audio Processing
- Multimodal (video + text + audio) systems
What You’ll Do
- Build scalable pipelines for video understanding
- Work with large models and APIs, optimizing inference performance
- Apply pre- and post-processing techniques to improve model precision
- Implement parallelization, pipelining, and inference optimization strategies
- Occasionally fine-tune models where needed
- Break down customer-level requirements into technical building blocks
- Write clean, production-ready Python code
- Collaborate with customers and external research teams
- Contribute to the evolution of next-generation video datasets
Explore our comprehensive directory of visa sponsorship jobs from employers worldwide who are ready to sponsor talented international professionals.
- 5+ years experience in computer vision or audio processing
- Strong Python skills
- Hands-on experience with PyTorch (or similar ML frameworks)
- Experience working with large models or model APIs
- Ability to optimize inference pipelines
- Clear communication skills (technical + external stakeholders)
- Strong ownership mindset
- In-person presence in San Francisco
- Experience building large-scale multimodal systems
- Startup experience (early hire)
- Open-source contributions
- Published research (bonus, not required)
- Demonstrated performance optimization work
- Passion for video / media technologies
Interested in opportunities specifically in United State? Discover our dedicated Visa Sponsorship Jobs in United State page featuring roles from top employers in this location.
- Initial Screen
- Technical Discussion with CTO
- Deep Technical Interview
- Conversation with CEO
- On-site
- Offer
Similar Jobs
Explore other opportunities that match your interests
pulserise technologies
snaplii
Engineering Manager - AI Agents