Lead Machine Learning Systems Engineer

neuramorphic United State
Relocation
Apply
AI Summary

Design and optimize machine learning inference pipelines for edge devices. Develop high-performance runtimes for GPU and ARM architectures. Work closely with research and platform teams to deploy production-ready AI systems.

Key Highlights
Design and optimize machine learning inference pipelines
Develop high-performance runtimes for GPU and ARM architectures
Work closely with research and platform teams
Key Responsibilities
Design and optimize machine learning inference pipelines for edge devices
Develop high-performance runtimes for GPU and ARM architectures
Optimize deep learning models for ultra-low-latency inference
Integrate model pipelines with frameworks such as PyTorch
Improve memory efficiency, GPU utilization, and power consumption
Work closely with the research and platform teams to deploy production-ready AI systems
Technical Skills Required
C++ Python GPU programming (CUDA) PyTorch TensorFlow ONNX TensorRT ML compilers Linux-based systems
Benefits & Perks
Relocation assistance
Nice to Have
Experience with edge AI or embedded systems
Experience with ARM architectures or NVIDIA Jetson devices
Experience building ML runtimes or compilers
Background in robotics, computer vision, or real-time AI systems

Job Description


About Neuramorphic


Neuramorphic is building the infrastructure for the next generation of intelligent systems. Our NeuraTensor™ SDK enables developers to run AI models directly on edge devices such as cameras, robots, and industrial systems—without cloud latency or privacy risks.

From autonomous vehicles to industrial robotics, our technology allows real-time AI inference where milliseconds matter.

We are looking for a Lead ML Systems Engineer to help design and optimize the core inference engine powering the NeuraTensor platform.


Responsibilities

  • Design and optimize machine learning inference pipelines for edge devices
  • Develop high-performance runtimes for GPU and ARM architectures
  • Optimize deep learning models for ultra-low-latency inference
  • Integrate model pipelines with frameworks such as PyTorch
  • Improve memory efficiency, GPU utilization, and power consumption
  • Work closely with the research and platform teams to deploy production-ready AI systems


Requirements

  • 5+ years of experience in machine learning systems, AI infrastructure, or high-performance computing
  • Strong programming skills in C++ and Python
  • Experience with GPU programming (CUDA preferred)
  • Experience with ML frameworks such as PyTorch or TensorFlow
  • Familiarity with ONNX, TensorRT, or ML compilers
  • Experience working with Linux-based systems


Nice to Have

  • Experience with edge AI or embedded systems
  • Experience with ARM architectures or NVIDIA Jetson devices
  • Experience building ML runtimes or compilers
  • Background in robotics, computer vision, or real-time AI systems


Job Types: Full-time, Part-time


Benefits:

  • Relocation assistance


Application Question(s):

  • Do you have professional experience with C++ for performance-critical systems?
  • Work Location: In person

Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Eaton

United State

Senior Systems Engineer - Guidance, Navigation, and Control (GNC)

Programming
1h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

fetchjobs.co

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

presales collective

United State

Subscribe our newsletter

New Things Will Always Update Regularly