Senior Python Engineer for AI Infrastructure Platform

ClearML Ireland
Remote
Apply
AI Summary

Design and maintain the client-side Python SDK for ClearML's AI infrastructure platform. Develop integrations with leading AI libraries and ensure seamless functionality across multiple platforms.

Key Highlights
Design and maintain the client-side Python SDK
Develop automatic instrumentation for code execution, parameters, and metrics
Build and maintain deep integrations with leading AI libraries
Manage the build lifecycle and create Python wheels and Eggs
Troubleshoot complex client-side issues
Technical Skills Required
Python PyTorch TensorFlow Keras Hugging Face vLLM scikit-learn metaclasses descriptors monkey patching AST manipulation Cython PyBind11 GitHub Actions
Benefits & Perks
Fully-remote work
Open-source company
Personal growth and accountability
Opportunity to work on a high-impact project

Job Description


About the job


At ClearML, our mission is to make infrastructure management effortless across every phase of the AI lifecycle -- from building and training AI models to large-scale production. Trusted by more than 2,000 organizations, AI builders and IT teams use our AI infrastructure platform to power everything from early-stage R&D to mission-critical public sector and enterprise-grade AI pipelines.


We’re growing quickly and looking for curious, self-driven individuals who are excited to shape the future of AI and the infrastructure that powers it. Our customers are tackling some of the world’s most important challenges -- revolutionizing healthcare, discovering new medicines, securing global finance, protecting national security, and preserving our planet’s ecosystems.


About the Role


At ClearML, our SDK is the bridge between data scientists and our infrastructure. We are seeking a Senior Python Engineer to work on the client-side clearml open-source library. This is not a standard backend role; you won't be tuning databases. Instead, you will be building the "magic" that allows ClearML to automatically track experiments, creating wheels for PyPI, and ensuring our code runs smoothly in diverse user environments.


You will dive deep into Python internals and build integrations with the latest AI frameworks (PyTorch, TensorFlow, vLLM, etc.). If you love metaprogramming, understanding how imports work under the hood, and packaging libraries for thousands of users, this role is for you.


ClearML is a fully-remote, open-source company which promotes independence and personal growth and accountability - we look for people who thrive in environments where the technology you develop quickly affects the work and productivity of people around the world. If you like getting quick feedback from real-world users, rapidly shipping features and technologies to production and seeing the difference you make, ClearML is the place for you!


Responsibilities


  • Core SDK Development: Design and maintain the client-side Python SDK, ensuring high performance and backward compatibility across Python versions.
  • Automatic Instrumentation: Develop logic to automatically capture code execution, parameters, and metrics using advanced Python techniques (monkey patching, decorators, introspection).
  • ML Framework Integration: Build and maintain deep integrations with leading AI libraries (PyTorch, TensorFlow, Keras, Hugging Face, vLLM, scikit-learn).
  • Packaging & Distribution: Manage the build lifecycle, creating and optimizing Python wheels and Eggs, and managing releases to PyPI and Conda.
  • Cross-Platform Reliability: Ensure the SDK functions seamlessly across Linux, macOS, and Windows environments.
  • Debug User Environments: Troubleshoot complex client-side issues related to multiprocessing, serialization (pickle/dill), and memory management.


Requirements


  • 8+ years of professional software development experience.
  • Deep Python Expertise: You understand metaclasses, descriptors, and how Python handles imports and memory.
  • Packaging Experience: Proven experience building and packaging Python libraries (setup.py, poetry, building wheels) and publishing to public repositories (PyPI).
  • Runtime Modification: Strong familiarity with monkey patching, AST manipulation, or writing custom import hooks.
  • AI Ecosystem Knowledge: Familiarity with the internals of frameworks like PyTorch, TensorFlow, or vLLM, llama.cpp.
  • Collaboration: Experience working in a remote, distributed team and contributing to code reviews.
  • Excellent communication skills and ability to work independently or as part of a team.


Advantages



  • Active contributor to major open-source Python projects.
  • Experience with C-extensions (Cython, PyBind11).
  • Experience with CI/CD pipelines for library publishing (GitHub Actions).



Subscribe our newsletter

New Things Will Always Update Regularly