Datacentre Operations Engineering Lead

Trust In SODA β€’ Greater Paris Metropolitan Region
Relocation
Apply
AI Summary

Join a rapidly growing GPU-as-a-Service provider as a Datacentre Operations Engineering Lead. You will lead day-to-day datacentre operations and service delivery, supporting site commissioning and operational readiness activities for new infrastructure deployments. This role requires strong experience within Datacentre Operations, Infrastructure Operations, or Critical Environment Operations.

Key Highlights
Lead day-to-day datacentre operations and service delivery
Support site commissioning and operational readiness activities
Provide technical leadership and coaching to engineering teams
Key Responsibilities
Lead day-to-day datacentre operations and service delivery
Support site commissioning and operational readiness activities for new infrastructure deployments
Provide technical leadership and coaching to engineering teams
Manage critical incidents and drive rapid issue resolution
Establish and maintain operational procedures, standards, and runbooks
Technical Skills Required
NVIDIA Grace Blackwell architecture Large-scale GPU clusters AI and High-Performance Computing (HPC) infrastructure InfiniBand networking Enterprise storage and networking platforms
Benefits & Perks
Relocation support available
Competitive compensation
Significant autonomy and responsibility
Nice to Have
Experience supporting GPU-based infrastructure
NVIDIA ecosystem exposure
AI or High-Performance Computing (HPC) environments

Job Description


Datacentre Operations Engineering Lead |

πŸ“ Paris, France (On-site) | Relocation Support Available


A rapidly growing GPU-as-a-Service provider is seeking two experienced Datacentre Operations Engineering Leads to help deliver and operate one of Europe's most ambitious AI infrastructure deployments.


Based at the Data4 campus in Paris-Saclay, you will play a key role in the deployment and operation of an approximately 8,000 GPU NVIDIA Grace Blackwell cluster, supporting next-generation AI and High-Performance Computing (HPC) workloads. Representing a strategic investment of approximately €400 million, this flagship European deployment will form the foundation for future AI datacentre expansion across EMEA.


This is an opportunity to join a business operating at the forefront of GPU infrastructure, AI compute and large-scale datacentre operations, while taking ownership of a highly visible, mission-critical environment.


The Opportunity

As Datacentre Operations Engineering Lead, you will act as the senior technical authority on site, providing hands-on leadership across commissioning, operations, incident management, vendor engagement and service delivery.


Working alongside a team of Datacentre Operations Engineers, HPC SREs, Network Engineering and Infrastructure Strategy teams, you will establish operational standards, drive reliability, and ensure the successful transition from deployment through to production operations.

This role combines technical depth, operational ownership and leadership responsibility within a fast-paced and rapidly scaling organisation.


Key Responsibilities

  • Lead day-to-day datacentre operations and service delivery.
  • Support site commissioning and operational readiness activities for new infrastructure deployments.
  • Provide technical leadership, coaching and mentorship to engineering teams.
  • Manage critical incidents and drive rapid issue resolution.
  • Establish and maintain operational procedures, standards and runbooks.
  • Manage vendor relationships, support contracts and escalation processes.
  • Drive capacity planning across power, cooling and space requirements.
  • Oversee hardware lifecycle management, asset tracking and operational documentation.
  • Work closely with Infrastructure, HPC SRE, Network Engineering and Platform teams.
  • Ensure compliance with operational, security and governance requirements.
  • Support future EMEA datacentre expansion initiatives and operational improvements.


Technical Environment

  • NVIDIA Grace Blackwell architecture
  • Large-scale GPU clusters
  • AI and High-Performance Computing (HPC) infrastructure
  • InfiniBand networking
  • Enterprise storage and networking platforms
  • Large-scale datacentre operations
  • Capacity management and infrastructure planning
  • High-density compute environments


What We're Looking For

  • Strong experience within Datacentre Operations, Infrastructure Operations or Critical Environment Operations.
  • Previous leadership experience within mission-critical environments.
  • Proven ownership of operational performance, reliability and SLA delivery.
  • Hands-on experience supporting enterprise infrastructure, compute, networking and storage platforms.
  • Experience working within large-scale datacentre, colocation or hyperscale environments.
  • Strong understanding of incident management, operational processes and SLA-driven environments.
  • Experience managing third-party vendors and service providers.
  • Excellent troubleshooting and problem-solving skills.
  • Strong communication and stakeholder management capabilities.
  • Fluent French (minimum B2, ideally C1+) and English.
  • Existing right to work within France or the European Union.


Highly Desirable

  • Experience supporting GPU-based infrastructure.
  • NVIDIA ecosystem exposure.
  • AI or High-Performance Computing (HPC) environments.
  • InfiniBand networking experience.
  • Colocation or hyperscale datacentre experience.
  • Large-scale infrastructure deployment or commissioning projects.
  • Experience supporting multi-cage, multi-floor or high-density compute environments.
  • Experience with large-scale AI infrastructure deployments.


Why Apply?

  • Lead one of Europe's largest AI infrastructure deployments.
  • Work with cutting-edge NVIDIA Grace Blackwell technology.
  • Take ownership of an approximately 8,000 GPU AI cluster.
  • Play a foundational role within a rapidly scaling organisation.
  • Significant autonomy, responsibility and technical influence.
  • Exposure to next-generation AI, GPU and HPC technologies.
  • Opportunity to shape operational standards for future EMEA deployments.
  • Competitive compensation and relocation support available.


Additional Information

  • Full-time onsite role based in Paris-Saclay, France.
  • Shift and on-call participation may be required.
  • Permanent, contractor and EOR arrangements may be considered.
  • Relocation support available for suitable candidates.
  • Immediate hiring requirement, with deployment milestones scheduled throughout the year.


Interested in learning more? Apply today for a confidential discussion.


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Trust In SODA

Greater Paris Metropolitan Region
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Raise

Canada
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Torch Technologies, Inc.

United State

Subscribe our newsletter

New Things Will Always Update Regularly