Senior GCP DevOps HPC Engineer

Remote
Apply
AI Summary

Seeking a Senior GCP DevOps HPC Engineer to lead cloud-based HPC environments. Responsibilities include migrating on-prem SLURM clusters to GCP, designing scalable architectures, and optimizing high-performance workloads. Requires 5+ years of HPC experience and strong GCP, Terraform, and Ansible skills.

Key Highlights
Lead SLURM HPC cluster migrations from on-prem to GCP.
Design, build, and operate secure, scalable HPC architectures in GCP.
Optimize SLURM scheduling, workload performance, and resource utilization.
Technical Skills Required
GCP DevOps HPC SLURM MPI OpenMP Terraform Ansible Python Bash Spack GCE VPC Cloud Storage Singularity Docker
Benefits & Perks
Fully remote role
Collaborative, engineering-led culture
Strong technical ownership

Job Description


GCP DevOps HPC Engineer (Senior)

About the Role

We’re hiring a Senior GCP DevOps HPC Engineer to join a high-performing engineering team working on large-scale, cloud-based HPC environments. This role is ideal for an experienced HPC engineer who enjoys leading complex migrations, designing scalable architectures, and optimising high-performance workloads in Google Cloud Platform (GCP).


You’ll take ownership of migrating on-prem SLURM HPC clusters to GCP, while acting as a technical authority across HPC, DevOps, and cloud infrastructure.


What You’ll Be Doing

  • Lead end-to-end migrations of SLURM-based HPC clusters from on-prem to GCP
  • Design, build, and operate secure, scalable HPC architectures in the cloud
  • Optimise SLURM scheduling, workload performance, and resource utilisation
  • Automate cluster deployment and operations using Terraform, Ansible, Python, and Bash
  • Manage HPC software stacks using Spack
  • Deploy and support parallel workloads using MPI, OpenMP, and related frameworks
  • Troubleshoot performance issues and drive continuous optimisation
  • Collaborate with engineering teams and stakeholders in a fully remote environment


What We’re Looking For

Essential

  • 5+ years’ experience in HPC environments (SLURM, MPI, parallel workloads)
  • Strong Linux systems expertise in performance-critical environments
  • Hands-on experience running or migrating HPC workloads in the cloud (GCP preferred)
  • Solid experience with Terraform and Ansible
  • Strong scripting skills (Python, Bash)
  • Deep understanding of GCP services (GCE, VPC, Cloud Storage)


Nice to Have

  • GCP certifications (DevOps / Cloud Engineer)
  • Experience with Preemptible VMs and cloud cost-optimisation strategies
  • HPC performance profiling and debugging tools
  • Containers in HPC (Singularity, Docker)
  • Exposure to Spark or big data tooling


Why Apply

  • Work on complex, high-impact HPC systems at scale
  • Influence architecture and technical decisions
  • Fully remote role based in Spain
  • Collaborative, engineering-led culture with strong technical ownership


Interested? Apply directly or message me to learn more.


Similar Jobs

Explore other opportunities that match your interests

Director of Solutions Engineering, EMEA

Devops
1d ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Grafana Labs

Spain

Senior DataOps Engineer

Devops
4d ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

dLocal

Spain
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

smartcore ag

Spain

Subscribe our newsletter

New Things Will Always Update Regularly