Seeking a Senior GCP DevOps HPC Engineer to lead cloud-based HPC environments. Responsibilities include migrating on-prem SLURM clusters to GCP, designing scalable architectures, and optimizing high-performance workloads. Requires 5+ years of HPC experience and strong GCP, Terraform, and Ansible skills.
Key Highlights
Technical Skills Required
Benefits & Perks
Job Description
GCP DevOps HPC Engineer (Senior)
About the Role
We’re hiring a Senior GCP DevOps HPC Engineer to join a high-performing engineering team working on large-scale, cloud-based HPC environments. This role is ideal for an experienced HPC engineer who enjoys leading complex migrations, designing scalable architectures, and optimising high-performance workloads in Google Cloud Platform (GCP).
You’ll take ownership of migrating on-prem SLURM HPC clusters to GCP, while acting as a technical authority across HPC, DevOps, and cloud infrastructure.
What You’ll Be Doing
- Lead end-to-end migrations of SLURM-based HPC clusters from on-prem to GCP
- Design, build, and operate secure, scalable HPC architectures in the cloud
- Optimise SLURM scheduling, workload performance, and resource utilisation
- Automate cluster deployment and operations using Terraform, Ansible, Python, and Bash
- Manage HPC software stacks using Spack
- Deploy and support parallel workloads using MPI, OpenMP, and related frameworks
- Troubleshoot performance issues and drive continuous optimisation
- Collaborate with engineering teams and stakeholders in a fully remote environment
What We’re Looking For
Essential
- 5+ years’ experience in HPC environments (SLURM, MPI, parallel workloads)
- Strong Linux systems expertise in performance-critical environments
- Hands-on experience running or migrating HPC workloads in the cloud (GCP preferred)
- Solid experience with Terraform and Ansible
- Strong scripting skills (Python, Bash)
- Deep understanding of GCP services (GCE, VPC, Cloud Storage)
Nice to Have
- GCP certifications (DevOps / Cloud Engineer)
- Experience with Preemptible VMs and cloud cost-optimisation strategies
- HPC performance profiling and debugging tools
- Containers in HPC (Singularity, Docker)
- Exposure to Spark or big data tooling
Why Apply
- Work on complex, high-impact HPC systems at scale
- Influence architecture and technical decisions
- Fully remote role based in Spain
- Collaborative, engineering-led culture with strong technical ownership
Interested? Apply directly or message me to learn more.
Similar Jobs
Explore other opportunities that match your interests
Director of Solutions Engineering, EMEA
Grafana Labs
Senior DataOps Engineer
dLocal