Senior Site Reliability Engineer (Cloud Infrastructure)

Agility Partners United State
Remote
Apply
AI Summary

The Senior Site Reliability Engineer will design, implement, and maintain reliable, scalable cloud infrastructure within a modernizing banking environment. Responsibilities include automation, deployment workflows, observability, and engineering support systems across distributed cloud-native platforms. The ideal candidate brings strong cloud, infrastructure-as-code, containerization, and troubleshooting expertise.

Key Highlights
Build and maintain reliable, scalable cloud infrastructure
Automate deployment workflows and improve observability
Collaborate with engineering teams to troubleshoot production issues
Key Responsibilities
Maintain cloud infrastructure across AWS, GCP, or Azure using Terraform, Ansible, and related automation tools
Support and enhance Kubernetes, Docker, and containerized application environments to improve deployment reliability and scalability
Strengthen monitoring, logging, alerting, and incident response processes across distributed systems
Troubleshoot production issues within cloud platforms, networks, and applications in collaboration with engineering teams
Drive automation initiatives and platform-wide reliability improvements
Technical Skills Required
AWS GCP Azure Terraform Ansible Kubernetes Docker Go Python Bash JavaScript Prometheus Grafana ELK/EFK CloudWatch
Benefits & Perks
Fully remote work

Job Description


**FULLY REMOTE**


Summary:

The Site Reliability Engineer will help build and maintain reliable, scalable cloud infrastructure within a modernizing banking environment. This role focuses on automation, deployment workflows, observability, and engineering support systems across distributed cloud‑native platforms. The ideal candidate brings strong cloud, infrastructure‑as‑code, containerization, and troubleshooting expertise, with opportunities to influence platform maturity and reliability strategy.


Responsibilities:

  • Build and maintain cloud infrastructure across AWS, GCP, or Azure using Terraform, Ansible, and related automation tools.
  • Support and enhance Kubernetes, Docker, and containerized application environments to improve deployment reliability and scalability.
  • Strengthen monitoring, logging, alerting, and incident response processes across distributed systems.
  • Troubleshoot production issues within cloud platforms, networks, and applications in collaboration with engineering teams.
  • Drive automation initiatives and platform-wide reliability improvements.


Qualifications:

  • 5+ years of experience in Site Reliability Engineering, Systems Engineering, DevOps, Software Engineering, or Platform Engineering.
  • Experience with AWS, GCP, or Azure cloud environments.
  • Hands-on experience with Terraform at an intermediate level or higher.
  • Strong scripting or programming experience in Go, Python, Bash, JavaScript, or similar languages.
  • Proficiency with Docker, Kubernetes, and containerized application platforms.
  • Strong troubleshooting capabilities in distributed and production systems.
  • Familiarity with monitoring and observability tools such as Prometheus, Grafana, ELK/EFK, CloudWatch, or similar.


Reasons to Love It:

  • Opportunity to influence platform reliability and cloud maturity across a modernizing enterprise environment.
  • Work closely with engineering and cross-functional teams on meaningful, production-impacting initiatives.
  • Stable long-term opportunity within a highly visible infrastructure and platform organization.
  • Hands-on exposure to modern cloud-native tools, automation practices, and scalable architectures.


**FULLY REMOTE**


Similar Jobs

Explore other opportunities that match your interests

Amazon Connect Engineer

Devops
4h ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Entry level

Oliver James

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

ocho

United State

Cloud Engineer III

Devops
5h ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

net2source (n2s)

United State

Subscribe our newsletter

New Things Will Always Update Regularly