Senior Systems Operations Engineer (Remote)

remotehunter • United State
Remote
Apply
AI Summary

Seeking a Senior Systems Operations Engineer to lead cloud infrastructure design, management, and scaling on AWS. Responsibilities include advancing IaC, integrating AI tools, and ensuring platform reliability and cost efficiency. Requires 5+ years of experience in cloud infrastructure, AWS, Kubernetes, and IaC tools like Terraform/OpenTofu. This fully remote role focuses on building scalable, secure, and resilient systems for a digital music distribution platform.

Key Highlights
Lead cloud infrastructure design, management, and scaling on AWS.
Advance Infrastructure-as-Code (IaC) practices and integrate AI-enhanced operational tools.
Ensure platform reliability, cost efficiency, and developer experience through automation and internal developer portals.
Key Responsibilities
Design and manage scalable, highly available cloud infrastructure on AWS.
Develop and maintain disaster recovery plans using AWS backup and replication features.
Collaborate with engineering and security teams to improve infrastructure health and scalability.
Design reusable Terraform/OpenTofu modules and lead IaC migration and adoption.
Implement IaC testing strategies and manage Bitbucket pipelines for multi-environment deployments.
Integrate AI tools to enhance monitoring, incident response, and automation.
Define service level objectives, lead incident response, and conduct blameless postmortems.
Implement chaos engineering and build monitoring solutions with CloudWatch and Datadog.
Develop automation scripts to reduce manual work.
Build and lead the implementation of an Internal Developer Portal to improve developer experience.
Drive cost optimization initiatives and monitor AWS resource usage.
Lead infrastructure projects, communicate strategic impact, and mentor junior engineers.
Maintain infrastructure documentation and operational runbooks.
Technical Skills Required
AWS Kubernetes Terraform OpenTofu Linux/Unix administration Shell scripting Python Go Bitbucket Pipelines Jenkins Prometheus Grafana CloudWatch Datadog
Benefits & Perks
Fully remote

Job Description


About Our Client

The organization operates in the digital music distribution industry, providing a platform that distributes music to major streaming services including Spotify, Apple Music, and YouTube. It addresses the challenge of efficiently delivering new music releases to global audiences by managing a large-scale cloud infrastructure that supports a high volume of music distribution.

The platform plays a central role in the music ecosystem, facilitating the majority of new music releases today through its technology.


About the Opportunity

The Senior Systems Operations Engineer is a key technical leader within the Systems Operations team, responsible for designing, managing, and scaling the cloud infrastructure that supports the organization's platform. This role focuses on advancing infrastructure-as-code practices, integrating AI-enhanced operational tools, and improving reliability and cost efficiency.

The position is fully remote and requires cross-team collaboration to ensure scalable, secure, and resilient infrastructure that supports the company's strategic objectives.


Responsibilities

  • Design and manage scalable, highly available cloud infrastructure on AWS
  • Develop and maintain disaster recovery plans using AWS backup and replication features
  • Collaborate with engineering and security teams to improve infrastructure health and scalability
  • Design reusable Terraform/OpenTofu modules and lead IaC migration and adoption
  • Implement IaC testing strategies and manage Bitbucket pipelines for multi-environment deployments
  • Integrate AI tools to enhance monitoring, incident response, and automation
  • Define service level objectives, lead incident response, and conduct blameless postmortems
  • Implement chaos engineering and build monitoring solutions with CloudWatch and Datadog
  • Develop automation scripts to reduce manual work
  • Build and lead the implementation of an Internal Developer Portal to improve developer experience
  • Drive cost optimization initiatives and monitor AWS resource usage
  • Lead infrastructure projects, communicate strategic impact, and mentor junior engineers
  • Maintain infrastructure documentation and operational runbooks


Requirements

  • Bachelor's degree in Computer Science, IT, or a related field, or equivalent experience
  • 5+ years in systems operations, platform engineering, or DevOps focused on cloud infrastructure and containers
  • Proven production experience with AWS services and Kubernetes
  • 5+ years of hands-on experience with Infrastructure as Code tools, especially Terraform or OpenTofu
  • Strong Linux/Unix administration and shell scripting skills
  • Proficiency in Python, Go, or similar languages
  • Experience with CI/CD pipelines for infrastructure deployments (e.g., Bitbucket Pipelines, Jenkins)
  • Experience with monitoring and observability tools such as Prometheus, Grafana, CloudWatch, or Datadog


Compensation

The pay range and compensation package for this role will be determined based on the candidate's experience, skills, and other relevant factors.


Equal Opportunity Statement

Our client is an equal opportunity employer. They celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, or national origin.


Note: RemoteHunter is not the Employer of Record (EOR) for this role. Our purpose is to connect exceptional candidates with leading employers. We help job seekers worldwide discover roles that match their goals and guide them to complete their full application directly through the hiring company's career page or ATS.


Similar Jobs

Explore other opportunities that match your interests

Senior Data Engineer

Devops
•
3h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

ActiveSoft, Inc

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Jobgether

United State

Senior AWS DevOps Engineer (Cloud Engineer)

Devops
•
23h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

hirenza

United State

Subscribe our newsletter

New Things Will Always Update Regularly