Seeking a Senior Systems Operations Engineer to lead cloud infrastructure design, management, and scaling on AWS. Responsibilities include advancing IaC, integrating AI tools, and ensuring platform reliability and cost efficiency. Requires 5+ years of experience in cloud infrastructure, AWS, Kubernetes, and IaC tools like Terraform/OpenTofu. This fully remote role focuses on building scalable, secure, and resilient systems for a digital music distribution platform.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Job Description
About Our Client
The organization operates in the digital music distribution industry, providing a platform that distributes music to major streaming services including Spotify, Apple Music, and YouTube. It addresses the challenge of efficiently delivering new music releases to global audiences by managing a large-scale cloud infrastructure that supports a high volume of music distribution.
The platform plays a central role in the music ecosystem, facilitating the majority of new music releases today through its technology.
About the Opportunity
The Senior Systems Operations Engineer is a key technical leader within the Systems Operations team, responsible for designing, managing, and scaling the cloud infrastructure that supports the organization's platform. This role focuses on advancing infrastructure-as-code practices, integrating AI-enhanced operational tools, and improving reliability and cost efficiency.
The position is fully remote and requires cross-team collaboration to ensure scalable, secure, and resilient infrastructure that supports the company's strategic objectives.
Responsibilities
- Design and manage scalable, highly available cloud infrastructure on AWS
- Develop and maintain disaster recovery plans using AWS backup and replication features
- Collaborate with engineering and security teams to improve infrastructure health and scalability
- Design reusable Terraform/OpenTofu modules and lead IaC migration and adoption
- Implement IaC testing strategies and manage Bitbucket pipelines for multi-environment deployments
- Integrate AI tools to enhance monitoring, incident response, and automation
- Define service level objectives, lead incident response, and conduct blameless postmortems
- Implement chaos engineering and build monitoring solutions with CloudWatch and Datadog
- Develop automation scripts to reduce manual work
- Build and lead the implementation of an Internal Developer Portal to improve developer experience
- Drive cost optimization initiatives and monitor AWS resource usage
- Lead infrastructure projects, communicate strategic impact, and mentor junior engineers
- Maintain infrastructure documentation and operational runbooks
Interested in remote work opportunities in Devops? Discover Devops Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
Requirements
- Bachelor's degree in Computer Science, IT, or a related field, or equivalent experience
- 5+ years in systems operations, platform engineering, or DevOps focused on cloud infrastructure and containers
- Proven production experience with AWS services and Kubernetes
- 5+ years of hands-on experience with Infrastructure as Code tools, especially Terraform or OpenTofu
- Strong Linux/Unix administration and shell scripting skills
- Proficiency in Python, Go, or similar languages
- Experience with CI/CD pipelines for infrastructure deployments (e.g., Bitbucket Pipelines, Jenkins)
- Experience with monitoring and observability tools such as Prometheus, Grafana, CloudWatch, or Datadog
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
Compensation
The pay range and compensation package for this role will be determined based on the candidate's experience, skills, and other relevant factors.
Equal Opportunity Statement
Our client is an equal opportunity employer. They celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, or national origin.
Note: RemoteHunter is not the Employer of Record (EOR) for this role. Our purpose is to connect exceptional candidates with leading employers. We help job seekers worldwide discover roles that match their goals and guide them to complete their full application directly through the hiring company's career page or ATS.
Similar Jobs
Explore other opportunities that match your interests
ActiveSoft, Inc
Jobgether
Senior AWS DevOps Engineer (Cloud Engineer)