Senior Site Reliability Engineer (Remote)

Jobgether • United Arab Emirates
Remote
Apply
AI Summary

Seeking a Senior Site Reliability Engineer for a next-generation AI-driven employment platform. Responsibilities include designing and operating scalable, secure, and reliable cloud-native infrastructure. Requires senior-level SRE/DevOps experience with Kubernetes and AWS, and a strong understanding of infrastructure-as-code and CI/CD.

Key Highlights
Core role in a next-generation AI-driven employment platform.
Focus on reliability, scalability, and security of distributed systems.
Hands-on engineering with significant influence on architecture and operations.
Key Responsibilities
Design and maintain scalable infrastructure-as-code solutions using tools like Terraform and Kubernetes.
Support platform evolution by improving automation and deployment workflows.
Build and operate observability systems including monitoring, logging, and alerting.
Lead incident response, postmortems, and reliability improvements to ensure high system availability.
Embed security and compliance practices into infrastructure and operational workflows.
Optimize system performance, reliability, and cloud costs through continuous analysis and tuning.
Eliminate operational toil by developing automation tools and scalable processes.
Partner with product and platform teams to improve APIs, deployment systems, and developer experience.
Technical Skills Required
Terraform Kubernetes AWS GitHub Actions GitLab Bash
Benefits & Perks
Competitive salary
Fully remote work
Flexible scheduling
Async-first culture
Equity or stock option opportunities
Flexible paid time off
Generous parental leave policies
Learning and development budget
Home office and equipment support
Mental health and wellness support services

Job Description


This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior Site Reliability Engineer (Remote Build) based in United Arab Emirates.

This role sits at the core of a next-generation platform enabling AI-driven services across global employment infrastructure.

You will be responsible for ensuring the reliability, scalability, and security of systems that power complex integrations across multiple countries and compliance frameworks.

The environment is highly distributed, async-first, and built for engineers who thrive in ownership and autonomy.

You will design and operate infrastructure that supports Kubernetes-based deployments, cloud-native services, and agentic workflows at scale.

This is a hands-on engineering role where you will directly influence system architecture, operational practices, and developer experience.

You will collaborate closely with engineering, product, and security teams to ensure performance, cost efficiency, and operational excellence across all services.

Accountabilities

  • You will design and maintain scalable infrastructure-as-code solutions using tools like Terraform and Kubernetes, ensuring robust, repeatable, and secure deployments across environments. You will also support platform evolution by improving automation and deployment workflows.
  • You will build and operate observability systems including monitoring, logging, and alerting, while leading incident response, postmortems, and reliability improvements to ensure high system availability.
  • You will embed security and compliance practices into infrastructure and operational workflows, ensuring adherence to global regulatory requirements while minimizing friction for engineering teams.
  • You will optimize system performance, reliability, and cloud costs through continuous analysis and tuning of infrastructure and workloads across distributed systems.
  • You will eliminate operational toil by developing automation tools and scalable processes that reduce manual intervention and improve engineering efficiency.
  • You will partner with product and platform teams to improve APIs, deployment systems, and developer experience, ensuring infrastructure supports long-term scalability and maintainability.

Requirements

  • You bring senior-level experience in Site Reliability Engineering, DevOps, or Systems Engineering, with a proven track record of operating production systems at scale in cloud environments. You are comfortable owning reliability end-to-end.
  • You have deep hands-on expertise with Kubernetes and AWS, including networking, compute, storage, and managed services, and understand how to operate resilient distributed systems.
  • You are highly proficient with infrastructure-as-code tools such as Terraform and apply software engineering principles to infrastructure design and management.
  • You have strong experience with CI/CD pipelines and deployment automation using tools like GitHub Actions, GitLab, or similar, including rollback strategies and safe deployment practices.
  • You are comfortable working with Linux systems, debugging production issues, writing scripts (especially in Bash), and understanding system-level behavior.
  • You are an effective communicator who can translate complex infrastructure concepts into clear explanations, documentation, and runbooks for both technical and non-technical stakeholders.

Benefits

  • Competitive salary aligned with global benchmarks and experience level
  • Fully remote work with flexible scheduling and async-first culture
  • Equity or stock option opportunities depending on role eligibility
  • Flexible paid time off and generous parental leave policies
  • Learning and development budget to support continuous growth
  • Home office and equipment support to set up your workspace
  • Mental health and wellness support services
  • Opportunities to work on globally distributed, high-impact infrastructure systems

How Jobgether Works

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.


Similar Jobs

Explore other opportunities that match your interests

AI Engineering Lead

Devops
•
11h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Jobgether

United Arab Emirates

Senior Cloud Network Engineer

Devops
•
2d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Jobgether

United Arab Emirates
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

ssc hr solutions

United Arab Emirates

Subscribe our newsletter

New Things Will Always Update Regularly