Senior Site Reliability Engineer

hired • United Arab Emirates
Remote
Apply
AI Summary

Join a high-performing team as a Senior Site Reliability Engineer to ensure the performance, reliability, and scalability of mission-critical infrastructure. Design, implement, and maintain scalable infrastructure using Linux, Kubernetes, and Prometheus. Collaborate closely with development and operations teams to deliver seamless deployments and high system uptime.

Key Highlights
Design, implement, and maintain scalable infrastructure
Monitor system health and analyze performance metrics
Collaborate with development and operations teams
Key Responsibilities
Design, implement, and maintain scalable infrastructure
Monitor system health, analyze performance metrics, and proactively address bottlenecks or potential failures
Automate operational processes to minimize manual intervention and increase system reliability
Respond swiftly to incidents, conduct root cause analysis, and drive continuous improvements in incident response procedures
Collaborate closely with development and operations teams to deliver seamless deployments and high system uptime
Technical Skills Required
Linux Kubernetes Prometheus
Benefits & Perks
Competitive payout
Remote work
Equal opportunity employer

Job Description


  • Role: Senior Site Reliability Engineer (Remote)
  • Location: Remote (Work from Anywhere)
  • Payout: Competitive


Role Overview:

One of our clients, a global leader in the Technology industry, is seeking a skilled Site Reliability Engineer to join their team as a contractor. This role involves ensuring the performance, reliability, and scalability of mission-critical infrastructure. As a Site Reliability Engineer, you will play a pivotal role in architecting, monitoring, and enhancing robust systems supporting innovative applications.


Key Responsibilities:

• Design, implement, and maintain scalable infrastructure using Linux, Kubernetes, and Prometheus.

• Monitor system health, analyze performance metrics, and proactively address bottlenecks or potential failures.

• Automate operational processes to minimize manual intervention and increase system reliability.

• Respond swiftly to incidents, conduct root cause analysis, and drive continuous improvements in incident response procedures.

• Collaborate closely with development and operations teams to deliver seamless deployments and high system uptime.


Required Skills & Qualifications:

• Deep expertise in Linux, Kubernetes, and Prometheus.

• Strong understanding of system monitoring, performance analysis, and automated process automation.

• Excellent problem-solving and troubleshooting skills with the ability to analyze complex system issues.

• Experience with incident response, root cause analysis, and continuous improvement.

• Strong collaboration and communication skills with ability to work with cross-functional teams.


More About the Opportunity:

As a Site Reliability Engineer, you will be part of a high-performing team that is passionate about delivering innovative solutions. This is a fantastic opportunity to work with cutting-edge technologies and make a significant impact on the company's infrastructure.


Equal Opportunity Employer:

We hire based on skills and expertise. All qualified candidates are welcome regardless of background, experience, or prior employment history. Applications are reviewed solely on demonstrated technical ability and qualifications.


Apply Now!


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Temporary
Experience Level Mid-Senior level

Prolific

United Arab Emirates

AWS Cloud Engineer (Remote)

Devops
•
2d ago
Visa Sponsorship Relocation Remote
Job Type Part-time
Experience Level Entry level

jobs ai

United Arab Emirates

Cloud Infrastructure Specialist

Devops
•
3d ago
Visa Sponsorship Relocation Remote
Job Type Part-time
Experience Level Entry level

onboard

United Arab Emirates

Subscribe our newsletter

New Things Will Always Update Regularly