Site Reliability Engineer - Production Systems Expert

humanitapp • United State

Remote

Apply

AI Summary

Experienced Site Reliability Engineer needed for a remote contract opportunity. 3+ years of SRE, DevOps, or production engineering experience required. Proficient with observability stacks, Linux systems, and container orchestration.

Key Highlights

Remote contract opportunity

3+ years of SRE, DevOps, or production engineering experience

Proficient with observability stacks, Linux systems, and container orchestration

Key Responsibilities

Author and review complex, realistic scenarios grounded in production incidents

Cover root cause analysis, monitoring and alerting, capacity planning, and post-incident remediation

Help evaluate and train AI models that reason about system failures and operational best practices

Technical Skills Required

Prometheus Grafana Datadog PagerDuty Linux systems Networking (TCP/IP, DNS, load balancing) Container orchestration (Kubernetes, Docker) Infrastructure-as-code (Terraform, Pulumi, CloudFormation) CI/CD pipelines

Benefits & Perks

$100-$160/hr

Remote work

Contract opportunity

Job Description

HumaniT is referring experienced Site Reliability Engineers to a remote contract opportunity a platform trusted by leading AI labs and Fortune 10 companies.

Role: Site Reliability Engineer — Production Systems Expert

Type: Independent Contractor | Fully Remote

Location: United States only

Rate: $100–$160/hr

Start Date: Late March, with additional openings in April

Who this is for:

Interested in remote work opportunities in Development & Programming? Discover Development & Programming Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.

— 3+ years of SRE, DevOps, or production engineering experience at a big tech company or leading startup

— Experience serving in on-call rotations managing Tier 1/Tier 2 production services with meaningful SLA requirements

— Proficient with observability stacks: Prometheus, Grafana, Datadog, PagerDuty, or equivalent

— Deep knowledge of Linux systems, networking (TCP/IP, DNS, load balancing), and container orchestration (Kubernetes, Docker)

— Hands-on with infrastructure-as-code (Terraform, Pulumi, CloudFormation) and CI/CD pipelines

— Strong debugging skills from application-level tracing to kernel-level diagnostics

What you will do:

— Author and review complex, realistic scenarios grounded in production incidents

Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.

— Cover root cause analysis, monitoring and alerting, capacity planning, and post-incident remediation

— Help evaluate and train AI models that reason about system failures and operational best practices

This project is currently in a pilot phase — participants are expected to be highly engaged with project leadership.

Applications are reviewed on a rolling basis.

Explore more opportunities at humanitapp.com

#SRE #SiteReliabilityEngineering #DevOps #RemoteWork #AIResearch #NowHiring

Job Overview

Posted Date Mar 29, 2026

Employment Type Contract

Experience Level Mid-Senior level

Location United State

Annual Salary 160,000 USD

Category Programming

Company humanitapp

Mentioned Skills

Industries

Similar Jobs

Explore other opportunities that match your interests

Cloud Python Developer

Programming

•

6h ago

Visa Sponsorship Relocation Remote

Job Type Contract

Experience Level Mid-Senior level

amtex systems inc

United State

Analytics Engineer

Programming

•

6h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Entry level

Lensa

United State

Sr. Director Engineering, Head of Internal Technology

Programming

•

7h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Director

remotehunter

United State

Site Reliability Engineer - Production Systems Expert

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

Cloud Python Developer

amtex systems inc

Analytics Engineer

Lensa

Sr. Director Engineering, Head of Internal Technology

remotehunter

Subscribe our newsletter