DevOps/Platform Engineer

The Judge Group United State
Remote
Apply
AI Summary

Join our fully remote engineering team as a DevOps/Platform Engineer to build a seamless developer experience and maintain a resilient, scalable cloud environment. Automate everything, reduce toil, and ensure our systems are observable and reliable. Bridge the gap between development and operations.

Key Highlights
Kubernetes Orchestration
Infrastructure as Code (IaC)
Observability & Reliability
Key Responsibilities
Manage, scale, and troubleshoot Azure Kubernetes Service (AKS) clusters
Use Terraform to provision and manage the Azure environment
Configure Datadog for deep system insights
Technical Skills Required
Azure Expertise Kubernetes Troubleshooting Terraform Datadog Kafka GitHub Actions ArgoCD
Benefits & Perks
Remote work
Flexible work arrangement

Job Description


Role Overview

We are looking for a DevOps / Platform Engineer to join our fully remote engineering team. In this role, you will be the backbone of our infrastructure, focusing on building a seamless developer experience and maintaining a resilient, scalable cloud environment.

You aren’t just "managing servers"—you’re architecting the platforms that allow our product teams to ship high-quality code with confidence and speed.


The Role & Responsibilities

As a core member of the Platform team, you will bridge the gap between development and operations. Your mission is to automate everything, reduce toil, and ensure our systems are observable and reliable.

Key Responsibilities

  • Kubernetes Orchestration
  • Manage, scale, and troubleshoot Azure Kubernetes Service (AKS) clusters
  • Ensure containerized workloads are healthy and efficiently binned
  • Infrastructure as Code (IaC)
  • Treat infrastructure as software
  • Use Terraform to provision and manage the Azure environment
  • Deployment Excellence
  • Design and maintain CI/CD pipelines using GitHub Actions
  • Implement GitOps-based CD using ArgoCD
  • Observability & Reliability
  • Configure Datadog for deep system insights
  • Build dashboards and proactive monitors
  • Track DORA metrics and SLOs to measure engineering health
  • Data Streaming
  • Manage Kafka operations (via Strimzi or equivalent)
  • Ensure event-driven systems remain performant and reliable
  • Cloud Governance
  • Oversee Azure services including:
  • Key Vault (secret management)
  • Container Registry (ACR)
  • Storage Accounts
  • Service Bus


What You Bring to the Table

  • Azure Expertise
  • Deep understanding of the Azure ecosystem, including networking and security
  • Kubernetes Troubleshooting
  • Strong knowledge of AKS internals, networking, ingress, and resource optimization
  • Automation Mindset
  • Proficiency with Terraform
  • Comfort with Git-centric workflows and automation-first practices
  • Observability Focus
  • Experience creating actionable Datadog alerts and defining SLOs
  • Collaboration
  • Ability to work closely with developers to explain platform behavior and system interactions


Our Tech Stack

  • Cloud Provider: Azure (AKS, Key Vault, Service Bus, ACR)
  • Infrastructure as Code: Terraform
  • CI/CD: GitHub Actions, ArgoCD (GitOps)
  • Monitoring & Reliability: Datadog (DORA metrics, SLOs, APM)

Messaging / Streaming: Kafka (Strimzi)


Similar Jobs

Explore other opportunities that match your interests

Amazon Connect Engineer

Devops
2h ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Entry level

Oliver James

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

ocho

United State

Cloud Engineer III

Devops
2h ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

net2source (n2s)

United State

Subscribe our newsletter

New Things Will Always Update Regularly