Senior Datadog Architect

Pomeroy United State
Remote
Apply
AI Summary

Design, implement, and optimize enterprise-level observability solutions across cloud, container, mainframe, and hybrid environments. Lead the architecture, deployment, governance, and optimization of the Datadog platform. Collaborate with cross-functional teams to drive end-to-end observability.

Key Highlights
Architect end-to-end Datadog observability frameworks
Design and architect integrations for Mainframe platforms
Deploy Datadog agents, integrations, and custom checks across large environments
Technical Skills Required
Datadog Azure Kubernetes Docker Terraform Python Bash APM Metrics Logs RUM Synthetics SIEM/Security Monitoring
Benefits & Perks
Competitive pay up to $75/hr W2
Fully remote role
Long-term 12+ month contract with extension potential

Job Description


Datadog Architect (Remote – US Only)

12+ Month Contract | Up to $75/hr W2 | US Citizenship Required


**3rd party resumes will not be considered**


We are seeking an experienced Datadog Architect to design, implement, and optimize enterprise-level observability solutions across cloud, container, mainframe, and hybrid environments. This fully remote contract role supports a long-term transformation of monitoring and reliability capabilities across a large, distributed ecosystem.


Overview

As the Datadog Architect, you will lead the architecture, deployment, governance, and optimization of the Datadog platform. This role works closely with Cloud, DevOps, SRE, Application, and Security teams to drive end-to-end observability across Azure, Kubernetes, microservices, and on-prem systems. You will define monitoring standards, automate instrumentation, establish tagging governance, and ensure visibility across mission-critical services.


Key Responsibilities


Architecture & Design

  • Architect end-to-end Datadog observability frameworks across Azure, Kubernetes, containers, and on-prem workloads.
  • Define SLIs/SLOs, dashboards, alerting strategy, tagging governance, and monitoring standards.
  • Design and architect end-to-end integrations for Mainframe platforms.
  • Build scalable log ingestion pipelines, retention strategies, and cost-optimized indexing models.
  • Develop patterns for APM instrumentation across microservices, serverless, and distributed systems.


Implementation & Optimization

  • Deploy Datadog agents, integrations, and custom checks across large, complex environments.
  • Configure APM, RUM, Logs, SIEM, Synthetics, NPM, and CI/CD Observability.
  • Partner with DevOps, SRE, Cloud, and Application teams to ensure complete service visibility.
  • Continuously optimize Datadog usage, retention, indexing, and billing for cost efficiency.


Governance & Best Practices

  • Establish enterprise-wide observability standards, tagging frameworks, alerting guardrails, and onboarding guidelines.
  • Build reusable Terraform modules and automation tooling for Datadog deployment.
  • Ensure alignment with security and compliance requirements.
  • Provide training, mentoring, and enablement on dashboards, logs, traces, and alerting best practices.


Troubleshooting & Insights

  • Lead root cause analysis (RCA) using metrics, traces, logs, and event correlation.
  • Collaborate with engineering teams to enhance system reliability, resilience, and performance.
  • Identify and close gaps in monitoring coverage and observability maturity.


Required Skills & Experience

  • 12+ years in Observability, Monitoring, SRE, DevOps, or Cloud Engineering.
  • 6+ years of hands-on experience with Datadog in enterprise environments.
  • Strong understanding of microservices, distributed systems, and cloud-native architectures.
  • Expertise with Kubernetes, Docker, and cloud platforms (Azure preferred; AWS/GCP a plus).
  • Proficiency with Infrastructure as Code, especially Terraform.
  • Deep experience with APM, Metrics, Logs, RUM, Synthetics, SIEM/Security Monitoring.
  • Strong command of dashboards, alerting, service maps, event correlation, and notebooks within Datadog.
  • Scripting experience in Python, Bash, or similar languages.
  • Excellent communication, analytical, and problem-solving skills.
  • US Citizenship required.


Preferred Qualifications

  • Datadog Certifications (APM, Log Management, Observability, etc.)
  • Experience with Retail observability environments.
  • CI/CD observability across GitHub Actions, Jenkins, GitLab CI, etc.
  • Background in Performance Engineering, Reliability Engineering, or Platform Engineering.


Why Apply?

  • Fully remote role supporting a major enterprise observability program
  • Long-term 12+ month contract with extension potential
  • Competitive pay up to $75/hr W2
  • High-impact role shaping monitoring strategy across cloud and mainframe platforms


Subscribe our newsletter

New Things Will Always Update Regularly