Observability Engineer

Jobs via Dice • United State
Remote
Apply
AI Summary

Design and implement end-to-end observability solutions to improve system reliability, performance, and visibility. Work closely with DevOps, SRE, and engineering teams. Build scalable observability frameworks.

Key Highlights
Design and implement observability solutions
Work with DevOps, SRE, and engineering teams
Build scalable observability frameworks
Key Responsibilities
Design and implement end-to-end observability solutions
Build dashboards and alerts to monitor system health and performance
Work closely with DevOps, SRE, and engineering teams to improve system reliability
Analyze system performance and troubleshoot production issues
Implement distributed tracing for microservices architectures
Optimize monitoring tools and reduce alert fatigue
Ensure high availability and scalability of observability platforms
Automate monitoring and alerting processes
Technical Skills Required
Prometheus Grafana ELK Stack (Elasticsearch, Logstash, Kibana) Datadog New Relic Distributed tracing tools (Jaeger, Zipkin, OpenTelemetry) Microservices architecture and cloud platforms (AWS/Azure/Google Cloud Platform) CI/CD pipelines and DevOps practices Scripting languages (Python, Bash, Go) Containerization (Docker, Kubernetes)
Benefits & Perks
100% Remote Work
Flexible working hours
Competitive salary/package
Opportunity to work on scalable, modern cloud systems
Nice to Have
Experience in Site Reliability Engineering (SRE) practices
Familiarity with Infrastructure as Code (Terraform, CloudFormation)
Experience with AIOps or advanced monitoring analytics
Certification in cloud platforms (AWS/Azure/Google Cloud Platform)
Exposure to security monitoring and compliance tools

Job Description


Dice is the leading career destination for tech experts at every stage of their careers. Our client, DevApps IT, is seeking the following. Apply via Dice today!

Job Title: Observability Engineer

Location: Remote

Experience: 5+ Years

Role Summary:

We are looking for a skilled Observability Engineer to design, implement, and maintain monitoring, logging, and tracing solutions across distributed systems. The ideal candidate will help improve system reliability, performance, and visibility by building scalable observability frameworks.

Key Responsibilities:

  • Design and implement end-to-end observability solutions (metrics, logs, traces)
  • Build dashboards and alerts to monitor system health and performance
  • Work closely with DevOps, SRE, and engineering teams to improve system reliability
  • Analyze system performance and troubleshoot production issues
  • Implement distributed tracing for microservices architectures
  • Optimize monitoring tools and reduce alert fatigue
  • Ensure high availability and scalability of observability platforms
  • Automate monitoring and alerting processes

Required Skills:

  • Strong experience with observability tools like Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Datadog, or New Relic
  • Hands-on experience with distributed tracing tools (Jaeger, Zipkin, OpenTelemetry)
  • Good understanding of microservices architecture and cloud platforms (AWS/Azure/Google Cloud Platform)
  • Experience with CI/CD pipelines and DevOps practices
  • Proficiency in scripting languages like Python, Bash, or Go
  • Knowledge of containerization (Docker, Kubernetes)

Preferred Qualifications:

  • Experience in Site Reliability Engineering (SRE) practices
  • Familiarity with Infrastructure as Code (Terraform, CloudFormation)
  • Strong problem-solving and debugging skills
  • Excellent communication skills in a remote work environment

Nice to Have:

  • Experience with AIOps or advanced monitoring analytics
  • Certification in cloud platforms (AWS/Azure/Google Cloud Platform)
  • Exposure to security monitoring and compliance tools

Benefits:

  • 100% Remote Work
  • Flexible working hours
  • Competitive salary/package
  • Opportunity to work on scalable, modern cloud systems

Similar Jobs

Explore other opportunities that match your interests

Senior Platform Engineer

Devops
•
4h ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

CBTS

United State
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

Zillion Technologies, Inc.

United State

Cloud Systems Engineer

Devops
•
5h ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Associate

Pride Health

United State

Subscribe our newsletter

New Things Will Always Update Regularly