Design and implement end-to-end observability solutions to improve system reliability, performance, and visibility. Work closely with DevOps, SRE, and engineering teams. Build scalable observability frameworks.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
Dice is the leading career destination for tech experts at every stage of their careers. Our client, DevApps IT, is seeking the following. Apply via Dice today!
Job Title: Observability Engineer
Location: Remote
Experience: 5+ Years
Role Summary:
We are looking for a skilled Observability Engineer to design, implement, and maintain monitoring, logging, and tracing solutions across distributed systems. The ideal candidate will help improve system reliability, performance, and visibility by building scalable observability frameworks.
Key Responsibilities:
- Design and implement end-to-end observability solutions (metrics, logs, traces)
- Build dashboards and alerts to monitor system health and performance
- Work closely with DevOps, SRE, and engineering teams to improve system reliability
- Analyze system performance and troubleshoot production issues
- Implement distributed tracing for microservices architectures
- Optimize monitoring tools and reduce alert fatigue
- Ensure high availability and scalability of observability platforms
- Automate monitoring and alerting processes
Interested in remote work opportunities in Devops? Discover Devops Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
- Strong experience with observability tools like Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Datadog, or New Relic
- Hands-on experience with distributed tracing tools (Jaeger, Zipkin, OpenTelemetry)
- Good understanding of microservices architecture and cloud platforms (AWS/Azure/Google Cloud Platform)
- Experience with CI/CD pipelines and DevOps practices
- Proficiency in scripting languages like Python, Bash, or Go
- Knowledge of containerization (Docker, Kubernetes)
- Experience in Site Reliability Engineering (SRE) practices
- Familiarity with Infrastructure as Code (Terraform, CloudFormation)
- Strong problem-solving and debugging skills
- Excellent communication skills in a remote work environment
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
- Experience with AIOps or advanced monitoring analytics
- Certification in cloud platforms (AWS/Azure/Google Cloud Platform)
- Exposure to security monitoring and compliance tools
- 100% Remote Work
- Flexible working hours
- Competitive salary/package
- Opportunity to work on scalable, modern cloud systems
Similar Jobs
Explore other opportunities that match your interests
CBTS
Zillion Technologies, Inc.