Design and build a next-generation observability ecosystem, leading architecture of unified telemetry pipelines, and collaborating with engineering teams to integrate observability into CI/CD workflows and production systems. Strong technical expertise in DevOps, cloud infrastructure, and observability engineering required. 8+ years of experience in DevOps, SRE, or observability engineering roles.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Job Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a DevOps/Observability Engineer in Canada.
This role is focused on designing and building a next-generation observability ecosystem that enables deep visibility across large-scale, distributed cloud environments. You will lead the architecture of unified telemetry pipelines, ensuring logs, metrics, and traces are efficiently collected, processed, and analyzed. Working within a modern AWS-based infrastructure, you will leverage OpenTelemetry, Kubernetes, and industry-leading monitoring tools to enhance system reliability and performance. The environment is highly technical, cloud-native, and centered on automation, scalability, and continuous improvement. You will collaborate closely with engineering teams to integrate observability into CI/CD workflows and production systems. This position offers the opportunity to shape enterprise-wide monitoring standards and directly influence operational excellence at scale.
Accountabilities
In this role, you will design, implement, and evolve a unified observability platform that supports large-scale distributed systems and ensures operational visibility across environments.
- Architect and implement end-to-end observability pipelines using OpenTelemetry, Prometheus, Grafana, and related tooling in AWS environments
- Design scalable log, metric, and trace collection strategies, including cross-account AWS telemetry integration and centralized monitoring frameworks
- Build and optimize log aggregation, filtering, and routing systems, including integrations with Splunk and other enterprise tools
- Develop advanced alerting, dashboards, and monitoring solutions using PromQL, CloudWatch, and Alertmanager
- Implement Infrastructure as Code using Terraform to deploy and manage observability and cloud infrastructure components
- Support Kubernetes-based observability across EKS/ECS environments, ensuring full-stack visibility and reliability
- Drive cost optimization initiatives by improving telemetry efficiency, storage strategies, and data filtering approaches
- Collaborate with engineering and platform teams to embed observability into deployment pipelines and production systems
Interested in remote work opportunities in Devops? Discover Devops Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
- 8+ years of experience in DevOps, SRE, or observability engineering roles
- Strong expertise in AWS cloud services and multi-account observability architectures
- Hands-on experience with OpenTelemetry, Prometheus, Grafana, Splunk, and CloudWatch
- Strong proficiency with Infrastructure as Code tools, particularly Terraform
- Advanced programming/scripting skills (Python, Go, or similar) for automation and tooling
- Experience with Kubernetes (EKS) and containerized environments (Docker, ECS)
- Deep understanding of logging, metrics, tracing, and distributed system observability principles
- Strong analytical, problem-solving, and systems-thinking abilities with a focus on scalability and reliability
- Excellent communication skills and ability to work in cross-functional, fast-paced engineering teams
- Competitive compensation aligned with experience and market benchmarks
- Fully remote work setup across Canada
- Opportunity to work on large-scale, cloud-native systems and cutting-edge observability platforms
- Exposure to advanced AI, cloud, and distributed engineering environments
- Career growth within a high-performance, innovation-driven engineering culture
- Collaborative and knowledge-sharing work environment with global teams
- Continuous learning opportunities and access to modern DevOps and cloud technologies
- Inclusive and flexible work culture supporting work-life balance.
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Why Apply Through Jobgether?
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Similar Jobs
Explore other opportunities that match your interests
breed staffing
Cloud and Infrastructure Engineer
Jobgether