Design and operate observability infrastructure to provide deep visibility into production systems. Collaborate with engineering teams to define meaningful SLOs, SLIs, and alerting strategies. Implement and optimize full-stack monitoring, APM, and log management solutions.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Job Description
At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley.
Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact worldwide.
When you apply for this position, you're taking the first step in a process that goes beyond the ordinary. We aim to align your passions and skills with our vacancies, setting you on a path to exceptional career development and success.
Observability Engineer at BairesDev
As an Observability Engineer, you will design and operate observability infrastructure, including metrics, logs, and traces, to provide engineering teams with deep visibility into production systems. You will bridge the gap between complex distributed systems and actionable insights, ensuring high availability and performance through robust monitoring strategies.
What You'll Do:
- Design, implement, and manage enterprise-wide observability platforms using Prometheus and Grafana.
Interested in remote work opportunities in Devops? Discover Devops Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
- Standardize telemetry data collection across distributed systems using OpenTelemetry.
- Implement and optimize full-stack monitoring, APM, and log management solutions with Datadog.
- Collaborate with engineering teams to define meaningful SLOs, SLIs, and alerting strategies.
- Build and maintain scalable infrastructure for long-term metrics storage and real-time dashboarding.
- Drive the adoption of observability best practices to improve incident detection and resolution times.
What we are looking for:
- 4+ years of experience in Infrastructure, Site Reliability Engineering, or Observability.
- Proven expertise in designing and operating observability infrastructure for metrics, logs, and traces.
- Proficiency with monitoring and visualization tools such as Prometheus and Grafana.
- Hands-on experience implementing distributed tracing using OpenTelemetry.
- Experience optimizing enterprise observability platforms like Datadog to provide system visibility.
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
- Advanced proficiency in English.
How we do make your work (and your life) easier:
- 100% remote work (from anywhere).
- Excellent compensation in USD or your local currency if preferred
- Hardware and software setup for you to work from home.
- Flexible hours: create your own schedule.
- Paid parental leaves, vacations, and national holidays.
- Innovative and multicultural work environment: collaborate and learn from the global Top 1% of talent.
- Supportive environment with mentorship, promotions, skill development, and diverse growth opportunities.
Apply now and become part of a global team where your unique talents can truly thrive!
Similar Jobs
Explore other opportunities that match your interests
Senior Infrastructure Engineer (Terraform)
BairesDev
Platform Engineer (Kubernetes)
BairesDev