Senior Observability Engineer

BairesDev Latin America
Remote
Apply
AI Summary

Design and operate observability infrastructure to provide deep visibility into production systems. Collaborate with engineering teams to define meaningful SLOs, SLIs, and alerting strategies. Implement and optimize full-stack monitoring, APM, and log management solutions.

Key Highlights
Design and operate observability infrastructure
Collaborate with engineering teams
Implement and optimize monitoring solutions
Key Responsibilities
Design, implement, and manage enterprise-wide observability platforms
Standardize telemetry data collection across distributed systems
Collaborate with engineering teams to define meaningful SLOs, SLIs, and alerting strategies
Technical Skills Required
Prometheus Grafana OpenTelemetry Datadog
Benefits & Perks
100% remote work
Excellent compensation in USD or local currency
Flexible hours
Paid parental leaves, vacations, and national holidays

Job Description


At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley.


Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact worldwide.


When you apply for this position, you're taking the first step in a process that goes beyond the ordinary. We aim to align your passions and skills with our vacancies, setting you on a path to exceptional career development and success.


Observability Engineer at BairesDev


As an Observability Engineer, you will design and operate observability infrastructure, including metrics, logs, and traces, to provide engineering teams with deep visibility into production systems. You will bridge the gap between complex distributed systems and actionable insights, ensuring high availability and performance through robust monitoring strategies.


What You'll Do:


- Design, implement, and manage enterprise-wide observability platforms using Prometheus and Grafana.

- Standardize telemetry data collection across distributed systems using OpenTelemetry.

- Implement and optimize full-stack monitoring, APM, and log management solutions with Datadog.

- Collaborate with engineering teams to define meaningful SLOs, SLIs, and alerting strategies.

- Build and maintain scalable infrastructure for long-term metrics storage and real-time dashboarding.

- Drive the adoption of observability best practices to improve incident detection and resolution times.


What we are looking for:


- 4+ years of experience in Infrastructure, Site Reliability Engineering, or Observability.

- Proven expertise in designing and operating observability infrastructure for metrics, logs, and traces.

- Proficiency with monitoring and visualization tools such as Prometheus and Grafana.

- Hands-on experience implementing distributed tracing using OpenTelemetry.

- Experience optimizing enterprise observability platforms like Datadog to provide system visibility.

- Advanced proficiency in English.


How we do make your work (and your life) easier:


- 100% remote work (from anywhere).

- Excellent compensation in USD or your local currency if preferred

- Hardware and software setup for you to work from home.

- Flexible hours: create your own schedule.

- Paid parental leaves, vacations, and national holidays.

- Innovative and multicultural work environment: collaborate and learn from the global Top 1% of talent.

- Supportive environment with mentorship, promotions, skill development, and diverse growth opportunities.


Apply now and become part of a global team where your unique talents can truly thrive!


Similar Jobs

Explore other opportunities that match your interests

Senior Infrastructure Engineer (Terraform)

Devops
1d ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

BairesDev

Latin America

Platform Engineer (Kubernetes)

Devops
1d ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

BairesDev

Latin America

Telecom Engineer (CSD)

Devops
2w ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Infinite Computer Solutions

Latin America

Subscribe our newsletter

New Things Will Always Update Regularly