Grafana Monitoring and Alerting Engineer

Remote
Apply
AI Summary

We are looking for an experienced Grafana Monitoring & Alerting Engineer to ensure the reliability, accuracy, and performance of monitoring systems across distributed environments. The ideal candidate will have strong expertise in Prometheus, Node Exporters, and monitoring infrastructures. This is a part-time contract position.

Key Highlights
Install, configure, and maintain monitoring agents (Node Exporters, Prometheus)
Perform regular health checks, upgrades, and validation of monitoring agents
Support and troubleshoot Grafana dashboard issues, including data source connectivity
Deploy monitoring agents using automation tools
Provide detailed troubleshooting, root cause analysis, and documentation
Technical Skills Required
Linux Windows Prometheus Grafana Kubernetes (k8s) Node Exporters
Benefits & Perks
100% remote work
Part-time contract (20hrs/week)
Contract duration: 6 months

Job Description


Role: Grafana Monitoring/Alerting Support

Position Type: Part-Time Contract (20hrs/week)

Contract Duration: 6 months

Work Hours: EST

Location: 100% Remote


We are looking for an experienced Grafana Monitoring & Alerting Engineer with strong expertise in Prometheus, Node Exporters, and monitoring infrastructures. The ideal candidate will ensure the reliability, accuracy, and performance of monitoring systems across distributed environments.


Key Responsibilities

  • Install, configure, and maintain monitoring agents (Node Exporters, Prometheus).
  • Perform regular health checks, upgrades, and validation of monitoring agents.
  • Support and troubleshoot Grafana dashboard issues, including data source connectivity.
  • Ensure operational dashboards remain functional and up to date.
  • Deploy monitoring agents using automation tools.
  • Validate Prometheus configurations, service availability, and alerting logic.
  • Provide detailed troubleshooting, root cause analysis, and documentation.
  • Maintain technical documentation and contribute to knowledge transfer.


Required Skills & Experience

  • Strong Linux/Windows administration experience.
  • Hands-on understanding of Prometheus and Grafana setup and workflows.
  • Experience working with Kubernetes (k8s) environments.
  • Strong troubleshooting, analytical, and RCA (Root Cause Analysis) skills.
  • Ability to resolve installation, connectivity, and configuration issues.
  • Experience supporting dashboards and monitoring pipelines in production.


Preferred Skills

  • Experience with automation tools for agent deployment.
  • Knowledge of monitoring best practices and alerting optimization.


Subscribe our newsletter

New Things Will Always Update Regularly