Manage and optimize IT infrastructure, focusing on automation and Kubernetes administration. Provide second-line operational support and implement automation strategies. Troubleshoot complex technical issues and maintain system reliability.
Key Highlights
Technical Skills Required
Benefits & Perks
Job Description
Job Title: Senior Systems Administrator
Location: Remote (NA, EU)
Department: IT Operations
Job Type: Full-Time
Job Summary
The Senior Systems Administrator will play a key role in managing and optimizing IT infrastructure, with a strong focus on automation and Kubernetes administration. As part of the second-line support team, this role will be responsible for maintaining system reliability, troubleshooting escalated issues, and implementing automation strategies using Ansible and Terraform. The ideal candidate will have experience in cloud-native environments, infrastructure as code (IaC), and IT operations best practices.
Key Responsibilities
- Systems Administration & Support: Provide second-line operational support for critical IT systems, diagnosing and resolving complex technical issues to ensure high availability and performance.
- Automation & Infrastructure as Code: Design, implement, and maintain automation workflows using Ansible and Terraform to streamline deployments and system management.
- Kubernetes Administration: Manage Kubernetes clusters, ensuring scalability, security, and performance, while troubleshooting containerized workloads.
- Cloud & Virtualization Management: Administer cloud environments (AWS, Azure, or GCP) and on-prem virtualization platforms (VMware, OpenStack).
- Monitoring & Incident Response: Implement and maintain monitoring solutions to proactively detect and resolve system issues, responding to incidents in line with SLAs.
- Security & Compliance: Enforce security best practices, conduct system hardening, and manage patching processes to mitigate risks.
- Process Improvement: Continuously refine operational workflows, leveraging automation and scripting to enhance efficiency.
- Collaboration & Documentation: Work closely with cross-functional teams, providing technical guidance and maintaining thorough documentation of systems and processes.
Required Qualifications & Experience
- 5+ years of experience in IT operations, systems administration, and automation.
- Strong experience in Linux and Windows server administration.
- Proficiency in Ansible and Terraform for automation and infrastructure provisioning.
- Experience with Kubernetes administration, container orchestration, and troubleshooting.
- Familiarity with cloud platforms (AWS, Azure, or GCP) and virtualization technologies (VMware, OpenStack).
- Experience in networking, storage, and system monitoring tools (e.g., Prometheus, Grafana, ELK stack, Datadog).
- Strong scripting skills (Bash, Python, or PowerShell) for automation and system management.
- Ability to troubleshoot complex system issues and support production environments.
- Experience working in an ITIL-based service management framework is a plus.
- Preferred certifications: CKA (Certified Kubernetes Administrator), RHCE, AWS/Azure certifications, ITIL Foundation.
Working Conditions
- Fully remote role with occasional travel for team meetings or data center visits.
- Participation in on-call rotations to support 24/7 infrastructure operations.