Manychat is seeking an experienced Senior Site Reliability Engineer to manage cloud resources, harden Kubernetes clusters, and shape a more reliable and developer-friendly platform.
Key Highlights
Technical Skills Required
Benefits & Perks
Job Description
Who We Are 🌍
We help creators get more out of every conversation with Instagram-focused automations and support for other channels like Messenger, WhatsApp, and TikTok. The result? Better engagement, more sales, and real, sustainable growth.
With a diverse team of 350+ people spread across three continents, we’re building the leading Chat Marketing platform that is used — and loved — by more than 1.5 million customers worldwide.
Who We're Looking For 🌟
We’re looking for a Senior Site Reliability Engineer who thrives at the crossroads of classic Linux and AWS infrastructure and modern Site Reliability Engineering. This is a high-impact, hybrid role designed for someone who can manage cloud resources, harden Kubernetes clusters, and shape a more reliable and developer-friendly platform.
We need you not just to maintain but to rethink and evolve our infrastructure, balancing hands-on operations with strategic improvements that future-proof our growing AI product landscape.
You’ll take over key responsibilities from our current Infra Lead who is transitioning to a software-focused role, giving you immediate ownership and space to shine.
WHY THE ROLE IS SPECIAL 💡
You won’t be a cog in a massive SRE org. You’ll be the bridge between Infrastructure and Engineering, shaping how we scale Kubernetes, how we approach platform reliability, and how developers ship fast without fear. You’ll get autonomy, ownership, and a smart, humble team excited to learn with you.
What You’ll Do 🤖
- Maintain and harden AWS infrastructure (EC2, ALB/NLB, WAF, IAM, CloudWatch)
- Operate and evolve our EKS clusters powering Python-based AI services
- Migrate existing services to Kubernetes using Terraform and Helm
- Codify infrastructure with Terraform and manage host-level automation via Ansible
- Build and improve CI/CD pipelines with GitHub Actions
- Own observability efforts: Prometheus, Grafana, alerting, and on-call readiness
- Support OS-level patching, certs, WAF rules, and general infra hygiene
- Partner with engineers to guide best practices and drive platform reliability
- Create clean, maintainable infrastructure documentation and playbooks
- Occasionally support rare off-hours incidents (don’t worry, really rare)
Looking to advance your Devops career with relocation support? Explore Devops Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.
- 5+ years of experience managing Linux in production (Ubuntu, Amazon Linux)
- Strong experience with Kubernetes (ideally EKS), Helm, and Terraform
- Comfort with running and debugging Python workloads in containers
- Solid understanding of networking, IAM, and cloud security best practices
- Hands-on Nginx experience (Ingress and reverse proxy setups)
- Excellent communication skills; you can explain complex infra to devs clearly
- Strong Ansible skills beyond the basics
- PostgreSQL or Amazon RDS tuning and operations experience
- Deep understanding of observability tools (Prometheus, Grafana, Loki, etc.)
- Familiarity with PHP production environments
- Experience with TDD, CI/CD best practices, and agile development
- Any previous SRE-like exposure such as building resilience, automation, or incident tooling
Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.
We care deeply about your growth, well-being, and comfort:
- 🌍 Hybrid onboarding to start work remotely and relocation support for you and your family.
- 💙 Comprehensive health insurance for both you and your family.
- 📚 Professional development budget for conference tickets, online courses, and other relevant resources to help you grow.
- 🫶 Flexible benefits package to tailor perks that matters most for you.
- 🪴 Hybrid work and generous leave options to prioritize your work-life balance.
- 🍽️ In-office perks, including free meals and snacks.
- 🤝 Company-funded sport activities, annual offsites and team-building events.
This commitment is also reflected through our candidate experience. If you have individual needs that may require an accommodation during the interview process, please indicate this in your application. We will do our best to provide assistance throughout your interview process to ensure you’re set up for success.
With my application, I accept the Manychat Privacy Policy.
Similar Jobs
Explore other opportunities that match your interests
Senior Platform Engineer
stx group
Senior Cloud DevOps Engineer
elevation group
DevOps Engineer - Fullstack Developer