Site Reliability Engineer

crunchafi United State
Remote
Apply
AI Summary

Crunchafi is seeking a Site Reliability Engineer to ensure the availability, performance, and scalability of our cloud-based SaaS platform. The ideal candidate will design, build, and maintain scalable and resilient infrastructure on Microsoft Azure. This role requires deep Azure cloud expertise, a strong background in infrastructure-as-code and incident management, and a passion for eliminating toil through automation.

Key Highlights
Design, build, and maintain scalable and resilient infrastructure on Microsoft Azure
Develop and maintain comprehensive monitoring, alerting, and observability systems
Lead incident response and on-call rotations
Key Responsibilities
Design, build, and maintain scalable and resilient infrastructure on Microsoft Azure
Define and track service level objectives (SLOs), service level indicators (SLIs), and error budgets
Build and maintain comprehensive monitoring, alerting, and observability systems
Lead incident response and on-call rotations
Develop and maintain CI/CD pipelines using GitHub Actions
Manage and optimize Azure Kubernetes Service (AKS) clusters
Collaborate with software engineering teams to embed reliability into application architecture
Technical Skills Required
Microsoft Azure Azure Kubernetes Service (AKS) GitHub Actions Terraform Bicep ARM templates Python Go Bash PowerShell C#
Benefits & Perks
Competitive salary
Health, dental, and vision plans
401(k) Retirement savings plan for US-based employees
Nice to Have
Experience operating SaaS platforms in accounting, financial services, or B2B environments
Experience with chaos engineering practices and tools
Familiarity with microservices and event-driven architecture patterns

Job Description


Job Title:  Site Reliability Engineer

Reports to: VP of Engineering

Type:  Full time, salaried

Location:  Remote; with occasional travel requirements to Milwaukee, WI



About Crunchafi 

Crunchafi (formerly LeaseCrunch) is revolutionizing the world of accounting with easy-to-use, cloud-based solutions designed to simplify complex financial data management. Our products empower CPA firms and financial professionals by streamlining lease accounting, data extraction, and cash flow forecasting, helping them deliver strategic value faster and more efficiently. Trusted by over 750 firms and more than 27,000 companies, Crunchafi combines cutting-edge technology with expert support to power the future of accounting. 



Our Team

Crunchafi is made up of passionate, forward-thinking professionals committed to transforming the accounting industry. Our team is dedicated to providing innovative solutions that simplify accounting processes and provide actionable financial insights. We value collaboration, creativity, humor, and a shared vision of improving the accounting profession through technology. 



Why Join Us?

We are looking for talented individuals to join our growing team and contribute to our mission of empowering CPA firms and financial professionals. At Crunchafi, you’ll be part of a dynamic, collaborative environment where your ideas are valued, and your growth is supported. We offer a rewarding work/life balance, opportunities for professional development, and a chance to make a real impact in the world of accounting. 

About This Role

Crunchafi is looking for a Site Reliability Engineer to ensure the availability, performance, and scalability of our cloud-based SaaS platform. This role bridges software engineering and operations — you will build and maintain the infrastructure, observability, and automation that keep our systems running reliably at scale. The ideal candidate brings deep Azure cloud expertise, a strong background in infrastructure-as-code and incident management, and a passion for eliminating toil through automation.

Responsibilities

  • Design, build, and maintain scalable and resilient infrastructure on Microsoft Azure to support production SaaS workloads
  • Define and track service level objectives (SLOs), service level indicators (SLIs), and error budgets to drive reliability decisions
  • Build and maintain comprehensive monitoring, alerting, and observability systems to ensure early detection of issues
  • Develop and maintain CI/CD pipelines using GitHub Actions to enable safe, rapid, and repeatable deployments
  • Lead incident response and on-call rotations, conduct blameless post-incident reviews, and drive follow-up action items to completion
  • Automate operational tasks and eliminate toil through scripting, infrastructure-as-code, and self-healing systems
  • Manage and optimize Azure Kubernetes Service (AKS) clusters, container orchestration, and related networking and storage configurations
  • Collaborate with software engineering teams to embed reliability into application
  • architecture, including capacity planning, load testing, and chaos engineering
  • Maintain and improve infrastructure-as-code using tools such as Terraform, Bicep, or ARM templates
  • Partner cross-functionally with Product, Support, and Quality to reduce friction and accelerate delivery

Qualifications

  • 5+ years of professional experience in site reliability engineering, DevOps, or infrastructure engineering roles
  • Strong hands-on experience with Microsoft Azure cloud services (AKS, Azure SQL, App Services, Virtual Networks, Azure Monitor, etc.)
  • Proficiency in at least one programming or scripting language (Python, Go, Bash, PowerShell, or C#)
  • Experience designing and managing CI/CD pipelines using GitHub Actions, Azure DevOps, or equivalent
  • Hands-on experience with containerization and orchestration technologies (Docker, Kubernetes)
  • Demonstrated experience with infrastructure-as-code tools (e.g. Bicep + ARM templates)
  • Strong understanding of networking fundamentals, DNS, load balancing, and TLS/SSL management
  • Experience with monitoring and observability platforms (Azure Monitor, Alerts, App Insights, Seq, etc.)
  • Proven track record of managing production incidents, conducting post-mortems, and driving reliability improvements
  • Exceptional analytical, interpersonal, and communication skills

Preferred Qualifications

  • Experience operating SaaS platforms in accounting, financial services, or B2B environments
  • Experience with chaos engineering practices and tools
  • Familiarity with microservices and event-driven architecture patterns
  • Background in capacity planning, performance tuning, and cost optimization on Azure
  • Experience with security hardening, compliance frameworks, or SOC 2 readiness
  • Azure certifications (AZ-104, AZ-400, AZ-500, or equivalent) are a plus

Benefits

  • Competitive salary
  • Health, dental, and vision plans
  • 401(k) Retirement savings plan for US-based employees
  • 100% remote work environment, with occasional travel for in-person company and/or team meetings
  • Unlimited PTO
  • Significant professional development growth opportunities
  • Dynamic and inclusive company culture with real commitment to our values

Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

chatgpt jobs

United State

AWS Cloud Engineer

Devops
3h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Qualified Recruiter, LLC

United State

Network Security Engineer

Devops
10h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

sentrilite

United State

Subscribe our newsletter

New Things Will Always Update Regularly