S

DevOps Engineer (L4)

sundayy • United State
Remote
Apply
AI Summary

Join our Cloud Platform team as a DevOps Engineer (L4) to design, build, and maintain shared cloud infrastructure. Collaborate with cross-functional teams to implement infrastructure improvements and uphold platform standards. Influence technical decisions and promote platform standards.

Key Highlights
Design, deploy, and operate Kubernetes clusters
Enhance AWS infrastructure and network architecture
Develop and maintain infrastructure-as-code and GitOps workflows
Key Responsibilities
Design, deploy, and operate a fleet of Kubernetes clusters
Enhance AWS infrastructure and network architecture
Develop and maintain infrastructure-as-code and GitOps workflows
Technical Skills Required
Kubernetes AWS Terraform
Benefits & Perks
Competitive compensation package
Retirement plans
Comprehensive health coverage

Job Description


About The Company

Upstart is a leading AI-driven lending marketplace dedicated to transforming the borrowing experience for all Americans. Our mission is to significantly reduce the cost and complexity associated with obtaining credit by leveraging advanced artificial intelligence, data analytics, and innovative technology. We partner with banks and credit unions to expand access to affordable credit, utilizing a platform that performs over one million predictions per borrower based on more than 1,800 signals. This approach enables smarter, fairer lending decisions that benefit millions of customers, helping them achieve financial progress with clarity and confidence. As a digital-first organization, Upstart promotes flexibility and inclusivity, empowering employees to work remotely while fostering a culture of collaboration, innovation, and impact. Our commitment to diversity, equity, and inclusion is integral to our success as we work toward a future where credit is accessible to everyone.

About The Role

We are seeking a highly skilled DevOps Engineer (L4) to join our Cloud Platform team within the Reliability organization. In this role, you will be instrumental in designing, building, and maintaining our shared cloud infrastructure that supports all product and machine learning workloads. Your expertise will help us scale our platform to meet increasing demands while ensuring high availability, reliability, security, and cost efficiency. You will collaborate closely with Site Reliability Engineering (SRE), Delivery, Information Security, and Product/ML teams to implement infrastructure improvements, streamline developer workflows, and uphold platform standards. This position offers an exciting opportunity to influence the architecture and operational excellence of a critical infrastructure that powers the entire organization. The role is fully remote, providing flexibility to work from anywhere within the United States or Canada, with periodic in-person collaboration sessions.

Qualifications

  • Bachelor's degree in Computer Science, Engineering, Mathematics, or a related field, or equivalent practical experience
  • Minimum of 3+ years of professional experience operating Kubernetes in production environments
  • Proficiency with AWS infrastructure, including VPC design, networking, and IAM
  • Hands-on experience with infrastructure-as-code tools such as Terraform or AWS CDK
  • Experience implementing GitOps workflows using tools like ArgoCD or similar
  • Strong understanding of cluster networking, storage, and RBAC in Kubernetes
  • Ability to influence technical decisions across teams and promote platform standards

Responsibilities

  • Design, deploy, and operate a fleet of Kubernetes (EKS) clusters across production, staging, and ephemeral environments, ensuring high reliability and availability
  • Enhance AWS infrastructure and network architecture, including VPCs, subnets, and IAM policies, to support scalable multi-team workloads
  • Develop and maintain infrastructure-as-code and GitOps workflows using Terraform, CDK, and ArgoCD
  • Define and monitor Service Level Objectives (SLOs), analyze incidents, and implement systemic fixes to improve platform reliability and performance
  • Participate in on-call rotations, lead incident response efforts, and conduct post-incident reviews to drive continuous improvement
  • Collaborate with cross-functional teams to implement high-impact infrastructure changes and establish platform standards
  • Simplify platform usage, reduce operational toil, and enable faster development cycles for product and ML teams
  • Optimize resource utilization and contribute to cost-efficiency initiatives across cloud infrastructure

Benefits

  • Competitive compensation package, including base salary, bonuses, and quarterly equity grants
  • Retirement plans with company matching contributions (e.g., 401(k) or Group Retirement Savings Plan)
  • Comprehensive health coverage, including medical, dental, and vision plans
  • Health Savings Account (HSA) contributions for eligible employees
  • Paid time off, sick leave, and holidays aligned with local regulations
  • Paid parental and family leave to support major life events
  • Employee Stock Purchase Plan (ESPP) with discounted stock options (US only)
  • Wellness resources, mental health support via Employee Assistance Program (EAP), and financial planning tools
  • Onsite perks such as catered meals and stocked micro-kitchens at select office locations
  • Flexible remote work arrangements with periodic in-person collaboration sessions

Equal Opportunity

Upstart is an Equal Opportunity Employer. We are committed to fostering an inclusive environment where all employees and applicants are treated with respect and dignity. We do not discriminate based on race, color, religion, sex, national origin, age, disability, sexual orientation, gender identity, or any other protected characteristic. We welcome candidates from diverse backgrounds and are dedicated to providing reasonable accommodations throughout the hiring process.

Similar Jobs

Explore other opportunities that match your interests

Senior Site Reliability Engineer - Government Cloud

Devops
•
14h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Tines

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

eitacies inc.

United State

Computer Vision Engineer (AI)

Devops
•
1d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

cura label technologies

United State

Subscribe our newsletter

New Things Will Always Update Regularly