Cloud Infrastructure and SRE Engineer with GCP Experience
We are seeking a Cloud Infrastructure and SRE Engineer with GCP experience to build and operate shared infrastructure and paved paths that help product teams deliver securely, reliably, and quickly. The ideal candidate will have experience operating production cloud platforms and services with an SRE mindset. This is a long-term position with opportunities for career growth.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
Title: Cloud Infrastructure & SRE Engineer with GCP Experience (only W2 Position โ No C2C Accepted)
Description: STG is a SEI CMMi Level 5 company with several Fortune 500 and State Government clients. STG has an opening forCloud Infrastructure & SRE Engineer with GCP Experience.
Please note that this project assignment is with our own direct clients. We do not go through any vendors. STG only does business with direct end clients. This is expected to be a long-term position. STG will provide immigration and permanent residency sponsorship assistance to those candidates who need it.
Position Description:
Cloud Infrastructure & SRE Engineer # Overview Platform Engineering builds and operates shared infrastructure and paved paths that help product teams deliver securely, reliably, and quickly. This role leans toward cloud infrastructure, DevOps, and Site Reliability Engineering (SRE), with strong software development skills. ## What youโll do - Design, build, and operate cloud infrastructure and platform capabilities (networking, compute, Kubernetes, CI/CD, secrets, certificates, identity). - Define and improve reliability using service-level indicators (SLIs), service-level objectives (SLOs), and error budgets. - Implement observability (metrics, logs, traces) with actionable alerting focused on user impact. - Create self-service workflows and automation (infrastructure as code, GitOps, build/release pipelines) that reduce toil. - Improve security and compliance through least-privilege access, secure defaults, policy-as-code, and continuous hardening. - Participate in on-call rotation, incident response, and post-incident reviews; drive systemic fixes and runbook quality. - Partner with application teams to improve deployability, resilience, and cost efficiency (capacity planning, autoscaling, graceful degradation). ## What weโre looking for ### Required - Experience operating production cloud platforms and services (e.g., GCP/AWS/Azure) with an SRE mindset. - Strong fundamentals in Linux, networking, distributed systems, and debugging complex production issues. - Proficiency with infrastructure as code and automation (e.g., Terraform, Helm/Kustomize, GitOps tooling). - Experience with containers and orchestration (Docker, Kubernetes) and modern CI/CD. - Programming and scripting ability (e.g., Go, Python, Java, TypeScript) to build tooling and automate workflows. - Clear communication, effective incident leadership, and a customer-focused approach to platform work. ### Preferred - Experience defining SLIs/SLOs and implementing SLO-based alerting and dashboards. - Observability platform experience (e.g., Prometheus/Grafana, OpenTelemetry, centralized logging). - Policy-as-code and supply chain security (e.g., OPA/Rego, SLSA concepts, SBOMs, artifact signing). - Experience building golden paths (container images, templates, reference architectures, paved pipelines) adopted by multiple teams. - Cost optimization experience (FinOps practices, capacity forecasting, right-sizing, multi-tenant platform controls). ## How we work - Automate first: eliminate repeatable manual work; measure and reduce toil. - Reliability is a feature: design for failure with timeouts, retries with jitter, idempotency, and graceful degradation. - Small, safe changes: incremental delivery, clear rollback strategies, and continuous improvement. - Engineering excellence: design reviews, blameless postmortems, and strong documentation/runbooks. ## What success looks like - Platform capabilities are easy to adopt, well-documented, and measurably reduce lead time for change. - Reliability improves over time (SLO attainment, reduced incident frequency/severity, faster MTTR). - Security posture improves via secure-by-default patterns and automated controls.
Looking to advance your Devops career with relocation support? Explore Devops Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.
Skills Required:
Cloud Infrastructure, Python, GCP, Platform Support, Kubernetes
1. Cloud Infrastructure > Expectation: A candidate has provisioned and operated production-grade infrastructure on a major cloud provider. For example, they designed a multi-region GCP network topology using VPCs, subnets, firewall rules, and Cloud NAT, managed with Terraform and deployed via a GitOps pipeline. They understand networking primitives, IAM boundaries, compute options, and can explain tradeoffs between managed services vs. self-hosted.
2. Python Expectation: A candidate has written production Python tooling or automation. For example, a script that queries the GCP Asset Inventory API to identify over-provisioned IAM bindings, generates a report, and opens a Jira ticket for remediation. Code is structured, testable (pytest), and handles errors and retries gracefully. Not just glue scripts, but maintainable tools used by a team.
3. GCP Expectation: A candidate has hands-on experience operating GCP services in a real platform context. For example, running workloads on Cloud Run, using Workload Identity for pod-level IAM, configuring policies, managing secrets in Secret Manager, and setting up VPC Service Controls. They can reason about GCP-specific reliability and security patterns, not just surface-level console familiarity.
Education Required:
Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.
- Bachelor's degree or equivalent qualification in computer science, engineering or related disciplines
Cloud Infrastructure & SRE Engineer with GCP Experience is based in Dearborn, MI. A great opportunity to experience the corporate environment leading personal career growth.
Resume Submittal Instructions: Interested/qualified candidates should email their word formatted resumes to Vasavi Konda โ vasavi.konda(.@)stgit.com and/or contact @(Two-Four-Eight) Seven- One-Two โ Six-Seven-Two-Five (@248.712.6725). In the subject line of the email please include: First and Last Name: Cloud Infrastructure & SRE Engineer with GCP Experience.
For more information about STG, please visit us at www.stgit.com.
Interested in relocating to United State? Check out our comprehensive Relocation Jobs in United State page with detailed relocation packages and benefits.
Sincerely,
Vasavi Konda| Recruiting Specialist
โOpportunities don't happen, you create them.โ
Systems Technology Group (STG)
3001 W. Big Beaver Road, Suite 500
Troy, Michigan 48084
Phone: @(Two-Four-Eight) Seven- One-Two โ Six-Seven-Two-Five: @248.712.6725(O)
Email: vasavi.konda(.@)stgit.com
Similar Jobs
Explore other opportunities that match your interests
information consulting service...
AWS Developer and Administrator
Jobs via Dice