We are seeking a Head of Infrastructure to own end-to-end infrastructure, freeing software engineers to ship features rapidly while ensuring reliability, performance, and security. This role is responsible for designing, building, and operating Azure cloud infrastructure. The ideal candidate will have 8+ years of experience operating production infrastructure at scale.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
We are working with a new, fully remote market maker on prediction markets, funded by a tier 1 operator to build next-generation trading systems, leveraging huge industry IP & resources from the tier 1 operator. The founding team includes true industry veterans with a vision for the future. They are building a lean, fully remote, senior team focused on high-performance systems, pragmatic engineering, and rapid iteration.
They are looking for a Head of Infrastructure to own end-to-end infrastructure, freeing software engineers to ship features rapidly while ensuring reliability, performance, and security. This is the first dedicated infrastructure leadership hire and a force multiplier for the entire engineering team.
Responsibilities:
- Design, build, and operate Azure cloud infrastructure supporting low-latency integrations with operator systems and external exchanges.
- Run Kubernetes (cluster sizing, autoscaling, multi-environment deployments).
- Stand up and optimize Kafka (event streaming), Redis (caching), and Postgres (OLTP).
- Implement CI/CD, secrets management, environment isolation, and infrastructure-as-code (Terraform/Bicep).
- Own observability (Prometheus, Grafana, OpenTelemetry), incident response, SLOs/SLIs, and on-call practices.
- Design and manage secure cloud networking (VNets, peering, private endpoints, DNS, firewall/NSGs) and connectivity with the operator.
- Drive security and compliance across identity, segmentation, secrets, and OS baselines in Azure.
- Lead performance engineering for market data ingestion and order routing.
- Partner with engineering on service boundaries, data contracts, and platform primitives.
- Manage cost, capacity, reliability, and availability while scaling the infrastructure function.
Interested in remote work opportunities in Devops? Discover Devops Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
Requirements:
- 8+ years operating production infrastructure at scale.
- Deep cloud expertise (Azure preferred; AWS/GCP backgrounds welcome).
- Hands-on production experience with Kubernetes, Kafka, Redis, and Postgres.
- Strong networking, security, identity (e.g., Azure AD), and infrastructure-as-code skills.
- Proven ability to build reliable, observable platforms with strong CI/CD; experience with Prometheus, Grafana, OpenTelemetry, or similar.
- Cloud networking fundamentals (VNet design, private endpoints, DNS, firewall rules).
- Performance tuning for latency- and throughput-sensitive systems.
- Strong collaboration skills translating product/trading needs into platform capabilities.
- Active, hands-on use of AI tooling (infra-as-code generation, log analysis, automation, incident triage).
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
Nice to Have:
- Experience in trading, sports betting, exchanges, financial markets, or other real-time systems.
- Event-driven architecture or stream processing experience.
- SRE leadership or reliability program experience.
Similar Jobs
Explore other opportunities that match your interests
mission, a cdw company
missing-link.io
Principal Infrastructure Engineer