Cloud and Platform Engineer (Hosting Operations) responsible for managing and supporting Azure GovCloud infrastructure, OpenShift clusters, Oracle database environments, and integration layers. Requires experience with Azure, Linux, Kubernetes, and Oracle databases. Must be able to troubleshoot across infrastructure, platform, and integration layers simultaneously.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
Cloud & Platform Engineer (Hosting Operations)
Location: Fully Remote (U.S.)
Employment Type: Direct Hire
Compensation: $160,000 - $200,000 depending on experience
Clearance: Public Trust clearance and active PIV card highly desired
About the Role
You will own the hosting and infrastructure operations of a large-scale federal financial system running in Azure Government Cloud. This is not a greenfield build. It is a complex, live production environment that processes real-time financial transactions across multiple federal agency systems every day. Your job is to keep it running, make it better, and help the team gain full operational control during a high-stakes transition period.
The architecture is a hybrid stack: Azure IaaS infrastructure, OpenShift container platform, Oracle databases with Data Guard and replication, enterprise integration services (ESB, SFTP, connections to external financial systems), and Control-M batch job orchestration. You will work across all of these layers. If something breaks at 2 AM between the integration tier and the database, you need to know where to look and how to fix it.
What You Will Do
Infrastructure & Cloud Operations
Manage and support Azure GovCloud infrastructure including VMs, storage, and networking. Maintain system availability and performance against SLA targets. Troubleshoot infrastructure issues that impact application and integration layers.
OpenShift & Platform Operations
Operate and support OpenShift clusters: nodes, pods, configurations. Monitor and troubleshoot containerized services and ESB workloads. Support platform-level changes, deployments, and restarts.
Database & Data Support
Interested in remote work opportunities in Devops? Discover Devops Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
Support Oracle database environments including OLTP, standby, and replication. Assist with backup, restore, and data validation during incidents. Enable production debugging through controlled data access.
Integration & File Transfer
Support SFTP and file transfer pipelines through VLTrader and related systems. Troubleshoot integration failures across internal and external systems including FPDS and payroll interfaces. Validate data movement and interface execution.
Batch Processing & Scheduling
Monitor and support Control-M batch job execution. Troubleshoot failed or delayed batch processes. Ensure critical financial processing cycles complete on time.
Monitoring & Incident Response
Use Dynatrace, Splunk, and Azure Monitor to detect and diagnose issues. Participate in incident response and root cause analysis. Coordinate with application and business teams during outages.
Security & Access
Support patching, vulnerability remediation, and compliance activities. Manage access through enterprise tools (AD groups, jump boxes, privileged access). Ensure adherence to least-privilege and separation-of-duties principles.
Change & Release Support
Support infrastructure aspects of deployments and system changes. Coordinate maintenance windows and system restarts. Ensure changes follow proper approval and rollback procedures.
Disaster Recovery
Support DR readiness across Azure regions. Participate in failover testing and validation. Assist in maintaining RTO/RPO objectives.
What You Bring
- Experience with Azure, specifically GovCloud or enterprise-scale environments
- Strong Linux (RHEL) system administration skills
- Experience operating Kubernetes or OpenShift container platforms in production
- Familiarity with Oracle databases and enterprise data systems
- Track record supporting production systems in a 24/7 environment
- Hands-on experience with Splunk and Dynatrace for monitoring and log analysis
- Ability to troubleshoot across infrastructure, platform, and integration layers simultaneously
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
What Sets You Apart
- Experience with federal systems or regulated environments (VA, DoD)
- Familiarity with Control-M or enterprise job schedulers
- Experience with secure file transfer systems (SFTP, AS2, VLTrader)
- Understanding of integration architectures: ESB, APIs, batch interfaces
- Experience with DevOps tooling (Jenkins, Helm, Puppet)
- Familiarity with RMF, ATO, or federal security compliance processes
What Success Looks Like
The system stays stable and available during a critical transition period. Issues get identified and resolved quickly with clear ownership. Integrations and batch processes execute reliably. Monitoring gives the team real visibility into system health. And the organization gains confident operational control of a complex, mission-critical environment.
Similar Jobs
Explore other opportunities that match your interests
Scalence L.L.C.
Optomi