Cloud and Platform Engineer (Hosting Operations)

elios talent • United State
Remote
Apply
AI Summary

Cloud and Platform Engineer (Hosting Operations) responsible for managing and supporting Azure GovCloud infrastructure, OpenShift clusters, Oracle database environments, and integration layers. Requires experience with Azure, Linux, Kubernetes, and Oracle databases. Must be able to troubleshoot across infrastructure, platform, and integration layers simultaneously.

Key Highlights
Manage and support Azure GovCloud infrastructure
Operate and support OpenShift clusters
Support Oracle database environments
Key Responsibilities
Manage and support Azure GovCloud infrastructure
Operate and support OpenShift clusters
Support Oracle database environments
Troubleshoot infrastructure issues
Monitor and troubleshoot containerized services
Support platform-level changes
Deploy and restart systems
Monitor and support Control-M batch job execution
Troubleshoot failed or delayed batch processes
Ensure critical financial processing cycles complete on time
Technical Skills Required
Azure Linux Kubernetes OpenShift Oracle databases Splunk Dynatrace Control-M ESB SFTP APIs batch interfaces
Benefits & Perks
$160,000 - $200,000 annual salary
Fully remote work
Public Trust clearance and active PIV card highly desired
Nice to Have
Experience with federal systems or regulated environments
Familiarity with Control-M or enterprise job schedulers
Experience with secure file transfer systems
Understanding of integration architectures

Job Description


Cloud & Platform Engineer (Hosting Operations)


Location: Fully Remote (U.S.)

Employment Type: Direct Hire

Compensation: $160,000 - $200,000 depending on experience

Clearance: Public Trust clearance and active PIV card highly desired


About the Role

You will own the hosting and infrastructure operations of a large-scale federal financial system running in Azure Government Cloud. This is not a greenfield build. It is a complex, live production environment that processes real-time financial transactions across multiple federal agency systems every day. Your job is to keep it running, make it better, and help the team gain full operational control during a high-stakes transition period.


The architecture is a hybrid stack: Azure IaaS infrastructure, OpenShift container platform, Oracle databases with Data Guard and replication, enterprise integration services (ESB, SFTP, connections to external financial systems), and Control-M batch job orchestration. You will work across all of these layers. If something breaks at 2 AM between the integration tier and the database, you need to know where to look and how to fix it.


What You Will Do

Infrastructure & Cloud Operations

Manage and support Azure GovCloud infrastructure including VMs, storage, and networking. Maintain system availability and performance against SLA targets. Troubleshoot infrastructure issues that impact application and integration layers.

OpenShift & Platform Operations

Operate and support OpenShift clusters: nodes, pods, configurations. Monitor and troubleshoot containerized services and ESB workloads. Support platform-level changes, deployments, and restarts.

Database & Data Support

Support Oracle database environments including OLTP, standby, and replication. Assist with backup, restore, and data validation during incidents. Enable production debugging through controlled data access.

Integration & File Transfer

Support SFTP and file transfer pipelines through VLTrader and related systems. Troubleshoot integration failures across internal and external systems including FPDS and payroll interfaces. Validate data movement and interface execution.

Batch Processing & Scheduling

Monitor and support Control-M batch job execution. Troubleshoot failed or delayed batch processes. Ensure critical financial processing cycles complete on time.

Monitoring & Incident Response

Use Dynatrace, Splunk, and Azure Monitor to detect and diagnose issues. Participate in incident response and root cause analysis. Coordinate with application and business teams during outages.

Security & Access

Support patching, vulnerability remediation, and compliance activities. Manage access through enterprise tools (AD groups, jump boxes, privileged access). Ensure adherence to least-privilege and separation-of-duties principles.

Change & Release Support

Support infrastructure aspects of deployments and system changes. Coordinate maintenance windows and system restarts. Ensure changes follow proper approval and rollback procedures.

Disaster Recovery

Support DR readiness across Azure regions. Participate in failover testing and validation. Assist in maintaining RTO/RPO objectives.


What You Bring

  • Experience with Azure, specifically GovCloud or enterprise-scale environments
  • Strong Linux (RHEL) system administration skills
  • Experience operating Kubernetes or OpenShift container platforms in production
  • Familiarity with Oracle databases and enterprise data systems
  • Track record supporting production systems in a 24/7 environment
  • Hands-on experience with Splunk and Dynatrace for monitoring and log analysis
  • Ability to troubleshoot across infrastructure, platform, and integration layers simultaneously


What Sets You Apart

  • Experience with federal systems or regulated environments (VA, DoD)
  • Familiarity with Control-M or enterprise job schedulers
  • Experience with secure file transfer systems (SFTP, AS2, VLTrader)
  • Understanding of integration architectures: ESB, APIs, batch interfaces
  • Experience with DevOps tooling (Jenkins, Helm, Puppet)
  • Familiarity with RMF, ATO, or federal security compliance processes


What Success Looks Like

The system stays stable and available during a critical transition period. Issues get identified and resolved quickly with clear ownership. Integrations and batch processes execute reliably. Monitoring gives the team real visibility into system health. And the organization gains confident operational control of a complex, mission-critical environment.


Similar Jobs

Explore other opportunities that match your interests

Distributed Systems Engineer

Devops
•
3h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Scalence L.L.C.

United State

Senior Technical Solutions Lead

Devops
•
3h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Optomi

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

clinical insights hub

United State

Subscribe our newsletter

New Things Will Always Update Regularly