We're hiring a senior Hardware Operations Lead to ensure reliability and performance of GPU-based compute infrastructure. This role is hands-on and critical to maintaining uptime across AI workloads. The ideal candidate will have strong hands-on experience with server hardware and data center environments.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
Location: Multiple (Buffalo, Houston, San Antonio, Abernathy, Barber Lake)
Relocation Support Provided.
Compensation: $100k–$150k + equity
About the Role
We’re hiring a senior Hardware Operations Lead to ensure reliability and performance of GPU-based compute infrastructure. This role is hands-on and critical to maintaining uptime across AI workloads.
Looking to advance your Development & Programming career with relocation support? Explore Development & Programming Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.
What You’ll Do
- Act as L3 escalation point for server and hardware-related incidents
- Troubleshoot and resolve complex hardware issues (servers, storage, GPU nodes)
- Perform break-fix, component replacement, and system validation
- Support deployments, upgrades, and system provisioning
- Ensure accuracy of hardware assets and configurations across the environment
- Collaborate with engineering teams to maintain performance across GPU clusters
- Participate in on-call rotations supporting 24/7 operations
Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.
Interested in relocating to United State? Check out our comprehensive Relocation Jobs in United State page with detailed relocation packages and benefits.
What We’re Looking For
- Strong hands-on experience with server hardware and data center environments
- Experience troubleshooting at BIOS, OS, and hardware levels
- Familiarity with Linux systems and infrastructure environments
- Ability to operate as an escalation owner in high-availability environments
- Experience supporting GPU infrastructure or AI workloads is a strong plus
Similar Jobs
Explore other opportunities that match your interests
Rise Technical
Systems Engineer II - Modeling, Simulation, and Analysis