Senior Hardware Operations Lead

Intelletec United State
Relocation
Apply
AI Summary

We're hiring a senior Hardware Operations Lead to ensure reliability and performance of GPU-based compute infrastructure. This role is hands-on and critical to maintaining uptime across AI workloads. The ideal candidate will have strong hands-on experience with server hardware and data center environments.

Key Highlights
Act as L3 escalation point for server and hardware-related incidents
Troubleshoot and resolve complex hardware issues
Collaborate with engineering teams to maintain performance across GPU clusters
Key Responsibilities
Act as L3 escalation point for server and hardware-related incidents
Troubleshoot and resolve complex hardware issues
Perform break-fix, component replacement, and system validation
Support deployments, upgrades, and system provisioning
Ensure accuracy of hardware assets and configurations across the environment
Collaborate with engineering teams to maintain performance across GPU clusters
Technical Skills Required
Linux systems Server hardware Data center environments GPU infrastructure AI workloads
Benefits & Perks
$100k–$150k + equity
Relocation Support Provided
Nice to Have
Experience supporting GPU infrastructure or AI workloads

Job Description


Location: Multiple (Buffalo, Houston, San Antonio, Abernathy, Barber Lake)

Relocation Support Provided.

Compensation: $100k–$150k + equity


About the Role

We’re hiring a senior Hardware Operations Lead to ensure reliability and performance of GPU-based compute infrastructure. This role is hands-on and critical to maintaining uptime across AI workloads.


What You’ll Do

  • Act as L3 escalation point for server and hardware-related incidents
  • Troubleshoot and resolve complex hardware issues (servers, storage, GPU nodes)
  • Perform break-fix, component replacement, and system validation
  • Support deployments, upgrades, and system provisioning
  • Ensure accuracy of hardware assets and configurations across the environment
  • Collaborate with engineering teams to maintain performance across GPU clusters
  • Participate in on-call rotations supporting 24/7 operations


What We’re Looking For

  • Strong hands-on experience with server hardware and data center environments
  • Experience troubleshooting at BIOS, OS, and hardware levels
  • Familiarity with Linux systems and infrastructure environments
  • Ability to operate as an escalation owner in high-availability environments
  • Experience supporting GPU infrastructure or AI workloads is a strong plus


Similar Jobs

Explore other opportunities that match your interests

AI Integration Consultant

Programming
1h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Rise Technical

United State

Systems Engineer II - Modeling, Simulation, and Analysis

Programming
1h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

fetchjobs.co

United State

Cyber Talent Management Organization (CTMO) Liaison Program Manager

Programming
1h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

joint activities llp

United State

Subscribe our newsletter

New Things Will Always Update Regularly