Join RedLine Performance Solutions as an HPC System Administrator to provide operational support for HPC clusters in Dayton, OH. This role requires strong technical skills and excellent communication abilities. The ideal candidate will have experience with Linux systems administration, HPC cluster system administration, and scripting languages.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
RedLine Performance Solutions (RedLine) has been in the HPC solutions engineering services business for over 26 years and is consistently determined to keep the "bar of excellence" quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. We offer services ranging from full life cycle HPC systems engineering to remote managed services to HPC program analysis. We are looking for an HPC System Administrator to join us.
The HPC System Administrator will provide operational support for HPC clusters located in Dayton, OH. Operations run 24x7 and therefore there will be a rotational on-call requirement. The HPC systems administrator will engage with the customer and participate in the evolution and maintenance of the technical infrastructure in addition to operationally supporting the on-site HPC environment. The administrator will be responsible for relaying these insights to the RedLine Program Manager and working together to translate customer needs into actionable project tasks.
IMPORTANT:
The position is at the customer site in Dayton, OH. As such, remote work is not viable for this role but relocation may be considered. The administrator functions as the lead point of contact for day-to-day operations and real-time problem resolution. Active DoD Top Secret security clearance and relevant technical certifications (e.g., Linux+, Security+) are mandatory requirements for this position.This full-time (W-2) position offers a full benefits package including paid time off, 401k match, and health care benefits.
Required Skills:
- 7 or more years of Linux systems administration, preferably in a Red Hat and/or Rocky environment
- Strong knowledge of TCP/IP networking.
- 5 or more years of HPC cluster system administration experience, preferably with Dell clusters
- Strong experience in Bash, Perl, and Python scripting in a version-controlled environment using Git
- Experience with job scheduling software (e.g., Slurm, PBS)
- Experience with cluster automation tools (e.g., xCAT, HPCM, Bright Cluster Manager)
- Experience with parallel filesystems (e.g., Lustre)
- Experience with high-speed interconnects (e.g., InfiniBand)
- Strong verbal and written communication skills, with the ability to coordinate between multiple team members in remote locations between several disparate projects
- Strong organizational skills
Looking to advance your IT & Network Engineering career with relocation support? Explore IT & Network Engineering Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.
Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.
Interested in relocating to United State? Check out our comprehensive Relocation Jobs in United State page with detailed relocation packages and benefits.
- Experienced with system engineering in addition to system administration
- Red Hat Certification (e.g., RHCSA, RHCE)
- Server automation experience (e.g., Puppet, Foreman, Ansible)
- Experience with MPI technologies.
- Experience with Warewulf cluster management and provisioning.
- Experience with Weka parallel file systems.
- Optimization experience with GPU based HPC clusters.
Similar Jobs
Explore other opportunities that match your interests
IT Application Developer
toyota material handling
Controls Engineer
Lawrence Livermore National La...
Senior Marketing Director, Grid Automation North America