Candidate will be working on a 24x7 team monitoring HPC systems for failures and performing critical interventions to keep platforms operating at the highest efficiency with minimal impact to customers. Candidates will be tasked with isolating hardware failures on failing systems to the component level and performing corrective maintenance and repairs. Candidates will perform specialized retest and reclaim experiments on failed modules to determine root causes and trends for failure while utilizing Linux bash and perl scripts to provide program monitoring capability.
The position is a night shift with hours of 7pm-7am Sunday, Monday, Tuesday and every other Saturday.
This is the perfect entry level position for someone just starting out and looking to gain more experience.
The candidate must have experience with an understanding of the concepts, procedures and guidelines to solve highly complex problems in the maintenance and hardware/software network infrastructure. Experience performing system set-up, experiments and diagnostics to evaluate printed circuit board exchanges, and troubleshoot and make component repairs based on test results. Knowledge and experience of LINUX Operations Systems, Knowledge and experience in electronics component repair. Ability to communicate and work well in an effective team environment. Ability to lift ~50lbs.
A TS/SCI security clearance with polygraph.
An associates degree OR 2 years of similar experience