Current SECRET Security Clearance Required - On-Site at Armold AFB, TN (Tullahoma)
PLEASE DO NOT SUBMIT unless you have this SECRET Security Clearance!
Job Description:
We are seeking a highly skilled and experienced High Performance Computing (HPC) expert to join our dynamic team. The ideal candidate will have a minimum of 6-10 years of experience in HPC, with a Bachelor's degree in Computer Science, Information Technology, or a related field.
Preferred a DoD SECRET Security Clearance.
Must be a U.S. Citizen
Work location: On Site at Armold AFB, TN (Tullahoma)
Responsibilities:
- Design, setup, and maintain large scale Linux clusters for high performance computing applications.
- Troubleshoot and resolve complex technical issues that may arise within the HPC environment.
- Collaborate with cross-functional teams to ensure seamless integration of HPC systems into broader IT infrastructure.
- Develop, implement, and manage policies and procedures for HPC system administration.
- Monitor system performance and make recommendations for improvements or upgrades as needed.
- Stay abreast of the latest developments in HPC technology and apply this knowledge to improve our systems.
- Provide technical support and guidance to less experienced team members.
- Participate in project planning, execution, and post-mortem analysis for HPC initiatives.
- Contribute to the continuous improvement of our IT infrastructure by identifying areas for optimization and implementing solutions.
- Adhere to all company policies and procedures, as well as relevant industry standards and best practices.
Qualifications:
- Minimum 6-10 years of experience in HPC system administration.
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- Strong knowledge of Linux operating systems and large scale cluster management tools (e.g., Slurm, LSF).
- Proficiency in scripting languages such as Python, Perl, or Bash.
- Experience with parallel computing paradigms and high-performance interconnects (e.g., InfiniBand, RoCE).
- Strong problem-solving skills and the ability to work effectively under pressure.
- Excellent communication skills, both written and verbal.
- Ability to work collaboratively in a team environment as well as independently.
- Familiarity with cloud computing platforms (e.g., AWS, Azure) is a plus.
- Proactive approach to learning new technologies and staying current with industry trends.