Post Job Free
Sign in

HPC Systems Administrator

Company:
ACL Digital
Location:
Clinton Township, OH, 43224
Posted:
April 29, 2025
Apply

Description:

Job Title: HPC Administrator

Location: Columbus, OH

Duration: 3+ Months

Position Overview:

We are seeking an experienced HPC Administrator to manage, maintain, and optimize our high-performance computing environment. This role is responsible for the physical and software aspects of our HPC cluster, including installation and maintenance of software applications, operating system management, hardware monitoring, and facilitating efficient data movement and storage between source and destination systems.

Key Responsibilities:

- Install, configure, and maintain software applications and operating systems on HPC clusters.

- Manage the physical infrastructure of the HPC environment, including monitoring, troubleshooting, and performing hardware upgrades and maintenance.

- Facilitate efficient and secure data storage solutions, including managing storage infrastructure and ensuring seamless data movement between various systems.

- Collaborate with researchers, developers, and IT teams to identify and implement solutions enhancing HPC performance and reliability.

- Ensure compliance with organizational security standards and best practices for HPC systems.

- Document system configurations, procedures, and policies clearly and comprehensively.

Requirements:

- Bachelor's degree in Computer Science, Engineering, or a related technical field; equivalent experience may be substituted.

- Demonstrated experience managing and maintaining HPC clusters or large-scale computing environments.

- Proficiency in Linux administration and troubleshooting.

- Experience with job scheduling systems (e.g., Slurm, PBS, Torque).

- Familiarity with data storage systems, parallel file systems (e.g., Lustre, GPFS), and data transfer technologies.

- Strong scripting skills (Bash, Python, etc.) for automation and management tasks.

- Excellent problem-solving skills, attention to detail, and the ability to manage multiple tasks simultaneously.

Apply