ADDITIONAL JOB DETAILS:
Provides daily support for the HPC environment with a focus on Red Hat Enterprise, Ubuntu, and other *nix operations both on-premise and within cloud computing platforms.
Develops and updates procedures and guidelines to install, patch, configure, customize, troubleshoot, upgrade, integrate, and maintain Red Hat Enterprise, Ubuntu, other *nix operating systems, and related software.
Researches, analyzes and resolves problems, providing root-cause analysis for Red Hat Enterprise, Ubuntu, and other *nix operating systems.
Proactively seeks information and utilizes analytical and creative problem-solving skills along with standard processes and technologies resulting in secure use of systems, applications, and infrastructure.
Demonstrates quality service and accountability in the process of resolving requests, supporting daily operations, and ensuring system stability that results in accurate, timely, and efficient solutions and data as evidenced by meeting customer needs.
Learning and keeping current with HPC technologies, such as backups, job-scheduling and parallel file system management.
Management of physical hardware in on-premise datacenter.
Requires occasional on-site presence, lifting and reaching into tight spaces.
Ability to lift 1U servers (approx. 50 lbs) for installation and servicing required.
MINIMUM QUALIFICATIONS:
Seven years of operating systems analysis/administration experience OR a bachelor's degree and five years of operating systems analysis/administration experience.
PREFERRED QUALIFICATIONS:
Bachelor's degree with a scientific computing or Data Science emphasis.
Three or more years of HPC systems experience, including:
- Extensive command-line systems administration/use.
- LDAP, user and group account administration.
- Building (from source code), installing, maintaining and troubleshooting application-level Linux and scientific software.
- Installation of Linux operating system on a variety of hardware platforms.
- Use of various Linux package management systems.
- Linux file system management and networking stack.
Experience writing and debugging Python and Bash scripts for system administration
Experience with automation tools such as Ansible.
Experience with log analytic tools such as Splunk.
Experience with applications such as R, Numpy/Pandas, and MATLAB preferred.
Experience with Infiniband networking.
Experience with Virtualization software use in a Linux environment.
Excellent interpersonal, oral and written communication skills.
Exact compensation may vary based on several factors, including skills, experience, and education.
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.