MRI Technologies logo

HPC Linux System Administrator

MRI Technologies
25 days ago
Full-time
On-site
Houston, Texas, United States
Administrative & Support

MRI Technologies has an exciting opportunity for an HPC Linux System Administrator on the JETS II contract at NASA Johnson Space Center. You will support the Flight Sciences Laboratory (FSL), one of JSC's primary computing facilities-over 700 machines, 26,000 cores, and 10+ petabytes of storage serving more than 1,000 users. The analyses running on FSL infrastructure support nearly every major NASA program, including International Space Station (ISS), Orion, Space Launch System (SLS), Commercial Crew, Human Landing System (HLS), Moon Base, and many others.

You will work with a team of HPC System Administrators to build and maintain all FSL services. This will include High Performance Compute (HPC) administration, high-end Linux workstation administration, high-speed networking, and high-speed parallel filesystem administration. A core part of the role is bringing a deep understanding of container technologies and how best they can be used in an HPC environment-FSL uses containers so users can bring their own environments (often older OS versions or older package stacks) into the cluster. Day-to-day tasks include investigating system problems, proactively monitoring system health, and working with FSL users to make sure they can support the NASA human spaceflight mission.

What We Are Looking For

Requirements:

  • Typically requires a bachelor's degree or equivalent certification in a related area, with a minimum of 5 years of experience in the field or in a related area
  • Linux system administration
  • System configuration management
  • Container-based HPC workflows
  • Complex CI/CD workflows
  • HPC job scheduler administration
  • Experience with large-scale system administration, with 2 of those years in HPC administration
  • Demonstrated problem-solving, planning, and communication skills
  • Ability to work effectively in a team environment

Preferences:

  • RedHat-based systems
  • Container technology such as Docker, Podman, and Apptainer
  • Lustre high-speed parallel filesystems
  • InfiniBand high-speed networking
  • Ansible / Foreman for configuration management
  • SLURM resource manager
  • SPACK software manager
  • Log consolidation and monitoring
  • Git/GitLab and software development (CI/CD)
  • Johnson Space Center campus network
  • NASA security mechanisms (security plans, POAMs, ATOs, Risk Assessments)

This position has been posted at multiple levels. Depending on your experience and business needs, we may consider candidates at any level for which the position is advertised.

Benefits and Perks

We offer a comprehensive benefits package including medical, dental, vision, company paid life and disability insurance, paid time off, and 401(k). You'll also enjoy a 9/80 work schedule (every other Friday off, when applicable), and the chance to work in one of JSC's most critical computing environments supporting human spaceflight.

Proof of U.S. Citizenship or U.S. Permanent Residency is a requirement for this position.

MRI Technologies is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.

As we are a Federal Contractor, most positions require the employee to obtain and maintain a U.S. Government background investigation. MRI also completes a pre-screening background check for anyone offered employment.