HPC Linux System Administrator
-
Houston, Texas, United States
Are you passionate about human space exploration, understanding the origins of the universe, and working with a passionate and diverse team to make a difference? If you are, we need you!
We need your talent, teamwork, and energy to help us achieve great things that inspire people all over the globe. We need you to bring creative ideas and diverse backgrounds to help us envision, shape, and deliver systems that will enable the exploration of space while benefiting people here on Earth. We are excited about what we do, and we need you on our team as we take on exciting challenges for NASA’s pursuits in deep space exploration. As NASA’s largest engineering solutions provider working together with NASA at centers across the United States.
We have an exciting opportunity for a
HPC Linux System Administrator to join the team with Oceaneering, a teammate company. This position will be working on the Amentum JETS II contract, will support the Flight Sciences Laboratory (FSL).
The FSL is one of JSC's primary computing labs and hosts a wide variety of analyses, which support almost all of the major programs at NASA including the International Space Station (ISS), Orion, Space Launch System (SLS), Commercial Crew Program, Lunar Gateway, Human Landing System (HLS), and many others. The FSL systems are currently comprised of over 700 machines, 26,000 cores and over 10 PB of storage, which serve more than 1000 users.
The
HPC Linux System Administrator will:
- Work with a team of System Administrators to build and maintain all FSL services.
- Perform High Performance Computer (HPC) administration.
- Perform high-end Linux workstation administration.
- Support high speed networking
- Perform high speed parallel filesystem administration.
- Oversee high speed parallel filesystems administration and job scheduler administration.
- Investigate system problems.
- Proactively monitor system health.
- Work with FSL users to make sure they can support the NASA human spaceflight mission.
Requisition Qualifications:This position has been posted at multiple levels. Depending on the candidate's experience, requirements, and business needs, we reserve the right to consider candidates at any level for which this position has been advertised.
- Typically requires a bachelor’s degree or equivalent certification in a related area and a minimum of 5 years of experience in the field or in a related area.
- Experience in the following areas is needed for this position:
- Linux system administration
- HPC job scheduler administration
- System configuration management
- High-speed parallel file storage administration
- Monitoring and alerting
- Demonstrated problem solving, planning, and communication skills
- Ability to work in a team environment
Requisition Preferences:
- Strong skills in administration of parallel filesystems like Lustre or GPFS and strong skills administering SLURM job scheduling system
- Familiarity with the following if preferred but not required for the position:
- RedHat-based systems
- Luster High-speed Parallel Filesystems
- InfiniBand
- Provisioners (xCAT, warewulf)
- Ansible / Foreman
- SLURM resource manager
- SPACK software manager
- Log consolidation and monitoring
- Git/Gitlab and software development (CI/CD)
- Johnson Space Center campus network
- NASA security mechanisms (security plans, POAMs, ATOs, Risk Assessments)
Why Join Our Team? In addition to exciting career opportunities, we also have:
- Excellent personal and professional career growth
- 9/80 work schedule (every other Friday off), when applicable
- Onsite cafeteria (breakfast & lunch)
- Much, much more!
For more information on our partnership with NASA at Johnson Space Center (JSC), please visit www.wehavespaceforyou.com
- Proof of U.S. Citizenship or US Permanent Residency may be a requirement for this position.
- Must be able to complete a U.S. government background investigation.
- Management has the prerogative to select at any level for which the position is advertised.
Essential Functions Work Environment Generally, an office environment, but can involve inside or outside work depending on task.
Physical Requirements Work may involve sitting or standing for extended periods (90% of time). May require lifting and carrying up to 25 lbs. (5% of time).
Equipment and Machines Standard office equipment (PC, telephone, printer, etc.).
Attendance Regular attendance in accordance with established work schedule is critical. Ability to work outside normal schedule and adjust schedule to meet peak periods and surge requirements.
Other Essential Functions Professional behavior that enhances productivity and promotes teamwork and cooperation. Grooming and dress must be appropriate for the position and must not impose a safety risk/hazard to the employee or others.
#JETS #JETSII