HPC System Admin

  • PTS00005N
  • On Site
  • Bristol, England, United Kingdom
  • Corsham, England, United Kingdom
  • Full time
We'll inspire and empower you to deliver your best work so you can evolve, grow and succeed - today and into tomorrow. With more than 55,000 people in 40+ countries, working here offers an exciting range of opportunities to develop your career within a supportive and diverse team who always strive to do the right thing for our people, clients and communities.

People are our greatest asset, and we offer a competitive package to retain and attract the best talent.

In addition to the benefits you’d expect, UK employees also receive free single medical cover and digital GP service, family-friendly benefits such as enhanced parental leave pay and free membership of employee assistance and parental programmes, plus reimbursement towards relevant professional development and memberships. We also give back to our communities through our Collectively program which incorporates matched-funding, paid volunteering time and charitable donations.

We are seeking an experienced High-Performance Computing (HPC) Engineer to join our team in supporting and advancing our high-performance computing infrastructure. The ideal candidate will have a deep understanding of Linux-based environments, HPC workload management, hardware troubleshooting, and user support. This role is critical to maintaining the performance, security, and reliability of our HPC environment.

Key Responsibilities:

  • Manage and maintain HPC clusters utilisingSlurmworkload manager.
  • Perform system administration tasks onRHELand otherLinux-based operating systems.
  • Support, configure, and maintainlicense servers, includingFlexLMandRLM.
  • Implement and supportlicense-aware schedulingin Slurm.
  • Deploy and provision nodes usingxCATandAnsible.
  • Diagnose and resolve hardware issues in HPC environments.
  • Provide technical support to end users, including troubleshooting job submissions and access issues.
  • Monitor cluster health and performance using tools such asNagiosandGanglia.
  • Conduct vulnerability scanning and remediation usingNessus.
  • Administer and support ourHPC storage systems, includingPure StorageandNetApp.
  • Manage HPC networking infrastructure, includingInfiniBand interconnect.


Requirements

  • Proven experience managing HPC systems in a production environment.
  • Strong expertise inSlurm,Linux (RHEL preferred), and node provisioning tools.
  • Familiarity with license management tools and integration with job schedulers.
  • Proficiency in scripting and automation (Bash, Python, etc.).
  • Experience with system monitoring and security tools.
  • Excellent problem-solving skills and a proactive approach to support and maintenance.
  • Strong communication skills and the ability to workwith researchers, developers, and IT staff.
  • Sole UK National status due to Role and Client requirement.
  • Ability to attain UKSV Security Clearance.


Our Culture

Our values stand on a foundation of safety, integrity, inclusion and diversity. We put people at the heart of our business and we truly believe that by supporting one another through our culture of caring, we all succeed. We value positive mental health and a sense of belonging for all employees. Find out more about life at our company.

We aim to embed inclusion and diversity in everything we do. We know that if we are inclusive, we’re more connected, and if we are diverse, we’re more creative. We accept people for who they are, regardless of age, disability, gender identity, gender expression, marital status, mental health, race, faith or belief, sexual orientation, socioeconomic background, and whether you’re pregnant or on family leave. This is reflected in our wide range of Global Employee Networks centred on inclusion and diversity - ACE, Careers, Enlace, Harambee, OneWorld, Prism, Vetnet, and Women’s - find out more about our employee networks here.

We partner with VERCIDA to help us attract and retain diverse talent. For greater online accessibility please visit www.vercida.com to view and access our roles. As a Disability Confident employer, we will interview all disabled applicants who meet the minimum criteria for a vacancy. We welcome applications from candidates who are seeking flexible working and from those who may not meet all the listed requirements for a role

If you require further support or reasonable adjustments with regards to the recruitment process (for example, you require the application form in a different format), please contact the team .

Your application experience is important to us and we’re keen to adapt to make every interaction even better. We welcome feedback on our recruitment process and if you need more from us before deciding to join us then please let us know.

Know someone who would be great for this job? Send it to them!

Not You?

Thanks for sharing this job with your friend or colleague


Start your application

Send me jobs alerts for jobs like this

Not You?

Thank you


Already an Amentum Employee?

Start your application here

Accessibility/Reasonable Accommodations

If you are an applicant with a disability that requires a reasonable accommodation to complete any part of the application process, or are limited in the ability—or unable to use—the online application system and need an alternative method for applying, you may contact our Reasonable Accommodation Helpline at 1-888-877-3181 or 301-944-3299 for assistance. In order to address your request, the following information is needed:

  • Name
  • The best method for contacting you
  • The position title
  • Requisition/Job Number
  • Upon receipt of this information we will respond to you promptly to obtain more information about your request.