ParTec

HPC System Engineer

ParTec AG is a fast-growing, agile technology company and a leading provider of modular high-performance, AI, and quantum computing systems. Our ambition is to enable our customers to grow efficiently, cost-effectively and continuously in an ever faster changing world and to adapt to future challenges and technologies.

The success of our company depends almost entirely on the technical and leadership skills of our employees, their ability to fulfil customer expectations and deliver innovative solutions. For further strengthen our team, we are looking for an

HPC System Engineer

Job Summary

This position is part of ParTec’s Customer Solutions Team and reports to the Manager of Customer Service. Besides other systems, the HPC System Engineer will maintain and support the administration and operation of a Platform-as-a-Service (PaaS) for the Israeli Quantum Compute Center IQCC (https://i-qcc.com). IQCC is a first of its kind research compute centre that combines several on-premise quantum computers of different modalities with on-premise and cloud-based classical compute resources. The PaaS developed by ParTec is a virtual Slurm HPC cluster on AWS that uses the HPC-QC integration software QBridge by Quantum Machines and ParTec.

Duties and responsibilities

  • Providing 1st-level support for the setup, configuration, administration and operation of the PaaS for IQCC.
  • Analysis, classification and solution of problems in tight communication with the customer and hardware and software partners, as well as with ParTec’s software engineering team.
  • DevOps activities for maintaining the PaaS solution and supporting ParTec’s software engineering team in further developing the solution.

Expected education and experience

  • Bachelor’s degree in a related field (or equivalent experience) and a minimum of 5 years of directly related experience.

Essential skills

  • Experience with AWS and Parallel Cluster, including firewall, WAF, rights management, image management, VPN, peering
  • Solid experience with Linux administration, including user management, network configuration, package management, parallel file systems, and high- availability setups.
  • Linux security and change management
  • Thorough knowledge of Bash scripting/Bashly, and at least one other scripting language, like Python or Perl.
  • Strong knowledge of the following technologies
    • Ansible and Jinja2 templates
    • SLURM
    • HA setup of databases such as LDAP, MySQL, PostgreSQL
    • System monitoring using Prometheus (Mimir, Loki)
    • Virtualisation and containers (Kubernetes)
  • Customer- and solution-oriented attitude.
  • Attention to detail.
  • Capability to work under strong time pressure and under difficult circumstances.
  • Fluency (written & spoken) in English.

Desirable skills

  • Experience with NVIDIA DGX compute systems.

Job conditions

  • Location: home office in Germany or Switzerland. Hybrid work setup (home office / company office resp. co-working space) available in Munich and Zurich.
  • Working hours dependent on the service level agreement with the customer. Standard day working hours plus potentially on-call service during night and weekends.
  • Limited travel requirements, mostly in Europe

This is an exciting opportunity in some of the most advanced and dynamically developing fields of computing. If you are interested in joining our team, please send your application and CV to career.customer-solutions@par-tec.com.