Job Reference: 583_24_OP_US_AISE
Position: AI Support Engineer - High-Performance Computing (HPC) Projects - AI4S
Closing Date: Monday, 16 September, 2024
About BSC: The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, and is now hosting entity for EuroHPC JU, the Joint Undertaking that leads large-scale investments and HPC provision in Europe. The mission of BSC is to research, develop and manage information technologies in order to facilitate scientific progress. BSC combines HPC service provision and R&D into both computer and computational science under one roof, and currently has over 1000 staff from 60 countries.
Context And Mission: Are you passionate about technology, Linux systems, Artificial intelligence trends, and high-performance computing? Do you want to be part of a leading supercomputing center in Europe? Look no further! The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is seeking a talented individual to join our Operations Department as a User Support specialist on AI projects. BSC hosts one of the biggest supercomputers in Europe within the framework of EUROHPC joint undertaking. The pre-exascale machine hosted by BSC is named MareNostrum 5 and has a total of 205 PFlops sustained with more than 180 of them accelerated with the latest NVIDIA technology, using H100 GPUs. The HPC support team has a direct role in improving the usage of this machine and to help the most relevant projects in their day-to-day usage.
Key Duties:
Improve the performance of existing AI environments, enhancing serial efficiency and scalability, changing, if necessary, the code or assisting developers with required modifications. Provide consultancy to scientists on AI solutions that can improve the evolution of their research. Requirements:
Education: Bachelor in Computer Science or related discipline linked with Artificial Intelligence at a technical level.
Essential Knowledge and Professional Experience:
Experience working with AI models and running them in parallel in HPC systems. Experience using performance analysis tools and parallel debuggers. Experience supporting and collaborating with external partners. Good understanding of Linux environment and Shell scripting. Experience working with Parallel programming codes (MPI and OpenMP) and batch systems like SLURM as a user. At least 1 year of experience in a similar position working with AI solutions. Additional Knowledge and Professional Experience:
Experience in managing big and collaborative projects and experience with git and SVN. Experience porting codes to accelerators specifically NVIDIA GPUs. A thorough understanding of high-performance computing architectures. Competences:
Excellent communication and interpersonal skills to work within a team to complete tasks on schedule. Analytical problem-solving ability. Conditions:
The position will be located at BSC within the Operations Department. We offer a full-time contract (37.5h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance. Duration: 4 years. Holidays: 23 paid vacation days plus 24th and 31st of December per our collective agreement. Salary: 40,000.00 €. Additional Expenses Grant: Each fellowship will be associated with a grant for additional expenses, such as IT equipment, travel, training, stays, etc. Starting date: ASAP - the incorporation for this vacancy must be before the 16th of December 2024. Applications procedure and process:
A full CV in English, including contact details. A cover/motivation letter with a statement of interest in English, clearly specifying for which specific area and topics the applicant wishes to be considered.
#J-18808-Ljbffr