Senior HPC AI Cluster Architect

hace 4 semanas


Madrid, Madrid, España Aitopics A tiempo completo
Job Title: Senior HPC AI Cluster Engineer

NVIDIA is seeking an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team. We are focused on building supercomputers and HPC clusters based on groundbreaking technologies. As a key player in the most exciting computing hardware and software, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.

Key Responsibilities:
  • Design, implement, and maintain large-scale HPC/AI clusters with monitoring, logging, and alerting.
  • Manage Linux job/workload schedules and orchestration tools.
  • Develop and maintain continuous integration and delivery pipelines.
  • Develop tooling to automate deployment and management of large-scale infrastructure environments.
  • Deploy monitoring solutions for servers, network, and storage.
  • Perform troubleshooting from bare metal to application level.
  • Develop, redefine, and document standard methodologies to share with internal teams.
  • Support Research & Development activities and engage in POCs/POVs for future improvements.
Requirements:
  • A degree in Computer Science, Engineering, or a related field and 5+ years of experience.
  • Knowledge of HPC and AI solution technologies from CPUs and GPUs to high-speed interconnects and supporting software.
  • Experience with job scheduling workloads and orchestration tools such as Slurm, K8s.
  • Excellent knowledge of Windows and Linux networking and internals, ACLs, and OS-level security protection.
  • Experience with multiple storage solutions such as Lustre, GPFS, zfs, and xfs.
  • Familiarity with newer and emerging storage technologies.
  • Python programming and bash scripting experience.
  • Comfortable with automation and configuration management tools such as Jenkins, Ansible, Puppet/Chef.
  • Deep knowledge of Networking Protocols like InfiniBand and Ethernet.
  • Deep understanding and experience with virtual systems.
  • Familiarity with cloud computing platforms.
Preferred Qualifications:
  • Knowledge of CPU and/or GPU architecture.
  • Knowledge of Kubernetes, container-related microservice technologies.
  • Experience with GPU-focused hardware/software.

We are looking for a highly skilled and experienced HPC Engineer to join our team. If you have a passion for building high-performance computing systems and a strong background in AI and HPC infrastructure, we encourage you to apply.



  • Madrid, Madrid, España Nvidia A tiempo completo

    NVIDIA is seeking an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team.We are focused on building supercomputers and HPC clusters based on groundbreaking technologies. As a key player in the most exciting computing hardware and software, you will contribute to the latest breakthroughs in artificial intelligence and GPU...


  • Madrid, Madrid, España Aitopics A tiempo completo

    Job SummaryNVIDIA is seeking an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.Key ResponsibilitiesDesign, implement, and maintain large-scale HPC/AI clusters...


  • Madrid, Madrid, España Aitopics A tiempo completo

    We are seeking an experienced HPC engineer to join our E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters based on groundbreaking technologies, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.You will provide insights on at-scale system design and...


  • Madrid, Madrid, España Nvidia A tiempo completo

    NVIDIA is seeking an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters based on groundbreaking technologies, you will provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated...


  • Madrid, Madrid, España Nvidia A tiempo completo

    NVIDIA Job OpportunityWe are seeking an experienced HPC Engineer to join our E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.Key Responsibilities:Design, implement, and maintain large-scale HPC/AI...


  • Madrid, Madrid, España Nvidia A tiempo completo

    NVIDIA Job OpportunityWe are seeking an experienced HPC Engineer to join our E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.Key Responsibilities:Design, implement, and maintain large-scale HPC/AI...


  • Madrid, Madrid, España Aitopics A tiempo completo

    About the RoleWe are seeking an experienced HPC Engineer to join our E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.Key ResponsibilitiesDesign, implement, and maintain large-scale HPC/AI clusters...


  • Madrid, Madrid, España Nvidia A tiempo completo

    Job SummaryNVIDIA is seeking an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.Key ResponsibilitiesDesign, implement, and maintain large-scale HPC/AI clusters...


  • Madrid, Madrid, España Nvidia A tiempo completo

    NVIDIA HPC/AI Infrastructure TeamWe are seeking an experienced HPC Engineer to join our E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.Key Responsibilities:Design, implement, and maintain large-scale...

  • Hpc/Ai Systems Engineer

    hace 4 semanas


    Madrid, Madrid, España Euraxess A tiempo completo

    Job Title: Hpc/Ai Systems EngineerWe are seeking a highly skilled Hpc/Ai Systems Engineer to join our team at the Barcelona Supercomputing Center. As a key member of our Operations Department, you will be responsible for the installation, maintenance, and update of IT services, including mail, web, databases, servers, and storage subsystems.Key...

  • Hpc/Ai Systems Engineer

    hace 4 semanas


    Madrid, Madrid, España Euraxess A tiempo completo

    Job Title: Hpc/Ai Systems EngineerWe are seeking a highly skilled Hpc/Ai Systems Engineer to join our team at the Barcelona Supercomputing Center. As a key member of our Operations Department, you will be responsible for the installation, maintenance, and update of IT services, including mail, web, databases, servers, and storage subsystems.Key...


  • Madrid, Madrid, España Arelance A tiempo completo

    Administración de Clústeres HPCEn Arelance estamos buscando a un profesional experimentado en la administración de clústeres de alto rendimiento para un proyecto de modalidad híbrida. El candidato ideal contará con al menos 4 años de experiencia en el rol y habilidades en el diseño y construcción de clústeres informáticos HPC basados en...


  • Madrid, Madrid, España Roche A tiempo completo

    Unlock Your Potential as a Solution Architect for Science and ResearchAre you a skilled technical professional looking to make a meaningful impact in the field of Health and Life Sciences? We have an exciting opportunity for a Solution Architect / Engineer to join our High Performance Computing team at Roche.About the RoleAs an HPC Solution...


  • Madrid, Madrid, España Roche A tiempo completo

    Unlock Your Potential as a Solution Architect for Science and ResearchAre you a skilled technical professional looking to make a meaningful impact in the field of Health and Life Sciences? We have an exciting opportunity for a Solution Architect / Engineer to join our High Performance Computing team at Roche.About the RoleAs an HPC Solution...


  • Madrid, Madrid, España Nvidia A tiempo completo

    Job Role OverviewWe are seeking an experienced HPC Engineer to join our E2E software verification HPC/AI Infrastructure team at NVIDIA. As a key player in building supercomputers and HPC clusters, you will contribute to groundbreaking technologies and the latest breakthroughs in artificial intelligence and GPU computing.Key ResponsibilitiesDesign, implement,...


  • Madrid, Madrid, España Mygwork A tiempo completo

    Senior AI Solution ArchitectJoin Pearson, a leading education company, in shaping the future of learning with cutting-edge AI solutions. As a Senior AI Solution Architect, you will play a key role in designing and implementing AI-driven applications that enhance the learning experience.About the RoleWe are seeking a talented and motivated professional to...


  • Madrid, Madrid, España Mygwork A tiempo completo

    Senior AI Solution ArchitectJoin Pearson, a leading education company, in shaping the future of learning with cutting-edge AI solutions. As a Senior AI Solution Architect, you will play a key role in designing and implementing AI-driven applications that enhance the learning experience.About the RoleWe are seeking a talented and motivated professional to...


  • Madrid, Madrid, España Arelance A tiempo completo

    Descripción del PuestoBuscamos un/a profesional experimentado/a en Administración HPC para un proyecto de modalidad híbrida en el que colaboramos en Madrid.El candidato ideal tendrá al menos 4 años de experiencia en el rol y será capaz de diseñar y construir clústeres informáticos HPC CPU/GPU basados en Linux.Entre las responsabilidades del puesto...


  • Madrid, Madrid, España Arelance A tiempo completo

    Descripción del PuestoBuscamos un/a profesional experimentado/a en Administración HPC para un proyecto de modalidad híbrida en el que colaboramos en Madrid.El candidato ideal tendrá al menos 4 años de experiencia en el rol y será capaz de diseñar y construir clústeres informáticos HPC CPU/GPU basados en Linux.Entre las responsabilidades del puesto...


  • Madrid, Madrid, España Arelance A tiempo completo

    Buscamos un/a profesional de alta calidad para un proyecto de Administración HPC en hibrido.Requisitos:Al menos 4 años de experiencia en el rol.Diseño y construcción de clústeres informáticos HPC CPU/GPU basados en Linux.Administración de sistemas y resolución de problemas para clústeres HPC/GPU.Experiencia previa en SO CentOS (versiones 5.X 6.X o...