Senior Hpc Ai Cluster Engineer

hace 3 meses


Madrid, España Nvidia A tiempo completo

.NVIDIA is looking for an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team. We are focused on building supercomputers and HPC clusters based on groundbreaking technologies. We are looking for an outstanding architect for a senior HPC role, to be a key player in the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing. You will provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialists to architect, develop, and bring up large-scale performance platforms.What you will be doing:Design, implement and maintain large scale HPC/AI clusters with monitoring, logging and alerting.Manage Linux job/workload schedules and orchestration tools.Develop and maintain continuous integration and delivery pipelines.Develop tooling to automate deployment and management of large-scale infrastructure environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources.Deploy monitoring solutions for the servers, network, and storage.Perform troubleshooting from bare metal, operating system, software stack, and application level.As a technical resource, develop, redefine, and document standard methodologies to share with internal teams.Support Research & Development activities and engage in POCs/POVs for future improvements.What we need to see:A degree in Computer Science, Engineering, or a related field and 5+ years of experience.Knowledge of HPC and AI solution technologies from CPUs and GPUs to high-speed interconnects and supporting software.Experience with job scheduling workloads and orchestration tools such as Slurm, K8s.Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalld, iptables, wireshark, etc.) and internals, ACLs and OS-level security protection, and common protocols e.G. TCP, DHCP, DNS, etc.Experience with multiple storage solutions such as Lustre, GPFS, zfs, and xfs. Familiarity with newer and emerging storage technologies.Python programming and bash scripting experience.Comfortable with automation and configuration management tools such as Jenkins, Ansible, Puppet/Chef.Deep knowledge of Networking Protocols like InfiniBand and Ethernet.Deep understanding and experience with virtual systems (for example VMware, Hyper-V, KVM, or Citrix).Familiarity with cloud computing platforms (e.G. AWS, Azure, Google Cloud).Ways to stand out from the crowd:Knowledge of CPU and/or GPU architecture.Knowledge of Kubernetes and container-related microservice technologies



  • Madrid, España MedOil Energy 2050 SLU A tiempo completo

    The job description for a junior AI and HPC engineer typically involves working with artificial intelligence and high-performance computing technologies. As a junior engineer, you would be responsible for assisting in the development and implementation of AI algorithms and models, as well as optimizing and maintaining high-performance computing systems. You...


  • Madrid, España Arelance A tiempo completo

    Administrador/a HPC en hibrido.En este momento buscamos un/a profesional de Administración HPC para un proyecto de modalidad híbrida en el que colaboramos en Madrid.¿Qué buscamos en ti?- Profesionales con al menos 4 años de experiência en el rol.- Diseño/construcción de clústeres informáticos HPC CPU/GPU basados en Linux.- Administración de...


  • Madrid, España arelance A tiempo completo

    Administrador/a HPC en hibrido. En este momento buscamos un/a profesional de Administración HPC para un proyecto de modalidad híbrida en el que colaboramos en Madrid. ¿Qué buscamos en ti? - Profesionales con al menos 4 años de experiência en el rol. - Diseño/construcción de clústeres informáticos HPC CPU/GPU basados en Linux. - Administración de...


  • Madrid, Madrid, España Arelance A tiempo completo

    Descripción del puestoBuscamos un profesional con experiencia en administración de sistemas y resolución de problemas para clústeres HPC/GPU.Las tareas principales incluyen:Diseño y construcción de clústeres informáticos HPC CPU/GPU basados en Linux.Administración de sistemas y resolución de problemas para clústeres HPC/GPU.Experiencia previa en...


  • Madrid, Madrid, España Nvidia A tiempo completo

    Job Role OverviewWe are seeking an experienced HPC Engineer to join our E2E software verification HPC/AI Infrastructure team at NVIDIA. As a key player in building supercomputers and HPC clusters, you will contribute to groundbreaking technologies and the latest breakthroughs in artificial intelligence and GPU computing.Key ResponsibilitiesDesign, implement,...

  • HPC Systems Engineer

    hace 3 días


    Madrid, Madrid, España Atos SE A tiempo completo

    Job OverviewEviden, a global leader in data-driven digital transformation, is seeking an experienced HPC Systems Engineer. As part of our team, you will be responsible for designing and building Linux-based HPC CPU/GPU computing clusters.Key Responsibilities:Designing and building high-performance computing clusters using Linux-based systems.System...

  • AI Engineer

    hace 7 meses


    Madrid, España WayOps A tiempo completo

    En WayOps buscamos un perfil AI Engineer que quiera desarrollar su carrera profesional formando parte de un equipo Data & AI de primer nivel y trabajando en proyectos cloud con las últimas tecnologías.CONTEXTO & RESPONSABILIDADESLa persona seleccionada se incorporará dentro de un equipo de nueva formación que tendrá como misión automatizar mediante...

  • Ai Institute Director

    hace 1 semana


    Madrid, España Barcelona Supercomputing Center A tiempo completo

    .Context And MissionThe Barcelona Supercomputing Center (BSC-CNS) is establishing a new Institute of Artificial Intelligence (AI Institute) and is seeking an experienced and dynamic AI Institute Director. Reporting to the BSC Executive Board, the selected candidate will lead the Institute's operations and strategic vision, driving research at the...


  • Madrid, España Amazon A tiempo completo

    Senior Software Development Engineer, Ring AI At Ring, we're seeking a driven and talented Senior Software Development Engineer to join our trailblazing AI Team. In this pivotal role, you'll have the opportunity to revolutionize the home security landscape by working on cutting-edge cloud services that power our innovative machine learning operation...


  • Madrid, España Amazon A tiempo completo

    Senior Software Development Engineer, Ring AI At Ring, we're seeking a driven and talented Senior Software Development Engineer to join our trailblazing AI Team. In this pivotal role, you'll have the opportunity to revolutionize the home security landscape by working on cutting-edge cloud services that power our innovative machine learning operation...

  • Senior AI Engineer

    hace 1 mes


    Madrid, Madrid, España Amazon A tiempo completo

    OverviewAmazon is at the forefront of Artificial General Intelligence (AGI) research, pushing the boundaries of what is possible with Large Language Models. Our team is seeking a highly skilled Senior AI Engineer to lead the development of industry-leading technology.


  • Madrid, España Amazon A tiempo completo

    Senior Software Development Engineer, Ring AIAt Ring, we're seeking a driven and talented Senior Software Development Engineer to join our trailblazing AI Team. In this pivotal role, you'll have the opportunity to revolutionize the home security landscape by working on cutting-edge cloud services that power our innovative machine learning operation...


  • Madrid, España Amazon A tiempo completo

    Senior Software Development Engineer, Ring AI At Ring, we're seeking a driven and talented Senior Software Development Engineer to join our trailblazing AI Team. In this pivotal role, you'll have the opportunity to revolutionize the home security landscape by working on cutting-edge cloud services that power our innovative machine learning operation...


  • Madrid, España Amazon A tiempo completo

    Senior Software Development Engineer, Ring AIAt Ring, we're seeking a driven and talented Senior Software Development Engineer to join our trailblazing AI Team. In this pivotal role, you'll have the opportunity to revolutionize the home security landscape by working on cutting-edge cloud services that power our innovative machine learning operation...


  • Madrid, España Amazon A tiempo completo

    Senior Software Development Engineer, Ring AIAt Ring, we're seeking a driven and talented Senior Software Development Engineer to join our trailblazing AI Team. In this pivotal role, you'll have the opportunity to revolutionize the home security landscape by working on cutting-edge cloud services that power our innovative machine learning operation...


  • Madrid, Madrid, España Genie Ai A tiempo completo

    About the RoleWe are seeking a highly skilled Senior Product Designer to join our team at Genie AI. As a key member of our design team, you will play a crucial role in shaping the user experience of our AI-powered legal document platform.Key ResponsibilitiesConduct user research to understand the needs and workflows of lawyers, founders, and business...

  • AI Engineer

    hace 3 días


    Madrid, Madrid, España Six Group Services Ltd. A tiempo completo

    SIX Group Services Ltd. is seeking a talented AI Engineer to join our team in Warsaw, Madrid or working from home up to 60%. As an AI Engineer, you will be responsible for designing, implementing and maintaining AI use cases in close collaboration with Business Analysts.Job DescriptionBachelor's or Master's degree in Computer Science, Artificial...


  • Madrid, Madrid, España Amazon A tiempo completo

    **Job Title:** AI Cloud Services EngineerAt Ring, we're seeking a driven and talented Senior Software Development Engineer to join our trailblazing AI Team. In this pivotal role, you'll have the opportunity to revolutionize the home security landscape by working on cutting-edge cloud services that power our innovative machine learning operation pipelines,...

  • AI Solutions Architect

    hace 2 semanas


    Madrid, Madrid, España Nielseniq A tiempo completo

    Compensation: $160,000 - $200,000 per yearAbout the Role:We are seeking an experienced Senior Machine Learning Engineer to join our team at NielsenIQ. As a Senior ML Engineer, you will play a crucial role in developing and implementing advanced AI solutions to solve complex business problems.Key Responsibilities:Develop and implement AI models and algorithms...


  • Madrid, Madrid, España Atos SE A tiempo completo

    We are seeking an experienced GPU Cluster Administrator to support our research initiatives. As a member of our team, you will design and build high-performance computing clusters, ensuring the smooth operation of our HPC infrastructure. Your responsibilities will include system administration, software maintenance, and deployment and management of...