Senior HPC AI Cluster Architect

hace 4 semanas


Madrid, Madrid, España Aitopics A tiempo completo

We are seeking an experienced HPC engineer to join our E2E software verification HPC/AI Infrastructure team. As a key player in building supercomputers and HPC clusters based on groundbreaking technologies, you will contribute to the latest breakthroughs in artificial intelligence and GPU computing.

You will provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest accelerated computing and deep learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions.

You will interact with HPC, OS, GPU compute, and systems specialists to architect, develop, and bring up large scale performance platforms.


Key Responsibilities:
  • Design, implement and maintain large scale HPC/AI clusters with monitoring, logging and alerting.
  • Manage Linux job/workload schedules and orchestration tools.
  • Develop and maintain continuous integration and delivery pipelines.
  • Develop tooling to automate deployment and management of large-scale infrastructure environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources.
  • Deploy monitoring solutions for the servers, network and storage.
  • Perform troubleshooting from bare metal, operating system, software stack, and application level.
  • Being a technical resource, develop, re-define and document standard methodologies to share with internal teams.
  • Support Research & Development activities and engage in POCs/POVs for future improvements.

Requirements:
  • A degree in Computer Science, Engineering, or a related field and 5+ years of experience.
  • Knowledge of HPC and AI solution technologies from CPUs and GPUs to high speed interconnects and supporting software.
  • Experience with job scheduling workloads and orchestration tools such as Slurm, K8s.
  • Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking and internals, ACLs and OS level security protection, and common protocols e.g. TCP, DHCP, DNS, etc.
  • Experience with multiple storage solutions such as Lustre, GPFS, zfs, and xfs. Familiarity with newer and emerging storage technologies.
  • Python programming and bash scripting experience.
  • Comfortable with automation and configuration management tools such as Jenkins, Ansible, Puppet/Chef.
  • Deep knowledge of Networking Protocols like InfiniBand and Ethernet.
  • Deep understanding and experience with virtual systems (for example VMware, Hyper-V, KVM, or Citrix). Familiarity with cloud computing platforms (e.g. AWS, Azure, Google Cloud).

Desirable Skills:
  • Knowledge of CPU and/or GPU architecture.
  • Knowledge of Kubernetes, container related microservice technologies.
  • Experience with GPU-focused hardware/software (DGX, Cuda).


  • Madrid, Madrid, España Nvidia A tiempo completo

    NVIDIA is seeking an experienced HPC Engineer to join the E2E software verification HPC/AI Infrastructure team.We are focused on building supercomputers and HPC clusters based on groundbreaking technologies. As a key player in the most exciting computing hardware and software, you will contribute to the latest breakthroughs in artificial intelligence and GPU...


  • Madrid, Madrid, España Arelance A tiempo completo

    Administración de Clústeres HPCEn Arelance estamos buscando a un profesional experimentado en la administración de clústeres de alto rendimiento para un proyecto de modalidad híbrida. El candidato ideal contará con al menos 4 años de experiencia en el rol y habilidades en el diseño y construcción de clústeres informáticos HPC basados en...

  • Hpc Cluster Engineer

    hace 3 días


    Madrid, Madrid, España Atos A tiempo completo

    Eviden, a leading global digital transformation company, is part of the Atos Group with an annual revenue of approximately €5 billion. As a next-generation digital business with top positions worldwide in digital, cloud, data, advanced computing, and security, it offers deep expertise for all industries across more than 47 countries.With unique high-end...


  • Madrid, Madrid, España Nvidia A tiempo completo

    Job Role OverviewWe are seeking an experienced HPC Engineer to join our E2E software verification HPC/AI Infrastructure team at NVIDIA. As a key player in building supercomputers and HPC clusters, you will contribute to groundbreaking technologies and the latest breakthroughs in artificial intelligence and GPU computing.Key ResponsibilitiesDesign, implement,...


  • Madrid, Madrid, España Inetum A tiempo completo

    Role Overview:Inetum, a leading technology company, seeks a skilled Senior AI Solutions Architect to lead our Generative AI Business Unit. As a seasoned professional, you will oversee the development of cutting-edge AI-driven applications and drive innovation in software solutions.Key Responsibilities:Design and implement AI-driven applications using Python,...


  • Madrid, Madrid, España Sartorius A tiempo completo

    Role Overview:We are seeking a Senior AI Software Architect to join our IT Software Development & Innovations department.As a key member of our team, you will be responsible for leading the development of our AI applications and ensuring that they meet the highest standards of quality and innovation.Key Responsibilities:Lead the development of AI...

  • AI Governance Architect

    hace 4 semanas


    Madrid, Madrid, España Zurich Insurance Group A tiempo completo

    Our OpportunityAt Zurich Insurance Group, we are seeking a highly skilled individual to join our Global AI Governance Team as an AI Governance Architect. This role plays a crucial part in driving cutting-edge responsible AI innovation. As an AI Governance Architect, you will be responsible for governing AI solutions along the full insurance value chain and...


  • Madrid, Madrid, España Pearson A tiempo completo

    About the Role: We are seeking a talented and motivated professional to join our growing team of architects as a Solution Architect. Reporting to the Chief Architect, you will be responsible for designing and implementing cutting-edge artificial intelligence solutions that enhance the learning experience for our customers.Key Responsibilities: Collaborate...


  • Madrid, Madrid, España Pearson A tiempo completo

    About the Role:We are seeking a talented and motivated professional to join our growing team of architects as a Solution Architect in the English Language Learning (ELL) division of Pearson.As a Solution Architect, you will be responsible for designing and implementing cutting-edge artificial intelligence solutions that enhance the learning...

  • AI Solutions Architect

    hace 3 semanas


    Madrid, Madrid, España Celonis A tiempo completo

    We're seeking a skilled AI Solutions Architect to join our team at Celonis.The ideal candidate will have a strong background in developing AI solutions for enterprises and a passion for innovative problem-solving.The Role:As an AI Solutions Architect, you will work closely with our customers to understand their business challenges and design tailored AI...


  • Madrid, Madrid, España Nvidia A tiempo completo

    Compensation: $150,000 - $200,000 per annum.About UsNVIDIA is a leader in the development of high-performance computing and artificial intelligence technologies. Our team is dedicated to building cutting-edge HPC/AI infrastructure that pushes the boundaries of what is possible.Job DescriptionWe are seeking an experienced High Performance Computing (HPC)...


  • Madrid, Madrid, España Mygwork A tiempo completo

    About the Role:We are seeking a talented and motivated professional to join our growing team of architects as a Solution Architect. Reporting to the Chief Architect, you will be responsible for designing and implementing cutting-edge artificial intelligence solutions that enhance the learning experience.Key Responsibilities:Collaborate with cross-functional...


  • Madrid, Madrid, España Mygwork A tiempo completo

    About the Role:We are seeking a talented Senior AI Solution Architect to join our team at Pearson, a leading education company. As a key member of our AI team, you will be responsible for designing and implementing cutting-edge artificial intelligence solutions that enhance the learning experience.Key Responsibilities:Collaborate with cross-functional teams...


  • Madrid, Madrid, España Idoven A tiempo completo

    Job SummaryWe are seeking an experienced AI DevOps Infrastructure Architect to join our Product Engineering Development & Research team at IDOVEN, a pioneering health tech startup using AI to prevent cardiac disease.

  • Senior AI Architect

    hace 1 semana


    Madrid, Madrid, España Nielseniq A tiempo completo

    Company OverviewNielsenIQ is a leading consumer intelligence company delivering comprehensive insights into consumer buying behavior and driving growth for businesses.Salary: €110,000 - €140,000 per annum (dependent on experience)Job DescriptionWe are seeking an experienced Senior Machine Learning Engineer specializing in Generative AI to join our...

  • AI Innovations Manager

    hace 2 semanas


    Madrid, Madrid, España Genie Ai A tiempo completo

    About the RoleWe are seeking a highly skilled Senior Product Designer to join our team at Genie AI. As a key member of our design team, you will play a crucial role in shaping the user experience of our AI-powered legal document platform.Key ResponsibilitiesConduct user research to understand the needs and workflows of lawyers, founders, and business...

  • Senior Marketing Manager

    hace 4 semanas


    Madrid, Madrid, España Mygwork A tiempo completo

    Job Title: Senior Marketing Manager - Southern Cluster EuropeJob Summary:We are seeking an experienced Senior Marketing Manager to drive marketing programs and commercial execution for the Southern Cluster Europe region. The successful candidate will be responsible for establishing and leading regional strategy, ensuring country alignment, and analyzing...

  • Data Architect

    hace 1 semana


    Madrid, Madrid, España Roche A tiempo completo

    Design and Implement AI-Driven SolutionsAt Roche, we believe that innovation is key to creating a healthier future. We are looking for a highly skilled Data Architect - AI Search Specialist to join our team and play a vital role in defining and communicating a shared technical and architectural vision for our Content Search & Knowledge Management product.The...


  • Madrid, Madrid, España Lumen Argentina A tiempo completo

    About LumenLumen connects the world, igniting business growth by linking people, data, and applications – quickly, securely, and effortlessly. Our team is building a culture and company from the people up, committed to teamwork, trust, and transparency. People power progress. We offer the flexibility you need to thrive and deliver lasting impact. The...

  • AI Solutions Architect

    hace 1 semana


    Madrid, Madrid, España Tether A tiempo completo

    At Tether, we're committed to pushing the boundaries of advanced AI technologies. With our investment in cutting-edge infrastructure, starting from Northern Data, we're poised to tackle ambitious AI projects and build the next generation of AI models.The role involves designing and implementing AI solutions across a spectrum of applications, from large-scale...