Ai Systems Engineer

hace 2 días

Pozuelo de Alarcón, España OpenNebula Systems A tiempo completo

For over a decade now, OpenNebula Systems has been leading the development of the European open source technology that helps organizations around the world to manage their corporate data centers and build their Enterprise Clouds.

If you want to join an established leader in the cloud infrastructure industry and the global open source community, keep reading, because you can now join a team of exceptionally passionate and talented colleagues whose mission is to help the world's leading enterprises to implement their next-generation edge and cloud strategies. We are hiring

Since 2019, and thanks to the support from the European Commission, OpenNebula Systems has been leading the edge computing innovation in Europe, investing heavily in research and open source development, and playing a key role in strategic EU initiatives such as the IPCEI-CIS and the “European Alliance for Industrial Data, Edge and Cloud”.

OpenNebula’s new AI Factory product line delivers sovereign, edge-to-cloud AI infrastructure—enabling enterprises and governments to deploy, orchestrate, and optimize next-generation AI workloads with full control. This role is key to building the execution layer powering that vision. We are currently looking for an AI Systems Engineer to come and join us in Europe as part of our new team developing the AI Factory product line.

We are looking for a highly skilled AI Systems Engineer with hands-on experience in executing, tuning, and scaling Large Language Models (LLMs) across multi-GPU infrastructures. This role is central to the development of our new AI Factory product line, which enables open, sovereign, and disaggregated AI infrastructure across cloud and edge environments.
You will help design and optimize LLM execution pipelines, working at the intersection of inference engines, orchestration platforms, and LLM model catalogs. Your responsibilities will include communicating with users, addressing their needs, troubleshooting, and providing step by step solutions.

**Responsibilities**
- Design, implement, and optimize LLM inference pipelines for multi-GPU and multi-node environments.
- Integrate with cutting-edge inference engines (e.g., vLLM, TensorRT-LLM, DeepSpeed, etc.).
- Tune execution parameters for latency, throughput, and memory efficiency across heterogeneous infrastructures.
- Work closely with orchestration frameworks such as Ray, NVIDIA NeMo/Dynamo, and others to coordinate LLM serving at scale.
- Integrate with LLM catalogs and registries such as HuggingFace, NVIDIA NIM, and internal repositories.
- Collaborate with product and platform teams to shape a modular, portable AI Factory execution layer.
- Interact with users and use cases, providing systems support, system architecture definition, making recommendations based on user needs, implementation, testing, user training, and deployment of open source solutions.
- Troubleshoot incidents, identify root causes, fix and document problems, and implement preventive measures.
- Deliver quality performance indicators within the scope of the assigned project, including project journals, status reports, and other standard documentation.
- Work with other companies in the cloud-edge ecosystem within international projects and open-source communities. Availability to occasional travel and participation in international events and meetings.
- Write and maintain software documentation and project reports.

**Experience required**
- Academic Background and Certifications_
- Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.
- Professional Experience_
- Strong hands-on experience deploying and optimizing LLMs in production environments
- Experience with inference frameworks such as vLLM, TensorRT, Triton Inference Server, DeepSpeed-Inference, etc.
- Hands-on experience with orchestration tools like Ray, NVIDIA NeMo/Dynamo, or KServe.
- Experience deploying LLM workloads on hybrid or sovereign cloud environments.
- Contributions to open-source LLM or inference projects.
- Technical Experience_
- Deep knowledge of multi-GPU systems and GPU memory management.
- Solid understanding of distributed systems and networking bottlenecks in model serving.
- Programming experience in Python, with knowledge of CUDA and model quantization a plus.
- Familiarity with LLM catalogs (e.g., HuggingFace, NGC, NIM).
- Familiarity with open-source MLOps or AI workload orchestration platforms.
- Language Skills_
- English fluency at a professional or native-equivalent level, with excellent clarity and expression in both writing and speech.
- Soft Skills & Collaboration_
- Strong customer service mindset, with a focus on responsiveness and user satisfaction.
- Clear communication and documentation with strong written and verbal English, async collaboration, and visibility of work.
- Excellent problem-solving skills and a proactive approach to identifying and resolving issues.
- Self-management and accountability with ability

Cloud Systems Engineer

hace 1 semana

Pozuelo de Alarcón, España OpenNebula Systems A tiempo completo

OpenNebula Systems is leading the development of European open source technology to manage corporate data centers and build Enterprise Clouds. We are hiring a Cloud Systems Engineer to join our team in Europe, developing the next generation management platform for the Cloud-Edge Computing Continuum.Job Description: Cloud Systems Engineers are responsible for...
Junior AI Engineer

hace 3 minutos

Pozuelo de Alarcón, Madrid, España Menhir AI A tiempo completo

En Menhir Technologies buscamos un Junior AI Engineer con ganas de aprender y de construir producto real. Trabajarás codo a codo con ingenieros senior para llevar casos de uso de IA y agentes conversacionales desde el prototipo hasta producción, con un plan claro de mentoría y crecimiento.TasksParticipar en el ciclo completo de una solución de IA:...
Junior AI Engineer

hace 2 minutos

Pozuelo de Alarcón, Madrid, España Menhir AI A tiempo completo

En Menhir Technologies buscamos un Junior AI Engineer con ganas de aprender y de construir producto real. Trabajarás codo a codo con ingenieros senior para llevar casos de uso de IA y agentes conversacionales desde el prototipo hasta producción, con un plan claro de mentoría y crecimiento.TasksParticipar en el ciclo completo de una solución de IA:...
Remote Cloud-Edge Systems Advisor

hace 1 semana

Pozuelo de Alarcón, España OpenNebula Systems A tiempo completo

A leading cloud infrastructure company is seeking experienced Cloud-Edge Systems Engineer Advisors to provide guidance and technical support for cloud-edge solutions. The role focuses on collaboration with engineers and architects to ensure security, resilience, and sustainability in designs. Ideal candidates should have at least 5 years of experience and...
Senior Research Engineer

hace 4 días

Pozuelo de Alarcón, España OpenNebula Systems A tiempo completo

For over a decade, OpenNebula Systems has been at the forefront of developing European open source technology that empowers organizations worldwide to manage their corporate data centers and build agile, secure Enterprise Clouds. If you’re looking to join an established leader in the cloud infrastructure industry and the global open source community, this...
Senior Infrastructure Engineer

hace 4 días

Pozuelo de Alarcón, España OpenNebula Systems A tiempo completo

Join to apply for the Senior Infrastructure Engineer role at OpenNebula Systems For over a decade now, OpenNebula Systems has been leading the development of European open source technology that helps organizations around the world to manage their corporate data centers and build their Enterprise Clouds. We are hiring a Senior Infrastructure Engineer to join...
Presales Engineer

hace 4 minutos

Pozuelo de Alarcón, Madrid, España OpenNebula Systems A tiempo completo

For over a decade now, OpenNebula Systems has been building the open source technology that helps organizations around the world to manage their corporate data centers and build Enterprise Clouds with unique, innovative features.If you want to join an established leader in the cloud infrastructure industry and the global open source community, keep reading,...
Founding AI Fullstack Engineer

hace 14 horas

Calle del Duque de Sevilla, España Orbio AI A tiempo completo

The compensation package includes stock options ✅ You Role and Purpose As a Founding AI Fullstack Engineer at Orbio, you will be the technical architect and backbone of our intelligent HR platform. Your primary focus will be backend development using Python/Django, designing scalable AI-native architectures to orchestrate complex, autonomous recruitment...
Prompt Engineer

hace 4 días

Pozuelo de Alarcón, España The Cigna Group A tiempo completo

**Prompt Engineer & AI Applications Lead**: - (Madrid, Spain or_ _Bengaluru,_ _India — Hybrid/Flexible)_ **About the Role** Cigna is accelerating responsible AI adoption across the enterprise. We’re seeking a **Prompt Engineer & AI Applications Lead** to partner with product, data, engineering, and business stakeholders to discover, evaluate, and...
Cloud Systems Engineer

hace 4 días

Pozuelo de Alarcón, España OpenNebula A tiempo completo

For over a decade now, OpenNebula Systems has been leading the development of the European open source technology that helps organizations around the world to manage their corporate data centers and build their Enterprise Clouds. If you want to join an established leader in the cloud infrastructure industry and the global open source community, keep reading,...

América

Europa

Asia / Oceanía

África

Ai Systems Engineer