Machine Learning Systems Engineer

hace 5 días

Galicia, España RelationalAI A tiempo completo

At RelationalAI , we’re solving one of the most important challenges in artificial intelligence: how to teach large language models the logic, semantics, and business context of the modern enterprise. Frontier models are trained almost entirely on public data — they can speak about the world, but they don’t understand your business. We fix that. RelationalAI has pioneered a breakthrough called Superalignment — technology that enables LLMs to learn natively from private, structured enterprise data inside the data cloud. By combining this with relational knowledge graphs and our proprietary neuro/symbolic-relational reasoners , we deliver trustworthy decision intelligence : systems that use semantic models to truly understand how a business operates and can reason across its data to drive better outcomes. We’re a globally distributed team of engineers, scientists, and builders redefining how AI learns from data. We believe that high-stakes decisions deserve frontier intelligence — intelligence that’s explainable, aligned, and grounded in reality. If you’re driven by curiosity, thrive in complexity, and want to help build the system that brings true understanding to enterprise AI, you’ll feel right at home here. Machine Learning Systems Engineer Experience Level: 3+ years of experience in machine learning engineering or research About ScalarLM ScalarLM unifies vLLM, Megatron-LM, and HuggingFace for fast LLM training, inference, and self-improving agents—all via an OpenAI-compatible interface. ScalarLM builds on top of the vLLM inference engine, the Megatron-LM training framework, and the HuggingFace model hub. It unifies the capabilities of these tools into a single platform, enabling users to easily perform LLM inference and training, and build higher lever applications such as Agents with a twist - they can teach themselves new abilities via back propagation. ScalarLM is inspired by the work of Seymour Roger Cray (September 28, 1925 – October 5, 1996), an American electrical engineer and supercomputer architect who designed a series of computers that were the fastest in the world for decades, and founded Cray Research, which built many of these machines. Called "the father of supercomputing", Cray has been credited with creating the supercomputer industry. It is a fully open source project (CC-0 Licensed) focused on democratizing access to cutting-edge LLM infrastructure that combines training and inference in a unified platform, enabling the development of self-improving AI agents similar to DeepSeek R1. ScalarLM is supported and maintained by TensorWave in addition to RelationalAI. The Role: As a Machine Learning Engineer, you will contribute directly to our machine learning infrastructure, to the ScalarLM open source codebase, and build large-scale language model applications on top of it. You’ll operate at the intersection of high-performance computing, distributed systems, and cutting‑edge machine learning research, developing the fundamental infrastructure that enables researchers and organizations worldwide to train and deploy large language models at scale. This is an opportunity to take on technically demanding projects, contribute to foundational systems, and help shape the next generation of intelligent computing. You Will: Contribute code and performance improvements to the open source project. Develop and optimize distributed training algorithms for large language models. Implement high-performance inference engines and optimization techniques. Work on integration between vLLM, Megatron-LM, and HuggingFace ecosystems. Build tools for seamless model training, fine‑tuning, and deployment. Optimize performance of advanced GPU architectures. Collaborate with the open source community on feature development and bug fixes. Research and implement new techniques for self‑improving AI agents. Who You Are Technical Skills: Programming Languages: Proficiency in both C/C++ and Python High Performance Computing: Deep understanding of HPC concepts, including: MPI (Message Passing Interface) programming and optimization Bulk Synchronous Parallel (BSP) computing models Multi‑GPU and multi‑node distributed computing CUDA/ROCm programming experience preferred Machine Learning Foundations: Solid understanding of gradient descent and backpropagation algorithms Experience with transformer architectures and the ability to explain their mechanics Knowledge of deep learning training and its applications Understanding of distributed training techniques (data parallelism, model parallelism, pipeline parallelism, large batch training, optimization) Research and Development Publications: Experience with machine learning research and publications preferred Research Skills: Ability to read, understand, and implement techniques from recent ML research papers Open Source: Demonstrated commitment to open source development and community collaboration Experience 3+ years of experience in machine learning engineering or research. Experience with large-scale distributed training frameworks (Megatron‑LM, DeepSpeed, FairScale, etc.). Familiarity with inference optimization frameworks (vLLM, TensorRT, etc.). Experience with containerization (Docker, Kubernetes) and cluster management. Background in systems programming and performance optimization. PhD or MS in Computer Science, Computer Engineering, Machine Learning, or related field. Experience with SLURM, Kubernetes, or other cluster orchestration systems. Knowledge of mixed precision training, data parallel training, and scaling laws. Experience with transformer architecture, pytorch, decoding algorithms. Familiarity with high performance GPU programming ecosystem. Previous contributions to major open source ML projects. Experience with MLOps and model deployment at scale. Understanding of modern attention mechanisms (multi-head attention, grouped query attention, etc.). Why RelationalAI At RelationalAI, you will: Work from anywhere in the world Earn competitive salary + equity Enjoy open PTO, flexible schedules, and recharge weeks Access global benefits, mental‑health support, and learning stipends Join a transparent, inclusive, and globally connected culture that values curiosity, excellence, and impact Regular team offsites and global events – Building strong connections while working remotely through team offsites and global events that bring everyone together. A culture of transparency & knowledge‑sharing – Open communication through team standups, fireside chats, and open meetings. Country Hiring Guidelines: RelationalAI hires people from around the world. All of our roles are remote; however, some locations might carry specific eligibility requirements. Because of this, understanding location & visa support helps us better prepare to onboard our colleagues. Our People Operations team can help answer any questions about location after starting the recruitment process. How to Apply If you’re driven by understanding, powered by curiosity, and ready to help shape the next era of enterprise intelligence — we’d love to hear from you. Join us and help build the reasoning layer for the modern enterprise. Privacy Policy: EU residents applying for positions at RelationalAI can see our Privacy Policy here . California residents applying for positions at RelationalAI can see our Privacy Policy here RelationalAI is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, color, gender identity or expression, marital status, national origin, disability, protected veteran status, race, religion, pregnancy, sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. #J-18808-Ljbffr

Machine Learning Engineer

hace 5 días

Galicia, España RelationalAI A tiempo completo

Frontier models are trained almost entirely on public data — they can speak about the world, but they don’t understand your business. RelationalAI has pioneered a breakthrough called Superalignment — technology that enables LLMs to learn natively from private, structured enterprise data inside the data cloud. By combining this with relational knowledge...
Machine Learning Engineer

hace 4 días

Galicia, España NTT DATA, Inc. A tiempo completo

NTT DATA es una consultora multinacional que ofrece soluciones tecnológicas, de negocio, estrategia, desarrollo y mantenimiento de aplicaciones, siendo referente en consultoría. Digital Technology es la unidad enfocada a acompañar a las grandes organizaciones iberoamericanas en su transformación digital, generando dividendos digitales a través de la...
Ingeniero/a de Machine Learning

hace 2 semanas

Galicia, España Tramas+ A tiempo completo

En TRAMAS , empresa líder en distribución y venta de artículos de textil hogar , contamos con más de 200 tiendas físicas en España, Portugal e Italia . Con más de 30 años de experiencia , una plantilla de más de 1.000 personas y una facturación de 100 millones de euros en el último ejercicio —con un crecimiento interanual del 25, seguimos en...
Data Engineer

hace 2 semanas

galicia, España DEUS: human(ity)-centered AI A tiempo completo

Are you obsessed with the possibilities of the emerging fields of artificial intelligence? Do you believe we can design & develop our own futures? Are you proactive and passionate? We’re looking for an experienced Data Engineer (medior/senior) who has a talent for problem solving, a keen interest in technology, and wants to help deliver real world...
Python Developer

hace 24 minutos

galicia, España DEUS: human(ity)-centered AI A tiempo completo

Join to apply for the Python Developer role at DEUS: human(ity)-centered AI 1 day ago Be among the first 25 applicants Join to apply for the Python Developer role at DEUS: human(ity)-centered AI Direct message the job poster from DEUS: human(ity)-centered AI People Development Specialist at DEUS: human(ity)-centered AI We are looking for an experienced...
Full Stack Engineer

hace 2 semanas

Galicia, España Talent-R A tiempo completo

Location: Flexible (Remote) Seniority: Senior ️ Position: SENIOR FULL-STACK ENGINEER About the job We are hiring for a fast-growing AI-native B2B SaaS company focused on building modern, cloud-native software that helps enterprises improve efficiency, transparency, and decision-making across complex business processes. As the company continues to scale...
Remote C# Engineer

hace 1 semana

Galicia, España AddanEx A tiempo completo

Title: AI Engineer Location: Remote Type: Contract Start Date : ASAP English We’re looking for an AI Engineer to design, implement, and maintain agentic systems for our clients. You’ll work across LLMs, orchestration frameworks, and data pipelines to deliver robust, observable, and secure automations. Every project you take on will target production and...
Backend Engineer

hace 2 semanas

Galicia, España Calyptus A tiempo completo

Want to put your job search on autopilot? Join our platform, complete a 6-minute AI screening interview, and get auto-applied to 100s of high-paying roles. Sign up now at and let the opportunities come to you. ____________________________________________________________ The Role We are seeking an experienced Backend Engineer who is excited to build the...
Mechanical Design Engineer

hace 2 días

galicia, España Reemoon China A tiempo completo

About The Company Founded in ****, Reemoon Technology Co., Ltd. is a high‑tech enterprise integrating R&D, manufacturing, sales, and service. The company is dedicated to providing advanced automation solutions for fruit and vegetable sorting, grading, and packaging. Headquartered in Jiangxi, China, Reemoon has established subsidiaries and service centers...
AI/ML Solutions Architect

hace 2 semanas

Galicia, España Xebia A tiempo completo

About us For more than 20 years, our global network of passionate technologists and pioneering craftspeople has delivered cutting-edge technology and game-changing consulting to companies on the brink of AI driven digital transformation. Since 2001, we have grown into a full service digital consulting company with 5500+ professionals working on a worldwide...

América

Europa

Asia / Oceanía

África

Machine Learning Systems Engineer