DevOps & ML Ops Engineer | Spain (BCN, Madrid, Malaga or Palma)
hace 2 semanas
DevOps & ML Ops Engineer would be responsible for developing and maintaining scalable, stable services that deliver machine learning models to end users with guaranteed uptime. The primary focus will be on the infrastructure, deployment, and continuous integration/continuous delivery (CI/CD) processes for our ML services.
RESPONSIBILITIES:
Manage resource allocation and workload scheduling for multiple ML services, ensuring efficient utilization of CPU/GPU resources and creating reliable queues based on service priorities.
Maintain VM environments and manage OS updates, keep up-to-date VM inventory
Work alongside the Dev and QA team to detect hot spots in our applications and set preventative measure before it becomes a live issue.
Troubleshooting and provide solutions for system configurations
Plan, execute and test disaster recovery
Monitor and examine all application, performance, event, and system logs to assist in troubleshooting
Responsible for filing all IT/Colocation tickets ensuring fulfilment of requests, escalating to the right person if necessary.
Design, develop, and maintain the infrastructure required for deploying and scaling machine learning services.
Implement and manage the CI/CD pipelines to ensure seamless and efficient deployment of ML models.
Collaborate with data scientists, ML researchers, and language experts to understand the requirements for deploying ML models and provide necessary infrastructure support.
Automate and streamline the build, test, and deployment processes to enhance efficiency and reduce time-to-market.
Monitor and optimize the performance, availability, and scalability of production ML systems.
Develop and maintain robust monitoring, logging, and alerting systems to proactively identify and address issues.
Implement security best practices to protect sensitive data and ensure compliance with relevant regulations.
Stay up-to-date with industry trends and emerging technologies related to ML Ops and DevOps, and propose innovative solutions to improve our ML service delivery.
REQUIRED SKILLS, EXPERIENCE AND QUALIFICATIONS:
Strong knowledge of cloud platforms (such as AWS, Azure, or GCP) and local cluster deployments, and experience in deploying and managing ML services on these platforms.
Knowledge of distributed computing frameworks (e.g., Spark) and big data technologies (e.g., Hadoop, Kafka).
Proficiency in Python, Shell, Ruby, Golang, or C++ and experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation).
Hands-on experience with containerization technologies (e.g., Docker) and orchestration frameworks (e.g. Kubernetes).
Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI/CD) and version control systems (e.g., Git).
Solid understanding of networking, security, and system administration concepts.
Strong problem-solving and troubleshooting skills, with the ability to quickly analyze and resolve issues in complex ML systems.
Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment.
Bachelor's or higher degree in Computer Science, Engineering, or a related field.
Proven experience as an ML Ops Engineer, DevOps Engineer, or a similar role, with a focus on deploying and maintaining machine learning models in production environments.
DESIRED SKILLS AND EXPERIENCE:
Experience with machine learning frameworks and libraries, such as TensorFlow, PyTorch, or scikit-learn.
Familiarity with serverless computing and event-driven architectures.
Experience with logging and monitoring tools (e.g., ELK Stack, Prometheus, Grafana).
Understanding of software development methodologies and agile practices
-
DevOps Engineer
hace 1 semana
Barcelona, Barcelona, España Logicalis Spain A tiempo completoEn Logicalis Spain actualmente estamos buscando a una persona con 2 años de experiencia trabajando con entornos de plataformas de infraestructuras basadas en microservicios de Kubernetes y OpenShift. La persona incorporada pasará a formar parte de uno de los equipos en los que colaboramos en la administración de entornos private and public cloud como...
-
DevOps Engineer
hace 1 semana
Barcelona, Barcelona, España Logicalis Spain A tiempo completoEnLogicalis Spainactualmente estamos buscando a una persona con 2 años de experiencia trabajando con entornos de plataformas de infraestructuras basadas en microservicios deKubernetesyOpenShift. La persona incorporada pasará a formar parte de uno de los equipos en los que colaboramos en la administración de entornos private and public cloud como DevOps...
-
Principal AI/ML Engineer
hace 1 semana
Barcelona, Barcelona, España King A tiempo completoCraft:Data, Analytics & StrategyJob Description:Principal AI/ML Engineer – Operable Level EcosystemJob DescriptionDo you have a dynamic approach to solving Machine Learning problems at scale? Are you passionate about building machine learning (ML) tools for level designers? Do you believe machine learning (ML) can have a material impact on how we do...
-
DevOps Engineer
hace 2 días
Barcelona, Barcelona, España NEWCO COMMUNICATIONS A tiempo completoWe are searching for a DevOps Engineer within the Innovation Lab. As part of our technical backbone, the DevOps Engineer ensures that our applications, AI platforms, and internal tools run smoothly, securely, and efficiently across our company. The role combines infrastructure operations with hands-on full stack development, allowing you to contribute...
-
DevOps Engineer
hace 1 hora
Barcelona, Barcelona, España RemoteStar A tiempo completoAt RemoteStar, we're currently hiring for one of our client based inSpain.Visa Sponsorship will be provided.Work Location: Relocation to Barcelona / Madrid / ZaragozaRelocation package (if applicable) will be givenAbout client :Well-funded and fast-growing deep-tech company founded in 2019. We are the biggest Quantum Software company in the EU. They are also...
-
DevOps Engineer
hace 1 semana
Barcelona, Barcelona, España Artificial Solutions A tiempo completoHey there We are Artificial Solutions and we are looking for an DevOps Engineer with experience in Kubernetes to join our Cloud & Ops Team Is this you? You will be part of a team responsible for building, running and supporting our cloud infrastructure that supports our Teneo Platform (that allows developers to build Bots and Conversational AI...
-
DevOps Engineer
hace 2 días
Barcelona, Barcelona, España NewCo Communications A tiempo completoWe are for a DevOps Engineer within the Innovation Lab. As part of our technical backbone, the DevOps Engineer ensures that our applications, AI platforms, and internal tools run smoothly, securely, and efficiently across our company. The role combines infrastructure operations with hands-on full stack development, allowing you to contribute directly to the...
-
Senior Backend/Devops Engineer
hace 1 semana
Barcelona, Barcelona, España Startup Talents A tiempo completoThe employer offers crypto accounts for players in the gaming industry acting as a wallet -as -a -service solution supporting multi -chain accounts, gas sponsorship, pop -upless blockchain interactions, and flexible ownership. Theis solution enables games to interact with on -chain assets easily and programmably, allowing for faster launches, adaptability...
-
AI/ML Engineer Lead
hace 1 semana
Barcelona, Barcelona, España MIGx AG A tiempo completoJD ID JD0049Position Name AI/ML Engineer LeadAbout the profile About MIGxMIGx is a global consulting company with an exclusive focus on the healthcare and life science industries, with their particularly demanding requirements on quality and regulatory aspects. We have been managing challenges and solving problems for our clients in the areas of...
-
Senior Machine Learning Platform/Ops Engineer
hace 2 días
Barcelona, Barcelona, España Preply A tiempo completoWe power people's progress.At Preply, we're all about creating life-changing learning experiences. We help people discover the magic of the perfect tutor, craft a personalized learning journey, and stay motivated to keep growing. Our approach is human-led, tech-enabled - and it's creating real impact. So far, 90,000 tutors have delivered over 20 million...