DevOps & ML Ops Engineer

hace 1 semana


madrid, España TransPerfect A tiempo completo

Join to apply for the DevOps & ML Ops Engineer role at TransPerfect DevOps & ML Ops Engineer would be responsible for developing and maintaining scalable, stable services that deliver machine learning models to end users with guaranteed uptime. The primary focus will be on the infrastructure, deployment, and continuous integration/continuous delivery (CI/CD) processes for our ML services. RESPONSIBILITIES Manage resource allocation and workload scheduling for multiple ML services, ensuring efficient utilization of CPU/GPU resources and creating reliable queues based on service priorities. Maintain VM environments and manage OS updates, keep up-to-date VM inventory Work alongside the Dev and QA team to detect hot spots in our applications and set preventative measure before it becomes a live issue. Troubleshooting and provide solutions for system configurations Plan, execute and test disaster recovery Monitor and examine all application, performance, event, and system logs to assist in troubleshooting Responsible for filing all IT/Colocation tickets ensuring fulfilment of requests, escalating to the right person if necessary. Design, develop, and maintain the infrastructure required for deploying and scaling machine learning services. Implement and manage the CI/CD pipelines to ensure seamless and efficient deployment of ML models. Collaborate with data scientists, ML researchers, and language experts to understand the requirements for deploying ML models and provide necessary infrastructure support. Automate and streamline the build, test, and deployment processes to enhance efficiency and reduce time-to-market. Monitor and optimize the performance, availability, and scalability of production ML systems. Develop and maintain robust monitoring, logging, and alerting systems to proactively identify and address issues. Implement security best practices to protect sensitive data and ensure compliance with relevant regulations. Stay up-to-date with industry trends and emerging technologies related to ML Ops and DevOps, and propose innovative solutions to improve our ML service delivery. REQUIRED SKILLS, EXPERIENCE AND QUALIFICATIONS Strong knowledge of cloud platforms (such as AWS, Azure, or GCP) and local cluster deployments, and experience in deploying and managing ML services on these platforms. Knowledge of distributed computing frameworks (e.g., Spark) and big data technologies (e.g., Hadoop, Kafka). Proficiency in Python, Shell, Ruby, Golang, or C++ and experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation). Hands-on experience with containerization technologies (e.g., Docker) and orchestration frameworks (e.g. Kubernetes). Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI/CD) and version control systems (e.g., Git). Solid understanding of networking, security, and system administration concepts. Strong problem-solving and troubleshooting skills, with the ability to quickly analyze and resolve issues in complex ML systems. Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment. Bachelor's or higher degree in Computer Science, Engineering, or a related field. Proven experience as an ML Ops Engineer, DevOps Engineer, or a similar role, with a focus on deploying and maintaining machine learning models in production environments. DESIRED SKILLS AND EXPERIENCE Experience with machine learning frameworks and libraries, such as TensorFlow, PyTorch, or scikit-learn. Familiarity with serverless computing and event-driven architectures. Experience with logging and monitoring tools (e.g., ELK Stack, Prometheus, Grafana). Understanding of software development methodologies and agile practices By applying, I confirm I have read and accept TransPerfect's Privacy Policy: Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and Information Technology Industries: Translation and Localization, Software Development, and IT Services and IT Consulting Referrals increase your chances of interviewing at TransPerfect by 2x #J-18808-Ljbffr



  • Madrid, Maryland, Spain TransPerfect A tiempo completo

    Job description DevOps & ML Ops Engineer would be responsible for developing and maintaining scalable, stable services that deliver machine learning models to end users with guaranteed uptime. The primary focus will be on the infrastructure, deployment, and continuous integration/continuous delivery (CI/CD) processes for our ML...


  • madrid, España EPAM Systems A tiempo completo

    Join to apply for the Principal ML Ops Engineer role at EPAM Systems We are looking for an ML Ops Engineer to join our Enterprise AI Products and Technology Team. The ideal candidate will have industry relevant experience delivering at scale Machine Learning or Data Science projects. You will be part of a collaborative team of multidisciplinary engineers and...


  • Madrid, España Workato A tiempo completo

    Overview Senior Infrastructure Engineer (ML/AI) – Workato, Madrid, Community of Madrid, SpainJoin to apply for the Senior Infrastructure Engineer (ML/AI) role at Workato. Workato transforms technology complexity into business opportunity and is a leader in enterprise orchestration with an AI-powered platform that enables teams to connect data, processes,...


  • madrid, España Workato A tiempo completo

    Overview Senior Infrastructure Engineer (ML/AI) – Workato, Madrid, Community of Madrid, Spain Join to apply for the Senior Infrastructure Engineer (ML/AI) role at Workato. Workato transforms technology complexity into business opportunity and is a leader in enterprise orchestration with an AI-powered platform that enables teams to connect data, processes,...


  • madrid, España Talent Connect A tiempo completo

    Overview MLOps / DevOps Engineer (AI/ML & GenAI) – Ubicación: España. Contrato: Full-time. Idioma: Inglés B2+ (requerido). Buscamos un/a MLOps / DevOps Engineer para diseñar, implementar y operar la plataforma de IA/ML y GenAI de punta a punta: desde ingestión y entrenamiento hasta despliegue, monitoreo y gobernanza en la nube. Serás clave para...


  • Madrid, España Talent Connect A tiempo completo

    MLOps / DevOps Engineer (AI/ML & GenAI) Ubicación: España Ubicación: España (remoto o híbrido) Contrato: Full-time Idioma: Inglés B2+ (requerido) Sobre el rol Buscamos un/a MLOps / DevOps Engineer para diseñar, implementar y operar la plataforma de IA/ML y GenAI de punta a punta: desde ingestión y entrenamiento hasta despliegue, monitoreo y...


  • Madrid, Madrid, España EPAM Systems A tiempo completo

    We are looking for anML Ops Engineerto join our Enterprise AI Products and Technology Team. The ideal candidate will have industry-relevant experience delivering at scale Machine Learning or Data Science projects.You will be part of a collaborative team of multidisciplinary engineers and working closely with data science teams, have a...

  • Devops engineer

    hace 3 semanas


    Madrid, España Oxigent Technologies A tiempo completo

    ¿Te interesaría seguir desarrollándote como Dev Ops en una multinacional líder en soluciones de medición de audiencia, en un entorno internacional altamente técnico donde podrás aprender, crecer y colaborar con equipos de primer nivel? Desde Oxigent Technologies seleccionamos un/a DEVOPS ENGINEER para participar en un proyecto global, en modalidad...

  • Ml Engineer, Hibrido

    hace 1 semana


    Madrid, España CAS TRAINING A tiempo completo

    ML Engineer en hibrido. Cas Training empresa de referencia con más de 20 años en consultoría tecnológica outsourcing y formación especializada selecciona a un/a ML Engineer para un importante proyecto HÍBRIDO EN MADRID EXPERIENCIA Al menos 2-3 años como ML Engineer Skills obligatorios - Amplia experiência trabajando Python y sus bibliotecas de...


  • Madrid, España EPAM Systems, Inc. A tiempo completo

    You will be part of a collaborative team of multidisciplinary engineers and working closely with data science teams, have a chance to create tools, standards and automate commonly used tasks of the machine learning product lifecycle. A part of the role is also to increase the capabilities of the platforms team to better suit the data scientist’s ways of...