DevOps & ML Ops Engineer | Spain (BCN, Madrid, Malaga or Palma)

hace 2 semanas


Madrid Maryland Spain TransPerfect A tiempo completo
Job description

DevOps & ML Ops Engineer would be responsible for developing and maintaining scalable, stable services that deliver machine learning models to end users with guaranteed uptime. The primary focus will be on the infrastructure, deployment, and continuous integration/continuous delivery (CI/CD) processes for our ML services.

RESPONSIBILITIES:

  • Manage resource allocation and workload scheduling for multiple ML services, ensuring efficient utilization of CPU/GPU resources and creating reliable queues based on service priorities.

  • Maintain VM environments and manage OS updates, keep up-to-date VM inventory

  • Work alongside the Dev and QA team to detect hot spots in our applications and set preventative measure before it becomes a live issue.

  • Troubleshooting and provide solutions for system configurations

  • Plan, execute and test disaster recovery

  • Monitor and examine all application, performance, event, and system logs to assist in troubleshooting

  • Responsible for filing all IT/Colocation tickets ensuring fulfilment of requests, escalating to the right person if necessary.

  • Design, develop, and maintain the infrastructure required for deploying and scaling machine learning services.

  • Implement and manage the CI/CD pipelines to ensure seamless and efficient deployment of ML models.

  • Collaborate with data scientists, ML researchers, and language experts to understand the requirements for deploying ML models and provide necessary infrastructure support.

  • Automate and streamline the build, test, and deployment processes to enhance efficiency and reduce time-to-market.

  • Monitor and optimize the performance, availability, and scalability of production ML systems.

  • Develop and maintain robust monitoring, logging, and alerting systems to proactively identify and address issues.

  • Implement security best practices to protect sensitive data and ensure compliance with relevant regulations.

  • Stay up-to-date with industry trends and emerging technologies related to ML Ops and DevOps, and propose innovative solutions to improve our ML service delivery.

Job requirements

REQUIRED SKILLS, EXPERIENCE AND QUALIFICATIONS:

  • Strong knowledge of cloud platforms (such as AWS, Azure, or GCP) and local cluster deployments, and experience in deploying and managing ML services on these platforms.

  • Knowledge of distributed computing frameworks (e.g., Spark) and big data technologies (e.g., Hadoop, Kafka).

  • Proficiency in Python, Shell, Ruby, Golang, or C++ and experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation).

  • Hands-on experience with containerization technologies (e.g., Docker) and orchestration frameworks (e.g. Kubernetes).

  • Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI/CD) and version control systems (e.g., Git).

  • Solid understanding of networking, security, and system administration concepts.

  • Strong problem-solving and troubleshooting skills, with the ability to quickly analyze and resolve issues in complex ML systems.

  • Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment.

  • Bachelor's or higher degree in Computer Science, Engineering, or a related field.

  • Proven experience as an ML Ops Engineer, DevOps Engineer, or a similar role, with a focus on deploying and maintaining machine learning models in production environments.

DESIRED SKILLS AND EXPERIENCE:

  • Experience with machine learning frameworks and libraries, such as TensorFlow, PyTorch, or scikit-learn.

  • Familiarity with serverless computing and event-driven architectures.

  • Experience with logging and monitoring tools (e.g., ELK Stack, Prometheus, Grafana).

  • Understanding of software development methodologies and agile practices

Hybrid
  • Madrid, Comunidad de Madrid, Spain
  • •Malaga, Andalucía, Spain
  • •Palma de Mallorca, Illes Balears [Islas Baleares], Spain
  • •Barcelona, Catalunya [Cataluña], Spain

• +3 more Tech Full-time, Permanent All done

Your application has been successfully submitted

Other jobs
  • DevOps & ML Ops Engineer

    hace 1 semana


    madrid, España TransPerfect A tiempo completo

    Join to apply for the DevOps & ML Ops Engineer role at TransPerfect DevOps & ML Ops Engineer would be responsible for developing and maintaining scalable, stable services that deliver machine learning models to end users with guaranteed uptime. The primary focus will be on the infrastructure, deployment, and continuous integration/continuous delivery (CI/CD)...


  • madrid, España EPAM Systems A tiempo completo

    Join to apply for the Principal ML Ops Engineer role at EPAM Systems We are looking for an ML Ops Engineer to join our Enterprise AI Products and Technology Team. The ideal candidate will have industry relevant experience delivering at scale Machine Learning or Data Science projects. You will be part of a collaborative team of multidisciplinary engineers and...


  • madrid, España Workato A tiempo completo

    Overview Senior Infrastructure Engineer (ML/AI) – Workato, Madrid, Community of Madrid, Spain Join to apply for the Senior Infrastructure Engineer (ML/AI) role at Workato. Workato transforms technology complexity into business opportunity and is a leader in enterprise orchestration with an AI-powered platform that enables teams to connect data, processes,...


  • Madrid, España Workato A tiempo completo

    Overview Senior Infrastructure Engineer (ML/AI) – Workato, Madrid, Community of Madrid, SpainJoin to apply for the Senior Infrastructure Engineer (ML/AI) role at Workato. Workato transforms technology complexity into business opportunity and is a leader in enterprise orchestration with an AI-powered platform that enables teams to connect data, processes,...


  • Madrid, Madrid, España EPAM Systems A tiempo completo

    We are looking for anML Ops Engineerto join our Enterprise AI Products and Technology Team. The ideal candidate will have industry-relevant experience delivering at scale Machine Learning or Data Science projects.You will be part of a collaborative team of multidisciplinary engineers and working closely with data science teams, have a...


  • Madrid, España EPAM Systems, Inc. A tiempo completo

    You will be part of a collaborative team of multidisciplinary engineers and working closely with data science teams, have a chance to create tools, standards and automate commonly used tasks of the machine learning product lifecycle. A part of the role is also to increase the capabilities of the platforms team to better suit the data scientist’s ways of...

  • Cloud Ops Engineer

    hace 7 días


    Madrid, España Digital Talent Agency A tiempo completo

    **¿CÓMO ES EL PROYECTO?** Trabajarás como Cloud Ops Engineer para unirse al equipo de infraestructura y ciberseguridad de uno de nuestros principales clientes. **¿QUÉ TE HARÁ TRIUNFAR EN ESTA POSICIÓN?** - Background técnico de administrador de sistemas y conocimientos de redes. - Más de 2 - 3 años manejando múltiples cloud **(AWS, Azure o...


  • Madrid, España Santander A tiempo completo

    DevOps Engineer DevOps Engineer - Open Digital Services (ODS) Country: Spain Open Digital Services is the software development company of Santander Group powering the next generation of banks by creating innovative banking products and implementing them in collaboration with Santander Group Affiliates. Santander Group is one of the worlds largest...


  • madrid, España JR Spain A tiempo completo

    Social network you want to login/join with: We are an international technology services company founded in 1983 and currently have more than 2,000 employees in 5 countries: France, Spain, Romania, Portugal, and Luxembourg. What are we looking for? We are seeking a DevOps Engineer MADRID, to join a stable international project based in Madrid. Main...

  • MLOps Engineer

    hace 2 semanas


    madrid, España Remotestar A tiempo completo

    At RemoteStar, we're currently hiring for one of our client based in Spain. 9-month fixed-term contract Hybrid (3 days/week onsite) Location: Barcelona or Madrid About client Well-funded and fast-growing deep-tech company founded in 2019. We are the biggest Quantum Software company in the EU. They are also one of the 100 most promising companies in AI in the...