Reliability Engineering Expert

hace 3 semanas


Madrid, Madrid, España Ericsson A tiempo completo

About This Role

We are seeking an experienced and skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will play a crucial role in designing, building, and maintaining the robust infrastructure that powers our products and services.

Key Responsibilities

  • Design, develop, and maintain our platform infrastructure to ensure high availability, scalability, and reliability.
  • Collaborate with cross-functional teams to understand product requirements and provide technical guidance for infrastructure design and implementation.
  • Built and maintain automated systems for deployment, monitoring, alerting, and incident response.
  • Participate in incident handling, investigating, and resolving production incidents to minimize impact and ensure system stability.
  • Perform capacity planning and optimization to ensure the platform meets performance and scalability targets.
  • Conduct regular system and performance analysis, identify improvement areas, and implement solutions to enhance efficiency and stability.
  • Troubleshoot and resolve complex system issues, including performance bottlenecks, network connectivity problems, and infrastructure failures.
  • Implement and maintain security best methodologies throughout the platform to ensure compliance with industry standards and regulations.
  • Collaborate with software engineers to define and implement DevOps practices, CI/CD pipelines, and infrastructure-as-code (IaC) approaches.
  • Participate in on-call rotations to provide support for the production environment, responding to and resolving incidents in a timely manner.

Requirements

  • 8+ years of relevant work experience as a Platform Engineer, SRE, or similar role, implementing operational processes and tools, managing environments, and large-scale cloud application environments.
  • Strong knowledge of system architecture and networking concepts, specifically designing scalable and fault-tolerant systems.
  • Very good knowledge of Automated Build Systems, including Jenkins and Spinnaker for building and managing CI/CD pipelines, GitOps, and tools like ArgoCD and Flux.
  • Proven experience in Infrastructure as Code solutions, including Terraform, Azure Resource Manager, Google Cloud Deployment Manager, and AWS CloudFormation.
  • IT Automation tools, including Ansible, Puppet, and experience with Kubernetes CNCF distributions like SUSE RKE2, Red Hat Openshift, and VMware Tanzu.
  • Proficiency in at least one programming language, including Python, scripting languages like Bash and PowerShell, and other languages like Go and Javascript.
  • Experience with databases, preferably NewSQL, and NoSQL knowledge.
  • Solid understanding of Linux-based systems, including administration, troubleshooting, and performance tuning.
  • Deep knowledge of monitoring and experience with implementing observability practices, including Prometheus, ELK, OpenTelemetry, Grafana, Jaeger, and Dynatrace.
  • Excellent problem-solving and analytical skills, with the ability to troubleshoot complex issues and provide effective solutions.
  • Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams.
  • Proficiency in English, both written and spoken.

Why Choose Ericsson?

At Ericsson, you'll have an outstanding opportunity to use your skills and imagination to push the boundaries of what's possible. You'll be challenged, but you won't be alone. You'll be joining a team of diverse innovators, all driven to go beyond the status quo to craft what comes next.



  • Madrid, Madrid, España Ebury A tiempo completo

    At Ebury, we're on a mission to revolutionize the FinTech sector with innovative solutions that empower businesses to thrive globally. As a Reliability Engineering Expert, you'll play a pivotal role in ensuring the high availability and reliability of our systems.About UsEbury is a leading FinTech company, recognized for its exceptional growth and...


  • Madrid, Madrid, España Thales A tiempo completo

    Thales Alenia Space is a leading manufacturer of high-tech space solutions, delivering cutting-edge products for telecommunications, navigation, Earth observation, environmental management, exploration, science, and orbital infrastructure development.We have customers with diverse needs in Space to Connect, Secure & Defend, Observe & Protect, Explore, Travel...


  • Madrid, Madrid, España Ericsson A tiempo completo

    Job SummaryWe are seeking a highly skilled Platform Reliability Expert to join our team at Ericsson. As a key member of our engineering team, you will be responsible for designing, building, and maintaining robust infrastructure that powers our products and services.About the RoleYou will participate in developing and implementing best methodologies in site...


  • Madrid, Madrid, España Ebury A tiempo completo

    Job Summary:We are seeking a seasoned Cloud Infrastructure Expert to lead our platform engineering team. In this role, you will oversee the design and implementation of highly available and scalable applications.About Ebury:Ebury is a FinTech firm that offers a range of products including FX risk management, trade finance, currency accounts, international...


  • Madrid, Madrid, España Amazon A tiempo completo

    About AmazonAmazon is one of the largest Fulfillment Centers (FC) in the Southern Hemisphere and first Amazon Robotics Site in Australia, looking for an experienced Reliability Maintenance Engineering leader to join our team in a highly automated and fast-paced Robotics Fulfillment Center (FC) in Kemps Creek.Job SummaryWe are seeking a highly skilled...


  • Madrid, Madrid, España Amazon A tiempo completo

    OverviewWe are seeking a seasoned Reliability Maintenance Engineering professional to lead our team in maintaining site operations in a safe, standard, and efficient manner.About the RoleThis is an exciting opportunity for an experienced Reliability Engineer to take on a leadership role within our fast-paced Robotics Fulfillment Centre. As a Reliability...


  • Madrid, Madrid, España Amazon A tiempo completo

    About the RoleWe are seeking a highly skilled Systems Development Engineer to join our Reliability and Automation Engineering Team at Amazon. This is an exciting opportunity to work on large-scale systems, gain top-notch experience in systems development, and contribute to the forward-looking vision of the team.Key ResponsibilitiesDesign, deploy, monitor,...


  • Madrid, Madrid, España Amazon A tiempo completo

    Job Title: Reliability Maintenance Engineering Team LeadAbout the Role:We are seeking an experienced Reliability Maintenance Engineering Team Lead to join our team in one of our highly automated and fast-paced Fulfilment Centres (FC).Key Responsibilities:Lead, support, and mentor a team of engineering technicians to maintain a safe, standard, and efficient...


  • Madrid, Madrid, España Alstom Gruppe A tiempo completo

    OverviewAlstom, a leading global player in the transport sector, is seeking a highly skilled Reliability Expert to join its team in Madrid. This exciting opportunity will see you take on a challenging role that combines reliability expertise with railway signalling systems.


  • Madrid, Madrid, España Amazon A tiempo completo

    Company OverviewAt Amazon, we're committed to delivering exceptional customer experiences. Our Reliability and Maintenance Engineering (RME) team plays a critical role in managing Amazon's fast growth and technology innovation.About the JobWe are seeking an experienced Systems Development Engineer to join our Global RME Central Team. As a key member of our...


  • Madrid, Madrid, España Contentsquare A tiempo completo

    About the Role:We are seeking a highly skilled Reliability Engineering Manager to join our Platform team at Contentsquare.As a key member of our Engineering organization, you will be responsible for leading our Platform Reliability efforts, ensuring the stability and performance of our SaaS platform.With a focus on building a high-performing team, you will...


  • Madrid, Madrid, España Amazon A tiempo completo

    About the RoleWe are seeking an experienced Reliability Maintenance Engineering leader to join our team in one of our highly automated and fast-paced Robotics Fulfillment Centers (FC) in Kemps Creek.Key ResponsibilitiesLead, support, and mentor a Reliability, Maintenance, and Engineering team, ensuring site operations are maintained in a safe, standard, and...


  • Madrid, Madrid, España Arup A tiempo completo

    Opportunity OverviewWe are seeking a highly motivated Ground Engineering Expert to join our dynamic team in Madrid. As a key member of our Geotechnics department, you will contribute to delivering high-quality ground engineering solutions for clients across various sectors.Job SummaryThe successful candidate will have the opportunity to work on a wide range...

  • Cloud Engineering Expert

    hace 3 semanas


    Madrid, Madrid, España 0014 DXC Technology Spain, S.A. A tiempo completo

    The role of Cloud Engineering Expert - AWS DevOps at 0014 DXC Technology Spain, S.A. is to design and implement monitoring solutions on the AWS platform using Kubernetes and Grafana/Prometheus.This position requires a deep understanding of cloud architecture, containerization, monitoring, and automation technologies.The team works closely with development,...


  • Madrid, Madrid, España Abylsen A tiempo completo

    Abylsen is seeking a highly motivated Senior Railway Reliability Engineer to join our railway division in Spain. As we expand our capabilities, you will play a crucial role in the analysis, design, and evaluation of system safety and reliability across various railway projects.Position OverviewThe Senior Railway Reliability Engineer will be responsible for...


  • Madrid, Madrid, España Alstom A tiempo completo

    Role OverviewWe are seeking a skilled Reliability Engineer to join our team in Madrid. As a key member of our organization, you will play a crucial role in ensuring the reliability and maintainability of our rail systems.The ideal candidate will have a strong background in engineering, with experience in RAMS (Reliability, Availability, Maintainability, and...


  • Madrid, Madrid, España Boeing A tiempo completo

    At Boeing, we are seeking a highly skilled Senior Digital Engineering Expert to join our team in Madrid, Spain or Munich, Germany. In this role, you will contribute to the mission of Boeing's Digital Engineering (DE) Global Center, a global initiative supporting the digital transformation of Boeing's engineering.About the RoleThis is a mid-level position...


  • Madrid, Madrid, España Ebury A tiempo completo

    About the Role:Ebury is a leading FinTech firm, seeking an experienced Platform Engineering Team Lead to join our team in Madrid. In this role, you will be responsible for providing leadership for our platform engineering teams, ensuring high availability and reliability of our systems.Responsibilities:Lead a team of SREs and collaborate with other teams to...


  • Madrid, Madrid, España Ebury A tiempo completo

    Ebury, a hyper-growth FinTech firm and one of the top 15 European Fintechs to work for by AltFi, offers a range of products including FX risk management, trade finance, currency accounts, international payments and API integration.We are seeking an experienced Platform Engineering Team Lead to join our team in Madrid. As a key member of our platform...


  • Madrid, Madrid, España Hitachi Vantara A tiempo completo

    Site Reliability Engineering RoleWe are seeking a skilled Senior Site Reliability Engineer to join our team at Hitachi Vantara. As a key member of our platform engineering group, you will be responsible for designing, building, and maintaining our site reliability infrastructure. Key Responsibilities:Develop and maintain system software performance,...