Reliability Engineering Expert
hace 3 semanas
About This Role
We are seeking an experienced and skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will play a crucial role in designing, building, and maintaining the robust infrastructure that powers our products and services.
Key Responsibilities
- Design, develop, and maintain our platform infrastructure to ensure high availability, scalability, and reliability.
- Collaborate with cross-functional teams to understand product requirements and provide technical guidance for infrastructure design and implementation.
- Built and maintain automated systems for deployment, monitoring, alerting, and incident response.
- Participate in incident handling, investigating, and resolving production incidents to minimize impact and ensure system stability.
- Perform capacity planning and optimization to ensure the platform meets performance and scalability targets.
- Conduct regular system and performance analysis, identify improvement areas, and implement solutions to enhance efficiency and stability.
- Troubleshoot and resolve complex system issues, including performance bottlenecks, network connectivity problems, and infrastructure failures.
- Implement and maintain security best methodologies throughout the platform to ensure compliance with industry standards and regulations.
- Collaborate with software engineers to define and implement DevOps practices, CI/CD pipelines, and infrastructure-as-code (IaC) approaches.
- Participate in on-call rotations to provide support for the production environment, responding to and resolving incidents in a timely manner.
Requirements
- 8+ years of relevant work experience as a Platform Engineer, SRE, or similar role, implementing operational processes and tools, managing environments, and large-scale cloud application environments.
- Strong knowledge of system architecture and networking concepts, specifically designing scalable and fault-tolerant systems.
- Very good knowledge of Automated Build Systems, including Jenkins and Spinnaker for building and managing CI/CD pipelines, GitOps, and tools like ArgoCD and Flux.
- Proven experience in Infrastructure as Code solutions, including Terraform, Azure Resource Manager, Google Cloud Deployment Manager, and AWS CloudFormation.
- IT Automation tools, including Ansible, Puppet, and experience with Kubernetes CNCF distributions like SUSE RKE2, Red Hat Openshift, and VMware Tanzu.
- Proficiency in at least one programming language, including Python, scripting languages like Bash and PowerShell, and other languages like Go and Javascript.
- Experience with databases, preferably NewSQL, and NoSQL knowledge.
- Solid understanding of Linux-based systems, including administration, troubleshooting, and performance tuning.
- Deep knowledge of monitoring and experience with implementing observability practices, including Prometheus, ELK, OpenTelemetry, Grafana, Jaeger, and Dynatrace.
- Excellent problem-solving and analytical skills, with the ability to troubleshoot complex issues and provide effective solutions.
- Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams.
- Proficiency in English, both written and spoken.
Why Choose Ericsson?
At Ericsson, you'll have an outstanding opportunity to use your skills and imagination to push the boundaries of what's possible. You'll be challenged, but you won't be alone. You'll be joining a team of diverse innovators, all driven to go beyond the status quo to craft what comes next.
-
Reliability Engineering Expert
hace 3 semanas
Madrid, Madrid, España Ebury A tiempo completoAt Ebury, we're on a mission to revolutionize the FinTech sector with innovative solutions that empower businesses to thrive globally. As a Reliability Engineering Expert, you'll play a pivotal role in ensuring the high availability and reliability of our systems.About UsEbury is a leading FinTech company, recognized for its exceptional growth and...
-
Reliability and Safety Engineering Expert
hace 3 semanas
Madrid, Madrid, España Thales A tiempo completoThales Alenia Space is a leading manufacturer of high-tech space solutions, delivering cutting-edge products for telecommunications, navigation, Earth observation, environmental management, exploration, science, and orbital infrastructure development.We have customers with diverse needs in Space to Connect, Secure & Defend, Observe & Protect, Explore, Travel...
-
Platform Reliability Expert
hace 2 semanas
Madrid, Madrid, España Ericsson A tiempo completoJob SummaryWe are seeking a highly skilled Platform Reliability Expert to join our team at Ericsson. As a key member of our engineering team, you will be responsible for designing, building, and maintaining robust infrastructure that powers our products and services.About the RoleYou will participate in developing and implementing best methodologies in site...
-
Reliability Engineering Team Lead
hace 3 semanas
Madrid, Madrid, España Ebury A tiempo completoJob Summary:We are seeking a seasoned Cloud Infrastructure Expert to lead our platform engineering team. In this role, you will oversee the design and implementation of highly available and scalable applications.About Ebury:Ebury is a FinTech firm that offers a range of products including FX risk management, trade finance, currency accounts, international...
-
Reliability Engineering Operations Lead
hace 1 semana
Madrid, Madrid, España Amazon A tiempo completoAbout AmazonAmazon is one of the largest Fulfillment Centers (FC) in the Southern Hemisphere and first Amazon Robotics Site in Australia, looking for an experienced Reliability Maintenance Engineering leader to join our team in a highly automated and fast-paced Robotics Fulfillment Center (FC) in Kemps Creek.Job SummaryWe are seeking a highly skilled...
-
Reliability Engineering Team Lead
hace 3 semanas
Madrid, Madrid, España Amazon A tiempo completoOverviewWe are seeking a seasoned Reliability Maintenance Engineering professional to lead our team in maintaining site operations in a safe, standard, and efficient manner.About the RoleThis is an exciting opportunity for an experienced Reliability Engineer to take on a leadership role within our fast-paced Robotics Fulfillment Centre. As a Reliability...
-
Reliability and Automation Engineering Expert
hace 4 semanas
Madrid, Madrid, España Amazon A tiempo completoAbout the RoleWe are seeking a highly skilled Systems Development Engineer to join our Reliability and Automation Engineering Team at Amazon. This is an exciting opportunity to work on large-scale systems, gain top-notch experience in systems development, and contribute to the forward-looking vision of the team.Key ResponsibilitiesDesign, deploy, monitor,...
-
Reliability Maintenance Engineering Team Lead
hace 3 semanas
Madrid, Madrid, España Amazon A tiempo completoJob Title: Reliability Maintenance Engineering Team LeadAbout the Role:We are seeking an experienced Reliability Maintenance Engineering Team Lead to join our team in one of our highly automated and fast-paced Fulfilment Centres (FC).Key Responsibilities:Lead, support, and mentor a team of engineering technicians to maintain a safe, standard, and efficient...
-
Reliability Expert for Railway Signalling Systems
hace 4 semanas
Madrid, Madrid, España Alstom Gruppe A tiempo completoOverviewAlstom, a leading global player in the transport sector, is seeking a highly skilled Reliability Expert to join its team in Madrid. This exciting opportunity will see you take on a challenging role that combines reliability expertise with railway signalling systems.
-
Reliability and Automation Engineering Expert
hace 3 semanas
Madrid, Madrid, España Amazon A tiempo completoCompany OverviewAt Amazon, we're committed to delivering exceptional customer experiences. Our Reliability and Maintenance Engineering (RME) team plays a critical role in managing Amazon's fast growth and technology innovation.About the JobWe are seeking an experienced Systems Development Engineer to join our Global RME Central Team. As a key member of our...
-
Madrid, Madrid, España Contentsquare A tiempo completoAbout the Role:We are seeking a highly skilled Reliability Engineering Manager to join our Platform team at Contentsquare.As a key member of our Engineering organization, you will be responsible for leading our Platform Reliability efforts, ensuring the stability and performance of our SaaS platform.With a focus on building a high-performing team, you will...
-
Strategic Reliability Engineering Leadership Opportunity
hace 3 semanas
Madrid, Madrid, España Amazon A tiempo completoAbout the RoleWe are seeking an experienced Reliability Maintenance Engineering leader to join our team in one of our highly automated and fast-paced Robotics Fulfillment Centers (FC) in Kemps Creek.Key ResponsibilitiesLead, support, and mentor a Reliability, Maintenance, and Engineering team, ensuring site operations are maintained in a safe, standard, and...
-
Ground Engineering Expert
hace 5 días
Madrid, Madrid, España Arup A tiempo completoOpportunity OverviewWe are seeking a highly motivated Ground Engineering Expert to join our dynamic team in Madrid. As a key member of our Geotechnics department, you will contribute to delivering high-quality ground engineering solutions for clients across various sectors.Job SummaryThe successful candidate will have the opportunity to work on a wide range...
-
Cloud Engineering Expert
hace 3 semanas
Madrid, Madrid, España 0014 DXC Technology Spain, S.A. A tiempo completoThe role of Cloud Engineering Expert - AWS DevOps at 0014 DXC Technology Spain, S.A. is to design and implement monitoring solutions on the AWS platform using Kubernetes and Grafana/Prometheus.This position requires a deep understanding of cloud architecture, containerization, monitoring, and automation technologies.The team works closely with development,...
-
Senior Railway Reliability Engineer
hace 3 semanas
Madrid, Madrid, España Abylsen A tiempo completoAbylsen is seeking a highly motivated Senior Railway Reliability Engineer to join our railway division in Spain. As we expand our capabilities, you will play a crucial role in the analysis, design, and evaluation of system safety and reliability across various railway projects.Position OverviewThe Senior Railway Reliability Engineer will be responsible for...
-
Reliability Engineer in Madrid
hace 3 semanas
Madrid, Madrid, España Alstom A tiempo completoRole OverviewWe are seeking a skilled Reliability Engineer to join our team in Madrid. As a key member of our organization, you will play a crucial role in ensuring the reliability and maintainability of our rail systems.The ideal candidate will have a strong background in engineering, with experience in RAMS (Reliability, Availability, Maintainability, and...
-
Senior Digital Engineering Expert
hace 1 semana
Madrid, Madrid, España Boeing A tiempo completoAt Boeing, we are seeking a highly skilled Senior Digital Engineering Expert to join our team in Madrid, Spain or Munich, Germany. In this role, you will contribute to the mission of Boeing's Digital Engineering (DE) Global Center, a global initiative supporting the digital transformation of Boeing's engineering.About the RoleThis is a mid-level position...
-
Platform Engineering Team Lead
hace 1 mes
Madrid, Madrid, España Ebury A tiempo completoAbout the Role:Ebury is a leading FinTech firm, seeking an experienced Platform Engineering Team Lead to join our team in Madrid. In this role, you will be responsible for providing leadership for our platform engineering teams, ensuring high availability and reliability of our systems.Responsibilities:Lead a team of SREs and collaborate with other teams to...
-
Platform Engineering Team Lead
hace 3 semanas
Madrid, Madrid, España Ebury A tiempo completoEbury, a hyper-growth FinTech firm and one of the top 15 European Fintechs to work for by AltFi, offers a range of products including FX risk management, trade finance, currency accounts, international payments and API integration.We are seeking an experienced Platform Engineering Team Lead to join our team in Madrid. As a key member of our platform...
-
Senior Site Reliability Engineer
hace 3 semanas
Madrid, Madrid, España Hitachi Vantara A tiempo completoSite Reliability Engineering RoleWe are seeking a skilled Senior Site Reliability Engineer to join our team at Hitachi Vantara. As a key member of our platform engineering group, you will be responsible for designing, building, and maintaining our site reliability infrastructure. Key Responsibilities:Develop and maintain system software performance,...