Senior Site Reliability Engineer

hace 3 semanas


España F. Hoffmann-La Roche Gruppe A tiempo completo

Roche fosters diversity, equity and inclusion, representing the communities we serve. When dealing with healthcare on a global scale, diversity is an essential ingredient to success. We believe that inclusion is key to understanding people’s varied healthcare needs. Together, we embrace individuality and share a passion for exceptional care. Join Roche, where every voice matters.The PositionThe role requires the candidate to be available for on-call duty service, responding promptly to urgent issues and emergencies outside of regular working hours, ensuring that critical situations are addressed in a timely and effective manner.Your MissionDesign and maintain cutting-edge tools, scripts, and frameworks that automate repetitive tasks, streamline software deployment, and manage expansive systems with unparalleled efficiency.Partner closely with forward-thinking development teams to architect and implement high-performance solutions that elevate system efficiency, optimize resource utilization, and enhance deployment processes for superior uptime and user satisfaction.Your Core ResponsibilitiesReliability Mastery: Proactively monitor and maintain system reliability using advanced tools like DataDog, VictorOps, ELK, Grafana, and Prometheus. Become a key player in ensuring system stability and performance.Uptime Guardian: Ensure optimal uptime and performance by swiftly identifying issues and responding to alerts with precision.Technical Troubleshooter: Basic understanding of architecture and designs to deep dive into complex technical issues, troubleshoot, investigate, and resolve them. Collaborate seamlessly with engineering teams to enable timely and effective resolutions.Service Excellence: Maintain and consistently achieve defined SLAs, SLIs, and SLOs, ensuring service levels are consistently met or exceeded.Automation Innovator: Develop and deploy automation scripts (using Python or other scripting languages) to streamline operations, enhance system efficiencies, and reduce manual tasks.Cloud Steward: Manage and maintain robust infrastructure across AWS and Azure environments, implementing best practices to ensure peak performance and reliability of cloud-based applications.Cross-functional Collaborator: Work closely with engineering, DevOps, security, and operations teams to drive continuous improvement and foster a culture of reliability and inclusion.Incident Responder: Handle requests and incidents through JIRA and ServiceNow, documenting troubleshooting procedures, solutions, and lessons learned to fuel ongoing improvements.Flexible Scheduling: Work on-call outside of normal working hours and weekends as scheduled to ensure continuous support.Team Builder: Actively contribute to the growth and development of the SRE team's capabilities, nurturing a stronger, more inclusive, and resilient team.Who You Are:Educational Background: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent professional experience.Certifications: Relevant industry certifications (AWS/Azure) to showcase your expertise.Experience: Approximately 5 years of experience in site reliability engineering, IT operations, DevOps, or related fields.Cloud Expertise: Solid experience with AWS and/or Azure, including setting up, monitoring, and maintaining cloud resources.Tool Proficiency: Proficiency with monitoring and logging tools such as DataDog, Splunk-Oncall, ELK stack, Grafana, and Prometheus.Hands-On Skills: Hands-on experience with JIRA and ServiceNow for tracking incidents, requests, and documentation.Scripting Knowledge: Proficiency in Python or similar scripting languages for automation purposes.Incident Response: Understanding of SRE Core principles and incident prioritization.Troubleshooting: Demonstrates proficient troubleshooting capabilities, especially in cloud and distributed system environments.Communication and Teamwork: Excellent communication, teamwork, and documentation skills.Why Join Us?By joining our team, you will be part of a dynamic environment where your contributions will directly impact the resilience and reliability of our services. You will have opportunities for professional growth and the ability to collaborate with industry leaders. Let’s drive the future of IT stability together, ensuring an exceptional experience for our customers.Ready to make a difference? Apply now to be our next SRE Incident Manager and help us build a more reliable futureWho we areAt Roche, more than 100,000 people across 100 countries are pushing back the frontiers of healthcare. Working together, we’ve become one of the world’s leading research-focused healthcare groups. Our success is built on innovation, curiosity and diversity.Roche is an Equal Opportunity Employer.
#J-18808-Ljbffr


  • Site Reliability Engineer

    hace 4 semanas


    España buscojobs España A tiempo completo

    Senior Site Reliability Engineer (SRE) - Fintech SectorLocation: Barcelona, Spain (Hybrid Model)Company Overview:Join a leading international fintech company at the forefront of innovation, revolutionizing financial services for millions worldwide. Our client is looking for a Senior Site Reliability Engineer (SRE) to play a pivotal role in ensuring the...


  • España Antal International A tiempo completo

    Job DescriptionSenior Site Reliability Engineer (SRE) - Fintech SectorLocation: Barcelona, Spain (Hybrid Model)Company Overview:Join a leading international fintech company at the forefront of innovation, revolutionizing financial services for millions worldwide. Our client is looking for a Senior Site Reliability Engineer (SRE) to play a pivotal role in...

  • Site Reliability Engineer

    hace 2 semanas


    España Antal International A tiempo completo

    Job DescriptionCompany Overview:Join a leading international fintech company at the forefront of innovation, revolutionizing financial services for millions worldwide. Our client is looking for a Senior Site Reliability Engineer (SRE) to play a pivotal role in ensuring the scalability, reliability, and sustainability of their services.Position Overview:As a...


  • España Blacklane A tiempo completo

    We are seeking an experienced Senior Site Reliability Engineer (SRE) to join our team and play a key role in driving the adoption of SRE best practices across our organization. If you are passionate about building reliable systems, enabling cultural transformation, and mentoring teams, this is the perfect opportunity for you. You'll work on mission-critical...


  • España Ebury A tiempo completo

    Senior Site Reliability Engineer - FintechLocation: MadridCompany: Ebury Madrid Office - Hybrid: 4 days in the office, 1 day working from homeEbury is a hyper-growth FinTech firm, named in 2021 as one of the top 15 European Fintechs to work for by AltFi. We offer a range of products including FX risk management, trade finance, currency accounts,...


  • España Zartis A tiempo completo

    The company and our mission: Zartis is a digital solutions provider working across technology strategy, software engineering and product development. We partner with firms across financial services, MedTech, media, logistics technology, renewable energy, EdTech, e-commerce, and more. Our engineering hubs in EMEA and LATAM are full of talented professionals...


  • España dynaTrace software GmbH A tiempo completo

    Our Business Insights team is looking for a DevOps to enhance our internal process and scale our delivery capabilities. The focus is to embrace the NoOps thinking and assist with knowledge in areas such as delivery, automation, and remediation. Suppose you have a passion for large-scale deployments and an interest in growing your skills around Site...


  • España Cabify A tiempo completo

    Do you want to change the world? At Cabify, that's what we're doing. We aim to make cities better places to live by improving mobility for the people living in them, connecting riders to drivers, providing mobility alternatives such as scooters and mopeds and many others to come, all at the touch of a button. Maybe one day cities will be places where nobody...

  • Site Reliability Engineer

    hace 2 semanas


    España Talent Recruit A tiempo completo

    Company Background: We are representing a renowned leader in digital and technology consulting. Specialising in services such as online and social media audits, digital analytics, web and mobile app development, as well as marketing and CRM automation, our client is at the forefront of innovation. If you're seeking an exciting new role as a Site Reliability...


  • España Hub71 Ltd A tiempo completo

    Únete a Bit2Me como Senior Site Reliability Engineer (SRE)¿Y si pudieras llevar los sistemas a gran escala al siguiente nivel? La respuesta es clara: únete a Bit2Me. Aquí encontrarás el entorno perfecto para innovar, optimizar y dejar tu huella en el rendimiento y la fiabilidad de sistemas que impactan a millones de usuarios.¿Cómo será tu trabajo...

  • Site Reliability Engineer

    hace 2 semanas


    España CDmon A tiempo completo

    Site Reliability Engineer (SRE) en Híbrido¡Únete a nuestro equipo en cdmon.com! Somos una destacada empresa española de dominios y servicios web dedicada a crear una Internet abierta y de calidad donde cualquiera pueda estar. Nos enorgullece desarrollar y ofrecer nuestros propios sistemas de hosting basados en Linux brindando una amplia gama de servicios...


  • España Roche A tiempo completo

    The Position Senior Site Reliability Engineer (Kubernetes Platform) - Digital Products and Enablement The 21st century needs a 21st century healthcare system. To help build this, Roche is not only developing highly personalized medicine and advanced diagnostics, but also heavily investing into software and digital solutions. To speed up medical processes,...


  • España Roche A tiempo completo

    Roche fosters diversity, equity and inclusion, representing the communities we serve. When dealing with healthcare on a global scale, diversity is an essential ingredient to success. We believe that inclusion is key to understanding people’s varied healthcare needs. Together, we embrace individuality and share a passion for exceptional care. Join Roche,...

  • Site Reliability Engineer

    hace 2 semanas


    España IDEMIA A tiempo completo

    You may not know our name, but you have surely used our innovations and solutions. Our mission is to unlock the world and make it safer through cutting-edge identity technologies. Every day, around the globe, we are enabling citizens and consumers alike to perform their daily critical activities (such as pay, connect and travel), in the physical as well as...


  • España Spectro Cloud A tiempo completo

    Who We Are Spectro Cloud aims to make infrastructure boundaryless for the enterprise, from data center to edge and every platform in between. We provide solutions that help enterprises run applications on Kubernetes, their way, anywhere. Established by a team of multi-cloud management experts and industry veterans with a track record of success, we're at the...

  • Site Reliability Engineer

    hace 4 semanas


    España buscojobs España A tiempo completo

    Intuition Machines uses AI/ML to build enterprise security products. We apply our research to systems that serve hundreds of millions of people, with a team distributed around the world. You are probably familiar with our best-known product, the hCaptcha security suite. Our approach is simple: low overhead, small teams, and rapid iteration.Role OverviewAs a...


  • España Intuition Machines, Inc. A tiempo completo

    Intuition Machines uses AI/ML to build enterprise security products. We apply our research to systems that serve hundreds of millions of people, with a team distributed around the world. You are probably familiar with our best-known product, the hCaptcha security suite. Our approach is simple: low overhead, small teams, and rapid iteration.Role OverviewAs a...


  • España ING A tiempo completo

    At ING we are looking for a Site Reliability Engineer Your role and work environment : We are looking for a talented and enthusiastic Site Reliability Engineer (SRE) to join our Team of SRE Expert Unit. The responsibility of this team is to ensure the reliability and scalability of the platform to provide the best customer experience to our clients and our...


  • España buscojobs Argentina A tiempo completo

    Intuition Machines uses AI/ML to build enterprise security products. We apply our research to systems that serve hundreds of millions of people, with a team distributed around the world. You are probably familiar with our best-known product, the hCaptcha security suite. Our approach is simple: low overhead, small teams, and rapid iteration.As a Site...


  • España Logicalis Spain A tiempo completo

    En Logicalis Spain actualmente estamos buscando a una persona con experiencia en entornos de operaciones en la nube, que aporte conocimientos en automatización, monitorización y resolución de incidencias. La persona incorporada pasará a formar parte de un equipo de especialistas como SRE (Site Reliability Engineer) encargados de garantizar la fiabilidad,...