Senior Site Reliability Engineer
hace 19 horas
Get to know Okta
Okta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.
At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we're looking for lifelong learners and people who can make us better with their unique experiences.
Join our team We're building a world where Identity belongs to you.
Auth0 provides an unparalleled authentication experience for hundreds of millions of users worldwide. Our commitment to reliability is a key foundation of our product and our dedication to exceeding customer availability expectations is a core engineering focus. As a Senior Site Reliability Engineer, you'll join our SRE team based in Europe to ensure our production systems are not only operational but also resilient, scalable, and ready for exponential growth. This isn't just about keeping the lights on; it's about directly contributing to the platform's core resiliency and robustness. You'll be a hands-on builder, crafting solutions that make our system more reliable by design.
What you'll do:
- Design and build custom software in Go to enhance the platform's reliability, resiliency, and redundancy.
- Partner with engineering teams to embed reliability principles, improving the availability, performance, and observability of our services.
- Use your deep understanding of infrastructure and observability principles to identify opportunities for improvement within the product and implement solutions.
- Contribute to our on-call rotation, providing rapid, effective response to critical incidents and using your expertise to troubleshoot, mitigate or accurately escalate production issues.
- Develop and refine our SRE tooling and processes, focusing on automation and operational efficiency.
- Define, document, and champion reliability best practices across the organisation.
What you'll need to be successful:
This role requires a unique blend of a software engineer's mindset and operational expertise. You'll thrive in this role if you have:
- A proactive and systematic approach to problem-solving, with a high degree of ownership.
- Proven experience in a production environment supporting large-scale, mission-critical applications with a high degree of autonomy.
- Proficiency in at least one programming language, with a preference for Go. You should be comfortable writing custom applications, not just scripts.
- Experience with infrastructure as code (Terraform), container orchestration (Kubernetes, Docker) and GitOps (ArgoCD).
- Demonstrable expertise in a major cloud provider (Azure, AWS, or GCP).
- A strong grasp of microservices architecture, databases (SQL, NoSQL), and networking fundamentals, so you can understand how custom code can solve platform-level issues.
- An understanding of core SRE principles, including SLIs, SLOs, and error budgets.
- Experience in an on-call rotation for a 24/7 cloud-based environment.
- Exceptional communication and collaboration skills, with a proven ability to work effectively in a remote, distributed team, where tasks may be self-driven.
We're looking for someone who is not just looking for a job, but a career-defining opportunity to tackle complex challenges at a massive scale. If you're a curious and motivated engineer who's passionate about building reliability directly into the platform, we'd love to hear from you.
#LI-Remote
P17684_3303790
What you can look forward to as a Full-Time Okta employee
- Amazing Benefits
- Making Social Impact
- Developing Talent and Fostering Connection + Community at Okta
Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today
Some roles may require travel to one of our office locations for in-person onboarding.
Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.
If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.
Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice
-
Senior Site Reliability Engineer
hace 3 días
Barcelona, Barcelona, España Spendesk A tiempo completoAbout the TeamThe Infrastructure team at Spendesk builds the tools, systems, and internal products that empower every engineering team to move faster and more safely. We are transforming traditional infrastructure into a developer-facing platform focused on enablement, automation, and scalability. We own CI/CD platform (ArgoCD and Github Actions), secrets...
-
Site Reliability Engineer
hace 1 semana
Barcelona, Barcelona, España F. Hoffmann-La Roche Ltd A tiempo completoAt Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure...
-
Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España CrowdStrike A tiempo completoAs a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed — we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3...
-
Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España CrowdStrike A tiempo completoAs a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed — we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3...
-
Senior Site Reliability Engineer
hace 19 horas
Barcelona, Barcelona, España N26 A tiempo completoAbout the opportunityWe are seeking a Senior Site Reliability Engineer to join the Platform Engineering Domain in the AI Platform Team.The mission of Platform Engineering is to provide trusted, performant, self-service platforms that empower product teams to build "the bank the world loves to use." The AI Platform team contributes to this mission by creating...
-
Senior Site Reliability Engineer, Platforms Team
hace 2 semanas
Barcelona, Barcelona, España Trabajos en NETQUEST A tiempo completoAbout Your New Role Our Platforms Team is a diverse and dynamic group focused on crafting and maintaining high-performance platforms, primarily leveraging the power of Amazon Web Services and Kubernetes. We're all about growth here – from dedicated Friday training sessions and daily collaborative pair-programming to shadowing opportunities and access to...
-
Senior Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España N26 A tiempo completoAbout the opportunityWe are seeking a Senior Reliability Engineer to join the Platform Engineering Domain in the Scalability Team.The mission of Platform Engineering is to provide trusted, performant, self-service platforms that empower product teams to build "the bank the world loves to use." Scalability's part of this mission is to develop solutions for...
-
Site Reliability Engineer
hace 16 horas
Barcelona, Barcelona, España N26 A tiempo completoAbout the opportunityWe are seeking a Site Reliability Engineer to join the Observability group inside our Platform Engineering domain.Platform Engineering's goal is to provide easy to use, self-service platforms to enable other segments to easily build, deploy and monitor their business applications. And Observability's role in that part of the company is...
-
Site Reliability Engineer
hace 5 días
Barcelona, Barcelona, España Perk A tiempo completoAbout UsPerk (formerly TravelPerk) is the intelligent platform for travel and spend management. Built to tackle the time-consuming, manual work that gets in the way of real work, our tools automate everything from travel bookings to expenses, invoice processing, and more. By eliminating this shadow work that wastes hours, erodes morale, and saps innovation,...
-
Senior Site Reliability Engineering
hace 6 días
Barcelona, Barcelona, España N26 A tiempo completoAbout The OpportunityWe are seeking aSenior Site Reliability Engineerto join theDatabase Platform Teamwithin the Platform Engineering Domain.Platform Engineering'smission is to provide trusted, performant, and self-service platforms, enabling product teams to build 'the bank the world loves to use.' As part of this, the Database Platform team is responsible...