Site Reliability Engineer
hace 2 semanas
Edpuzzle is a leading edtech company with offices in San Francisco and Barcelona and over 12 years of history helping teachers find and create exciting, interactive learning experiences. We're a software company built by teachers, for teachers, committed to empowering educators with intuitive software to engage students all in one place, from video learning and beyond.
Millions of teachers and students around the world are already using Edpuzzle to make education more equitable and engaging. If you're passionate about making an impact and find joy in learning, you'll feel right at home with us. Check out the job details below to see if Edpuzzle could be the right fit for you
About working at Edpuzzle
Working at Edpuzzle means joining a global team dedicated to enhancing education for all. Picture a place where you can connect with your teammates, whether remotely or in person, whenever you need support. A place where one day you're helping shape one of the biggest edtech platforms in the world, and the next day you're doing a teambuilding activity with your coworkers. A place where everyone has been selected because they're the best at what they do, and where your manager and team trust your decisions fully.
We value work-life harmony, which is why we've embraced a "remote-first" approach that emphasizes flexibility and choice while fostering meaningful engagement. It's no surprise that in our latest employee satisfaction survey, Work-Life Balance (92%), Leadership (85%), and Employee Engagement (84%) were highlighted as our top drivers, because we genuinely care about creating an environment where people can thrive, feel supported, and do their best work. A place where you're encouraged to learn and grow, because education is the cornerstone of everything we do.
About the process
The goal of our interview process is to learn about each other. Each step is structured to help us understand your unique talents and contributions while offering you insight into our team and culture.
For a detailed breakdown of our recruitment process, please refer to our Selection Process Guide which outlines every step of our candidate journey. A dedicated member of our team will support you through each step, and you'll have the opportunity to meet various Edpuzzlers along the way.
About the role
We're looking for a passionate Site Reliability Engineer to pioneer our SRE strategies of our Security and Infrastructure Team in Barcelona. The right person will help us create the best possible product for teachers and empower them to engage their students with videos. If you're a self-starter who's eager to contribute to the education sector, you'll feel right at home with us.
As the key reference point for all things SRE, you'll have the autonomy to shape our systems from the ground up. This role is perfect for someone ready to lead and innovate, making a significant impact on our cloud infrastructure and observability strategies using Datadog. You'll be responsible for ensuring our system's reliability, scalability, and maintainability, handling everything from our cloud infrastructure to in-depth observability and comprehensive monitoring. By working closely with our DevOps and Engineering teams, you'll drive the design and implementation of resilient systems, manage incidents effectively, and champion best practices for observability and incident response.
About our tech stack Technically speaking, we are hosted on AWS and use CDK with JavaScript for the Infrastructure as Code. Our product is written in Node and Express applying DDD and hexagonal architecture in the backend. We use MongoDB and OpenSearch for our database, and we have our own encoding and streaming system. We work with testing, trunk based development, CI/CD on GitHub Actions, and follow best practices making sure we never compromise on code quality and reliability. On the observability side, we monitor everything using Datadog and CloudWatch.
About our team We are a product-focused team. Our methodologies foster close partnerships between Engineering, Product, Infrastructure, and other key areas (Design, Data, QA, Security, etc.). Everyone is encouraged to share their ideas and opinions, take initiative, and be resourceful when coming up with creative solutions that elevate the experience for our users.
At our core is an environment where every voice is heard and valued, and each team member plays a role in shaping project strategy and designing technical solutions. We embrace proactiveness and curiosity to understand the bigger picture. Engineers are not just encouraged but expected to think critically, propose solutions, and take ownership. We've built a culture where ideas are shared openly, challenges are tackled head-on, and assumptions are questioned to drive continuous improvement.
Curious to learn more about our Product team and Engineering culture? Don't miss these talks:
• Santi Herrero (Co-founder and CTO) at SCPNA 2024,SCBCN 2024 andEdpuzzle Tech Discovery 2025gives insights on the deliberate choices behind our structure and growth.
• Santi Herrero (Co-founder and CTO) and Ferrán Martín (Engineering Manager) share how we migrated and implemented hexagonal architecture and DDD inthis streamwith midudev.
• Asier Zapata (Engineering Manager) shares how our team successfully brings AI to production in a scalable and reliable way in this Edpuzzle Tech Discovery 2025 talk. About the job
- Work with the Product, Infrastructure and Engineering teams to find the best technical solutions by participating in discussions and sharing your opinions.
- Take ownership of the problems that are being worked on, understanding why they are needed by the users, carrying out your own research, making your own proposals and working on the implementation while relying on your teammates for help when needed.
- Communicate effectively in a team in order to maximize productivity, ownership, and focus to help projects reach the finish line with the best possible outcome and by the project deadline.
- Design a cloud infrastructure that is secure, scalable, and highly available on AWS.
- Engage in proactive monitoring and observability with comprehensive tools and practices that not only detect and warn, but also predict potential system issues before they affect our users.
- Lead the charge in root cause analysis for production and infrastructure issues, transforming challenges into learning opportunities.
- Provision, configure and maintain cloud infrastructure as code.
- Perform rotatory on-call service, ensuring reliability and uptime for our users.
- Write technical documentation, contributing to our technical knowledge base and empowering your peers.
- Perform other exciting duties as opportunities and needs arise.
- At least 3 years of experience in Site Reliability Engineering, DevOps Engineering, System Administration or Cloud Infrastructure Engineering for a web-based product with a focus on observability and reliability.
- Good knowledge of Amazon Web Services (AWS), CloudWatch and Datadog.
- Experience with software release management and deployment pipelines (Git, CI/CD).
- Experience with Infrastructure as Code using AWS CDK.
- Experience writing JavaScript, TypeScript or code.
- Pragmatic with technologies: you understand tech is a tool to solve a product problem, tech is never the end goal.
- Excellent ability to communicate your ideas, regardless of the audience.
- Product-oriented: You make all your technology decisions with the final user in mind.
- You are naturally drawn towards understanding the bigger picture and recognize when there's a need for improvement, applying your intentional and rational thought process to address complex issues.
- You are able to work independently, plan and exercise conscious control of time spent on specific goals to reach deadlines effectively, and you don't hesitate to pursue a goal despite the difficulties, all while maintaining a flexible mindset.
- You are based in Barcelona and have a work permit to work in Spain.
- Experience with MongoDB or OpenSearch database administration.
- Experience deploying and maintaining complex cloud infrastructures serving high traffic web applications.
- Experience with complex backend architectures such as Hexagonal Architecture and Domain Driven Design (DDD)
- Experience with other cloud providers such as Azure or Google Cloud Platform
- ... or another amazing skill you bring to the table that we haven't thought of yet
- Salary between €45K – €59K based on your professional experience
- On-call compensation
- While we are a remote-first company, for this role we're seeking someone who appreciates the balance of working from home and spending time at our Barcelona office
- 24 days' paid holidays plus December 24th and 31st
- Flexible working hours and reduced working time on Fridays to support work-life balance
- €2000 annual allowance for meals with Cobee
- Private health insurance policy with AXA
- Access to Wellhub to support physical and emotional well-being
- Flexible remuneration for childcare
- Flexible remuneration for public transport
- Flexible remuneration for health insurance of immediate family members (spouse and/or children)
- Training and development (CodelyTV, Cloud Academy, etc.)
- Fully stocked pantry with a variety of snacks and drinks in the Barcelona office
- Team-building events during working hours to connect, learn, and create lasting bonds with passionate colleagues
Please be aware of potential scams involving fake job offers using Edpuzzle's name. Official communications will always originate from the domain, not external domains like Gmail. Edpuzzle will never request payments or skip formal interviews during the hiring process, nor request sensitive personal information without a valid reason. To verify any communication, please contact [email protected].
References from previous employers will be requested from candidates during the selection process. If you'd like to be considered for this position, please apply below. We look forward to hearing from you Edpuzzle may use limited artificial intelligence (AI) tools to assist in certain administrative parts of the hiring process, such as preparing interview notes or summaries. These tools are used only to support our hiring team, they do not replace human judgment, scoring, or decision making. Participation in any AI assisted interview is entirely optional, and candidates who prefer not to participate may request a standard, non AI interview. Declining to participate will not affect the recruitment process or the outcome of any application. AI assisted interviews are not used for candidates located in, or applying for positions based in, New York city or the EU/EEA/UK. For more details about how your information is processed please review our 'Job Applicant And Successful Candidate Privacy Notice' or contact us at [email protected
-
Site reliability engineer
hace 2 semanas
Barcelona, Barcelona, España K2 Partnering Solutions A tiempo completoWe're hiringSite Reliability Engineer – Platform EngineeringBarcelona, Spain— Hybrid (2 days/week on-site)4+ years of experienceWe're looking for an SRE who's passionate about building scalable, secure and reliable platforms in a modern Kubernetes environment.What you'll do:• Design, build and maintain high-quality, scalable systems on Kubernetes•...
-
Site Reliability Engineer
hace 1 semana
Barcelona, Barcelona, España Merlin Digital Partner A tiempo completoWe are Merlin Digital PartnerA leading IT and Digital headhunting company who stands out from the crowd, boasting over a decade of experience. We've successfully collaborated and played a pivotal role in the growth of industry heavyweights such as Wallapop, Glovo, Banc Sabadell, and Factorial, among others.Our emphasis lies in people-centric approaches and...
-
Site Reliability Engineer
hace 1 semana
Barcelona, Barcelona, España CrowdStrike A tiempo completoAs a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed — we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3...
-
Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España Okta A tiempo completoGet to know Okta Okta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth. At Okta, we celebrate a variety of...
-
Senior Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España Searchability® A tiempo completoSenior Site Reliability Engineer (SRE) – Barcelona (Hybrid)KEY POINTS• Barcelona-based hybrid role with a respected global organisation• Azure-first SRE work across cloud, edge and on-premise platforms• Terraform, GitHub Actions, Azure Arc, AKS, Datadog• Salary up to€75,000ABOUT THE CLIENTI'm supporting an established international organisation...
-
Senior Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España Okta A tiempo completoGet to know Okta Okta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth. At Okta, we celebrate a variety of...
-
Senior Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España K2 Partnering Solutions A tiempo completoWe are seeking a highly skilled(Senior) Site Reliability Engineerto join our Platform Engineering team. In this role, you will be at the heart of our technical vision, designing and maintaining the scalable, reliable systems that power our global operations. This is a "code-first" SRE role where excellent programming skills are the foundation of everything...
-
Senior Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España Okta A tiempo completoGet to know Okta Okta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth. At Okta, we celebrate a variety of...
-
Senior Site Reliability Engineer
hace 1 día
Barcelona, Barcelona, España Manychat A tiempo completoWHO WE ARE We help creators get more out of every conversation with Instagram-focused automations and support for other channels like Messenger, WhatsApp, and TikTok. The result? Better engagement, more sales, and real, sustainable growth.With a diverse team of 350+ people spread across three continents, we're building the leading Chat Marketing platform...
-
Site Reliability Engineer
hace 2 semanas
Barcelona, Barcelona, España Edpuzzle A tiempo completoAbout usEdpuzzle is a leading edtech company with offices in San Francisco and Barcelona and over 12 years of history helping teachers find and create exciting, interactive learning experiences. We're a software company built by teachers, for teachers, committed to empowering educators with intuitive software to engage students all in one place, from video...