Senior Platform Reliability Engineer
hace 2 semanas
Platform-as-product mindset Full time.While this is a remote position, we are currently only considering candidates between UTC-1 to UTC +2.About LandbotOperating in more than 150 countries, Landbot offers a platform that helps companies to create exceptional chatbot and AI agent conversations across several channels as Web, WhatsApp, and Messenger.At Landbot , we’re building a high-performance team that blends engineering excellence, product mindset, and customer obsession . We believe quality and speed go hand in hand — and we’re looking for a Senior Reliability Engineer to help us scale our platform and deliver real impact.About The TeamYou'll join our Platform Engineering team, a small, focused group responsible for building and maintaining Landbot Engineering Platform, Data Platform and Security.Our mission is to empower Landbot teams to deliver value faster, more reliably, and at scale.Our core team values:Platform-as-product mindset Autonomy and ownership Collaboration over gatekeeping About the PositionThe Role While the title is SRE, this role aligns with Systems Engineering in a Platform team principles: treating infrastructure as a product, focusing on developer needs, reducing operational toil, and creating self-service capabilities that abstract complexity so teams can focus on building features.As Senior Reliability Engineer you will:Build and Maintain the Internal Developer PlatformDesign and implement core platform services (CI/CD pipelines, infrastructure provisioning, and observability systems). Design and implement developer-facing tools, APIs, and automation that enable application teams to deploy, scale, and operate services independently. Define and Maintain Platform OperationsManage and optimize cloud resources, Kubernetes clusters, databases, and networking for reliability, scalability, and cost optimization. Establish SLIs, SLOs, and error budgets to balance reliability with feature velocity. Design and maintain observability solutions for real-time visibility and proactive issue detection. Implement alerting strategies that reduce noise and focus on actionable signals. Lead incident response, conduct blameless postmortems and drive continuous improvement. Enhance Developer Experience and Drive Platform StrategyPartner with application teams (platform customers) to understand their workflows and pain points, gather feedback, and prioritize improvements aligned with business objectives. Create and maintain documentation, runbooks, and knowledge bases that reduce knowledge silos and enable self-service. Drive decisions through written formats (RFCs, ADRs) that document architectural choices. Measure platform success through developer productivity metrics, adoption rates, and toil reduction. Experience3-5 years experience in Site Reliability Engineering, Platform Engineering, Infrastructure Engineering, or DevOps roles, or as a full-time freelancer in similar roles. Experience reducing operational toil through automation and self-service tooling. Experience building internal platforms or developer tooling, or enabling platform capabilities from application teams, with a platform-as-product mindset focused on developer experience. Experience managing production infrastructure and establishing reliability practices (SLIs/SLOs, observability, incident response). Technical SkillsStrong working knowledge of Kubernetes and the container ecosystem Experience with cloud platforms (GCP, AWS, Azure) Proficiency with Infrastructure as Code tools. Knowledge of Kubernetes manifest management tools and GitOps practices. Experience with Observability platforms. Knowledge of OpenTelemetry is a plus. Good skill in shell scripting. Experience with Python or Go is a plus. Experience in Linux, databases management, networking, and distributed systems. Solid knowledge of CI/CD pipelines. Ability to work effectively in paired/mob programming and asynchronous work environments. Nice to HaveExperience with database performance tuning, query optimization, replication strategies, and database scaling in production environments. Familiarity with security best practices in cloud-native environments. Experience with data platforms, data pipelines, and data infrastructure (data warehouses, data lakes, ETL/ELT processes, streaming data platforms). Experience supporting AI workloads and infrastructure (LLM platforms, AI agents, vector databases, AI orchestration). Experience with cloud cost optimization and FinOps practices. Personal AttributesProactive nature and autonomous - able to identify opportunities for improvement and drive initiatives to completion. Great problem-solving abilities with a focus on root cause analysis and long-term solutions. Empathy for developers and a commitment to improving their daily experience. You are fluent in English and Spanish Eligibility to work in Spain Hiring Process1️⃣ HR interview (20 - 30 min) - We know each other, we know your concerns, validate details and experience and we tell you more about Landbot (20 min)2️⃣ Interview with Engineering Director (60 min) - Initial interview to get to know each other. We will check your background and product mindset and you will see if we are a good fit for you3️⃣ Interview with the team (120 min) - You’ll meet members of our team to go deeper into your technical experience, how you approach problem-solving, system design and architecture decisions, as well as how you collaborate with others to deliver solutions. It’s a two-way conversation, so feel free to ask about our challenges, tech stack, and ways of working.4️⃣ Meet the Founders (30 min) - Final conversation to ensure alignment on vision and values.BenefitsHybrid work model: flexibility to work remotely, from our Barcelona office ️, or a combination of both. Collaborative work environment. Flexible working hours. Paid time off and flexible holidays: 26 paid days per year (23 regular days + December 24th & 31st) , plus one additional day off on your birthday. Annual budget for training and professional development . Transportation ticket
-
Site Reliability Engineer
hace 1 semana
En remoto, España Landbot A tiempo completo**About Landbot** Operating in more than 40 countries, **Landbot** _(the most powerful No-Code Chatbot Builder)_ offers a platform that helps companies to create unbeatable chatbot conversations in different channels: Web, WhatsApp, and Messenger. With us, you will be working in a team of engineers, designers, PMs. A team with diverse and exciting...
-
Platform Engineer
hace 1 semana
En remoto, España Epos Now A tiempo completo**Platform Engineer** **100% Remote** **€36,000 - €45,000 We are a market-leading retail and hospitality software business with a growing international presence. We operate within the payments and POS space, enabling businesses in over 70 countries to grow and thrive. Due to our continued growth and investment, we are looking for talented Platform...
-
Senior SRE — Remote Cloud Reliability Leader
hace 2 semanas
remoto, España ICEO - Venture Builder A tiempo completoA leading tech company is seeking a Senior Site Reliability Engineer for a full-time, 100% remote position. You will shape the organization's reliability strategy, lead infrastructure development, and implement best practices. The ideal candidate has 5+ years of experience in a DevOps or SRE role, proficiency in programming languages like Python or Go, and...
-
Remote Senior SRE
hace 2 semanas
remoto, España Zartis A tiempo completoA leading digital solutions provider is seeking a Senior SRE with AI/ML platform experience to manage core infrastructure for a fitness software company. The role involves designing AWS infrastructure and improving reliability through observability practices. Ideal candidates have over 8 years in SRE or DevOps, strong AWS expertise, and familiarity with...
-
Senior Site Reliability Engineer
hace 1 semana
En remoto, España Grafana Labs A tiempo completo**Senior SRE - Databases**: **About the role**: We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these databases as a SaaS product from AWS, GCP, and Azure across all regions. The High SLA SRE team...
-
Platform Engineer
hace 2 semanas
% remoto, España PSS A tiempo completo¿Te gustaría impulsar tu carrera en el sector IT con un proyecto sólido y un equipo de profesionales excepcionales? Si valoras la estabilidad y el crecimiento profesional, este es el momento de incorporarte a un entorno donde el talento evoluciona de forma constante. En PSS queremos contar contigo.Actualmente buscamos un/a Platform Engineer con...
-
Site Reliability Engineer
hace 1 semana
En remoto, España Novatec Software Engineering España SL A tiempo completoAbout the job We are currently looking for a** Site Reliability Engineer with experience in Databases** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion...
-
Staff Platform Engineer
hace 2 semanas
En remoto, España Contentsquare A tiempo completoContentsquare is the all-in-one experience intelligence platform designed to be easily used by anyone who cares about digital journeys. With our flexible and scalable platform, organizations quickly get a deep understanding of their customers' whole online journey.We are a global leader in the experience analytics space, with a growing presence across 15...
-
Senior Backend Engineer – Contacts Platform
hace 1 día
remoto, España Sinch A tiempo completoSinch is pioneering the way the world communicates. More than 150,000 businesses - including Google, Uber, Paypal, Visa, Tinder, and many others - rely on Sinch's Customer Communications Cloud to power engaging customer experiences through mobile messaging, voice, and email. Whether you need to verify users or craft omnichannel campaigns, Sinch makes it...
-
Senior Platform Engineer – Remote, Cloud
hace 2 días
remoto, España Trimble Inc. A tiempo completoYour Title : Software EngineerYour Location : Europe , RemoteOur Department : Transportation Common PlatformJob SummaryWe're seeking a Software Engineer to help design, build and scale the underlying systems, APIs, and tooling that power how our engineers deploy, operate and observe software. You’ll craft secure, reliable, and scalable software that...