Senior Site Reliability Engineer, Observability

hace 2 días


Barcelona, España MongoDB A tiempo completo

**Team and Role Overview**:
The SRE Observability team is part of the larger Platform Engineering organization, and is dedicated to building and maintaining the observability stack (metrics, logging, tracing) used by all engineering teams to ensure the smooth functioning of their service. We also own related services, including our telemetry pipeline, and our monitoring and alerting infrastructure. Our stack includes VictoriaMetrics, Splunk, QuickWit, Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also work closely with other SWE and SRE teams to promote and implement best practices in instrumenting and monitoring their services. This is a highly collaborative role, and you will get to own some of the most relied upon internal infrastructure at Mongo.

This role will be based remotely in Spain.

**Responsibilities**:

- Define standards and vision for the mission-critical observability platform leveraged by all parts of the engineering organization
- Design, architect, build and deliver core pieces of our observability services in collaboration with other vested parties
- Design, implement, and troubleshoot the monitoring of services that seamlessly spans the globe - including several cloud providers
- Build for reliability, making services and infrastructure available, resilient, fault tolerant and self-healing
- Identify and configure key metrics to detect incidents and quantify service health, availability and performance.
- Participate in a week-long on-call rotation and blameless post-mortem process
- Improve our observability capabilities, optimizing for cost, ease of use, and maintainability

**Requirements**:

- Experience running mission critical services at scale
- Experience with observability of large scale distributed systems
- An understanding of information security issues
- Firm grasp of at least one modern programming language, beyond basic scripting
- Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc)
- Bachelor's degree in Computer Science or equivalent experience

**Nice to haves**:

- Experience with at least one of the major cloud providers (Amazon Web Services, Google Compute, Microsoft Azure)
- Experience working in a kubernetes-based environment kubernetes clusters

**What's in it for you**:

- Generous compensation package
- Opportunities to learn on the job (time to up skill in new technologies)
- High level of independence in your day to day work

To drive the personal growth and business impact of our employees, we're committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees' wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it's like to work at MongoDB, and help us make an impact on the world

MongoDB is an equal opportunities employer.

Req ID: 1263097733



  • Barcelona, España Trust In SODA A tiempo completo

    Senior Site Reliability Engineer | Spain (Hybrid)An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing heavily in reliability, platform maturity and engineering quality as it continues to grow.This is a true senior SRE role for someone who has...


  • Barcelona, España Trust In Soda A tiempo completo

    Senior Site Reliability Engineer | Spain (Hybrid) ¿Interesado en saber más sobre este trabajo? Desplácese hacia abajo y descubra qué habilidades, experiencia y cualificaciones académicas se necesitan.An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally...


  • Barcelona, España Trust In Soda A tiempo completo

    Senior Site Reliability Engineer | Spain (Hybrid)Por favor, lea detenidamente la siguiente descripción del puesto para asegurarse de que encaja con el perfil antes de enviar su solicitud. An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing...


  • Barcelona, España Trust In Soda A tiempo completo

    Senior Site Reliability Engineer | Spain (Hybrid)Por favor, lea detenidamente la siguiente descripción del puesto para asegurarse de que encaja con el perfil antes de enviar su solicitud. An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing...


  • barcelona, España Trust In SODA A tiempo completo

    Senior Site Reliability Engineer | Spain (Hybrid) An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing heavily in reliability, platform maturity and engineering quality as it continues to grow. This is a true senior SRE role for someone who...


  • Barcelona, España Trust In Soda A tiempo completo

    Senior Site Reliability Engineer | Spain (Hybrid)Por favor, lea detenidamente la siguiente descripción del puesto para asegurarse de que encaja con el perfil antes de enviar su solicitud. An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing...


  • Barcelona, España Trust In SODA A tiempo completo

    Senior Site Reliability Engineer | Spain (Hybrid) An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing heavily in reliability, platform maturity and engineering quality as it continues to grow. This is a true senior SRE role for someone...


  • Barcelona, España Trust In Soda A tiempo completo

    Senior Site Reliability Engineer | Spain (Hybrid)¿Es este el puesto que está buscando? Si es así, siga leyendo para obtener más detalles y no olvide enviar su solicitud hoy mismo.An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing...


  • Barcelona, España Trust In SODA A tiempo completo

    Cloud & DevOps Specialist across the DACH region Senior Site Reliability Engineer | Spain (Hybrid) An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing heavily in reliability, platform maturity and engineering quality as it continues to grow....


  • Barcelona, España Searchability® A tiempo completo

    Senior Site Reliability Engineer (SRE) – Barcelona (Hybrid)Si sus habilidades, experiencia y cualificaciones coinciden con las de esta descripción del puesto, no demore su solicitud. KEY POINTS • Barcelona-based hybrid role with a respected global organisation • Azure-first SRE work across cloud, edge and on-premise platforms • Terraform, GitHub...