Senior Site Reliability Engineer, Observability
hace 2 días
**Team and Role Overview**:
The SRE Observability team is part of the larger Platform Engineering organization, and is dedicated to building and maintaining the observability stack (metrics, logging, tracing) used by all engineering teams to ensure the smooth functioning of their service. We also own related services, including our telemetry pipeline, and our monitoring and alerting infrastructure. Our stack includes VictoriaMetrics, Splunk, QuickWit, Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also work closely with other SWE and SRE teams to promote and implement best practices in instrumenting and monitoring their services. This is a highly collaborative role, and you will get to own some of the most relied upon internal infrastructure at Mongo.
This role will be based remotely in Spain.
**Responsibilities**:
- Define standards and vision for the mission-critical observability platform leveraged by all parts of the engineering organization
- Design, architect, build and deliver core pieces of our observability services in collaboration with other vested parties
- Design, implement, and troubleshoot the monitoring of services that seamlessly spans the globe - including several cloud providers
- Build for reliability, making services and infrastructure available, resilient, fault tolerant and self-healing
- Identify and configure key metrics to detect incidents and quantify service health, availability and performance.
- Participate in a week-long on-call rotation and blameless post-mortem process
- Improve our observability capabilities, optimizing for cost, ease of use, and maintainability
**Requirements**:
- Experience running mission critical services at scale
- Experience with observability of large scale distributed systems
- An understanding of information security issues
- Firm grasp of at least one modern programming language, beyond basic scripting
- Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc)
- Bachelor's degree in Computer Science or equivalent experience
**Nice to haves**:
- Experience with at least one of the major cloud providers (Amazon Web Services, Google Compute, Microsoft Azure)
- Experience working in a kubernetes-based environment kubernetes clusters
**What's in it for you**:
- Generous compensation package
- Opportunities to learn on the job (time to up skill in new technologies)
- High level of independence in your day to day work
To drive the personal growth and business impact of our employees, we're committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees' wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it's like to work at MongoDB, and help us make an impact on the world
MongoDB is an equal opportunities employer.
Req ID: 1263097733
-
Senior Site Reliability Engineer
hace 4 días
Barcelona, España Trust In SODA A tiempo completoSenior Site Reliability Engineer | Spain (Hybrid)An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing heavily in reliability, platform maturity and engineering quality as it continues to grow.This is a true senior SRE role for someone who has...
-
Senior Site Reliability Engineer
hace 4 semanas
Barcelona, España Trust In Soda A tiempo completoSenior Site Reliability Engineer | Spain (Hybrid) ¿Interesado en saber más sobre este trabajo? Desplácese hacia abajo y descubra qué habilidades, experiencia y cualificaciones académicas se necesitan.An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally...
-
Senior Site Reliability Engineer
hace 4 semanas
Barcelona, España Trust In Soda A tiempo completoSenior Site Reliability Engineer | Spain (Hybrid)Por favor, lea detenidamente la siguiente descripción del puesto para asegurarse de que encaja con el perfil antes de enviar su solicitud. An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing...
-
Senior Site Reliability Engineer
hace 1 semana
Barcelona, España Trust In Soda A tiempo completoSenior Site Reliability Engineer | Spain (Hybrid)Por favor, lea detenidamente la siguiente descripción del puesto para asegurarse de que encaja con el perfil antes de enviar su solicitud. An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing...
-
Senior Site Reliability Engineer
hace 4 días
barcelona, España Trust In SODA A tiempo completoSenior Site Reliability Engineer | Spain (Hybrid) An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing heavily in reliability, platform maturity and engineering quality as it continues to grow. This is a true senior SRE role for someone who...
-
Senior Site Reliability Engineer
hace 3 días
Barcelona, España Trust In Soda A tiempo completoSenior Site Reliability Engineer | Spain (Hybrid)Por favor, lea detenidamente la siguiente descripción del puesto para asegurarse de que encaja con el perfil antes de enviar su solicitud. An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing...
-
Senior Site Reliability Engineer
hace 3 días
Barcelona, España Trust In SODA A tiempo completoSenior Site Reliability Engineer | Spain (Hybrid) An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing heavily in reliability, platform maturity and engineering quality as it continues to grow. This is a true senior SRE role for someone...
-
Senior Site Reliability Engineer
hace 18 horas
Barcelona, España Trust In Soda A tiempo completoSenior Site Reliability Engineer | Spain (Hybrid)¿Es este el puesto que está buscando? Si es así, siga leyendo para obtener más detalles y no olvide enviar su solicitud hoy mismo.An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing...
-
Senior Site Reliability Engineer
hace 2 días
Barcelona, España Trust In SODA A tiempo completoCloud & DevOps Specialist across the DACH region Senior Site Reliability Engineer | Spain (Hybrid) An opportunity to join a high growth, late stage technology company operating at significant scale. The business supports thousands of customers globally and is investing heavily in reliability, platform maturity and engineering quality as it continues to grow....
-
Senior Site Reliability Engineer
hace 4 semanas
Barcelona, España Searchability® A tiempo completoSenior Site Reliability Engineer (SRE) – Barcelona (Hybrid)Si sus habilidades, experiencia y cualificaciones coinciden con las de esta descripción del puesto, no demore su solicitud. KEY POINTS • Barcelona-based hybrid role with a respected global organisation • Azure-first SRE work across cloud, edge and on-premise platforms • Terraform, GitHub...