Site Reliability Engineer
hace 6 meses
Job Description
Hi there
We are Semrush, a global IT company developing our own product - a platform for digital marketers. New stars are born here, so don’t miss your chance.
This is our role
Site Reliability Engineer for those who want to turn ideas into reality using code, algorithms, and maybe a bit of magic
Tasks in the role
- Collaborate with cross-functional teams to define observability requirements and develop robust solutions.
- Configure and maintain Prometheus and VictoriaMetrics for monitoring and alerting.
- Utilize Grafana to create customized dashboards and visualizations for performance and system health monitoring.
- Implement Grafana Tempo for distributed tracing and enhanced observability.
- Develop and maintain log management and analysis solutions using Splunk.
- Collaborate closely with product teams to ensure seamless deployment of observability tools and practices.
- Configure and maintain Sentry for error tracking and real-time error monitoring.
- Investigate and troubleshoot complex issues related to observability.
- Automate and streamline observability system setup and configuration.
- Stay updated with industry best practices and emerging observability technologies.
- Participate in on-call rotation to address critical incidents and outages of Observability services.
**Requirements**:
Who we are looking for
- Proficiency in Golang for custom observability solution development.
- Strong experience working with Kubernetes (K8s) and Helm for container orchestration and deployment.
- Proven expertise in Prometheus and Grafana for monitoring and visualization.
- Familiarity with distributed tracing and tracing instrumentation.
- Experience with Splunk or similar log analysis and management tools.
- Effective team collaboration and communication skills.
- Excellent problem-solving and troubleshooting abilities.
- Prior experience in a DevOps, SRE, or observability-related role is advantageous.
**You share our common values**: Trust, because we prefer to speak up and be our true selves; Sense of Ownership, because it’s not worth wasting time on something you don’t believe in; and enthusiasm for Constant Changes, because we are always looking to make things better
A bit about the team
Metal Team focuses on selecting, implementing, and optimizing observability tools and platforms to efficiently collect and analyze data from various systems. Their primary goal is to design and maintain a robust and scalable observability infrastructure that empowers other teams to meet their monitoring and observability needs, ensuring the company's IT ecosystem runs smoothly and reliably.
We will try to create all the right conditions for you to work and rest comfortably
- It’s up to you to decide what work format works best for you. You can #wfo, #wfh, or mix both.
- Flexible working day start.
- Health insurance coverage.
- Working from a modern coworking space (or working from home).
- Internet coverage (up to 30 eur/month).
- Corporate events.
- Unlimited PTO.
- Hobby benefit.
- Training, courses, conferences.
- English and Spanish courses.
- Gifts for employees.
Finally, a little more about our company
We’ve been developing our product for 15 years and have been awarded G2's Top 100 Software Products, Global and US Search Awards 2021, Great Place to Work Certification, Deloitte Technology Fast 500 and many more. In March 2021 Semrush went public and started trading on the NYSE with the SEMR ticker.
10,000,000+ users in America, Europe, Asia, and Australia have already tried Semrush, and over 1,000 people around the world are working on its development. The Semrush team is constantly growing.
Semrush is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based upon race, religion, creed, color, national origin, sex, pregnancy, sexual orientation, gender identity, gender expression, age, ancestry, physical or mental disability, or medical condition including medical characteristics, genetic identity, marital status, military service, or any other classification protected by applicable local, state or federal laws. All employment decisions are based on business needs, job requirements, merit, and individual qualifications.
-
Senior Site Reliability Engineer
hace 6 meses
En remoto, España Novatec Software Engineering España SL A tiempo completoAbout the job We are currently looking for a **Senior Site Reliability Engineer** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion for IT, agile...
-
Site Reliability Engineer
hace 6 meses
En remoto, España Novatec Software Engineering España SL A tiempo completoAbout the job We are currently looking for a** Site Reliability Engineer** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion for IT, agile software...
-
Site Reliability Engineer
hace 6 meses
En remoto, España Fortexpro A tiempo completoWe are looking for SRE to work on a major international project. 100% remote work. Offer addressed to workers from any EEC country. Tasks - Implements Site Reliability Engineering and/or DevOPS practices. - Manages technology, infrastructure and software development projects in accordance with SRE and/or DevOPS principles. - Empowers development teams...
-
Senior Site Reliability Engineer
hace 6 meses
En remoto, España Grafana Labs A tiempo completo**Senior SRE - Databases**: **About the role**: We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these databases as a SaaS product from AWS, GCP, and Azure across all regions. The High SLA SRE team...
-
Senior Site Reliability Engineer
hace 6 meses
En remoto, España Grafana Labs A tiempo completo**Senior SRE - Databases**: **About the role**: We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these databases as a SaaS product from AWS, GCP, and Azure across all regions. The High SLA SRE team...
-
Site Reliability Engineer
hace 6 meses
En remoto, España Baxter Planning A tiempo completo**Company Overview** Founded in 1993, Baxter Planning has 30+ years of industry expertise setting the standard for SaaS in the service supply chain planning space. With a strong and growing customer base, we are developing new products and solutions as well as finding new ways to extend and enhance our established products building on our success in the...
-
Site Reliability Engineer
hace 6 meses
En remoto, España Semrush A tiempo completoHi there! We are Semrush, a global IT company developing our own product - a platform for digital marketers. New stars are born here, so don’t miss your chance. This is our role **Site Reliability Engineer** for those who want to turn ideas into reality using code, algorithms, and maybe a bit of magic. **Tasks in the role**: - Read and write code in...
-
Site Reliability Engineer
hace 6 meses
En remoto, España Red Hat, Inc. A tiempo completoThe Red Hat - Site Reliability Engineering (SRE) team is looking for Software Engineer to join us. In this role, you will develop, scale, and operate our - OpenShift managed cloud services - OpenShift is Red Hat’s enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling customer self-service, making our...
-
Site Reliability Engineer
hace 6 meses
En remoto, España audiense A tiempo completo**Engineering culture**: Ship early - and often Only one project - at a time Testing is a first - class problem - ️ Always be recruiting Communicate openly and frequently Audiense is an equal opportunity employer, and we know it's our differences that makes us great, so we want to welcome people from all backgrounds to our family. We encourage black,...
-
Site Reliability Engineer
hace 6 meses
En remoto, España audiense A tiempo completo**Engineering culture**: Ship early - and often Only one project - at a time Testing is a first - class problem - ️ Always be recruiting Communicate openly and frequently Audiense is an equal opportunity employer, and we know it's our differences that makes us great, so we want to welcome people from all backgrounds to our family. We encourage black,...
-
Site Reliability Engineer
hace 6 meses
En remoto, España audiense A tiempo completo**Engineering culture**: Ship early - and often Only one project - at a time Testing is a first - class problem - ️ Always be recruiting Communicate openly and frequently Audiense is an equal opportunity employer, and we know it's our differences that makes us great, so we want to welcome people from all backgrounds to our family. We encourage black,...
-
Senior Site Reliability Engineer
hace 6 meses
En remoto, España Semrush A tiempo completoJob Description Hi there! We are Semrush, a global IT company developing our own product - a platform for digital marketers. New stars are born here, so don’t miss your chance. This is our role Backend Developer for those who want to turn ideas into reality using code, algorithms, and maybe a bit of magic. Tasks in the role - Leverage Golang expertise to...
-
Site Reliability Engineer
hace 5 meses
En remoto, España WNTD A tiempo completo**What are we looking for?** For our team specialised we are looking for a SRE - Wintel to be part of a team working close to one of our main clients. This position can be performed 100% remote from any location in Spain. **Wintel Technologies** - Microsoft Windows Operating Systems - IIS - File Systems - DNS / DHCP / DFS - Identity & Security -...
-
Jr. Sre Engineer
hace 6 meses
En remoto, España Dabster Systems UK Limited A tiempo completoJob Description: **What are we looking for?** We are looking for a **SRE Engineer** working close to one of our main clients. **Main Tasks And Accountabilities Will Be** - You will be a key member of a team that leverages software and system engineering practices to build and run distributed, highly reliable systems at scale within AWS. - Working with...
-
Senior Site Reliability Engineer
hace 7 meses
En remoto, España Raisin A tiempo completoTeam Our SRE team builds and operates a reliable cloud infrastructure and empowers the product teams with tools and processes to deliver features as fast as possible. - Infrastructure - Process, tooling, automation - Observability of business critical systems - Security and compliance (in a highly regulated sector) Your Responsibilities - Design, build and...
-
Test Engineer
hace 6 meses
En remoto, España Xpert Direct A tiempo completoFor one of our clients, we are looking for a **Test Engineer** to join their Bluetooth R&D Team on a fully remote basis. The role can be developed from anywhere in Spain (international applicants available to relocate to Spain will be considered). Fluency in English and availability to travel within Europe for client site visits, specialization courses and...
-
Data Engineer
hace 6 meses
En remoto, España HomeBuddy A tiempo completo**Are you passionate about developing robust data models and building scalable data pipelines, and got a deep interest in technology? Are you willing to be a part of a quickly growing, product-oriented company while working remotely from home? Then welcome to HomeBuddy!** **This role is full-time and offers home working flexibility** **HomeBuddy** is an...
-
Site Reliability Engineering
hace 6 meses
En remoto, España Cibervoluntarios A tiempo completoTareas 1. Administración de sistemas Linux para asegurar un rendimiento óptimo y la disponibilidad de los servicios. 2. Gestión de contenedores utilizando Docker y el orquestador Kubernetes para optimizar la implementación y escalabilidad de aplicaciones. 3. Experiência en Microsoft Entra e Intune para la gestión eficiente de la infraestructura. 4....
-
Site Reliability Engineering
hace 6 meses
En remoto, España Fundación Cibervoluntarios A tiempo completo**Funciones**: - Administración de sistemas Linux para asegurar un rendimiento óptimo y la disponibilidad de los servicios. - Gestión de contenedores utilizando Docker y el orquestador Kubernetes para optimizar la implementación y escalabilidad de aplicaciones. - Experiência en Microsoft Entra e Intune para la gestión eficiente de la...
-
Cloud Lead Engineer
hace 6 meses
En remoto, España Enersys A tiempo completoEnerSys® is an industrial technology leader serving the global community with mission critical stored energy solutions that meet the growing demand for energy efficiency, reliability and sustainability. We are driven by a passion to provide people everywhere with accessible power to help them work and live better. Our people are our strength, an endless...