Senior Site Reliability Engineer

hace 6 días


En remoto, España Knack A tiempo completo
Senior Site Reliability Engineer - Spain Remote- Hi, thanks for reading about our
-
Senior Site Reliability Engineer opportunity We're glad you're here.

  • We're Knack, a codefree platform used by thousands of customers — from nonprofits to the world's biggest companies — to easily build custom apps, workflows, and databases.
  • We're looking for someone to help improve our reliability and performance through deep analysis and remediation of our AWS infrastructure, monitors, alerts, and code.

_Please note:
this is a remote role based in Spain._

Key Responsibilities

  • Perform deep analysis of logs, existing systems and codebases to find opportunities to improve performance and reliability, driving execution of suggested changes
  • Refactor our existing monitors and alerts to be actionable and reliable, recommending and implementing diagnostic techniques and monitoring tools.
  • Help discover correlations between customer experience and performance indicators to determine what is noticeable by customers: suggest and implement improvements based on findings
  • Help us to develop SLI's, SLO's, and SLA's that are impactful as they relate to our customer's experience
  • Help triage outages and issues across multiple teams, services, and codebases as they arise, leading root cause analysis and creating sustainable solutions to prevent and/or autoremediate those issues in the future
  • Work with our QA teams to help implement automated performance and scalability testing within our CI/CD pipelines
  • Assist in creating reusable pipeline code, working with cloud, dev, and qa teams to help reduce complexity and deployment times
  • Introduce chaos engineering, promoting experimentation in production to discover and remediate systemic weaknesses and improve performance and reliability
Skills Knowledge and Expertise

  • Expertise in AWS
  • Expertise with RDS, preferably Aurora PostgreSQL engine
  • Expertise with containerization
  • Expertise in monitoring, alerting, and logging solutions and in how to use them to enable the organization to achieve reliability and performance goals
  • Experience implementing, maintaining, and troubleshooting continuous integration/continuous delivery (CI/CD) tooling
  • Experience with implementing improvements in areas such as maintainability, scalability, availability, extensibility and security
  • Ability to work with many teams across disciplines (cloud, platform, development, qa, and security) to resolve issues as they arise and implement improvements
Our Stack

  • Our stack is evolving over the next year and we'd love you to be a part of that

Currently we're using:
-
Back-end: JavaScript/TypeScript, , ES6, GoLang
-
Data: Aurora PostgreSQL, Redis, ElasticSearch
-
DevOps & Deployment: All things AWS, Terraform (and Terraform Cloud), Jenkins, Github, Grafana, GrayLog

-
Testing: Playwright, Mocha, Jest

-
Front-end: , Webpack, SCSS

Benefits

  • The biggest benefit of Knack is getting to work alongside our awesome team of Knackleheads. We're a funny, humble, talented team of delightful human beings that, above all, enjoy working with each other, growing with each other, and supporting each other.
  • These benefits aren't that bad either, though:

  • Define your work: find the location, environment, and schedule that is best for your life and work. It's not about separation, it's about optimization.
-
Paid Corporate Retreats: we get together once a year at amazing locations to do normal human being things in person. We pay for your flight, lodging, and meals.
-
Tech: we provide a top-of-the-line MacBook.
-
Referral Bonus: we think you're great which means you know awesome people we offer a referral bonus to anyone you refer for an open position once they are hired as an official Knackster

  • We are also passionate about
-
learning and professional development.

We provide multiple learning opportunities and encourage each other to continuously learn and grow:

  • Long term
    growth and learning plans, with regular check-ins to help you level up on what's important to you.
  • Have
    executivelevel visibility into how the company is run and performing, including revenue.
  • Use an
    annual allowance to stay on top of your game with training, classes, books, and workshops.
  • Attend
    industry conferences that are meaningful to you.
About Knack

  • Hi We're Knack
  • We launched in 2012 with one simple goal: to enable everyone to do amazing things with their data.
  • We've been growing steadily since as we've built our team, perfected our product, and nailed our productmarket fit.
  • So how are we different?
    We're 100% remote: and have been from the beginning. Every decision we've made has been based on optimizing our remote operations.
-
We take culture seriously: We're not one of those companies that just slaps some cultural adjectives down in a handbook article then calls it a day. We use our cultur

  • En remoto, España Novatec Software Engineering España SL A tiempo completo

    About the job We are currently looking for a **Senior Site Reliability Engineer** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion for IT, agile...


  • En remoto, España Novatec Software Engineering España SL A tiempo completo

    About the job We are currently looking for a **Senior Site Reliability Engineer** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion for IT, agile...


  • En remoto, España Novatec Software Engineering España SL A tiempo completo

    About the job We are currently looking for a **Senior Site Reliability Engineer** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion for IT, agile...

  • Site Reliability Engineer

    hace 3 semanas


    En remoto, España Novatec Software Engineering España SL A tiempo completo

    About the job We are currently looking for a** Site Reliability Engineer** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion for IT, agile software...

  • Site Reliability Engineer

    hace 3 semanas


    En remoto, España Novatec Software Engineering España SL A tiempo completo

    About the job We are currently looking for a** Site Reliability Engineer** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion for IT, agile software...


  • En remoto, España Eventbrite A tiempo completo

    THE CHALLENGE Eventbrite's business continues to grow and scale rapidly, powering millions of events. Event creators and event goers need new tools and technologies that empower them to have the most meaningful live experiences. As a Senior Site Reliability Engineer, you will be part of a team that ensures that the Eventbrite platform runs efficiently,...


  • En remoto, España Landbot A tiempo completo

    **About Landbot** Operating in more than 40 countries, **Landbot** _(the most powerful No-Code Chatbot Builder)_ offers a platform that helps companies to create unbeatable chatbot conversations in different channels: Web, WhatsApp, and Messenger. With us, you will be working in a team of engineers, designers, PMs. A team with diverse and exciting...


  • En remoto, España Business Insights A tiempo completo

    **Descripción**: Desde Business Insights, buscamos dos perfiles AWS** **Site Reliability Engineer para participar en un proyecto interesante. Modalidad: híbrida o 100% teletrabajo Ubicación: Aragón, preferentemente Zaragoza **Requisitos**: **_Skills:_** - _ >2 years of experience in SRE Engineering roles in AWS_ - _ Experience in AWS public cloud...


  • En remoto, España Knack.com A tiempo completo

    Senior Site Reliability Engineer - Spain Remote- Hi, thanks for reading about our - **Senior Site Reliability Engineer** opportunity! We're glad you're here. - We're Knack, a code-free platform used by thousands of customers — from non-profits to the world’s biggest companies — to easily build custom apps, workflows, and databases. - We’re looking...


  • En remoto, España Eventbrite A tiempo completo

    THE CHALLENGEEventbrite's business continues to grow and scale rapidly, powering millions of events. Event creators and event goers need new tools and technologies that empower them to have the most meaningful live experiences. As a Senior Site Reliability Engineer, you will be part of a team that ensures that the Eventbrite platform runs efficiently,...


  • En remoto, España Grafana Labs A tiempo completo

    **Senior SRE - Databases**: **About the role**: We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these databases as a SaaS product from AWS, GCP, and Azure across all regions. The High SLA SRE team...


  • En remoto, España Grafana Labs A tiempo completo

    **Senior SRE - Databases**: **About the role**: We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these databases as a SaaS product from AWS, GCP, and Azure across all regions. The High SLA SRE team...


  • En remoto, España Landbot A tiempo completo

    About LandbotOperating in more than 40 countries, Landbot _(the most powerful No-Code Chatbot Builder)_ offers a platform that helps companies to create unbeatable chatbot conversations in different channels: Web, WhatsApp, and Messenger.With us, you will be working in a team of engineers, designers, PMs. A team with diverse and exciting backgrounds...


  • En remoto, España Novatec Software Engineering España SL A tiempo completo

    About the jobWe are currently looking for a Site Reliability Engineer to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain.The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany).We bring our passion for IT, agile software development...


  • En remoto, España Booming Games A tiempo completo

    About the roleJoin our team at Booming Games as a Site Reliability Engineer and ensure the peak performance and reliability of our systems across multiple geographical locations As a key player in troubleshooting and resolving complex issues, you will collaborate with engineers to drive automation, standardization, and optimization efforts. Your expertise in...


  • En remoto, España Novatec Software Engineering España SL A tiempo completo

    About the job We are currently looking for a** Site Reliability Engineer with experience in Databases** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our...


  • En remoto, España Akamai A tiempo completo

    **Do you enjoy collaborating with teams to solve complex challenges?** **Do you have a passion for cutting edge technologies and tackling system problems?** **Join our highly skilled Storage team** **Partner with the best** You'll collaborate with operations and development teams to build and manage our scalable storage platforms. You'll create tooling...

  • Site Reliability Engineer

    hace 2 semanas


    En remoto, España Fortexpro A tiempo completo

    We are looking for SRE to work on a major international project. 100% remote work. Offer addressed to workers from any EEC country. Tasks - Implements Site Reliability Engineering and/or DevOPS practices. - Manages technology, infrastructure and software development projects in accordance with SRE and/or DevOPS principles. - Empowers development teams...


  • En remoto, España Fortexpro A tiempo completo

    We are looking for SRE to work on a major international project.100% remote work.Offer addressed to workers from any EEC country.Tasks Implements Site Reliability Engineering and/or DevOPS practices. Manages technology, infrastructure and software development projects in accordance with SRE and/or DevOPS principles. Empowers development teams through the...


  • En remoto, España Semrush A tiempo completo

    Job Description Hi there! We are Semrush, a global IT company developing our own product - a platform for digital marketers. New stars are born here, so don’t miss your chance. This is our role Backend Developer for those who want to turn ideas into reality using code, algorithms, and maybe a bit of magic. Tasks in the role - Leverage Golang expertise to...