Senior Site Reliability Engineer

hace 3 semanas


En remoto, España Booming Games A tiempo completo

About the role

Join our team at Booming Games as a Site Reliability Engineer and ensure the peak performance and reliability of our systems across multiple geographical locations As a key player in troubleshooting and resolving complex issues, you will collaborate with engineers to drive automation, standardization, and optimization efforts. Your expertise in operating systems, networking, and distributed systems, combined with your passion for problem-solving, will make you an invaluable asset. If you are ready to revolutionize the reliability and scalability of our services while working with cutting-edge technologies, this role is perfect for you.

**Responsibilities**:

- Perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.
- Drive standardization efforts across multiple disciplines and services in conjunction with SREs throughout the organization.
- Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services.
- Represent the SRE organization in design reviews and operational readiness exercises for new and existing services.
- Work with software engineers to improve upon deployment processes.
- Participate in the on-call rotation for production systems.
- 3rd line support in the networks and infrastructure team being the last line of defense in Engineering Support Escalation
- Manage the server and network infrastructure, assist in the development of security strategies and their implementation and participate in global network infrastructure upgrades with upstream providers
- Work with both SRE & Development teams on new projects and technologies such as: New Infrastracture Setup, Kubernetes Migrations, New Geographic
- Locations, Monitoring & Upgrades and more
- Perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.
- Promote openness, diversity of opinions and inclusive discussions at all times to evaluate a wide variety of ideas and perspectives in solving challenging problems
- Demonstrate clear decision making and good trade-offs in complex situations comprising multiple opinions, needs, teams, technologies, cloud providers, and architectural settings
- Communicate effectively with stakeholders ranging from executives to junior engineers across the breadth and depth of the engineering organization
- Enable the engineering organization to innovate and deliver with greater speed and safety
- Any other tasks or responsibilities that may be given in the due course of role.

**Requirements**:

- Sound fundamentals in operating systems, networking, and distributed systems.
- Exemplify high accountability, integrity, and resilience to maintain focus on both big-picture goals and milestones to get there
- Strong familiarity with Linux systems administration and management best practices.
- Familiarity with container technologies: Kubernetes, CRI, Docker, namespaces, cgroups.
- Strong understanding of: Ethernet, VLANs, IPv4/IPv6, ARP, DHCP, DNS, and TCP.
- Familiarity with distributed system problems: leader election, Raft consensus, etc.
- Expert level understanding with at least one public or private cloud technology such as Amazon AWS, Google GKE, or OpenStack.
- Practical knowledge of various aspects of service design, including messaging protocols and behavior, caching strategies and software design practices.
- Practical intermediate knowledge of shell scripting, some Ruby is a plus.
- Excellent knowledge of Linux/UNIX systems administration and performance tuning.
- Comfortable configuring DNS, DHCP, and LAN/WAN technologies.
- Minimum 5 years of managing services in an internet scale \*nix environment.
- Must be able to communicate well with technical as well as non-technical colleagues to achieve business goals.
- Must be adaptable and able to focus on the simplest, most efficient and reliable solutions.
- Track record of successful practical problem solving, excellent written and interpersonal communication in English, and documentation skills.
- Curiosity and an interest in networking, systems software, and distributed systems.
- Experience as a systems administrator or operations engineer.
- Experience with a 24/7 production environment.
- Experience with managed deployments providing software, platforms, or infrastructure as a service.
- Experience with SuperMicro server and storage gear is a plus.

Good to know
- We kindly ask for your understanding that we can only consider applicants within the the Central European Timezone +/-2
- To be considered for the role, we kindly ask that you submit your resume/CV in English
- This full-time position can be a permanent employment in Malta or on a freelance basis for contractors in the other countries

Why Work for Booming Game



  • En remoto, España Novatec Software Engineering España SL A tiempo completo

    About the job We are currently looking for a **Senior Site Reliability Engineer** to join our team based in Andalucia but not only, since we are open to remote applicants all over Spain. The Company Novatec Software Engineering España is a branch of Novatec Consulting GmbH, with headquarter in Stuttgart (Germany). We bring our passion for IT, agile...


  • En remoto, España Business Insights A tiempo completo

    **Descripción**: Desde Business Insights, buscamos dos perfiles AWS** **Site Reliability Engineer para participar en un proyecto interesante. Modalidad: híbrida o 100% teletrabajo Ubicación: Aragón, preferentemente Zaragoza **Requisitos**: **_Skills:_** - _ >2 years of experience in SRE Engineering roles in AWS_ - _ Experience in AWS public cloud...


  • En remoto, España Landbot A tiempo completo

    **About Landbot** Operating in more than 40 countries, **Landbot** _(the most powerful No-Code Chatbot Builder)_ offers a platform that helps companies to create unbeatable chatbot conversations in different channels: Web, WhatsApp, and Messenger. With us, you will be working in a team of engineers, designers, PMs. A team with diverse and exciting...


  • En remoto, España Grafana Labs A tiempo completo

    **Senior SRE - Databases**: **About the role**: We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. We provide these databases as a SaaS product from AWS, GCP, and Azure across all regions. The High SLA SRE team...


  • En remoto, España Fortexpro A tiempo completo

    We are looking for SRE to work on a major international project. 100% remote work. Offer addressed to workers from any EEC country. Tasks - Implements Site Reliability Engineering and/or DevOPS practices. - Manages technology, infrastructure and software development projects in accordance with SRE and/or DevOPS principles. - Empowers development teams...


  • En remoto, España Baxter Planning A tiempo completo

    **Company Overview** Founded in 1993, Baxter Planning has 30+ years of industry expertise setting the standard for SaaS in the service supply chain planning space. With a strong and growing customer base, we are developing new products and solutions as well as finding new ways to extend and enhance our established products building on our success in the...


  • En remoto, España audiense A tiempo completo

    **Engineering culture**: Ship early - and often Only one project - at a time Testing is a first - class problem - ️ Always be recruiting Communicate openly and frequently Audiense is an equal opportunity employer, and we know it's our differences that makes us great, so we want to welcome people from all backgrounds to our family. We encourage black,...


  • En remoto, España audiense A tiempo completo

    **Engineering culture**: Ship early - and often Only one project - at a time Testing is a first - class problem - ️ Always be recruiting Communicate openly and frequently Audiense is an equal opportunity employer, and we know it's our differences that makes us great, so we want to welcome people from all backgrounds to our family. We encourage black,...

  • Webcenter Site

    hace 4 semanas


    En remoto, España Krell Consulting A tiempo completo

    Sistemas- ADMINISTRADORES- hace 3 días**Descripción**: - KRELL Consulting, selecciona para contratar, un P. senior WEBCENTER SITE, para trabajar con uno de nuestros importantes clientes y contratación inicial con Krell. (indefinida)UBICACION: se trabajara en Remoto inicialmente (MADIRD )- SBA: entorno a 40K- Profesión : Ingeniero de Sistemas,...

  • Site Reliability Engineer

    hace 4 semanas


    En remoto, España LanguageWire A tiempo completo

    Job Description We are looking for an engineer, who is keen on Infrastructure as a code and cannot leave a day without improving her/his zone of responsibility. We are looking for someone, who sees Microsoft cloud solutions and services as tiles of one complex, but beautiful puzzle. The role you’ll play As a key player in our team, you'll be responsible...


  • En remoto, España Raisin A tiempo completo

    Team Our SRE team builds and operates a reliable cloud infrastructure and empowers the product teams with tools and processes to deliver features as fast as possible. - Infrastructure - Process, tooling, automation - Observability of business critical systems - Security and compliance (in a highly regulated sector) Your Responsibilities - Design, build and...


  • En remoto, España Warman O'Brien A tiempo completo

    Within this role, you will be responsible for the planning, initiation, coordination and management of all monitoring and monitoring-related activities as well as supervision of all site-related activities to ensure compliance with SOPs, GCP and regulatory requirements. You will also engage with clinical sites to develop, build, and maintain strong...


  • En remoto, España IQVIA A tiempo completo

    Job Overview Under general supervision, perform tasks at a country/region level associated with site activation activities in accordance with applicable local and/or international regulations, standard operating procedures (SOPs), project requirements and contractual/budgetary guidelines. May also include feasibility or maintenance activities. Essential...

  • Senior Data Engineer

    hace 4 semanas


    En remoto, España Ciklum A tiempo completo

    **Description**: **Ciklum **is looking for a **Senior Data Engineer** to join our team full-time in Spain. We are a leading global product engineering and digital services company that unites 4000+ seasoned professionals globally on various projects in healthcare, fintech, travel, sportswear, entertainment, and security. Ciklum delivers high-impact...


  • En remoto, España CAS TRAINING A tiempo completo

    Senior Data Engineer / Data Scientist en 100% En remoto. Seleccionamos para proyecto que se desarrolla en remoto 2 profesionales Data Engineer y 2 profesionales Data Scientist. Modelos predictivos de venta upselling downselling cross-selling y fuga de diferentes productos bancarios con Spark y R aplicando ineligencia artificial. Modelos de clasificación...

  • Site Manager

    hace 2 días


    En remoto, España FieldCore A tiempo completo

    **Job Summary**: Site Manager - Outage is responsible for preparation, planning, leading execution, and close out of complex project/outages events for gas turbine, steam turbine and generator power plants while supporting the development of the business strategy for field fulfilment excellence in FieldCore. The Site Manager - Outage is a focal role, with...

  • Senior Data Engineer

    hace 4 semanas


    En remoto, España Polar Analytics A tiempo completo

    **About Polar Analytics**: Polar Analytics is a Full-Stack Business Intelligence Solution for Consumer Brands. A powerful, yet simple solution for business users to get the insights they need to succeed and make the right decisions. Our mission is to empower indie DTC brands worldwide to grow faster and more profitably! **What we’re looking for**: We're...

  • Cloud Services Engineer

    hace 4 semanas


    En remoto, España Red Hat Software A tiempo completo

    About the job: Red Hat is seeking a Cloud Services Engineer with a strong background in OpenShift and/or Kubernetes cluster deployment and configuration to join our Cloud Services Customer Experience team in Spain You will consistently rely on your knowledge of production support and your background in dev-ops processes, including incident/change/problem...


  • En remoto, España Krell Consulting A tiempo completo

    Sistemas- CLOUD( AZURE, AWS, Google)- hace 2 horas**Descripción**: - Krell consulting busca incorporar a su equipo Senior Architect AWS- Arquitecto senior con amplia experiência en servicios AWS orientados a data (Glue, S3, EMR). Muy valorable también conocimiento IaC y CDK.- Rol: cross entre diferentes proyectos/iniciativas, referencia técnica AWS,...


  • En remoto, España NexGen Cloud A tiempo completo

    NexGen Cloud is a rapidly growing IaaS company focused on providing innovative cloud solutions and infrastructure services. Our GPU cloud infrastructure solutions accelerate development in industries such as Artificial Intelligence & Machine Learning, VFX & Rendering, Data Science & IoT, and Computer Aided Engineering & MDO. We are dedicated to helping our...