Site Reliability Engineer, Technical Referent
hace 1 semana
Site Reliability Engineer, Technical Referent Join to apply for the Site Reliability Engineer, Technical Referent role at dLocal Why should you join dLocal? dLocal enables the biggest companies in the world to collect payments in 40 countries in emerging markets. Global brands rely on us to increase conversion rates and simplify payment expansion effortlessly. As both a payments processor and a merchant of record where we operate, we make it possible for our merchants to make inroads into the world’s fastest-growing, emerging markets. By joining us you will be a part of an amazing global team that makes it all happen, in a flexible, remote-first dynamic culture with travel, health and learning benefits, among others. Being a part of dLocal means working with 1000+ teammates from 30+ different nationalities and developing an international career that impacts millions of people’s daily lives. We are builders, we never run from a challenge, we are customer-centric, and if this sounds like you, we know you will thrive in our team. What's the opportunity? We are looking for a Site Reliability Engineer (SRE) to join our team As our Site Reliability Engineer (SRE), you will be focused on the design, implementation and continuous maintenance of our centralized observability platform using OpenTelemetry (OTEL) as its backend. You will be part of a talented team that works on mission-critical applications with big customers like Netflix, Amazon, Nike, Facebook & more As a Site Reliability Engineer, you are always expected to ask the necessary questions: What data do we need to understand how our systems are performing? How do we collect this data? What patterns are we looking for in the data and what do they mean? Who should be notified when a certain system is not working properly? Do we have any systems that we need more data for? An SRE engineer designs systems and processes to answer the questions above and to provide automated support and response where possible. What will you do? Own OpenTelemetry Pipelines: Design, implement, and maintain observability pipelines across the three main signals—logs, metrics, and traces—ensuring standardized, scalable, and efficient data ingestion. Optimize ingestion strategies to balance cost, performance, and usability. Empower Engineering Teams: Build self-service automation and tooling that enables development teams to instrument and leverage observability without requiring manual intervention from the SRE team. Drive adoption of best practices while ensuring teams own their telemetry. Support Incident Management: Be the Engineering side of our Incident Management Team, designing the processes, playbooks, checklists, and automations for them and other engineers to follow during an incident. Collaborate Across Teams: Interact with members from almost all teams across the business to understand their monitoring, alerting and SLO / SLA requirements and design systems and processes that ensure we meet or exceed these requirements. Influence architectural decisions during initial design stages to ensure resiliency and scale at the outset of software development. Automate Observability Infrastructure: Leverage Infrastructure-as-Code (IaC) to provision and manage monitoring tools, alerting rules, and our observability configurations across OTEL Pipelines. Define Baseline Observability Standards: Design base level requirements for new and existing services to ensure that all dLocal infrastructure and code are monitored consistently and accurately at a basic level. Own Technical and Security Health: Take full ownership of dLocal’s infrastructure reliability, ensuring adherence to key availability and security KPIs. Optimize Alerting Systems: Continuously refine alerting signals to minimize noise and ensure them are always actionable, reducing fatigue and improving response efficiency. Which skill do you need? Over 4 years’ of experience as SRE Engineer or in a very similar role more focused on observability Expertise in Kubernetes, including its core components, deployment methodologies, and monitoring best practices Some understanding of OpenTelemetry, including setting up OTEL collectors, instrumentation, and pipeline optimization Proficiency with monitoring and logging tools such as Grafana, Prometheus, Loki, New Relic, or Datadog Hands‑on experience with IaC tools (Terraform) and GitOps CI/CD solutions (ArgoCD, GitHub Actions, or similar) Experience integrating incident management platforms (PagerDuty, Jira) with automated alerting workflows Strong scripting abilities (Python, Go, or similar) for automating observability tasks A problem‑solving mindset, with the ability to collaborate across multi‑functional teams to drive reliability improvements. You will stand out if you have: Cloud experience, especially AWS and ECS‑based workloads Experience managing observability pipelines at scale in high‑throughput environments Familiarity with Configuration‑as‑Code (Ansible, Chef, or SaltStack) for managing configurations across legacy instances Database performance monitoring experience, particularly in large‑scale distributed environments What do we offer? Besides the tailored benefits we have for each country, dLocal will help you thrive and go that extra mile by offering you: Remote work: work from anywhere or one of our offices around the globe* Flexibility: we have flexible schedules and we are driven by performance Fintech industry: work in a dynamic and ever‑evolving environment, with plenty to build and boost your creativity Referral bonus program: our internal talents are the best recruiters - refer someone ideal for a role and get rewarded Learning & development: get access to a Premium Coursera subscription Language classes: we provide free English, Spanish, or Portuguese classes Social budget: you'll get a monthly budget to chill out with your team (in person or remotely) and deepen your connections dLocal Houses: want to rent a house to spend one week anywhere in the world coworking with your team? We’ve got your back For people based in Montevideo (Uruguay) applying to non‑IT roles, 55% monthly attendance to the office is required What happens after you apply? Our Talent Acquisition team is invested in creating the best candidate experience possible, so don’t worry, you will definitely hear from us. We will review your CV and keep you posted by email at every step of the process Also, you can check out our webpage, LinkedIn, Instagram, and YouTube for more about dLocal We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us. Seniority level Not Applicable Employment type Full-time Job function Engineering and Information Technology #J-18808-Ljbffr
-
Site Reliability Engineer
hace 1 semana
Roma, España Immobiliare.it A tiempo completoImmobiliare.it S.p.A. è un gruppo italiano composto da società specializzate in servizi Digital Tech per la compravendita e l’affitto di immobili, rivolti a privati, professionisti del real estate, istituti bancari e operatori del settore finanziario. Fondata nel 2005 Immobiliare.it, il portale immobiliare N.1 in Italia, ha ampliato la propria offerta...
-
Service Reliability Engineer
hace 1 semana
Roma, España BNL BNP Paribas A tiempo completoSiamo alla ricerca di una/un Service Reliability Engineer che, all'interno del “Center of Expertise” Application Production, garantirà continuità, stabilità e performance dei servizi IT, monitorando gli SLO, anticipando problemi e collaborando con i team di progetto per integrare criteri di affidabilità fin dalle prime fasi di progettazione. Curare...
-
Remote SRE: Build Ultra-Reliable Web Infra
hace 1 semana
Roma, España Immobiliare.it A tiempo completoUna società leader nel settore immobiliare in Italia cerca un Site Reliability Engineer a Roma o in full remote. Il candidato ideale avrà esperienza nella gestione di sistemi GNU/Linux e sarà responsabile dell'efficienza dei progetti e dell'affidabilità dell'infrastruttura. Offriamo opportunità di crescita e partecipazione a conferenze nazionali e...
-
Rams engineer
hace 1 semana
Roma, España Tinexta Defence A tiempo completoTinexta Defence è alla ricerca di un/una RAMS System Engineer entusiasta e motivato/a, per partecipare in qualità di consulente alle attività di ingegneria dei sistemi in ambito RAMS, per un'importante azienda europea operante nel settore della Difesa. Responsabilità Produzione di Reliability, Availability, Maintainability, Safety e Testability Analysis...
-
Internship: Proposal
hace 1 semana
Roma, España Gruppo Sapio A tiempo completoUn'azienda nel settore semiconduttori è alla ricerca di un/una Proposal & Technical Sales Engineer in stage per il team tecnico-commerciale. La figura si occuperà dell'analisi tecnica e della redazione di offerte. Richiesta laurea in ingegneria con forte background tecnico e conoscenza della lingua inglese, almeno B1. Buone capacità di comunicazione,...
-
Aerospace Systems Reliability Engineer
hace 1 semana
Roma, España Immobiliare.it A tiempo completoPurtroppo, le candidature dall'estero per questa offerta non possono essere prese in considerazione. Completano il profilo un approccio analitico e orientamento al problem solving, la capacità di lavorare in team multidisciplinari, proattività e spirito d’iniziativa ed una capacità di gestione efficace… TXT e-Tech Srl, società del TXT Group, è alla...
-
Proposal & Technical Sales Engineer
hace 1 semana
Roma, España Gruppo Sapio A tiempo completoGruppo Sapio è alla ricerca di una figura da inserire in stage nel team tecnico-commerciale come Proposal & Technical Sales Engineer nel Settore Semiconduttori. Responsabilità Analisi tecnica di capitolati, RFQ e pacchetti di gara. Definizione della strategia di risposta e redazione di offerte tecnico-commerciali. Coordinamento del team di proposta....
-
German speaking Customer Service Specialist – Lisbon
hace 1 semana
Roma, España Lingo-nova A tiempo completoGerman speaking Customer Service Specialist – Lisbon (On-site) Apply for the German speaking Customer Service Specialist – Lisbon (On-site) role at Lingo-nova. Start Date: 8 January 2026 | Location: Lisbon, Portugal | Work Model: On-site | Contract Type: Fixed-term contract (CDD – 6 months) Work Schedule Monday to Friday: 06:00 – 22:00 (rotational...
-
RAMS Systems Engineer – Difesa Elettronica
hace 1 semana
Roma, España Tinexta Defence A tiempo completoUn'importante azienda nel settore della Difesa in Italia è alla ricerca di un/una RAMS System Engineer motivato/a. Il candidato ideale avrà una Laurea Magistrale in Ingegneria o discipline tecnico-scientifiche e sarà coinvolto/a nella produzione di analisi di affidabilità, disponibilità e manutenibilità per sistemi elettronici militari. È necessario...
-
Technical Asset Manager – PV
hace 1 semana
Roma, España Greentalent A tiempo completoJoin to apply for the Technical Asset Manager – PV role at GreentalentGreentalent is a recruiter specialized in searching and selecting professionals and managers in the Energy, Engineering and Environment market, on behalf of an investment and asset management platform dedicated to the renewable energy generation sector, is looking for a:Key...