Senior Site Reliability Engineer
hace 5 días
Job Description
Location:
Fully remote EU timezone (CET ±2h)
Start date:
ASAP
Languages:
Fluent English is mandatory
Industry:
Cloud Computing
We are hiring at Pragmatike to expand our team and drive the growth of our internal projects.
Our focus is on developing cutting-edge solutions in Cloud Computing, while fostering a culture of collaboration and innovation. Joining us means being part of a passionate team where your ideas and skills directly contribute to shaping tomorrows technologies.
If you're excited about working on ambitious projects in a dynamic and flexible environment, we'd love to hear from you
Responsabilities:
- Operate and maintain Linux-based infrastructure (Debian/Ubuntu).
- Deploy, manage, and scale Kubernetes clusters across bare-metal, virtualized, and on-prem environments.
- Oversee full cluster lifecycle: upgrades, node pools, networking, storage, and security hardening.
- Implement automation for provisioning and operations using Ansible, Bash/Python, and GitOps workflows.
- Design and maintain networking architecture including VLANs, L2/L3 routing, VPNs, and multi-site connectivity.
- Build automated deployment workflows (PXE boot, Preseed, cloud-init).
- Deploy and maintain observability stacks (Prometheus/Grafana, Loki, ELK, Graylog).
- Lead incident response activities, define SLOs/SLIs, and optimize alerting and monitoring pipelines.
- Manage virtualization and orchestration layers (OpenStack, Proxmox, VMware).
Requirements:
Expert-level, hands-on experience operating Kubernetes in production environments.
- Strong proficiency with Linux systems administration (Debian/Ubuntu).
- Solid understanding of networking fundamentals (VLANs, routing, VPNs).
- Experience building and maintaining automation workflows (Ansible, Bash/Python, Git-based).
- Experience with observability stacks such as Prometheus, Grafana, ELK, Loki, or Graylog.
- Background with virtualization technologies (OpenStack, Proxmox, VMware).
- Strong understanding of distributed systems and container orchestration.
- Ability to work autonomously in a fast-paced, engineering-driven environment.
Nice To Have:
- Experience with service mesh (Istio, Linkerd) or advanced CNI implementations.
- Knowledge of Cloudflare APIs, DNS automation, or tunnel configurations.
- Experience with GPU infrastructure, node preparation, or resource scheduling.
- Familiarity with security best practices (RBAC, firewalls, network policies).
- Exposure to IT asset management or license tracking workflows.
Why Join Us:
- 100% remote work with flexible hours
- High-impact role with autonomy and ownership
- Collaborative and international engineering team
- Cutting-edge tech stack with strong focus on reliability and automation.
-
Site Reliability Engineer
hace 2 semanas
Madrid, Madrid, España SIX A tiempo completoBME - Bolsas y Mercados Españoles - drives the transformation of financial markets and belongs to SIX, the third largest exchange group in Europe.What sets us apart drives us ahead: between local roots and global relevance, we are a unique blend of tradition and future, of foundation and growth. We value bright minds and inspire them to grow with their...
-
Senior Site Reliability Engineer
hace 2 semanas
Madrid, Madrid, España Colliers A tiempo completoCompany DescriptionColliers is a leading diversified professional services and investment management company. With operations in 68 countries, our 22,000 enterprising people work collaboratively to provide expert advice to maximize the potential of property and real assets to accelerate the success of our clients, our investors and our people.We are at the...
-
Senior Site Reliability Engineer
hace 2 semanas
Madrid, Madrid, España Colliers A tiempo completoColliers is a leading diversified professional services and investment management company. With operations in 68 countries, our 22,000 enterprising people work collaboratively to provide expert advice to maximize the potential of property and real assets to accelerate the success of our clients, our investors and our people.We are at the forefront of the...
-
Senior Site Reliability Engineer
hace 2 semanas
Madrid, Madrid, España Nexthink A tiempo completoCompany DescriptionNexthink is the leader in digital employee experience management software. The company provides IT leaders with unprecedented insight allowing them to see, diagnose and fix issues at scale impacting employees anywhere, with any applicationor network, before employees notice the issue. As the first solutionto allow IT to progress from...
-
Senior Site Reliability Engineer
hace 1 semana
Madrid, Madrid, España Nexthink A tiempo completoCompany Description Nexthink is the leader in digital employee experience management software. The company provides IT leaders with unprecedented insight allowing them to see, diagnose and fix issues at scale impacting employees anywhere, with any application or network, before employees notice the issue. As the first solution to allow IT to progress from...
-
Site Reliability Engineer
hace 2 semanas
Madrid, Madrid, España Electronic Arts (EA) A tiempo completoDescription & RequirementsElectronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter. A team where everyone makes play happen.*Description &...
-
Site Reliability Engineer
hace 5 días
Madrid, Madrid, España Merlin Digital Partner A tiempo completoWe are Merlin Digital PartnerA leading IT and Digital headhunting company who stands out from the crowd, boasting over a decade of experience. We've successfully collaborated and played a pivotal role in the growth of industry heavyweights such as Wallapop, Glovo, Banc Sabadell, and Factorial, among others.Our emphasis lies in people-centric approaches and...
-
Site Reliability Engineer
hace 4 días
Madrid, Madrid, España CrowdStrike A tiempo completoAs a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed — we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3...
-
Site Reliability Engineer
hace 5 días
Madrid, Madrid, España Exoticca A tiempo completoWhat is Exoticca?Exoticca is a pioneering online travel agency that has revolutionized the conception, production, and e-commerce of long-distance dream trips. At the core of Exoticca's brand equity is the commitment to "creating life milestones." We believe in delivering best-value trips, exploring unique destinations, curating extraordinary travel...
-
Site Reliability Engineer
hace 2 semanas
Madrid, Madrid, España Happyrobot A tiempo completoAbout HappyrobotHappyRobot is the AI-native operating system for the real economy—a system that closes the circuit between intelligence and action. By combining real-time truth, specialized AI workers, and an orchestrating intelligence, we help enterprises run complex, mission-critical operations with true autonomy.Our AI OS compounds knowledge, optimizes...