Empleos actuales relacionados con Senior Site Reliability Engineer - Madrid, Madrid - Colliers


  • Madrid, Madrid, España Tempo A tiempo completo

    With over 30,000 customers, including a third of Fortune 500 companies, Tempo is trusted by organizations across the globe to make their workflows work better. We create a suite of integrated solutions for time management, resource planning, budget management, roadmapping, program management, reporting and more. We create the tech that enables the modern...


  • Madrid, Madrid, España Cubic³ A tiempo completo

    The CompanyCubic³ provides advanced software-defined vehicle solutions to over 200 countries around the world. Our powerfully smart connectivity enables leading automotive, agriculture, and transportation OEMs to deliver innovative new services and fully compliant in-vehicle experiences that customers desire, regardless of local market requirements.We...


  • Madrid, Madrid, España Crossvale A tiempo completo

    Crossvale is seeking aSenior Consultant: Integration & Automation Engineerwho views data pipelines assoftware products, rather than one-off solutions. This role is for hands-on engineers who want to build theautomation engines, templates, and toolkitsthat enable data integration to scale across teams and environments. You will play a critical role in...


  • Madrid, Madrid, España Ebury A tiempo completo

    Ebury is a global fintech firm dedicated to empowering businesses to expand internationally through tailored and forward-thinking financial solutions. Since our founding in 2009, we've grown to a diverse team of over 1,700 professionals across 40+ offices and 29+ markets worldwide. Joining Ebury means becoming part of a collaborative and innovative...

  • Senior Backend Engineer

    hace 2 semanas


    Madrid, Madrid, España Airalo A tiempo completo

    About Airalo Alo Airalo is the world's first eSIM store that helps people connect in over 200+ countries and regions across the globe. We are building the next digital service that revolutionizes the telecom industry. We are a travel-tech company and an equal-opportunity environment that values and executes diversity, inclusion, and equity. Our team is...


  • Madrid, Madrid, España Aircall A tiempo completo

    Aircall is a unicorn AI-powered customer communications platform used by 22,000+ companies worldwide to drive revenue, faster resolutions, and scale. We're redefining what a customer communications platform can be—by combining voice, SMS, WhatsApp, and AI into one seamless workspace. Our momentum comes from a simple but powerful idea: help every...

  • Senior Railway Engineer

    hace 2 semanas


    Madrid, Madrid, España AECOM A tiempo completo

    Company Description Work with Us. Change the World.At AECOM, we're delivering a better world. Whether improving your commute, keeping the lights on, providing access to clean water, or transforming skylines, our work helps people and communities thrive. We are the world's trusted infrastructure consulting firm, partnering with clients to solve the world's...

  • Senior Python Engineer

    hace 2 semanas


    Madrid, Madrid, España Description Ciklum A tiempo completo

    DescriptionCiklum is looking for a Senior Python Engineer to join our team in Spain.We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we engineer...


  • Madrid, Madrid, España Edreams A tiempo completo

    Java Senior Software Engineer - eDO Transport (Hybrid)As you contemplate your future, you might be asking yourself, what's the next step? Start your journey with usWe're seeking an experienced Java Senior Software Engineer to join our Transport area in Barcelona or Porto (hybrid) to help customers reach their destination by finding the best travel deals,...

  • Senior Software Engineer

    hace 2 semanas


    Madrid, Madrid, España InteractiveAI A tiempo completo 70.000 € - 90.000 €

    What You'll DoAs a Forward Deployed Engineer, you will operate at the intersection of engineering, delivery, and customer environments. You'll embed closely with enterprise customers to implement, customize, and deploy InteractiveAI's agent-based solutions in real-world settings. Your mission is to ensure successful, reliable adoption of AI agent workflows...

Senior Site Reliability Engineer

hace 7 horas


Madrid, Madrid, España Colliers A tiempo completo

Company Description
Colliers is a leading diversified professional services and investment management company. With operations in 68 countries, our 22,000 enterprising people work collaboratively to provide expert advice to maximize the potential of property and real assets to accelerate the success of our clients, our investors and our people.

We are at the forefront of the real estate industry, leading the way and backed by an exceptional record of success. We are building for our future –
and yours.
We strive to build our business at a competitive pace by augmenting internal growth with smart strategic acquisitions that increase market share, expand service offerings and extend our geographic reach for the benefit of our clients and shareholders.

For more than 29 years, Colliers has created value for shareholders that has resulted in superior returns and industry growth. Our people also own significant equity in our business, which brings pride of ownership to everything we do. We are passionate, take personal responsibility and always do what's right for our clients, people and communities.

Job Description
We are looking for an experienced and passionate
Senior Side Reliability Engineer
for our
newly established Technology Hub
in Madrid. This is a unique opportunity to become part of the founding team that helps shape the culture, practices, and technical direction. Working in a
hybrid model
(2 days on-site), you will enjoy significant freedom to
innovate and influence
the future of one of the world's largest commercial real estate companies. With the leadership of the Global Technology Hub coming from a background of technology startups, you will benefit from a fast-paced learning environment, high visibility of your contributions and opportunities to shape processes, culture and technology strategies. At the same time, you take advantage of the stability, resources, and reach of a
successful global company
.

Guided by our global digital strategy, the Madrid Hub collaborates closely with international teams to deliver
world-class technology solutions
that power the future of commercial real estate.

As
Senior Side Reliability Engineer
, you are focused on ensuring the reliability, performance and availability of our applications and platforms across GCP and Azure, while enabling development teams to ship faster with confidence.

As a senior member of the DevOps team, you will help design and implement observability systems, reliability practices, and incident response processes in collaboration with Software Engineering and Infrastructure teams. Your mission is to bring an engineering-first mindset to operations, applying automation, data, and feedback loop continuously improve the resilience of our systems and platforms. You will contribute to global products and platforms that serve both internal and external customers in Commercial Real Estate across multiple regions. Working closely with international Product, Engineering, DevOps, Data, QA and Architecture teams, you will ensure delivery excellence, engineering quality and great consumer experience.

You are a hands-on problem-solver with strong design principles, who thrives in a collaborative and agile environment. As a senior member of the DevOps function, you will set technical direction, drive best practices. and mentor junior engineers while building solutions with real business impact.

Reliability Engineering, and Operational Excellence

  • Define and maintain Service Level Indicators, Service Level Objectives and Service Level Agreements across critical services in partnership with Product Owners; Engineering and Infrastructure Teams.
  • Identify resilience gaps and lead initiatives such as redundancy improvements and scaling strategies to address these.
  • Automate incident response, recovery and scaling where possible.
  • Build tooling for self-healing infrastructure and applications, reducing manual intervention.
  • Contribute to runbooks, playbooks and knowledge sharing for operations best practices.
  • Build mechanisms to ensure error budgets are respected and used to drive prioritization decisions.

Observability Monitoring and Incident Management

  • Design, implement and evolve monitoring, logging, and tracing systems (Azure Monitor, GCP Operations Suite, Prometheus, Grafana, Datadog).
  • Develop dashboards and alerting systems that provide actionable insights for engineers and stakeholders.
  • Design and implement a comprehensive ChatOps strategy and ensure close integration with Teams.
  • Collaborate with QA teams and Engineering teams to integrate performance and availability testing into CI/CD pipelines.
  • Lead incident response and postmortems, ensuring learnings are captured and acted upon.
  • Partner with Engineering Ops to ensure metrics and trends are tracked, reported, and tied into continuous improvement initiatives.
  • Drive a blameless culture of reliability, focused on learning, prevention and continuous improvement.

Leadership and Collaboration

  • Work closely with Software Engineering teams and Engineering Ops to embed reliability best practices in the development lifecycle.
  • Partner with CloudOps Engineers to ensure resilient cloud architectures.
  • Collaborate with Platforms Engineers to optimize container and Kubernetes workloads for reliability
  • Support Product Owners with visibility into availability, reliability trade-offs and error budgets.
  • Mentor engineers across the organization in incident best practices.
  • Provide input into the global operation model for cloud management, balancing standardization with regional needs.

Qualifications
Required Skills/Experience:

  • 5-8+ years of professional experience in SRE, reliability engineering or production operations.
  • Strong knowledge of GCP and Azure services, with focus on reliability, high availability, and scalability.
  • Hands-on experience with monitoring, logging and observability stacks (Azure Monitor, GCP Operations Suite, Prometheus, Grafana Datadog).
  • Deep experience with incident response, postmortems and reliability reporting.
  • Proficiency in automation and scripting (Bash, PowerShell, Python).
  • Familiarity with Kubernetes, containers and service mesh technologies.
  • Agile Development experience (Scrum/Kanban), including story estimation, code reviews and pair programming.
  • Experience working in distributed or remote teams and using tools such as Jira, GitHub, Gitlab, Miro and Azure DevOps.
  • Excellent interpersonal and communication skills, fluent in English and Spanish.
  • Understanding of secure development practices and compliance frameworks (e.g. ISO27001, GDPR, SOC2).

Preferred Skills/Experience

  • Experience with reliability-focused testing (load, performance, failover)
  • Contributions to internal development platforms or developer experience initiatives.
  • Ability to work in a fast-paced, growing tech environment and to foster change in larger organizations.