Site Reliability Engineer
hace 7 días
Responsibilities & Qualifications Provide day-to-day operational support for production environments, ensuring high availability and reliability of critical services. Develop, maintain and enhance automation scripts and tools using Bash, Python and Ansible to streamline operational tasks and incident response. Monitor system performance, proactively identify issues and implement solutions to prevent service disruptions. Collaborate with development, QA and infrastructure teams to implement practices for deployment, monitoring and incident management. Participate in on-call rotation and respond to production incidents, performing root cause analysis and driving resolution. Maintain and improve configuration management, CI/CD pipelines and infrastructure as code practices. Document operational processes, troubleshooting steps and automation workflows. Proven experience in a production support or SRE role within a complex, high-availability environment. Strong automation skills with proficiency in Bash, Python and Ansible. Experience with monitoring and alerting tools (e.g. Prometheus, Grafana, Elastic stack, Datadog). Solid understanding of Linux/Unix systems administration and troubleshooting. Familiarity with cloud platforms (e.g. AWS) and containerisation technologies (e.g. Docker, Kubernetes). Experience with configuration management and infrastructure as code tools (e.g. Terraform, CloudFormation). Knowledge of networking fundamentals, security practices and incident management processes. Excellent problem-solving skills, attention to detail and ability to work under pressure. Strong communication and collaboration skills. Experience with version control systems (e.g. Git). Familiarity with Agile methodologies and DevOps culture. Exposure to database administration and troubleshooting (e.g. MySQL, PostgreSQL, Oracle). Scripting or automation experience with other languages (e.g. Go, Ruby). One day of working from home per week. Engaging company culture with a focus on innovation. Seniority level Entry level Employment type Full-time Job function Engineering and Information Technology Industries Manufacturing #J-18808-Ljbffr
-
Site Reliability Engineer — Kubernetes
hace 7 días
Milano, España Moltiply Group A tiempo completoUna società di tecnologia con sede a Milano cerca un Site Reliability Engineer esperto per gestire e automatizzare l'infrastruttura IT. I candidati devono avere esperienza in strumenti di automazione come Ansible e container orchestration come Kubernetes. La posizione prevede modalità di lavoro ibrida, combinando smart working e presenza in ufficio. Si...
-
Remote Site Reliability
hace 7 días
Milano, España Canonical A tiempo completoA pioneering tech firm is looking for a Site Reliability / Gitops Engineer to enhance automation and cloud operations. This role requires an enthusiast for Linux who can develop infrastructure as code and maintain core services across global teams. Ideal candidates will have a strong engineering background, experience in software development and Linux...
-
Site Reliability Engineer
hace 7 días
Milano, España Moltiply Group A tiempo completoIn Moltiply affrontiamo e trasformiamo i processi più complessi dei nostri clienti -dal customer care alla digitalizzazione -unendo tecnologie avanzate e il talento di oltre 3.500 professionisti in Italia e nel mondo. La nostra missione è aiutare le aziende a moltiplicare il proprio valore, ridisegnando e semplificando modelli operativi con l’obiettivo...
-
Lead Site Reliability Engineer
hace 7 días
Milano, España PRAGMATIKE A tiempo completoJob Description Location : Fully remote EU timezone (CET ±2h) Start date : ASAP Languages : Fluent English is mandatory Industry : Cloud Computing We are hiring at Pragmatike to expand our team and drive the growth of our internal projects. Our focus is on developing cutting‑edge solutions in Cloud Computing, while fostering a culture of collaboration and...
-
Site Reliability
hace 7 días
Milano, España Canonical A tiempo completoCanonical is a leading provider of open‑source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our customers include the world’s leading public cloud and silicon providers, and...
-
Senior Cloud Reliability Engineer
hace 7 días
Milano, España Generali Italia A tiempo completoA major global insurance player is seeking a Service Reliability Engineer & Application Maintenance Specialist to optimize their cloud platforms' reliability, scalability, and cost. This role involves working closely with the devOPS team to define recovery automations and ensure compliance with service SLAs. Candidates must have a degree in Computer Science,...
-
Milano, España ALDEBARAN Group A tiempo completoA prominent energy and infrastructure company is looking for a Site Field Engineer to coordinate activities for a major decarbonization project in the Netherlands. The ideal candidate will have an engineering degree and 3 to 5 years of relevant experience. Responsibilities include overseeing field engineering, ensuring compliance with local regulations, and...
-
Ingénieur Projet Site Junior
hace 7 días
Milano, España ALDEBARAN Group A tiempo completoSite Field Engineer – Ref. JOB-1505 Do you want to actively contribute to a major industrial project and gain solid on-site experience on a strategic decarbonization initiative? We are looking for a Junior Site Project Engineer / Junior Field Engineer to support engineering, construction, and permitting activities on a large-scale industrial project in the...
-
Ingénieur Projet Site Junior
hace 7 días
Milano, España ALDEBARAN Group A tiempo completoSite Field Engineer – Ref. JOB-1505 Do you want to actively contribute to a major industrial project and gain solid on‑site experience on a strategic decarbonization initiative? We are looking for a Junior Site Project Engineer / Junior Field Engineer to support engineering, construction, and permitting activities on a large‑scale industrial project in...
-
Service Reliability Engineer
hace 7 días
Milano, España Generali Italia A tiempo completoJob Description We are looking for a Service Reliability Engineer & Application Maintenance Specialist to ensure the reliability, scalability, and cost optimization of our cloud platforms. The ideal candidate will have strong experience in automation, proactive monitoring, and performance management, with a mindset focused on continuous improvement. Key...