.SRE Tooling & Observability Platform ExpertJob title: SRE Tooling & Observability Platform Expert- Spain / BarcelonaAbout the jobAt Sanofi CHC, we're committed to providing the next-gen healthcare that patients and customers need. Join our team as SRE Tooling & Observability Platform Expert and you can help make it happen. Your job? The SRE Tooling & Observability Platform Expert at CHC is a specialized role designed to enhance the reliability, scalability, and efficiency of our platforms through expert implementation and management of SRE tooling. This role focuses on logging and analyzing, alerting, monitoring, configuration and Infrastructure as Code (IaC), and incident management to ensure high availability and performance across all systems. The SRE Tooling Expert will work closely with the platform engineering and site reliability teams to develop and maintain a robust tooling ecosystem that supports CHC's operational and business goals.Main responsibilities:- Develop and maintain a comprehensive suite of SRE tools for logging, analyzing, alerting, and monitoring to ensure system reliability and performance.- Implement and manage configuration and Infrastructure as Code (IaC) solutions to automate and streamline infrastructure provisioning and management processes.- Design and implement effective incident management strategies and tools to quickly identify, respond to, and resolve system issues.- Collaborate with engineering teams to integrate SRE tooling into the development and operational lifecycle, enhancing system observability and reliability.- Continuously evaluate and introduce improvements to the SRE tooling ecosystem, staying ahead of industry trends and best practices.- Participate in the planning and execution of system scalability and reliability initiatives, ensuring the infrastructure can support growing workloads and traffic.- Partner with Operations team, ensuring they have all the tools needed to be best in class.About youExperience:The ideal candidate for the SRE Tooling Expert position at CHC is someone who possesses a deep technical proficiency across a broad spectrum of SRE tooling and demonstrates a proven track record of applying these skills in a dynamic environment. This individual will have extensive experience in logging, analyzing, alerting, monitoring, and incident management, showcasing their ability to ensure system reliability and performance.Soft skills:- Strong analytical and critical thinking skills, with the ability to develop creative solutions to complex problems.- Excellent communication skills, ensuring clear and effective technical information exchange among various stakeholders.Technical skills:- Proficiency with logging tools such as ELK (Elasticsearch, Logstash, Kibana), Splunk, Dynatrace or Datadog.- Experience with alerting tools like Prometheus Alertmanager, Grafana, or PagerDuty