Roche fosters diversity, equity and inclusion, representing the communities we serve.
When dealing with healthcare on a global scale, diversity is an essential ingredient to success.
We believe that inclusion is key to understanding people's varied healthcare needs.
Together, we embrace individuality and share a passion for exceptional care.
Join Roche, where every voice matters.The PositionThe role requires the candidate to be available for on-call duty service, responding promptly to urgent issues and emergencies outside of regular working hours, ensuring that critical situations are addressed in a timely and effective manner.Step into the Future of IT Infrastructure with Roche!
As a seasoned Site Reliability Engineer (SRE) at Roche, you'll leverage your deep software engineering expertise to propel our IT infrastructure to new heights of robustness, scalability, and reliability.
This isn't just a role—it's an invitation to shape the backbone of critical infrastructures and drive our technological innovations forward.Your MissionDesign and maintain cutting-edge tools, scripts, and frameworks that automate repetitive tasks, streamline software deployment, and manage expansive systems with unparalleled efficiency.Your Core ResponsibilitiesReliability Mastery: Proactively monitor and maintain system reliability using advanced tools like DataDog, VictorOps, ELK, Grafana, and Prometheus.Uptime Guardian: Ensure optimal uptime and performance by swiftly identifying issues and responding to alerts with precision.Technical Troubleshooter: Basic understanding of Architecture and designs to deep dive into complex technical issues, troubleshoot, investigate, and resolve them.Service Excellence: Maintain and consistently achieve defined SLAs, SLIs, and SLOs, ensuring service levels are consistently met or exceeded.Automation Innovator: Develop and deploy automation scripts (using Python or other scripting languages) to streamline operations, enhance system efficiencies, and reduce manual tasks.Cloud Steward: Manage and maintain robust infrastructure across AWS and Azure environments.Cross-functional Collaborator: Work closely with engineering, DevOps, security and operations teams to drive continuous improvement.Incident Responder: Handle requests and incidents through JIRA and ServiceNow, documenting troubleshooting procedures and solutions.Flexible Scheduling: Work on-call outside of normal working hours and weekends as scheduled.Team Builder: Actively contribute to the growth and development of the SRE team's capabilities.Who You AreEducational Background: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent professional experience.Certifications: Relevant industry certifications (AWS/Azure).Experience: Approximately 5 years of experience in site reliability engineering, IT operations, DevOps, or related fields.Cloud Expertise: Solid experience with AWS and/or Azure.
#J-18808-Ljbffr