Job Description:
The Online Tribe at Deutsche Bank Private Bank Germany is responsible for the digital customer access channels and services offered by Deutsche Bank and Postbank.
The Tribe's application portfolio consists of modern cloud-driven Online Banking & Brokerage customer services, administration frontends used by internal employees, as well as further customer-oriented innovative solutions.
We work in an agile environment, focusing on customer centricity and outstanding user experience, with high reusability and flexibility of technical solutions in mind.
Our fundamental is that the highest application availability, scalability, technology- and security standards are a must.
Tasks:
As Site Reliability Engineer, you contribute to the overarching implementation and operation of Deutsche Bank's Online Banking platform in the Google Cloud, with hands-on support of the Investments feature squads, based on the paradigm "you built it you run it" with a focus on various digital investments processes.
Utilize your experience, technical knowledge, and DevOps mindset in driving platform optimization and preventive measures.
Through Automation, CI/CD, and application monitoring, ensure scalability, availability, and performance of the application.
Prevent problems proactively through continuous monitoring of overall system health, and identify potential weak points of the architecture and infrastructure.
Actively support incident analysis in case of any production irregularities through investigating possible root causes and suggesting solutions.
Through collaboration with the squad's Product Owner and engineers, help to identify priority measures, define SLIs & SLOs, and create effective strategies for maintaining and improving system performance and availability.
Support release deployments and iteratively enhance current deployment procedures and pipelines.
Act as a multiplicator for Site Reliability Engineering and Cloud knowledge in the SRE chapter, setting best practices.
Skills & Experience:
Expert knowledge and hands-on experience with applications running in cloud ecosystems such as Google Cloud, as well as with Docker/Kubernetes in combination with GKE or similar technology, automation and CI/CD, Terraform/Terragrunt, Terraform Enterprise, Ansible, and GitHub.
Advanced experience in monitoring, for instance, with OpsGenie, New Relic, DataDog, Splunk, Google Operations Suite.
Very good knowledge of security standards (e.g., TLS, OAuth2, KMS, Vault, Admission Controllers, Let's Encrypt), microservice architectures, and experience with API Management with Apigee or WSO2.
#J-18808-Ljbffr