Overview: As Senior Systems Site Reliability Engineer (Kubernetes), you will be involved in exciting technical challenges by analyzing, troubleshooting, and designing vital services, platforms, and infrastructure while always thinking about reliability, scalability, resilience, security, and performance. Reporting to the Vice President, Customer Service & Operations SITA FOR AIRCRAFT, you will be a part of the team responsible for helping to support 24x7 uptime and availability of production mission-critical customer-facing cloud services distributed across multiple regions. You'll help to create more consistent, automated push-button environments across all tiers, proactively test and tune all aspects of the infrastructure, streamline CI/CD processes, monitor and respond to system notifications and alerts, and continually work to optimize and improve the performance, security, and reliability of our systems. Are you ready to be part of the future?
What you will do: Help build a Site Reliability Engineering culture across the organization by sharing your best practices, approaches, documentation, and code with other engineering teams.Conduct system analysis, configuration management and develop improvements for system software performance, availability, and reliability.Design, write, ship, and motivate the creation of software and systems to increase observability, product reliability, and organizational efficiency.Work closely with software engineers and testers to ensure the system is responding properly to non-functional requirements such as performance, security, and availability.Document your system knowledge as you acquire it over time, create runbooks, and ensure critical system information is readily available to those who need it.Maintain and monitor deployment, orchestration of the servers, docker containers, databases, and general backend infrastructure.Keep up-to-date with security and proactively identify, diagnose, and solve complex security issues. Qualifications: Who You Are: 4+ years' experience as SRE/DevOps Engineer - Mandatory.Demonstrable experience in Containerization - Docker and orchestration (Kubernetes) - Mandatory.Demonstrable experience in CI/CD tools such as Bitbucket, Bamboo, Nexus, and Helm - Mandatory.Experience with Infrastructure as Code (Terraform, Cloud Formation, Ansible).Knowledge and proven hands-on experience in large-scale databases and distributed technologies, such as Kafka and Confluent Platform Kafka.Basic programming and scripting skills (preferably Golang, Bash, Shell, etc.).Ability to provide advice, best practices, and recommendations for the operation and deployment of Microsoft Azure.Experience in monitoring and analyzing infrastructure performance using standard performance monitoring tools - Nagios, New Relic, Perfmon, PerfView, ProcDump, DebugDiag.Familiarity with Linux and UNIX systems (e.g., CentOS, RedHat) and command line system administration such as Bash, VIM, SSH.Hands-on experience in configuration management of server farms (using tools such as Puppet, Chef, Ansible, etc.).Network routing, Load balancing and Networking protocols, a base knowledge of TCP/IP, with an understanding of HTTP and DNS.SRE & Agile methodologies with B. Tech./B.E. degree in Electronics & Telecomm or Computer Science.Demonstrated understanding of ITIL methodologies, ITIL v3 or v4 certification.Kubernetes CKA or CKAD certification nice to have. What we offer: SITA's workplace is all about diversity: many different countries and cultures are represented in our workforce, and colleagues who've been working here for decades collaborate with those just out of college and early in their careers. SITA is a place of change and constant improvement, where we're always pushing ourselves to find better ways of doing things: smarter, quicker, easier, for us and our customers and for their customers too. And we offer all the good stuff you'd expect like holidays, bonus, flexible benefits, medical policy, pension plan and access to world-class learning.
Welcome to SITA. We design, build, and support technology solutions all with one vision to create easy air travel every step of the way. As an organization, we cover 95% of all international air travel destinations and work with over 2,800 air transport and government customers in every corner of the globe. Are you ready to explore the opportunities?
Keywords: Senior SRE, SRE, Kubernetes, Linux, Docker, DevOps, Cloud
#J-18808-Ljbffr