Ebury is a hyper-growth FinTech firm, named in 2021 as one of the top 15 European Fintechs to work for by AltFi.
We offer a range of products including FX risk management, trade finance, currency accounts, international payments, and API integration.
Senior Site Reliability Engineer - FintechEbury Madrid Office - Hybrid: 4 days in the office, 1 day working from home
Ebury is a Global FinTech: we apply new technologies to enhance and automate financial services and processes.
This allows small and medium-sized businesses to trade and transact internationally by eliminating boundaries related to more traditional methods.
Are you ready to be an Eburian?
We are looking for an experienced Site Reliability Engineer to join our team.
In this role, you will be working within one of our platform engineering teams to ensure that our platform meets the needs of our customers and business objectives.
What we offer:Variety of meaningful and competitive benefits to meet your needsCompetitive salaryContinuous professional growth thanks to our career progression framework with regular reviewsEquity process through a performance bonusAnnually paid time off as well as during local public holidaysContinued personal development through training and certificationBeing part of a diverse technology team that cares deeply about culture and best practices, and believes in agile principlesContribute to our technical design through our open and collaborative Request For Comments (RFC) processWe are Open Source friendly, following Open Source principles in our internal projects and encouraging contributions to external projectsWhat you will do:Work within a team of SREs to ensure high availability and reliability of our systemsDevelop and maintain monitoring, incident management, and troubleshooting of infrastructure and applicationsUtilize observability tools to gain insights into system performance and health, and make decisions for improvementsDesign and implement automation tools and processes to improve efficiency and reduce downtimePerform on-call on a rotating basis to address high-severity incidents and ensure system availabilityWork closely with development teams to ensure that their applications are designed for scalability and reliabilityParticipate in the design and implementation of new systems and services to ensure they meet our reliability and scalability requirementsKeep up-to-date with emerging technologies, tools, and practices related to SRE and infrastructureWhat we expect from you:Several years of relevant industry experience building large scale distributed systemsSolid understanding of cloud architecture and application deployment patterns on GCP or AWSExperience operating web-scale deployments of containerised systems on Kubernetes and Amazon Container ServicesA deep understanding of programming languages and the systems you've worked onA passion for architecting large systems with elegant interfaces that can scale easily
#J-18808-Ljbffr