.Ebury is a hyper-growth FinTech firm, named as one of the top 15 European Fintechs to work for by AltFi.We offer a range of products including FX risk management, trade finance, currency accounts, international payments, and API integration.Senior Site Reliability Engineer - GCP - FintechEbury Madrid Office - Hybrid (4 days in the office, 1 day working from home).We apply new technologies to enhance and automate financial services and processes, allowing small and medium-sized businesses to trade and transact internationally by eliminating boundaries related to more traditional methods.Are you ready to be an Eburian? We are looking for an experienced Site Reliability Engineer to join our team.What we offer:A variety of meaningful and competitive benefits to meet your needs.Competitive salary.Continuous professional growth through our career progression framework with regular reviews.Equity process through a performance bonus.Allowance for annually paid time off as well as during local public holidays.Continued personal development through training and certification.Being part of a diverse technology team that cares deeply about culture and best practices and believes in agile principles.Contribute to our technical design through our open and collaborative Request For Comments (RFC) process.We are Open Source friendly, following Open Source principles in our internal projects and encouraging contributions to external projects.Why should I join Ebury?
- Work in a high-growth environment.
- Build a better world with a focus on inclusion.We stand against discrimination in all forms and have no tolerance for intolerance. At Ebury, you will find an internal group dedicated to discussing how we can build a more diverse and inclusive workplace. If you're excited about this job opportunity but your background doesn't match exactly the requirements, we strongly encourage you to apply anyway. You may be just the right candidate for this or other positions we have.What you will do:Work within a team of SREs to ensure high availability and reliability of our systems.Develop and maintain monitoring, incident management, and troubleshooting of infrastructure and applications.Utilize observability tools to gain insights into system performance and health, and make decisions for improvements.Design and implement automation tools and processes to improve efficiency and reduce downtime.Perform on-call duties on a rotating basis to address high-severity incidents and ensure system availability.Work closely with development teams to ensure that their applications are designed for scalability and reliability.Participate in the design and implementation of new systems and services to ensure they meet our reliability and scalability requirements.Keep up-to-date with emerging technologies, tools, and practices related to SRE and infrastructure.What we expect from you:Several years of relevant industry experience building large-scale distributed systems