Cloud Infrastructure and Reliability Engineer Full time. Barcelona, Spain (relocation support possible)
We are Zoundream, a dynamic, growing startup, developing the world's most advanced technology for infant cry analysis, using AI and sound recognition. Our team has a strong background in machine learning and embedded systems, a mature core technology that we are continually improving upon, and an award-winning product based on it. We are now in the process of developing the next iteration of this product, which includes a cloud-based service.
We are looking to expand our team with a cloud engineer specialized on the AWS platform, with a proven track record in implementing and monitoring services that can scale as needed to match varying levels of traffic, while maintaining at all times a high level of reliability. Together with the rest of our team you will help grow our cloud service all the way to a highly scalable and reliable service, ready to cater to a global audience. In this role you will have ample autonomy in choosing the best tools and technologies needed to achieve this goal. You will also have the opportunity to shape the growth of our cloud engineering team as we expand it further, as well as the company as a whole.
What you will do:
Improve and expand our monitoring and observability system to:
Alert of anomalies before they become problems. Allow the engineering team to help identify and solve the root cause of any detected anomalies. Provide our business people a clear and constantly up-to-date view of operational costs and traffic patterns, so they can make the best informed decisions on future offerings and partnerships. Contribute to, and improve, our CI/CD infrastructure, establish practices that guarantee the quality of our code as well as smooth and safe deployments. Contribute to ensuring that our organization's security practices meet GDPR and ISO 27001 requirements. Work with the rest of the engineering team to help identify ways to reduce costs and improve performance while maintaining or increasing reliability and security. Constantly document your work and engage in knowledge transfer with the rest of the team. Help foster a work culture aligned with the most current DevOps and IT Security principles and practices. Help writing job descriptions and participate in interviews for hiring new staff. As our cloud engineering team grows, your help in shaping it will be invaluable. What you will bring:
A strong and provable track record in designing, implementing, deploying and monitoring highly scalable and reliable cloud services. An honest and forward approach to the review of existing systems and infrastructure. Confidence in your experience and expertise. An interest and willingness to help shape the direction of the company as it grows. Tech Stack: The technology stack driving our products includes python3 and nodejs/typescript. We mainly use AWS, from Lambda to API-gw, to ECS, and more. However, we are interested in K8S skills, whether it's GCP's GKE or in Azure. We also use Terraform, Packer, and Ansible to codify and manage our infrastructure, Wazuh & Sensu for monitoring, GitHub Actions for CI (though this is open to change). Last, Linux Systems Administrator and Information Security skills and experience are also very welcome.
What type of person you are:
Proactive. Independent. Entrepreneur spirit. Curious, always willing to learn new things. Structured. Able to read a project plan, and stick to agreed milestones and time. Fluent in English. What we offer:
Becoming a key person, in the core business of a high-tech ambition.
#J-18808-Ljbffr