..Centric Pricing (formerly StyleSage) is an AI-driven competitive assortment benchmarking and market trend insights solution for fashion, beauty, and home goods brands and retailers. We are a key innovation partner for iconic and emerging brands across the world. Our platform analyzes the info of more than 1,000 retailers, processing data from over 600,000 brands, tracking millions of products!The Data Science team is responsible for enriching the data that our crawlers collect massively from fashion-related websites with our own machine learning models. Our models add information to the existing products such as categories (clothing, footwear, beauty...), genders, attributes, colors, bounding boxes, etc.The database already contains more than 500 million products (growing daily) and we process 1-2M new products every week. To do that, you will use the latest and best open-source technologies available. We code in Python (and we love it; you may want to come to the PyCon Spain conference with us!), using Keras as our main Deep Learning framework (although we are starting to use PyTorch for certain projects) along with other machine learning and computer vision libraries like scikit-learn or OpenCV. On the engineering side, we use Django as our main framework for accessing the data. We are a cloud-native company, so our code runs in AWS. Our massive amount of data lives in PostgreSQL databases, and we monitor all this using observability tools like Grafana, InfluxDB, and Telegraf. If you do not know a lot about some of those technologies, worry not; our engineers will be happy to support you while you become an expert in them.Your Job:As a Data Scientist, you will be responsible for ensuring that our current data science pipelines run smoothly over time with the best performance, as well as developing new machine learning pipelines and algorithms by:Creating datasets from our vast data lake of products and social media data, selecting the most relevant items for your use case, and ensuring data quality.Hands-on training, deploying, productionizing, and operating Machine Learning models and pipelines at scale, including both batch and real-time use cases.Contributing to expanding and improving the infrastructure to support all stages of the machine learning model lifecycle, including feature engineering, feature store, model training, testing, monitoring, and deployment in a production environment.Proactively identifying and implementing internal process improvements, including automating manual work, optimizing data delivery, and redesigning infrastructure for greater scalability.Staying up-to-date with the latest industry trends and technologies to ensure our ML capabilities remain competitive and cutting-edge.Onboarding and enabling Data Scientists with different levels of engineering expertise.Your Skills:3+ years of experience working as a software engineer.Bachelor's degree in Computer Science, Engineering, or a related field.
#J-18808-Ljbffr