.Centric Pricing (formerly StyleSage) is an AI-driven competitive assortment benchmarking and market trend insights solution for fashion, beauty, and home goods brands and retailers. We are a key innovation partner for iconic and emerging brands across the world. Our Platform analyzes the info of more than 1,000 retailers, processing data from over 600,000 brands, tracking millions of products!The Data Science team is responsible for enriching the data that our crawlers collect massively from fashion-related websites with our own machine learning models. Our models add information to the existing products such as categories (clothing, footwear, beauty…), genders, attributes, colors, bounding boxes, etc.The database already contains more than 500 million products (growing daily), and we process 1-2M new products every week. To do that, you will use the latest and best open-source technologies. We code in Python (and we love it, you may want to come to the PyCon Spain conference with us!), using Keras as our main Deep Learning framework (although we are starting to use PyTorch for certain projects) along with other machine learning and computer vision libraries like scikit-learn or OpenCV. In the engineering side, we use Django as our main framework for accessing the data. We are a cloud-native company, so our code runs in AWS. Our massive amount of data lives in PostgreSQL databases, and we monitor everything using observability tools like Grafana, InfluxDB, and Telegraf. If you do not know a lot about some of those technologies, worry not, our engineers will be happy to support you while you are on your journey to becoming an expert in them.Your JobWe are seeking a highly skilled and innovative Data Scientist to join our team and contribute to the research and implementation of cutting-edge Large Language Models for analyzing and giving insights from our raw data and reports in the fashion retail industry. The ideal candidate will have a strong background in machine learning, deep learning, and NLP, with a particular focus on generative models.Your Skills3+ years of experience working as a software engineer.Bachelor's degree in Computer Science, Engineering, or related field.3+ years of experience as a production-level Python developer and Deep Learning frameworks: TensorFlow, Keras, or PyTorch.Machine learning and Python data libraries like scikit-learn, pandas, or NumPy.Experience with NLP (Natural Language Processing).Experience with LLMs: RAG (Retrieval-Augmented Generation), LangChain, Prompt Engineering, etc.Bonus PointsAdditionally, it would be nice if you are familiar with:Django ORM.PostgreSQL or other relational databases.Familiar with Transformers and other SOTA deep learning architectures.Familiar with HuggingFace and out-of-the-box LLMs.Image processing libraries like OpenCV or Pillow.NLP processing libraries such as Spacy or NLTK.Asynchronous processes with RabbitMQ and Celery.#J-18808-Ljbffr