Centric Pricing (formerly StyleSage) is an AI-driven competitive assortment benchmarking and market trend insights solution for fashion, beauty, and home goods brands and retailers. We are a key innovation partner for iconic and emerging brands across the world. Our platform is able to analyze the info of more than 1,000 retailers, processing data from more than 600,000 brands, tracking millions of products.
The Challenge: You will be part of the Data Collection Team, formed by a group of motivated individuals that focus on crawling services. This team is the origin and fuel of our pipeline, thus needing to guarantee data is extracted in a reliable, sustainable, and homologated way.
As Data Collection Engineer, your main mission is to deliver software systems focused on fast high-level web crawling by using Python web scraping frameworks, designing, developing, automating, and evolving tools for crawling at scale.
Responsibilities:Collaborate with the rest of the technical team to ensure the Data-Collection solutions align with the organization's goals, as well as customer needs.Build internal solutions used to crawl websites and extract structured data from their pages.Work around bot protections, analyzing patterns, state of the art and generating cutting-edge alternatives.Review software code written by other team members to identify bugs and improve the code quality.Remain current on technology trends to keep our software as innovative as possible.Desired Technical Skills:5+ years of experience working as a software engineer.Relevant experience implementing software in Python. Django knowledge is a plus.Experience using scraping frameworks, particularly Scrapy.Knowledge and agility working on low-level TCP/IP protocols (TLS, HTTP(S), SSL, etc).High knowledge of the web environment (model, standards, DOM, Request-Response, Cookies, Javascript, Browsers, Headers, XHR, etc.).Building well-documented and organized systems, following common coding conventions.Strong troubleshooting and debugging skills.Experience in Continuous Integration/Continuous Deployment (CI/CD) and related tooling.Familiarity with cloud platforms like AWS, GCP, or Azure. Experience with Docker is a plus.Experience with UNIX systems and scripting.SQL and some database administration knowledge.Git, as it is the version control system our whole company uses and it's deeply integrated with our development process.Soft Skills:Your job will require written and spoken communications in English.Collaborative skills and teamwork mindset; we work with people from different countries and time zones.Ability to work autonomously; we will be there to unblock you and help you with all your tasks at any time, but we expect you to do the heavy lifting by yourself.Analytic orientation, able to decompose complex problems and projects into manageable pieces; comfortably suggesting and presenting solutions.Centric Software provides equal employment opportunities to all qualified applicants without regard to race, sex, sexual orientation, gender identity, national origin, color, age, religion, protected veteran or disability status, or genetic information.
#J-18808-Ljbffr