Data Collection Engineer
hace 2 días
**Engineering - Remote, Madrid, Spain - Full Time**:
Centric Pricing (formerly StyleSage), is an AI driven competitive assortment benchmarking and market trend insights solution for fashion, beauty and home goods brands and retailers.
We are a key innovation partner for iconic and emerging brands across the world.
Our Platform is able to analyze the info of more than 1.000 retailers, processing data from more than 600.000 brands, tracking millions of products
**The Challenge**:
You will be part of the Data Collection Team, formed by a group of motivated individuals that focus on crawling services. This team is the origin and fuel of our pipeline, thus needing to guarantee data is extracted in a reliable, sustainable and homologated way.
As Data Collection Engineer, your main mission is to deliver software systems focused on fast highlevel web crawling by using Python web scraping frameworks, designing, developing, automating and evolving tools for crawling at scale.
**Responsibilities**:
- Collaborate with the rest of the technical team to ensure the Data-Collection solutions align with the organization’s goals, as well as customer needs.
- Build internal solutions used to crawl websites and extract structured data from their pages.
- Work around bot protections, analyzing patterns, state of the art and generating cutting edge alternatives.
- Review software code written by other team members to identify bugs and improve the code quality.
- Remain current on technology trends to keep our software as innovative as possible.
**Desired Technical Skills**:
- 5+ years of experience working as a software engineer
- Relevant experience implementing software in Python. Django knowledge is a plus
- Experience using scraping frameworks. It would be great if you have knowledge with Scrapy.
- Knowledge and agility working on low level TCP/IP protocols (TLS, HTTP(S), SSL, etc)
- High knowledge of the Web environment (model, standards, DOM, Request-Response, Cookies, Javascript, Browsers, Headers, XHR, etc.).
- Building well documented and organized systems, following common coding conventions.
- Strong troubleshooting and debugging skills.
- Experience in Continuous Integration/Continuous Deployment (CI/CD) and related tooling.
- Familiarity with cloud platforms like AWS, GCP, or Azure. Experience with Docker is a plus.
- Experience with UNIX systems and scripting.
- SQL and some database administration knowledge.
- Git, as it is the version control system our whole company uses and it’s deeply integrated with our development process.
**Soft Skills**:
- Your job will require written and spoken communications in English.
- Collaborative skills and teamwork mindset. We work with people from different countries and time zones.
- Ability to work autonomously. We will be there to unblock you and help you with all your tasks at any time, but we expect you to do the heavy lifting by yourself.
- Analytic orientation, able to decompose complex problems and projects into manageable pieces; comfortably suggesting and presenting solutions.
Centric Software provides equal employment opportunities to all qualified applicants without regard to race, sex, sexual orientation, gender identity, national origin, color, age, religion, protected veteran or disability status or genetic information.
-
Data Collection Engineer Senior
hace 2 semanas
Madrid, España BMIND A tiempo completo**¡En JAKALA IBERIA seguimos creciendo!**: **MODELO HÍBRIDO Y HORARIO FLEXIBLE** En **JAKALA Iberia** ***no paramos de crecer. Cada semana contamos con nuevas oportunidades y, en esta ocasión, buscamos un perfil de **Data Collection** **Engineer Senior** ***para incorporarse en el equipo de **Data Collection **y trabajar en proyectos a nível transversal...
-
Data Collection Engineer
hace 2 semanas
Madrid, España Appen A tiempo completo**About Appen** Appen is a leader in AI enablement for critical tasks such as model improvement, supervision, and evaluation. To do this we leverage our global crowd of over one million skilled contractors, speaking over 180 languages and dialects, representing 130 countries. In addition, we utilize the industry's most advanced AI-assisted data annotation...
-
Data Collection Engineer
hace 1 semana
Madrid, España Incode A tiempo completoPOWER A WORLD OF TRUST Incode is the leading provider of world-class identity solutions that is reinventing the way humans authenticate and verify their identities online to power a world of digital trust. Through our revolutionary identity solutions, we are unleashing the business potential of universal industries including finance, government, retail,...
-
Field Data Collection Specialist\Surveyor
hace 2 semanas
madrid, España TSMG A tiempo completoCompany description TSMG is a field data collection company founded in 2017 in Europe. We collect data where automation is not possible. We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and North America, ready to...
-
Language Data Engineer for Multimodal AI
hace 1 semana
Madrid, España Amazon A tiempo completoA leading technology company is looking for a Language Engineer in Madrid to develop and evaluate AI data models. This role involves designing data collection processes and analyzing data across multiple languages. The ideal candidate has a Master’s degree in a relevant field and extensive experience in language data projects. A collaborative mindset and...
-
Data collection
hace 2 semanas
Madrid, España Grupo IskayPet A tiempo completoGrupo Iskay Pet es el líder en Iberia en el cuidado de los animales de compañía. Iskay, cuyo significado en quechua es “la unión de dos”, surgió en 2020 con la fusión de Tiendanimal y Kiwoko. Con nuestras tiendas físicas, clínicas veterinarias, hospital veterinario y plataforma online, nos consolidamos como la mejor opción para quienes aman a...
-
DATA COLLECTION
hace 4 días
Madrid, España Grupo IskayPet (Tiendanimal, Kiwoko, Kivet, Clinicanimal) A tiempo completoGrupo IskayPet es el líder en Iberia en el cuidado de los animales de compañía. Iskay, cuyo significado en quechua es "la unión de dos", surgió en 2020 con la fusión de Tiendanimal y Kiwoko. Con nuestras tiendas físicas, clínicas veterinarias, hospital veterinario y plataforma online, nos consolidamos como la mejor opción para quienes aman a los...
-
Language Data Engineer for Multimodal AI
hace 1 semana
Madrid, España Amazon A tiempo completoA leading technology company is looking for a Language Engineer in Madrid to develop and evaluate AI data models. This role involves designing data collection processes and analyzing data across multiple languages. The ideal candidate has a Master’s degree in a relevant field and extensive experience in language data projects. A collaborative mindset and...
-
Data collection
hace 2 días
Madrid, España Grupo IskayPet A tiempo completoGrupo Iskay Pet es el líder en Iberia en el cuidado de los animales de compañía. Iskay, cuyo significado en quechua es “la unión de dos”, surgió en 2020 con la fusión de Tiendanimal y Kiwoko. Con nuestras tiendas físicas, clínicas veterinarias, hospital veterinario y plataforma online, nos consolidamos como la mejor opción para quienes aman a...
-
Data collection
hace 2 días
madrid, España Grupo IskayPet A tiempo completoGrupo Iskay Pet es el líder en Iberia en el cuidado de los animales de compañía. Iskay, cuyo significado en quechua es “la unión de dos”, surgió en 2020 con la fusión de Tiendanimal y Kiwoko. Con nuestras tiendas físicas, clínicas veterinarias, hospital veterinario y plataforma online, nos consolidamos como la mejor opción para quienes aman a...