Data Collection Engineer

hace 7 meses


Madrid, España CENTRIC SOFTWARE INC A tiempo completo

Centric Pricing (formerly StyleSage), is an AI driven competitive assortment benchmarking and market trend insights solution for fashion, beauty and home goods brands and retailers.

We are a key innovation partner for iconic and emerging brands across the world.

Our Platform is able to analyze the info of more than 1.000 retailers, processing data from more than 600.000 brands, tracking millions of products

**The Challenge**:
You will be part of the Data Collection Team, formed by a group of motivated individuals that focus on crawling services. This team is the origin and fuel of our pipeline, thus needing to guarantee data is extracted in a reliable, sustainable and homologated way.

As Data Collection Engineer, your main mission is to deliver software systems focused on fast highlevel web crawling by using Python web scraping frameworks, designing, developing, automating and evolving tools for crawling at scale.

**Responsibilities**:

- Collaborate with the rest of the technical team to ensure the Data-Collection solutions align with the organization’s goals, as well as customer needs.
- Build internal solutions used to crawl websites and extract structured data from their pages.
- Work around bot protections, analyzing patterns, state of the art and generating cutting edge alternatives.
- Review software code written by other team members to identify bugs and improve the code quality.
- Remain current on technology trends to keep our software as innovative as possible.

**Desired Technical Skills**:

- 5+ years of experience working as a software engineer
- Relevant experience implementing software in Python. Django knowledge is a plus
- Experience using scraping frameworks. It would be great if you have knowledge with Scrapy.
- Knowledge and agility working on low level TCP/IP protocols (TLS, HTTP(S), SSL, etc)
- High knowledge of the Web environment (model, standards, DOM, Request-Response, Cookies, Javascript, Browsers, Headers, XHR, etc.).
- Building well documented and organized systems, following common coding conventions.
- Strong troubleshooting and debugging skills.
- Experience in Continuous Integration/Continuous Deployment (CI/CD) and related tooling.
- Familiarity with cloud platforms like AWS, GCP, or Azure. Experience with Docker is a plus.
- Experience with UNIX systems and scripting.
- SQL and some database administration knowledge.
- Git, as it is the version control system our whole company uses and it’s deeply integrated with our development process.

**Soft Skills**:

- Your job will require written and spoken communications in English.
- Collaborative skills and teamwork mindset. We work with people from different countries and time zones.
- Ability to work autonomously. We will be there to unblock you and help you with all your tasks at any time, but we expect you to do the heavy lifting by yourself.
- Analytic orientation, able to decompose complex problems and projects into manageable pieces; comfortably suggesting and presenting solutions.

Centric Software provides equal employment opportunities to all qualified applicants without regard to race, sex, sexual orientation, gender identity, national origin, color, age, religion, protected veteran or disability status or genetic information.

oKWNEHf7nE



  • Madrid, España Bmind A tiempo completo

    .¡En JAKALA IBERIA seguimos creciendo!En JAKALA Iberia no paramos de crecer. Cada semana contamos con nuevas oportunidades y, en esta ocasión, buscamos un perfil de Data Collection Engineer Senior para incorporarse en el equipo de Data Collection y trabajar en proyectos a nivel transversal en el mundo del dato y la tecnología.¿QUÉ HARÁS?...


  • Madrid, España BMIND A tiempo completo

    **¡En JAKALA IBERIA seguimos creciendo!**: **MODELO HÍBRIDO Y HORARIO FLEXIBLE** En **JAKALA Iberia** ***no paramos de crecer. Cada semana contamos con nuevas oportunidades y, en esta ocasión, buscamos un perfil de **Data Collection** **Engineer Senior** ***para incorporarse en el equipo de **Data Collection **y trabajar en proyectos a nível transversal...


  • Madrid, España BMIND A tiempo completo

    **¡En JAKALA IBERIA seguimos creciendo!**: En **JAKALA Iberia** **no paramos de crecer. Cada semana contamos con nuevas oportunidades y, en esta ocasión, buscamos un perfil de **Data Collection **Engineer Senior** **para incorporarse en el equipo de **Data Collection **y trabajar en proyectos a nível transversal en el mundo del dato y la...


  • Madrid, España Bmind A tiempo completo

    **¡En JAKALA IBERIA seguimos creciendo!**:En **JAKALA Iberia** **no paramos de crecer. Cada semana contamos con nuevas oportunidades y, en esta ocasión, buscamos un perfil de **Data Collection **Engineer Senior** **para incorporarse en el equipo de **Data Collection **y trabajar en proyectos a nível transversal en el mundo del dato y la...


  • Madrid, España Bmind A tiempo completo

    **¡En JAKALA IBERIA seguimos creciendo!**:**MODELO HÍBRIDO Y HORARIO FLEXIBLE**En**JAKALA Iberia*****no paramos de crecer. Cada semana contamos con nuevas oportunidades y, en esta ocasión, buscamos un perfil de**Data Collection****Engineer Senior*****para incorporarse en el equipo de**Data Collection **y trabajar en proyectos a nível transversal en el...


  • Madrid, España CENTRIC SOFTWARE INC A tiempo completo

    Centric Pricing (formerly StyleSage), is an AI driven competitive assortment benchmarking and market trend insights solution for fashion, beauty and home goods brands and retailers. We are a key innovation partner for iconic and emerging brands across the world. Our Platform is able to analyze the info of more than 1.000 retailers, processing data from...


  • Madrid, España Centric Software A tiempo completo

    **Engineering - Madrid/Remote, Spain - Full Time**: Centric Pricing (formerly StyleSage), is an AI driven competitive assortment benchmarking and market trend insights solution for fashion, beauty and home goods brands and retailers. We are a key innovation partner for iconic and emerging brands across the world. Our Platform is able to analyze the info of...


  • Madrid, España Centric Software A tiempo completo

    **Engineering - Remote, Madrid, Spain - Full Time**: Centric Pricing (formerly StyleSage), is an AI driven competitive assortment benchmarking and market trend insights solution for fashion, beauty and home goods brands and retailers. We are a key innovation partner for iconic and emerging brands across the world. Our Platform is able to analyze the info...


  • Madrid, España Appen A tiempo completo

    **About Appen** Appen is a leader in AI enablement for critical tasks such as model improvement, supervision, and evaluation. To do this we leverage our global crowd of over one million skilled contractors, speaking over 180 languages and dialects, representing 130 countries. In addition, we utilize the industry's most advanced AI-assisted data annotation...


  • Madrid, España Jakala Group S.P.A. A tiempo completo

    .En JAKALA, buscamos un/a Data Collection Engineer Consultant para participar en proyectos de recolección, procesamiento y administración de datos provenientes de diferentes fuentes.Este puesto está dentro del equipo de Data & Tech y podrá estar ubicado en cualquiera de nuestras oficinas, de manera híbrida.Funciones: Etiquetado de web/apps: Implementar...


  • Madrid, España Jakala Group S.P.A. A tiempo completo

    En JAKALA, buscamos un/a Data Collection Engineer Consultant para participar en proyectos de recolección, procesamiento y administración de datos provenientes de diferentes fuentes.Este puesto está dentro del equipo de Data & Tech y podrá estar ubicado en cualquiera de nuestras oficinas, de manera híbrida.Funciones:Etiquetado de web/apps: Implementar y...


  • Madrid, España Bmind A tiempo completo

    **¡En JAKALA IBERIA seguimos creciendo!**:En JAKALA IBERIA no paramos de crecer.Cada semana contamos con nuevas oportunidades que cubrir y, en esta ocasión, buscamos un **Data Collection** Engineer Junior** para incorporarse en el equipo de Data & Tech.**Tu perfil**:**¿QUÉ BUSCAMOS?**Que tengas background técnico y experiência en:- Capacidad de generar...

  • Senior Data Engineer

    hace 3 semanas


    Madrid, España Grupo Data A tiempo completo

    Hi! We are DATA Group and we are searching for the best talent! Our goal is to simplify our clients' lives with innovative IT solutions. We operate at global scale and we are expanding to Portugal! If you are passionate and have the desire to make the difference, we want to get to know you! Join us to be part of this incredible adventure! Who are we looking...

  • Senior Data Engineer

    hace 3 semanas


    Madrid, España Grupo Data A tiempo completo

    Hi!We are DATA Group and we are searching for the best talent!Our goal is to simplify our clients' lives with innovative IT solutions.We operate at global scale and we are expanding to Portugal!If you are passionate and have the desire to make the difference, we want to get to know you!Join us to be part of this incredible adventure!Who are we looking...


  • Madrid, España Appen A tiempo completo

    **About Appen** Appen is a leader in AI enablement for critical tasks such as model improvement, supervision, and evaluation. To do this we leverage our global crowd of over one million skilled contractors, speaking over 180 languages and dialects, representing 130 countries. In addition, we utilize the industry's most advanced AI-assisted data annotation...


  • Madrid, Madrid, España Unison Infrastructure A tiempo completo

    Company OverviewUnison Infrastructure is a leading investment firm in the telecom and renewables sector, with a strong presence in the United States and Europe. Founded in partnership with Ardian, a global investment house based in Paris, Unison has been operating since 2003 and has invested in various real estate and infrastructure projects.Job...


  • Madrid, España Data Engineer Jobs A tiempo completo

    .We help the world see new possibilities and inspire change for better tomorrows. Our analytic solutions bridge content, data, and analytics to help business, people, and society become stronger, more resilient, and sustainable.Job DescriptionHYBRID - ON SITE - MUST BE OK WITH COMING INTO THE JERSEY CITY, NJ OFFICEThe Data Warehouse Dimensional Modeler is a...


  • Madrid, España Terry Soot MG A tiempo completo

    Company description:Terry Soot Management Group (TSMG) is a field data collection company founded in 2017 in Europe. We collect data where automation is not possible. We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe...

  • Data Engineer

    hace 2 meses


    Madrid, España Salt Pay A tiempo completo

    Company DescriptionAbout TeyaTeya exists to make sure that every small and growing business in Europe has the opportunity to thrive. We want to become Europe's go-to software solution for these businesses, simplifying their every day and helping them reconnect with the joy of running their business. Teya was born in 2019 and is home to over 1,000 employees...

  • Data Engineer

    hace 7 meses


    Madrid, España Page Personnel España A tiempo completo

    Buscamos un perfil DATA ENGINEER CON AL MENOS 3 AÑOS DE EXPERIENCIA PYTHON, PYSPARK, CLOUD INGLÉS ALTO INGENIERÍA INFORMÁTICA/TELECOMUNICACIONES/MATEMÁTICAS CONTRATO INDEFINIDO|CLIENTE FINAL Buscamos un perfil DATA ENGINEER CON AL MENOS 3 AÑOS DE EXPERIENCIA PYTHON, PYSPARK, CLOUD INGLÉS ALTO INGENIERÍA...