Vector Data Engineer
hace 2 semanas
At Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to innovate across the full spectrum of healthcare solutions today to deliver the breakthroughs of tomorrow, and profoundly impact health for humanity. Learn more at
Job Function
Data Analytics & Computational Sciences
Job Sub Function
Data Science
Job Category
Scientific/Technology
All Job Posting Locations:
Cornellà de Llobregat, Barcelona, Spain, Madrid, Spain
Job Description
Johnson and Johnson Innovative Medicine (J&J IM), a pharmaceutical company of Johnson & Johnson is recruiting for a Vector Data Engineer. This position has a primary location of Barcelona, Spain. The secondary location is Madrid. This is a hybrid role.
Our expertise in Innovative Medicine is informed and inspired by patients, whose insights fuel our science-based advancements. Visionaries like you work in teams that save lives by developing the medicines of tomorrow.
Join us in developing treatments, finding cures, and pioneering the path from lab to life while championing patients every step of the way. Learn more at
Position Summary:
The Vector Data Engineer designs and implements the embedding and semantic-search infrastructure that connects discovery, translational, and clinical data into AI-ready knowledge representations.
This role bridges multi-omics data engineering and machine-learning infrastructure, enabling scientists and agentic tools to discover biological insights through vector-based search and reasoning.
Key Responsibilities:
- Develop scalable pipelines that convert multi-omics and clinical data (e.g., proteomics, transcriptomics, spatial omics, biomarkers) into vectorized embeddings for AI and semantic retrieval.
- Build and maintain vector databases and hybrid data stores using technologies such as TileDB, Weaviate, or Snowflake Cortex.
- Collaborate with the Data Transformation Engineers to design standardized data formats suitable for embedding generation and cross-modality mapping.
- Integrate metadata, ontology terms, and provenance into vector representations to ensure traceability and governance compliance.
- Partner with AI/ML Team to deploy embeddings supporting agentic reasoning, semantic similarity, and cross-dataset query.
- Optimize indexing, retrieval, and inference performance across large-scale multi-omics data collections.
- Evaluate and incorporate emerging representation-learning and knowledge-graph techniques to improve data discoverability and model interoperability.
Qualifications
- MS/PhD in Computer Science, Computational Biology, Data Science, or related field.
- 3+ years of experience building or maintaining vector or semantic-retrieval infrastructure.
- Hands-on experience with multi-omics or biomedical data integration (e.g., RNA-seq, proteomics, clinical endpoints).
- Proficiency in Python and frameworks such as LangChain, Transformers, or sentence-embedding models.
- Familiarity with TileDB, Snowflake, Weaviate, FAISS, or other vector/array database systems.
- Understanding of metadata modeling, ontologies (e.g., OBO, UMLS), and FAIR data practices.
- Strong ability to collaborate across solution architecture, data science, and AI/ML teams.
Strategic Impact:
- Multi-omics and clinical data assets transformed into interoperable, vectorized embeddings supporting scientific AI applications.
- AI can perform semantic queries and reasoning over governed datasets.
- Vector database infrastructure scales efficiently and complies with governance and lineage standards.
- Accelerated insight generation across discovery, translational, and clinical domains.
#JRDDS
Required Skills
Preferred Skills:
Advanced Analytics, Business Intelligence (BI), Coaching, Collaborating, Critical Thinking, Data Analysis, Database Management, Data Privacy Standards, Data Reporting, Data Savvy, Data Science, Data Visualization, Econometric Models, Process Improvements, Technical Credibility, Technologically Savvy, Workflow Analysis
-
Vector Data Engineer
hace 2 semanas
Madrid, Madrid, España Johnson & Johnson Innovative Medicine A tiempo completo 80.000 € - 120.000 € al añoAt Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to...
-
Data Engineer
hace 2 semanas
Madrid, Madrid, España RockStar Data A tiempo completo 60.000 € - 80.000 € al año¿Eres un apasionado de los datos y te encanta transformar información en oportunidades?En Rockstar Data estamos construyendo la primera plataforma de Business Intelligence y AI Predictiva diseñada específicamente para el ocio nocturno y la restauración.Donde antes había intuición, ahora hay datos. Y donde antes había hojas de Excel, ahora hay...
-
Data Engineer – Azure Databricks
hace 4 días
Madrid, Madrid, España EDB - EXPORTADORA DATA BASE, S.A. A tiempo completo 40.000 € - 80.000 € al añoEn Exportadora Data Base buscamos un Data Engineer con experiencia en el diseño de arquitecturas de datos en Azure Databricks, para colaborar en un proyecto dentro del área de ingeniería de datos.Se trata de un proyecto temporal de 9 meses.Responsabilidades principalesDiseño de arquitecturas de datos en Azure Databricks, incluyendo modelado de datos,...
-
Data Engineer
hace 4 días
Madrid, Madrid, España Quanteam UK A tiempo completo 60.000 € - 120.000 € al añoRole:Data Engineer / AI SpecialistLocation:Madrid, SpainOn-site workingFull time workingOverview:We are seeking aData Engineer / AI Specialistto support the development, implementation, and monitoring of AI-driven and automated solutions. The role involves optimising data pipelines, ensuring data integrity and compliance, and contributing to the integration...
-
Data Engineer
hace 2 semanas
Madrid, Madrid, España ADEREN A tiempo completo 45.000 € - 55.000 € al añoBUSCAMOS: Data Engineer (Java/Spark) Para importante empresa sector TIC, buscamos un profesional con experiencia contrastada desempeñando el role de Data EngineerFunciones & Tareas: ■ Diseñar y desarrollar pipelines de datos eficientes y escalables.■ Gestionar y optimizar la infraestructura de datos (seguridad, rendimiento, escalabilidad).■...
-
Senior Data Engineer
hace 4 días
Madrid, Madrid, España Awin A tiempo completo 70.000 € - 110.000 € al añoPurpose of positionAs a Senior Data Engineer, you will play a pivotal role in our AI/ML workstream, you'll work closely with business teams and data scientists to design, maintain, and improve machine learning applications. Your main responsibilities will include managing existing ML workloads and building new batch and on-demand pipelines to support...
-
Senior GenAI Engineer
hace 4 días
Madrid, Madrid, España Ultra Tendency A tiempo completo 60.000 € - 120.000 € al añoOur Engineering community is growing, and we're now looking for a Senior GenAI Engineer - Databricks (m/f/*) to join our team in Spain, supporting our global growth. As a Senior GenAI Engineer (m/f/*), you will lead the design and implementation of advanced data and AI solutions on the Databricks Lakehouse Platform. Your focus will lie in building robust...
-
Data Engineer Snowflake
hace 1 semana
Madrid, Madrid, España Logicalis Group A tiempo completo 40.000 € - 60.000 € al añoEn Logicalis Spain estamos buscando un perfil Data Engineer especializado en Snowflake para incorporarse en nuestra área interna de Delivery en un equipo multidisciplinar de consultores que trabajan por squads dentro de nuestra unidad de negocio de Data & Analytics ubicada en las oficinas de Madrid o Barcelona. El equipo de Data & Analytics de Logicalis...
-
Data Scientist
hace 4 días
Madrid, Madrid, España Trust In SODA A tiempo completoAI Data Engineer – Quantum & AI Deep Tech Leader | Spain (Hybrid)Join one of Europe's most excitingdeep-tech pioneers— a company at the intersection ofquantum and artificial intelligence, backed by major global investors and strong EU support. Our breakthrough technology compresses large language models by up to95% without accuracy loss, cutting...
-
Data Engineer
hace 2 semanas
Madrid, Madrid, España IT Partner A tiempo completo 40.000 € - 60.000 € al añoUbicaciones: Madrid, Barcelona, Jaén, Granada, Córdoba, Málaga, Sevilla o Almería (posibilidad de perfiles deslocalizados) Modalidad híbrida:Andalucía: 4 días en oficina + 1 de teletrabajo (viernes)Madrid y Barcelona: 3 días en oficina + 2 de teletrabajo Descripción del puestoBuscamos un Data Engineer con entre 3 y 5 años de experiencia (aunque...