Data Engineer for Language Technologies
hace 2 días
**Job Reference**:
- 522_25_LS_LT_RE2
**Position**:
- Data Engineer for Language Technologies (RE2)
**Closing Date**:
- Sunday, 24 August, 2025
**Reference**: 522_25_LS_LT_RE2
**Job title**: Data Engineer for Language Technologies (RE2)
**About BSC**
- The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, was a founding and hosting member of the former European HPC infrastructure PRACE (Partnership for Advanced Computing in Europe), and is now hosting entity for EuroHPC JU, the Joint Undertaking that leads large-scale investments and HPC provision in Europe. The mission of BSC is to research, develop and manage information technologies in order to facilitate scientific progress. BSC combines HPC service provision and R&D into both computer and computational science (life, earth and engineering sciences) under one roof, and currently has over 1000 staff from 60 countries.
Look at the BSC experience:
BSC-CNS YouTube Channel
Let's stay connected with BSC Folks
We promote Equity, Diversity and Inclusion, fostering an environment where each and every one of us is appreciated for who we are, regardless of our differences.
**Context And Mission**
- The Language Technologies Laboratory at BSC has consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan governments with the mission to develop fundamental open
- source resources and technologies for Spanish and Catalan. In connection with this, the LT Laboratory is currently in charge of two flagship projects at the national and regional level: the ALIA project, funded by the Spanish Secretariat of Digitalisation and Artificial Intelligence, and the AINA project, aimed at developing AI resources for Catalan, funded by the Catalan Digitalisation Department. In addition, the Laboratory participates in various EU funded international projects.
The researcher will implement innovative techniques for language modelling and evaluation in the HPC environment.
**Key Duties**
- Work, in collaboration with the group members, on the design and development of the solutions needed to achieve the goals of the group’s research projects.
- Interact with relevant stakeholders of the group’s research projects to understand their problems and the available data to formulate valuable solutions.
- Ensure the long-term acquisition, management and accessibility of language data through the design and implementation of scalable storage solutions and structured data systems, and processing tools.
- Collaborate with the members of the group in the generation and evaluation of language models using Deep Learning techniques (Transformers, Recurrent Neural Networks, and other neural network architectures).
**Requirements**:
- Education
- Degree in Applied Linguistics, Computer Science or related disciplines with a very strong linguistic background.
- Essential Knowledge and Professional Experience
- Native speaker of Spanish.
- Good knowledge of Python.
- Good knowledge of Linux.
- Knowledge of Deep Learning.
- Experience in Machine Learning techniques applied to NLP.
- Experience/ knowledge in corpus annotation and generation of linguistic resources.
- Understanding of data administration and management functions (transfer, storage, analysis, distribution, exploration, etc.).
- Research experience, with some publications related to language modeling and resources in a multilingual context.
- Additional Knowledge and Professional Experience
- Theoretical broad knowledge of AI techniques.
- Knowledge of HPC workload managers such as Slurm.
- Knowledge of Continuous Integration/Delivery/Deployment, including tools such as (or similar to) GitLab CI, Github, Docker and/or Ansible.
- Experience in machine learning and data mining including knowledge of PyTorch, Tensorflow, OpenCV, Pandas, Scikit-learn and/or Numpy.
- Basic Knowledge of GPU-based computing.
- Fluency in spoken and written English.
- Experience in web/data scraping.
- Expertise in building and maintaining data-curation pipelines.
- Competences
- Capacity to explore new research lines.
- Ability to work independently and collaboratively within multidisciplinary teams.
- Proactive, detail-oriented mindset, capable of problem-solving in complex data contexts.
- Good communication and presentation skills.
- Commitment to deadlines and quality research output
**Conditions**
- The position will be located at BSC within the Life Sciences Department
- We offer a full-time contract (37.5h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance, su
-
Data Engineer for Language Technologies
hace 2 días
Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completo**Job Reference**: - 606_25_LS_LT_RE1 **Position**: - Data Engineer for Language Technologies (RE1) **Closing Date**: - Saturday, 18 October, 2025 **Reference**: 606_25_LS_LT_RE1 **Job title**: Data Engineer for Language Technologies (RE1) **About BSC** - The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the...
-
Data Engineer for Language Technologies
hace 5 horas
Barcelona, España Somm Excellence Alliance A tiempo completoContext And Mission The Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning. It has been entrusted by the Spanish and the Catalan government with the mission to develop essential open-source resources and...
-
Deep Learning Engineer For Language Technologies
hace 2 semanas
Barcelona, Barcelona, España Barcelona Supercomputing Center A tiempo completoDeep Learning Engineer for Language Technologies (RE3)Apply for the Deep Learning Engineer for Language Technologies (RE3) role at Barcelona Supercomputing Center.Job Reference: 677_25_LS_LT_RE3Closing Date: Thursday, 27 November ****Location: Barcelona Supercomputing Center, Life Sciences Department.About BSCThe Barcelona Supercomputing Center (BSC-CNS) is...
-
Data Manager for Language Technologies
hace 5 horas
Barcelona, España Somm Excellence Alliance A tiempo completoContext And Mission The Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning. It has been entrusted by the Spanish and the Catalan government with the mission to develop essential open-source resources and...
-
Data Linguist for Language Technologies
hace 1 semana
Barcelona, España Somma A tiempo completo**Reference**: 281_25_LS_LT_RE1 **Job title**: Data Linguist for Language Technologies (RE1) **About BSC** The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, and is now hosting entity for EuroHPC JU, the...
-
NLP ML Engineer for Language Technologies
hace 4 días
Barcelona, España EURAXESS Ireland A tiempo completoA top research institution in Barcelona seeks a Full-stack Machine Learning Engineer for Language Technologies. The successful candidate will design NLP applications using LLMs, with a focus on improving resource access for Iberian languages. Candidates should have significant experience in deep learning and NLP, and strong programming capabilities. This...
-
NLP ML Engineer for Language Technologies
hace 18 horas
Barcelona, España EURAXESS Ireland A tiempo completoA top research institution in Barcelona seeks a Full-stack Machine Learning Engineer for Language Technologies. The successful candidate will design NLP applications using LLMs, with a focus on improving resource access for Iberian languages. Candidates should have significant experience in deep learning and NLP, and strong programming capabilities. This...
-
Deep Learning Engineer for Language Technologies
hace 5 días
Barcelona, España Barcelona Supercomputing Center - Centro Nacional de Supercomputación A tiempo completo**Context And Mission** The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government with the mission to develop...
-
Deep Learning Engineer for Language Technologies
hace 2 semanas
Barcelona, Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completoJob Reference442_25_LS_LT_RE2PositionDeep Learning Engineer for Language Technologies (RE2)Closing DateSunday, 30 November, 2025Reference: 442_25_LS_LT_RE2Job title: Deep Learning Engineer for Language Technologies (RE2)About BSCThe Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain....
-
Deep Learning Engineer for Language Technologies
hace 2 semanas
Barcelona, Barcelona, España Barcelona Supercomputing Center A tiempo completoJob Reference442_25_LS_LT_RE2PositionDeep Learning Engineer for Language Technologies (RE2)Closing DateSunday, 30 November, 2025Reference:442_25_LS_LT_RE2Job title:Deep Learning Engineer for Language Technologies (RE2)About BSCThe Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain....