![Barcelona Supercomputing Center (BSC)](https://media.trabajo.org/img/noimg.jpg)
Data Engineer for Language Technologies
hace 1 semana
Job Reference:
- 215_24_LS_LT_RE2
Position: - Data Engineer for Language Technologies (RE2)
Closing Date: - Friday, 31 May, 2024
Reference: 215_24_LS_LT_RE2
Job title: Data Engineer for Language Technologies (RE2)
About BSC - The Barcelona Supercomputing Center
- Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, was a founding and hosting member of the former European HPC infrastructure PRACE (Partnership for Advanced Computing in Europe), and is now hosting entity for EuroHPC JU, the Joint Undertaking that leads largescale investments and HPC provision in Europe. The mission of BSC is to research, develop and manage information technologies in order to facilitate scientific progress. BSC combines HPC service provision and R&D into both computer and computational science (life, earth and engineering sciences) under one roof, and currently has over 900 staff from 55 countries.
- Look at the BSC experience:
- BSC-CNS YouTube Channel
- Let's stay connected with BSC Folks
Context And Mission - The Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning. It has been entrusted by the Spanish and the Catalan government with the mission to develop essential opensource resources and technologies for Spanish and Catalan.
the Spanish National Plan for the Advancement of Language Technology, funded by the Spanish Secretariat of Digitalisation and Artificial Intelligence, and the AINA project, aimed at developing AI resources for Catalan, funded by the Catalan Digitalisation Department.
In addition, the Unit participates in various EU-funded international projects.- The Language Technologies Unit at BSC is seeking a Data Manager with experience in language technologies to lead the development of the largest curated Spanish language corpus. This corpus will be used to train reference foundational LLMs.
Key Duties - Identification of open/public data sources: Proactively identify and evaluate open and public data sources for the creation of extensive corpora in Spanish and coofficial languages. This includes scouting for datasets that are relevant to the group's research focus on language models, including translation, audio processing, and large language models (LLMs).
- Engagement with data providers: Act as the primary contact point for negotiations and communications with external data providers, including public entities, companies, and other research institutions. Establish and maintain relationships to secure access to valuable data resources.
- Data acquisition strategy design: Develop and implement strategies for the efficient acquisition of external data. This includes outlining procedures for data requests, licensing negotiations, and ensuring compliance with data privacy regulations.
- Data management and governance: Collaborate in data management protocols to ensure the integrity, confidentiality, and availability of data.
- Dissemination and engagement activities: Lead the dissemination of findings and datasets within the scientific community and beyond. This includes publishing data reports, contributing to academic papers, and presenting at conferences. Also, engage with the broader research community to foster collaborations and share best practices in data management.
- Manage corpora and language data according to the requirements specified in the Unit's data managemt.
- Control the quality of collected data and metadata.
- Compliance and ethics oversight: Ensure all data management activities comply with relevant laws, ethical standards, and best practices in data handling. This includes overseeing the ethical review of data sources and uses, as well as managing any data protection implications.
Requirements:
- Education
- Bachelor's Degree.
- Essential Knowledge and Professional Experience
- Proficiency in data management principles and techniques.
- Strong understanding of data acquisition strategies, including licensing negotiations and compliance with data privacy regulations.
- Knowledge of open/public data sources relevant to language models, translation, audio processing, and large language models (LLMs).
- Familiarity with data governance principles, including data integrity, confidentiality, and availability.
- Excellent communication and negotiation skills for engaging with external data providers and stakeholders.
- Experience in disseminating findings and datasets within the scientific community through reports, academic papers, and conference presentations.
- Strong attention to detail and ability to control the quality of collected data and metadata.
- Knowledge
-
Plaça De Data Engineer For Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center-Centro Nacional De Supercomputación (Bsc-Cns) A tiempo completoBarcelona Supercomputing Center-Centro Nacional de Supercomputación (BSC-CNS). 1 plaça de Data Engineer for Language Technologies (RE2). Concurs o valoració de mèrits. Laboral temporal Termini obert. A1 - Grau universitari (correspondència amb llicenciatures). Llicenciatura. Fluïdesa en català escrit i parlatVeure convocatòria- Contracte laboral...
-
Data Engineer For Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center A tiempo completoContext And MissionThe Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning. It has been entrusted by the Spanish and the Catalan government with the mission to develop essential open-source resources and...
-
Data Manager for Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completoJob Reference: 216_24_LS_LT_RE3Position: Data Manager for Language Technologies (RE3)Closing Date:Friday, 31 May, 2024Reference: 216_24_LS_LT_RE3Job title: Data Manager for Language Technologies (RE3)About BSC The Barcelona Supercomputing Center Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses...
-
Plaça de Data Engineer for Language and
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center-Centro Nacional de Supercomputación (BSC-CNS) A tiempo completoBarcelona Supercomputing Center-Centro Nacional de Supercomputación (BSC-CNS). 1 plaça de Data Engineer for Language and Translation Technologies (RE2). Concurs o valoració de mèrits. Laboral temporal Termini obert. A - Grau universitari. Grau en lingüística aplicada, informàtica o disciplines afins. Domini de l'anglès, el castellà i el català...
-
Engineer for Language Technologies and Social Media
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center - Centro Nacional de Supercomputación A tiempo completoThe Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, Machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government with the mission to develop fundamental open-source...
-
Data Engineer for Language and Translation
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completoJob Reference: 9_23_LS_TM_RE1Position: Data Engineer for Language and Translation Technologies (RE1)Closing Date:Tuesday, 28 February, 2023Reference: 9_23_LS_TM_RE1Job title: Data Engineer for Language and Translation Technologies (RE1)About BSC The Barcelona Supercomputing Center Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing...
-
Data Engineer for Language and Translation
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completoJob Reference: 234_24_LS_LT_RE2Position: Data Engineer for Language and Translation Technologies (RE2)Closing Date:Friday, 17 May, 2024Reference: 234_24_LS_LT_RE2Job title: Data Engineer for Language and Translation Technologies (RE2)About BSC The Barcelona Supercomputing Center Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing...
-
Plaça De Data Manager For Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center-Centro Nacional De Supercomputación (Bsc-Cns) A tiempo completoBarcelona Supercomputing Center-Centro Nacional de Supercomputación (BSC-CNS). 1 plaça de Data Manager for Language Technologies (RE3). Concurs o valoració de mèrits. Laboral temporal Termini obert. A1 - Grau universitari (correspondència amb llicenciatures). Llicenciatura en Informàtica, Sistemes d'Informació, Lingüística amb enfocament...
-
Deep Learning Engineer for Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completoJob Reference: 14_23_LS_TM_RE2Position: Deep Learning Engineer for Language Technologies (RE2)Closing Date:Tuesday, 28 February, 2023Reference: 14_23_LS_TM_RE2Job title: Deep Learning Engineer for Language Technologies (RE2)About BSC The Barcelona Supercomputing Center Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in...
-
Deep Learning Engineer for Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completoJob Reference: 230_24_LS_LT_RE2Position: Deep Learning Engineer for Language Technologies (RE2)Closing Date:Friday, 17 May, 2024Reference: 230_24_LS_LT_RE2Job title: Deep Learning Engineer for Language Technologies (RE2)About BSC The Barcelona Supercomputing Center Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain....
-
Data Engineer for Language and Translation
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center - Centro Nacional de Supercomputación A tiempo completo**Context And MissionThe Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning. It has been entrusted by the Spanish and the Catalan government with the mission to develop essential open-source resources and...
-
Deep Learning Engineer for Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center - Centro Nacional de Supercomputación A tiempo completoContext And MissionThe Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government with the mission to develop...
-
Deep Learning Engineer for Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center A tiempo completoContext And Mission The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government to develop fundamental...
-
ML developer for Language Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center A tiempo completoContext And Mission The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government to develop fundamental...
-
Senior Data Engineer, Hibrido
hace 1 semana
Barcelona, Barcelona, España OXIGENT Technologies A tiempo completoSenior Data Engineer en hibrido.Would you be interested in working as a Senior Data Engineer in a leading company in the retail sector using cutting-edge technology tools?From Oxigent Technologies we are looking for a SENIOR DATA ENGINEER to participate in Data-related projects located in Barcelona.What would you be doing? Interact with Big Data Architects...
-
Plaça de Deep Learning Engineer for Language
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center-Centro Nacional de Supercomputación (BSC-CNS) A tiempo completoBarcelona Supercomputing Center-Centro Nacional de Supercomputación (BSC-CNS). 1 plaça de Deep Learning Engineer for Language Technologies (RE2). Concurs o valoració de mèrits. Laboral temporal Termini obert. A1 - Grau universitari (correspondència amb llicenciatures). Llicenciat en Informàtica, Telecomunicacions, Lingüística Aplicada o disciplines...
-
Data Engineer
hace 1 semana
Barcelona, Barcelona, España OXIGENT Technologies A tiempo completoData Engineer (80% Remoto) en hibrido.¿Te interesaría seguir desarrollándote como Ingeniero/a de Data en una empresa líder del sector transportes y turismo en un entorno colaborativo con jerarquía horizontal y con proyección a futuro ubicada en el Baix Llobregat?Desde Oxigent Technologies seleccionamos un/a DATA ENGINEER para formar parte de un equipo...
-
Data Engineer
hace 3 semanas
Barcelona, Barcelona, España OXIGENT Technologies A tiempo completo¿Te interesaría seguir desarrollándote como Ingeniero/a de Data en una empresa líder del sector transportes y turismo en un entorno colaborativo con jerarquía horizontal y con proyección a futuro, ubicada en el Baix Llobregat?Desde Oxigent Technologies seleccionamos un/a DATA ENGINEER para formar parte de un equipo de profesionales cuya misión será...
-
Data Engineer
hace 2 meses
Barcelona, Barcelona, España OXIGENT Technologies A tiempo completo¿Te interesaría seguir desarrollándote como Ingeniero/a de Data en una empresa líder del sector transportes y turismo en un entorno colaborativo con jerarquía horizontal y con proyección a futuro, ubicada en el Baix Llobregat?Desde Oxigent Technologies seleccionamos un/a DATA ENGINEER para formar parte de un equipo de profesionales cuya misión será...
-
Deep Learning Engineer for Speech Technologies
hace 1 semana
Barcelona, Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completoJob Reference: 30_23_LS_TM_RE1Position: Deep Learning Engineer for Speech Technologies (RE1)Closing Date:Thursday, 16 March, 2023Reference: 30_23_LS_TM_RE1Job title: Deep Learning Engineer for Speech Technologies (RE1)About BSC The Barcelona Supercomputing Center Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It...