Data Engineer for Language and Translation

hace 2 semanas


Barcelona, España Barcelona Supercomputing Center (BSC) A tiempo completo

**Job Reference**:

- 12_23_LS_TM_RE2**Position**:

- Data Engineer for Language and Translation Technologies (RE2)**Closing Date**:

- Tuesday, 28 February, 2023**Reference**: 12_23_LS_TM_RE2**Job title**: Data Engineer for Language and Translation Technologies (RE2)**About BSC**
- The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, and is a hosting member of the PRACE European distributed supercomputing infrastructure. The mission of BSC is to research, develop and manage information technologies in order to facilitate scientific progress. BSC combines HPC service provision and R&D into both computer and computational science (life, earth and engineering sciences) under one roof, and currently has over 770 staff from 55 countries.
- Look at the BSC experience:

- BSC-CNS YouTube Channel
- Let's stay connected with BSC Folks
- We are particularly interested for this role in the strengths and lived experiences of women and underrepresented groups to help us avoid perpetuating biases and oversights in science and IT research.**Context And Mission**
- The Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning. It has been entrusted by the Spanish and the Catalan government with the mission to develop essential open-source resources and technologies for Spanish and Catalan. In connection with this, the LT Unit is currently in charge of two flagship projects at the national and regional levels: the Spanish National Plan for the Advancement of Language Technology, funded by the Spanish Secretariat of Digitalisation and Artificial Intelligence, and the AINA project, aimed at developing AI resources for Catalan, funded by the Catalan Digitalisation Department. In addition, the Unit participates in various EU-funded international projects.
- The LT Unit at BSC is looking for a Data Engineer with experience in Natural Language Processing and/or Machine translation.-
**Key Duties**
- Collect language data as required by the projects carried out in the Unit.
- Prepare language data processing scripts to clean and prepare data to be ingested by the neural architectures.
- Automatically annotate data using state-of-the-art language processing tools.
- Manage corpora and language data according to the requirements specified in the Unit’s data management plan.
- Control the quality of collected data and metadata.
- Coordinate with machine learning engineers to determine data requirements
- Write technical reports and project documentation in English, Spanish and Catalan.
- Prepare research proposals and write scientific papers.
- Coordinate external teams for data collection and data annotation
- Ensure the applicability of open licenses to data sets, and resolve queries

**Requirements**:

- Education
- Degree in Applied linguistics, Computer Science or related disciplines
- Essential Knowledge and Professional Experience
- Demonstrated experience of at least 3 years in NLP, MT or Speech processing fields.
- Excellent understanding of data administration and management functions (transfer, storage, analysis, distribution, exploration, etc.).
- Proven experience in working with large datasets and distributed file systems: SQL, databases and metadata management.
- Proven experience in UNIX/LINUX environments, scripting languages and Python Competences
- Fluent in written and spoken English and Spanish.
- Additional Knowledge and Professional Experience
- Demonstrated experience in developing open-source software and resources
- Fluent in written and spoken Catalan.
- Strong understanding of linguistic concepts.
- Competences
- Ability to work independently and in a team to complete tasks on schedule.
- Ability to work under set deadlines

**Conditions**
- The position will be located at BSC within the Life Sciences Department
- We offer a full-time contract, a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, tickets restaurant, private health insurance, fully support to the relocation procedures
- Duration: Open-ended contract due to technical and scientific activities linked to the project and budget duration
- Starting date: asap

**Applications procedure and process**- A full CV in English including contact details
- A Cover Letter with a statement of interest in English, including two contacts for further references - Applications without this document will not be considered

At BSC we are seeking continuous improvement in our recruitment processes, for any suggestions or feedback/complaints about our Recruitment Processes, please contact recruitment [at] bsc [dot] es.

For more information follow this link

**Deadline**OTM-R



  • Barcelona, España Barcelona Supercomputing Center-Centro Nacional de Supercomputación (BSC-CNS) A tiempo completo

    Barcelona Supercomputing Center-Centro Nacional de Supercomputación (BSC-CNS). 1 plaça de Data Engineer for Language Technologies (RE2). Concurs o valoració de mèrits. Laboral temporal. 2024-05-31. Termini obert. A1 - Grau universitari (correspondència amb llicenciatures). Llicenciatura. Fluïdesa en català escrit i parlat Veure convocatòria -...


  • Barcelona, España Localizationacademy A tiempo completo

    Identify, gather and process relevant data from various sources, ensuring its quality, accuracy and integrity Implement data cleaning, transformation and enrichment processes to optimize data usability for AI models Adapt language to fit a particular culture or market (localization) Translate content from English to Russian Giving support, focusing on...


  • Barcelona, España Barcelona Supercomputing Center - Centro Nacional de Supercomputación A tiempo completo

    **Context And Mission** The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government with the mission to develop...

  • Junior Data Engineer

    hace 7 días


    Barcelona, España Kiteris Solutions A tiempo completo

    We are looking for a junior data engineer to join our team. You will use various methods to transform raw data into useful data systems. To succeed in this data engineering position, you should have strong analytical skills and the ability to combine data from different sources. Data engineer skills also include familiarity with several programming...


  • Barcelona, España M47 Labs & International Fiducia, S.L. A tiempo completo

    M47AI is a fast growing Barcelona based tech company with a focus on providing outstanding international data analytics services. We may be a newer company, but our deep knowledge and strong industry experience allows us to work with top companies around the world. We are offering an internship position to join our team as a Data Annotator. You will...


  • Barcelona, España Barcelona Supercomputing Center A tiempo completo

    Context And Mission The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government to develop fundamental...


  • Barcelona, España Apple Inc. A tiempo completo

    AIML - Sr Language Expansion Engineer, Siri and Information Intelligence Do you get excited by driving product impact via measurement and evaluation, for products and services used by hundreds of millions of people globally?Play a part in the next revolution in human-computer interaction on all of Apple's Platforms: iPhone, iPad, HomePod, Mac, Watch, tv,...


  • Barcelona, España Amazon Spain Services, S.L.U. A tiempo completo

    Bachelor's degree or equivalent - Experience in program or project management - Experience in requirement gathering and ability to write clear and detailed requirement document They will possess the ability to complete a high volume of tasks with mínimal guidance or supervision. They will be comfortable in finding solutions to make the best use of the...

  • Principal Data Engineer

    hace 4 semanas


    Barcelona, España Hewlett Packard A tiempo completo

    In the GTM advanced analytics COE, our mission is to deliver impact by building machine learning (ML) products to optimize pricing, marketing investments and provide guidance to sales and other HP teams. We're looking for a principal data engineer / data architect to join our data engineering team. **Qualifications** - Typically 5+ years of experience in...


  • Barcelona, España Sdi Digital Group A tiempo completo

    Summary : Do you get excited by driving product impact via measurement and evaluation, for products and services used by hundreds of millions of people globally? Play a part in the next revolution in human-computer interaction on all of Apple's Platforms : iPhone, iPad, HomePod, Mac, Watch, tv, Vision Pro across dozens of languages. Contribute to a product...

  • Monitoring Engineer

    hace 4 semanas


    Barcelona, España NTT DATA A tiempo completo

    And we are looking for you! ¿Want to take the next step in your career? Want to be part of a challenging and amazing team?? Would you like to be part of NTT DATA’s International Organisations division and take part in international projects? This is your opportunity, join NTT DATA! As a Monitoring engineer, you will be part of a our team in the...


  • Barcelona, España NTT DATA A tiempo completo

    Are you looking for the next step in your career? Join NTT DATA! We are currently looking for an On-Site Server & Storage Engineer to join our fast-growing team. **Your profile**: Experience with and knowledge of administrating Linux based servers (RHEL) Administers and configures the enterprise backup solution (Commvault). Works with vendors to isolate...

  • Sr. Data Engineer

    hace 3 semanas


    Barcelona, España Merlin Digital Partner A tiempo completo

    We are Merlin Digital Partner! A leading IT and Digital headhunting company who stands out from the crowd, boasting over a decade of experience. We've successfully collaborated and played a pivotal role in the growth of industry heavyweights such as Wallapop, Glovo, Banc Sabadell, and Factorial, among others. Our emphasis lies in people-centric approaches...

  • Data Engineer

    hace 4 días


    Barcelona, España Itjobs A tiempo completo

    At CGI we are looking for a Data Engineer with Palantir.Requirements:- More than 2 years of experience in the architecture of data solutions developed in the cloud.- Palantir Certification- Python/PySpark for Data Engineer- Knowledge of data management fundamentals and data warehousing principles.- Demonstrated strength in data modelling, ETL and...


  • Barcelona, España NTT DATA A tiempo completo

    Job Description Are you looking for the next step in your career? Join NTT DATA!We are currently looking for an On-Site Server & Storage Engineer to join our fast-growing team.Your profile:Experience with and knowledge of administrating Linux based servers (RHEL)Administers and configures the enterprise backup solution (Commvault).Works with vendors to...

  • Data Science Engineer

    hace 4 semanas


    Barcelona, España TEC Partners A tiempo completo

    Job Type: Permanent Sector: Technology Seniority: Junior Essential skils: Data, Python, Software Engineering, Retail, Research, Science Location: Barcelona Region: Catalonia Salary: Negotiable **Data Science Engineer** - Permanent Role - Minimum 2 years - Barcelona Based** TEC Partners is looking for a Data Science Engineer to join my clients Data...


  • Barcelona, España Barcelona Supercomputing Center - Centro Nacional de Supercomputación A tiempo completo

    **Context And Mission The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government to develop fundamental...

  • Data Engineer

    hace 2 semanas


    Barcelona, España Antal International Network A tiempo completo

    Our client is the best technological consultant company, specialist in Microsoft, as well as Dynamics 365 FO, Dynamics 365, AX, Power BI, CRM, EPR, Office 365 and Azure. With offices in Madrid, Barcelona, Valencia and Castellon with specialized professional experts, the company has aided in the development of small businesses through automatization and...


  • Barcelona, España Bayer A tiempo completo

    **At Bayer we’re visionaries, driven to solve the world’s toughest challenges and striving for a world where ,Health for all, Hunger for none’ is no longer a dream, but a real possibility. We’re doing it with energy, curiosity and sheer dedication, always learning from unique perspectives of those around us, expanding our thinking, growing our...

  • Data Engineer

    hace 4 semanas


    Barcelona, España Veeva Systems A tiempo completo

    Veeva is a mission-driven organization that aspires to help our customers in Life Sciences and Regulated industries bring their products to market, faster. We are shaped by our values: Do the Right Thing, Customer Success, Employee Success, and Speed. Our teams develop transformative cloud software, services, consulting, and data to make our customers more...