AI Safety Engineer

hace 16 horas


Barcelona, España PepsiCo A tiempo completo

We are seeking a Senior Data Scientist/AI Engineer specializing in AI Safety to lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases. You will join a collaborative interdisciplinary team to design, evaluate, and harden AI/ML systems against misuse, failures, and emerging risks. You will work closely with product owners, engineering teams, and business stakeholders to identify safety requirements, conduct adversarial assessments, and develop robust mitigation strategies. This role is highly technical and safety critical, with broad visibility and influence across the organization. RESPONSABILITIES AI Safety, Robustness & Risk Assessment Lead adversarial testing, including jailbreak attempts, prompt injection, harmful content generation, system prompt extraction, and agent tool misuse. Conduct end to end risk assessments for AI driven chatbots and autonomous agent systems, identifying hazards, evaluating exposure, and defining mitigation strategies. Build and maintain AI safety evaluation pipelines, including red team test suites, scenario-based evaluations, and automated stress testing. Define and monitor safety KPIs such as harmful output rates, robustness scores, and model resilience metrics. Analyze failure modes (e.g., hallucinations, deceptive reasoning, unsafe tool execution) and design guardrails to minimize risks. Technical Development & Collaboration Develop reproducible experiments for LLM behavior analysis, including prompt engineering, control mechanisms, and guardrail testing. Partner with data engineers and MLOps teams to integrate safety evaluations into CI/CD pipelines. Work with product teams to translate safety requirements into actionable technical specifications. Support model governance, including documentation, safety reports, and compliance with internal and external standards. Contribute to innovation and research around emerging safety methodologies for LLMs and agent architectures. Knowledge Sharing & Leadership Serve as an internal expert on AI safety best practices, adversarial testing methodologies, and robust system design. Provide guidance and mentorship to data scientists, engineers, and product partners on safe AI development. Create high-quality documentation, playbooks, and reusable tools for safety evaluations. QUALIFICATIONS Master's degree in Computer Science, Data Science, Machine Learning, or related quantitative field. 4+ years of experience developing or evaluating machine learning systems, including LLM- or NLP-based applications. Strong knowledge of Generative AI and Transformer-based models. Experience with at least one deep learning framework (PyTorch, TensorFlow). Proficiency with Python and common data/ML libraries. Experience conducting model evaluations, experimentation, or reliability testing. Clear communication skills and the ability to translate technical findings into business relevant insights. Preferred Qualifications Experience with adversarial ML, red teaming, or AI safety research. Familiarity with safety testing frameworks such as automated red-teamers, harmful content classifiers, or jailbreak detection systems. Hands-on experience with LLM agents, tool-use orchestration, or autonomous systems. Knowledge of risk management frameworks (e.g., NIST AI RMF, ISO 42001) and Responsible AI principles. Experience designing safety guardrails, moderation layers, or policy enforcement mechanisms. Background in reinforcement learning or agent evaluation. Experience with cloud platforms (AWS, Azure, GCP) and MLOps workflows.


  • AI Safety Engineer

    hace 1 día


    Barcelona, España PepsiCo A tiempo completo

    We are seeking aSenior Data Scientist/AI Engineer specializing in AI Safetyto lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases.You will join a collaborative...

  • AI Safety Engineer

    hace 24 horas


    Barcelona, España Pepsico A tiempo completo

    We are seeking a Senior Data Scientist/AI Engineer specializing in AI Safety to lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases.You will join a collaborative...


  • Barcelona, España Pepsico A tiempo completo

    A multinational food and beverage company in Spain is seeking a Senior Data Scientist/AI Engineer specializing in AI Safety. This role involves leading adversarial testing and risk assessment for AI technologies, ensuring safety and reliability across high-impact use cases. The ideal candidate has a Master's degree in a related field, over four years of...


  • barcelona, España Alinia AI, Inc. A tiempo completo

    A cutting-edge AI technology firm in Barcelona seeks a Back-End Engineer to shape infrastructure for generative AI. You will maintain and secure a Python FastAPI backend, design scalable APIs, and implement best practices for security and compliance. Ideal candidates have 3+ years of experience and strong skills in Python, FastAPI, and cloud platforms. The...


  • barcelona, España Galtea A tiempo completo

    A growing AI technology company in Barcelona is looking for a Staff AI Engineer to lead the technical vision for AI Safety features. The role involves architecting core AI services, mentoring engineers, and collaborating on cutting-edge research. The ideal candidate has 6+ years of experience in software development, strong knowledge in Machine Learning, and...

  • Director AI Safety

    hace 2 semanas


    Barcelona, España Openchip & Software Technologies A tiempo completo

    The Director of AI Safety at Openchip will be responsible for ensuring that our AI systems are secure, ethical, and aligned with industry safety standards. This role demands a vertically integrated approach, addressing AI security from high-level applications down to low-level implementation. The ideal candidate will lead the development of safety...

  • Director AI Safety

    hace 1 día


    Barcelona, España OPENCHIP & SOFTWARE TECHNOLOGIES A tiempo completo

    The Director of AI Safety at Openchip will be responsible for ensuring that our AI systems are secure, ethical, and aligned with industry safety standards. This role demands a vertically integrated approach, addressing AI security from high-level applications down to low-level implementation. The ideal candidate will lead the development of safety...

  • AI Engineer

    hace 2 semanas


    barcelona, España Quadrivia AI A tiempo completo

    Join to apply for the AI Engineer role at Quadrivia AI Own and evolve the core “brain” service that powers Qu. Design, build, and operate multi-agent LLM systems that communicate in real time over text and voice. Ship fast Python services with FastAPI, keep latency low, quality high, and evaluation continuous. What You’ll Do Own Qu’s brain service...

  • AI Engineer

    hace 3 semanas


    Barcelona, España PepsiCo A tiempo completo

    We are seeking a Senior Data Scientist/AI Engineer specializing in AI Safety to lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases.You will join a collaborative...

  • AI Engineer

    hace 4 días


    Barcelona, España The BIG Jobsite A tiempo completo

    We are seeking a Senior Data Scientist/AI Engineer specializing in AI Safety to lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases.You will join a collaborative...