AI Safety Engineer

hace 16 horas

Barcelona, España PepsiCo A tiempo completo

We are seeking a Senior Data Scientist/AI Engineer specializing in AI Safety to lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases. You will join a collaborative interdisciplinary team to design, evaluate, and harden AI/ML systems against misuse, failures, and emerging risks. You will work closely with product owners, engineering teams, and business stakeholders to identify safety requirements, conduct adversarial assessments, and develop robust mitigation strategies. This role is highly technical and safety critical, with broad visibility and influence across the organization. RESPONSABILITIES AI Safety, Robustness & Risk Assessment Lead adversarial testing, including jailbreak attempts, prompt injection, harmful content generation, system prompt extraction, and agent tool misuse. Conduct end to end risk assessments for AI driven chatbots and autonomous agent systems, identifying hazards, evaluating exposure, and defining mitigation strategies. Build and maintain AI safety evaluation pipelines, including red team test suites, scenario-based evaluations, and automated stress testing. Define and monitor safety KPIs such as harmful output rates, robustness scores, and model resilience metrics. Analyze failure modes (e.g., hallucinations, deceptive reasoning, unsafe tool execution) and design guardrails to minimize risks. Technical Development & Collaboration Develop reproducible experiments for LLM behavior analysis, including prompt engineering, control mechanisms, and guardrail testing. Partner with data engineers and MLOps teams to integrate safety evaluations into CI/CD pipelines. Work with product teams to translate safety requirements into actionable technical specifications. Support model governance, including documentation, safety reports, and compliance with internal and external standards. Contribute to innovation and research around emerging safety methodologies for LLMs and agent architectures. Knowledge Sharing & Leadership Serve as an internal expert on AI safety best practices, adversarial testing methodologies, and robust system design. Provide guidance and mentorship to data scientists, engineers, and product partners on safe AI development. Create high-quality documentation, playbooks, and reusable tools for safety evaluations. QUALIFICATIONS Master's degree in Computer Science, Data Science, Machine Learning, or related quantitative field. 4+ years of experience developing or evaluating machine learning systems, including LLM- or NLP-based applications. Strong knowledge of Generative AI and Transformer-based models. Experience with at least one deep learning framework (PyTorch, TensorFlow). Proficiency with Python and common data/ML libraries. Experience conducting model evaluations, experimentation, or reliability testing. Clear communication skills and the ability to translate technical findings into business relevant insights. Preferred Qualifications Experience with adversarial ML, red teaming, or AI safety research. Familiarity with safety testing frameworks such as automated red-teamers, harmful content classifiers, or jailbreak detection systems. Hands-on experience with LLM agents, tool-use orchestration, or autonomous systems. Knowledge of risk management frameworks (e.g., NIST AI RMF, ISO 42001) and Responsible AI principles. Experience designing safety guardrails, moderation layers, or policy enforcement mechanisms. Background in reinforcement learning or agent evaluation. Experience with cloud platforms (AWS, Azure, GCP) and MLOps workflows.

AI Safety Engineer

hace 1 día

Barcelona, España PepsiCo A tiempo completo

We are seeking aSenior Data Scientist/AI Engineer specializing in AI Safetyto lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases.You will join a collaborative...
AI Safety Engineer

hace 24 horas

Barcelona, España Pepsico A tiempo completo

We are seeking a Senior Data Scientist/AI Engineer specializing in AI Safety to lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases.You will join a collaborative...
Senior AI Safety Engineer

hace 1 semana

Barcelona, España Pepsico A tiempo completo

A multinational food and beverage company in Spain is seeking a Senior Data Scientist/AI Engineer specializing in AI Safety. This role involves leading adversarial testing and risk assessment for AI technologies, ensuring safety and reliability across high-impact use cases. The ideal candidate has a Master's degree in a related field, over four years of...
Remote Backend Engineer – AI Safety

hace 2 semanas

barcelona, España Alinia AI, Inc. A tiempo completo

A cutting-edge AI technology firm in Barcelona seeks a Back-End Engineer to shape infrastructure for generative AI. You will maintain and secure a Python FastAPI backend, design scalable APIs, and implement best practices for security and compliance. Ideal candidates have 3+ years of experience and strong skills in Python, FastAPI, and cloud platforms. The...
Remote Staff AI Engineer: Lead AI Safety

hace 1 semana

barcelona, España Galtea A tiempo completo

A growing AI technology company in Barcelona is looking for a Staff AI Engineer to lead the technical vision for AI Safety features. The role involves architecting core AI services, mentoring engineers, and collaborating on cutting-edge research. The ideal candidate has 6+ years of experience in software development, strong knowledge in Machine Learning, and...
Director AI Safety

hace 2 semanas

Barcelona, España Openchip & Software Technologies A tiempo completo

The Director of AI Safety at Openchip will be responsible for ensuring that our AI systems are secure, ethical, and aligned with industry safety standards. This role demands a vertically integrated approach, addressing AI security from high-level applications down to low-level implementation. The ideal candidate will lead the development of safety...
Director AI Safety

hace 1 día

Barcelona, España OPENCHIP & SOFTWARE TECHNOLOGIES A tiempo completo

The Director of AI Safety at Openchip will be responsible for ensuring that our AI systems are secure, ethical, and aligned with industry safety standards. This role demands a vertically integrated approach, addressing AI security from high-level applications down to low-level implementation. The ideal candidate will lead the development of safety...
AI Engineer

hace 2 semanas

barcelona, España Quadrivia AI A tiempo completo

Join to apply for the AI Engineer role at Quadrivia AI Own and evolve the core “brain” service that powers Qu. Design, build, and operate multi-agent LLM systems that communicate in real time over text and voice. Ship fast Python services with FastAPI, keep latency low, quality high, and evaluation continuous. What You’ll Do Own Qu’s brain service...
AI Engineer

hace 3 semanas

Barcelona, España PepsiCo A tiempo completo

We are seeking a Senior Data Scientist/AI Engineer specializing in AI Safety to lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases.You will join a collaborative...
AI Engineer

hace 4 días

Barcelona, España The BIG Jobsite A tiempo completo

We are seeking a Senior Data Scientist/AI Engineer specializing in AI Safety to lead adversarial testing, risk assessment, and safety evaluations for LLM- and agent-powered chatbot systems. This role focuses on ensuring that AI technologies are safe, reliable, and aligned with business and user needs across high impact use cases.You will join a collaborative...

América

Europa

Asia / Oceanía

África

AI Safety Engineer