AI Benchmarking Spec.

hace 1 semana


Madrid, España Amazon A tiempo completo

The Seller AI team within International Seller Services organization focuses on helping sellers with Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training, measuring, and improving Artificial Intelligence (AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior seller experience to our sellers worldwide. The AI Benchmarking Associate supports the evaluation of AI systems by designing and executing benchmarking and audit activities to assess model quality, compliance, robustness, and fairness. The role combines elements of AI auditing, quality assurance, and traditional audit-style documentation and stakeholder communication. By joining us, you will play a pivotal role in shaping the future of selling on Amazon for sellers worldwide.Location: Bengaluru, Karnataka, INDKey job responsibilitiesAssist in planning and executing benchmarking exercises for AI models, including defining test plans, metrics, and acceptance criteria across accuracy, robustness, bias, and reliability.Support content accuracy, relevancy, and privacy checks by reviewing datasets, model outputs, and data handling practices, escalating potential regulatory risks.Validate data based on specific annotation guidelines, ensuring the accuracy and quality of the collected information.Prepare clear audit and benchmarking reports, including error ratings, root‑cause analysis, and recommendations, and contribute to presentations for senior stakeholders.Maintain organized audit documentation, evidence, and benchmarking datasets to support internal review.Work closely with team members and managers to drive process efficiencies and explore opportunities for automation.Strive to enhance the productivity and effectiveness of the data generation by contributing to the development and continuous improvement of AI audit methodologies, checklists, and test frameworks as regulations and best practices evolve.Basic QualificationsSpeak, write, and read fluently in Spanish.Min. B2.2 or C1 certified in Spanish Language.Bachelor's degree or equivalent with 3+ years of equivalent experience.Preferred QualificationsMasters Degree or C1 certified in Spanish Language.Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.#J-18808-Ljbffr


  • AI Benchmarking

    hace 1 semana


    Madrid, España Amazon A tiempo completo

    A leading global e-commerce company is seeking an experienced AI Benchmarking Associate to support the evaluation of AI systems. The role involves planning benchmarking exercises, validating data accuracy, and preparing audit reports. Candidates must be fluent in Spanish and have a Bachelor's degree with at least 3 years of relevant experience. This position...

  • Machine Learning Engineer

    hace 2 semanas


    Madrid, España Code Talent A tiempo completo

    At Byte & Code, we are not just another consultancy.Maximice sus posibilidades de que su candidatura sea seleccionada asegurándose de que su CV y sus habilidades se ajustan al perfil.We are your career partner — connecting you to exceptional opportunities with no fuss and a premium experience.If you are passionate about Artificial Intelligence and Machine...


  • Madrid, España Code Talent A tiempo completo

    At Byte & Code, we are not just another consultancy.We are your career partner — connecting you to exceptional opportunities with no fuss and a premium experience. If you are passionate about Artificial Intelligence and Machine Learning, and want to work on challenging, high-impact projects that push the boundaries of research and applied AI, this...

  • Gen ai senior engineer

    hace 10 horas


    Madrid, España Knowmad Mood A tiempo completo

    En knowmad mood, nuestra comunidad Data está creciendo y buscamos un perfil innovador: un Senior Engineer con visión creativa y dominio en Inteligencia Artificial Generativa, listo para impulsar proyectos que marcan la diferencia. ¿Qué ofrecemos? Horario flexible : Adapta tu jornada a tu ritmo personal, coordinando con tu equipo para disfrutar de un...

  • Lead AI Engineer

    hace 2 semanas


    Madrid, España Lyfegen A tiempo completo

    Get AI-powered advice on this job and more exclusive features.Direct message the job poster from LyfegenDo you want to be part of a company that’s transforming healthcare?Lyfegen helps bridge the gap to faster, smarter access with pricing, access, and rebate management solutions. Our platform is used by health insurers, governments, hospital payers, and...

  • Lead AI Engineer

    hace 2 semanas


    Madrid, España Lyfegen A tiempo completo

    Get AI-powered advice on this job and more exclusive features.¿Tiene las cualificaciones y habilidades adecuadas para este trabajo? Descúbralo a continuación y pulse en "solicitar" para ser considerado.Direct message the job poster from LyfegenDo you want to be part of a company that’s transforming healthcare?Lyfegen helps bridge the gap to faster,...


  • Madrid, España DocuSketch A tiempo completo

    DocuSketch is a leading software solution provider for the restoration, property insurance, and real estate industries in North America. Our cutting-edge digital platforms enable restoration companies, adjusters, and insurers to efficiently document, manage, and process property claims. Driven by a loyal customer base, DocuSketch has become one of the...


  • Madrid, España DocuSketch A tiempo completo

    DocuSketch is a leading software solution provider for the restoration, property insurance, and real estate industries in North America. Our cutting-edge digital platforms enable restoration companies, adjusters, and insurers to efficiently document, manage, and process property claims. Driven by a loyal customer base, DocuSketch has become one of the...


  • Madrid, España Elastic A tiempo completo

    Elasticsearch - Senior Software Engineer (Performance Team)1 day ago Be among the first 25 applicants Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500,...


  • Madrid, España DocuSketch A tiempo completo

    Docu Sketch is a leading software solution provider for the restoration, property insurance, and real estate industries in North America. Our cutting-edge digital platforms enable restoration companies, adjusters, and insurers to efficiently document, manage, and process property claims. Driven by a loyal customer base, Docu Sketch has become one of the...