MCP & Tools Python Developer - Agent Evaluation Infrastructure

hace 2 semanas


Madrid, Madrid, España Mindrift A tiempo completo

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. 

What we do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for

Calling all security researchers, engineers, and penetration testers with a strong foundation in problem-solving, offensive security, and AI-related risk assessment.

If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us 

We're looking for someone who can bring a hands-on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. 

About the project

We're on the hunt for hands-on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You'll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team.

What you'll be doing:

  • Developing and maintaining MCP-compatible evaluation servers
  • Implementing logic to check agent actions against scenario definitions
  • Creating or extending tools that writers and QAs use to test agents
  • Working closely with infrastructure engineers to ensure compatibility
  • Occasionally helping with test writing or debug sessions when needed

Although we're only looking for experts for this current project, contributors with consistent high-quality submissions may receive an invitation for ongoing collaboration across future projects. 

How to get started:

Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

Requirements

The ideal contributor will have:

  • 4+ years of Python development experience, ideally in backend or tools
  • Solid experience building APIs, testing frameworks, or protocol-based interfaces
  • Understanding of Docker, Linux CLI, and HTTP-based communication
  • Ability to integrate new tools into existing infrastructures
  • Familiarity with how LLM agents are prompted, executed, and evaluated
  • Clear documentation and communication skills - you'll work with QA and writers

We also value applicants who have:

  • Experience with Model Context Protocol (MCP) or similar structured agent-server interfaces
  • Knowledge of FastAPI or similar async web frameworks
  • Experience working with LLM logs, scoring functions, or sandbox environments
  • Ability to support dev environments (devcontainers, CI configs, linters)
  • JS experience

Benefits

  • Get paid for your expertise, with rates that can go up to $30/hour depending on your skills, experience, and project needs.
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.

  • AIML Evaluation

    hace 2 semanas


    Madrid, Madrid, España Apple A tiempo completo

    At Apple new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Apple is a place where extraordinary people come together to do their life's best work. Together, we build technologies and experiences people once couldn't have imagined - and now can't imagine living without The AI/ML team in Madrid, Spain,...


  • Madrid, Madrid, España Mirantis A tiempo completo

    Company DescriptionAbout MirantisMirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers...


  • Madrid, Madrid, España Mirantis A tiempo completo

    Company Description About MirantisMirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers...

  • Python Back Developer

    hace 1 semana


    Madrid, Madrid, España Uptime Institute A tiempo completo

    LEET Security (an Uptime Institute Company) is a leading Cybersecurity Rating Agency. Established in December 2010 with the sole purpose of building a cybersecurity rating service that customers and service providers use as a means to assess, manage and improve their cyber posture. LEET's cybersecurity rating consolidates and maps a broad series of security...

  • Python Developer

    hace 4 días


    Madrid, Madrid, España Odixcity Consulting A tiempo completo

    The Role:Develop scalable backend systems, APIs, and services using Python.Solve complex technical challenges, including performance bottlenecks, concurrency, and large-scale data processing.Develop automated tests, maintain CI/CD pipelines, and ensure smooth deployment to production.Deploy Python libraries, frameworks, and tools to improve development...

  • Python Developer

    hace 1 semana


    Madrid, Madrid, España AXON Networks A tiempo completo

    AXON Networks delivers a robust AI-driven, analytics-based orchestration platform and a wide portfolio of next-gen high-speed routers that leverage the newest Wi-Fi technologies. Together, these technologies give ISPs the ability to manage and troubleshoot their networks in real time, and to deliver an outstanding customer experience.AXON Networks is a...


  • Madrid, Madrid, España Mindrift A tiempo completo

    This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we...


  • Madrid, Madrid, España Mindrift A tiempo completo

    This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we...

  • Python Developer

    hace 1 semana


    Madrid, Madrid, España T-Systems Iberia A tiempo completo

    Company Description At T-Systems, you will find groundbreaking projects that contribute to social and ecological well-being. We want to welcome new talents like you, who bring fresh ideas, different points of view, who accept challenges and continuous learning, to grow and impact society… All this, in a fun wayIt doesn't matter when or where you work. It's...

  • Python Developer

    hace 1 semana


    Madrid, Madrid, España T-Systems Iberia A tiempo completo

    At T-Systems, you will find groundbreaking projects that contribute to social and ecological well-being. We want to welcome new talents like you, who bring fresh ideas, different points of view, who accept challenges and continuous learning, to grow and impact society… All this, in a fun wayIt doesn't matter when or where you work. It's about doing work...