Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

MCP & Tools Python Developer - Agent Evaluation Infrastructure

$80 per hour

Mindrift

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We're Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We’re looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You'll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What You’ll Be Doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed How to Get Started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements The ideal contributor will have: 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you'll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Seniority Level Mid‑Senior level Employment Type Part‑time Job Function Other Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Mindrift by 2x. #J-18808-Ljbffr Mindrift

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the MCP & Tools Python Developer - Agent Evaluation Infrastructure in Phoenix, AZ vacancy
  • $99.6k - $234.6k

     ...The Principal AI Agent / ML Software...  ...on Oracle Cloud Infrastructure (OCI). This...  ...orchestration of LLMs, tools, APIs, memory, retrieval, evaluation, guardrails,...  ...Protocol (MCP) integration, vector...  ...models, AI developer tooling, and agentic...  ...skills in Python and ability to... 
    Suggested
    Temporary work
    Flexible hours

    Oracle

    Phoenix, AZ
    1 day ago
  • $50 per hour

     ...Description Job Description Python - Senior Developers - AI Training - USA About...  ...Network to help train and evaluate cutting-edge AI models. If...  ..., or Flask. Testing & Tools: proficiency with pytest,...  ...this crucial human data infrastructure, Prolific is positioning... 
    Suggested
    Work from home
    Flexible hours

    Prolific Academic Ltd

    Phoenix, AZ
    17 days ago
  • $80 per hour

     ...modern AI systems are tested and evaluated? Analysts, researchers or...  ...for QAs for autonomous AI agents for a new project focused on...  ...closely with QA, writers, or developers to suggest refinements or edge...  ...future of AI while building tools that benefit everyone. Requirements... 
    Suggested
    Permanent employment
    Part time
    Freelance
    Remote work
    Flexible hours

    Mindrift

    Phoenix, AZ
    2 days ago
  •  ...consulting firm is seeking hands-on Python engineers for a part-time role focused on developing Model Context Protocol servers. The role involves evaluating agent behavior and requires 4+ years of...  ..., particularly in backend or tools development. Ideal candidates should... 
    Suggested
    Remote job
    Part time
    Flexible hours

    Mindrift

    Phoenix, AZ
    2 days ago
  • $30 per hour

     ...Oracle's customer and developer communities; and 4...  ...motivated AI Agent Intern to join Oracle...  ...Analysis Evaluate logistics AI use...  ...Familiarity with Python, SQL, APIs, or...  ...Experience with analytics tools. Coursework...  ...the data, infrastructure, applications, and... 
    Suggested
    Hourly pay
    Temporary work
    Internship
    Flexible hours

    Oracle

    Phoenix, AZ
    1 day ago
  • $99.6k - $223.4k

     ...is seeking a Senior AI Agent Engineer to build...  ...balancing speed with safety, evaluation, and maintainability....  ...work across prompts, tools, code, APIs,...  ...engineering workflows. Develop structured tool-calling...  ...brings together the data, infrastructure, applications, and expertise... 
    Temporary work
    Flexible hours

    Oracle

    Phoenix, AZ
    2 days ago
  •  ...Cloud & AI-Driven Infrastructure) About the Role...  ...it through new tools and approaches....  ...software delivery Develop and maintain Terraform...  ...and the Claude Agent SDK, is strongly preferred...  ...skills in Python, Bash, or similar...  ...Claude Agent SDK and MCP for DevOps... 

    Ryde Technologies

    Phoenix, AZ
    2 days ago
  •  ...enterprise identities, AI agents, machine identities,...  ...including on-premises infrastructure and public cloud...  ...gap analyses, control evaluations, and target-state Agentic...  ...Terraform, CI/CD, DevSecOps tooling RESPONSIBILITIES...  ...environments. Develop automation for access... 

    American Express

    Phoenix, AZ
    2 days ago
  •  ...Senior Healthcare Commissioning Agent LJC is seeking a Senior...  ...for critical hospital infrastructure, coordinate multidisciplinary...  ...design through occupancy. Developing and executing commissioning...  ...testing of building systems. Evaluating system performance and... 
    For contractors
    Flexible hours

    CRG

    Phoenix, AZ
    4 days ago
  • $70 - $85 per hour

     ...Python Developer With Ai Join a dynamic team focused on developing innovative solutions in the banking sector. This contract position requires...  ...architectural design of distributed, scalable systems. Evaluate and monitor AI solutions to ensure high performance and... 
    Contract work

    Mitchell Martin

    Phoenix, AZ
    1 day ago
  •  ...Cyber is looking for security focused Python programmers to help develop custom plugins and 3rd party...  ...common standards using log aggregation tools (e.g., Elasticsearch and Splunk); enriching...  ...tools Experience with cloud infrastructure and networking in AWS and/or Azure... 
    Remote work
    Work from home

    Phoenix Cyber

    Phoenix, AZ
    1 day ago
  • $90.4k - $173.3k

     ...SME, Primary Engineering, Energy and Infrastructure Projects (Phoenix, AZ) LOCATION: Phoenix...  ...will apply your technical mastery to develop innovative Substation Layout Designs spanning...  ...and/or MicroStation, and use these tools to produce accurate, high-quality design... 
    Full time
    Work at office

    Linxon Pvt Ltd

    Phoenix, AZ
    2 days ago
  • $127.5k - $224k

     ...Platform team. The role involves building the data infrastructure for AI use cases, enhancing platform usage with AI tools, and mentoring engineers. The ideal candidate...  ...in data engineering and strong skills in Python, SQL, and cloud platforms like Databricks. Benefits... 
    Flexible hours

    Scribd, Inc.

    Phoenix, AZ
    3 days ago
  • $218.03k - $256.5k

     ...Partner with the Coinbase Infrastructure team to support and...  ...to build surveillance tooling into deployment...  ...monitoring/observability ~ Developing and maintaining...  ...scripting languages (Bash, python, Ruby, Go, etc) ~ Proficiency...  ...you to carefully evaluate how your skills and... 
    Local area

    Coinbase

    Phoenix, AZ
    2 days ago
  •  ...Aerospace is looking for a Senior AI Engineer to focus on Agentic Evaluation and V&V. The role involves building evaluation frameworks and...  ...6+ years of experience in software or ML engineering, strong Python skills, and a Bachelor’s in a relevant field. This is a full-time... 
    Full time
    Remote work

    Slingshot Aerospace

    Phoenix, AZ
    22 hours ago
  • $109.32k - $180.69k

     ...groundwater, water supply, treatment, infrastructure, recycled water, stormwater, construction...  ...practice. West Yost has developed the AWWA cybersecurity resources since...  ...Performing system analysis using SKM Power Tools power systems analysis software. At a minimum... 
    Full time
    Temporary work
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours
    Night shift
    Afternoon shift

    West Yost

    Phoenix, AZ
    4 days ago
  • $166.83k - $245.34k

     ...dedicated to solving the world's critical infrastructure, environmental, energy, and resource...  ...accelerate the energy transition. We evaluate options and identify opportunities and...  ...environment along with the opportunity to develop your career prospects within Canada and... 
    Full time
    Work at office
    Local area
    Remote work
    Overseas
    Flexible hours
    3 days per week

    Worley

    Phoenix, AZ
    4 days ago
  • $185.1k - $284.1k

     ...the Organization The Evaluation team builds and...  ...driving technology. We develop metrics, automated workflows...  ..., and analysis tooling for simulation and on-...  ...metrics, tooling, and infrastructure. Raise the quality...  ...proficiency developing Python in production team environments... 
    Local area
    Work from home
    Flexible hours

    General Motors

    Phoenix, AZ
    7 days ago
  •  ...expertise, particularly in large-scale infrastructure projects. If you are looking for a lifelong...  ...team of engineers and designers that develop projects through the project life-cycle...  ...equipment/components, technical bid evaluation from vendors, and vendor selection.... 
    Full time
    Contract work
    Work at office
    Remote work
    Flexible hours
    Weekend work

    Kiewit Corporation

    Phoenix, AZ
    2 days ago
  • $157.49k - $174.71k

     ...Intelligent Data Management: Use AI tools to analyze, map, and automate the data...  ...Automated Deployment & Self-Healing Infrastructure: Build fully automated CI/CD pipelines...  ..., Grafana, ELK stack, or similar) Python scripting experience for automation and... 
    Flexible hours

    General Dynamics Mission Systems

    Scottsdale, AZ
    1 day ago
  •  ...Company GitKraken is the developer experience (DevEx)...  ..., issue tracking tools, and AI solutions make...  ...and operational data Evaluate ambiguous ideas quickly...  ...ecosystem: Languages: Python (for data/ML execution...  .... Data & Infrastructure: Snowflake for data warehousing... 
    Flexible hours

    Axosoft

    Scottsdale, AZ
    1 day ago
  • $90.4k - $173.3k

     ...SME, Secondary Engineering, Energy and Infrastructure Projects (Phoenix, AZ) Location Phoenix,...  ...reliable, high‑performing substation systems. Develop specifications, bills of materials (BOMs...  ...Qualifications Exposure to ETAP and SEL tools. Strong skills in CAD software,... 
    Full time
    Work at office

    Snc-Lavalin

    Phoenix, AZ
    2 days ago
  • $80 per hour

     ...world. Over 35,000 AI developers, researchers, and...  ...Network to help train and evaluate the next generation of...  ...& Data: advanced Python (NumPy, Pandas, Scikit...  ...Vertex AI), or specialized tools like Weights & Biases...  ...crucial human data infrastructure, Prolific is... 
    Hourly pay
    Work from home
    Flexible hours

    Prolific Academic Ltd

    Phoenix, AZ
    20 days ago
  •  ...Learning Engineer to design, develop, and deploy a multi-modal action...  ...machine learning and multi-agent decision making: you will...  ...and curation through training, evaluation, and field deployment Integrate...  ...environments Write robust Python and C++ code Required... 
    Local area

    Swarmbotics AI

    Phoenix, AZ
    4 days ago
  •  ...Engineer in Phoenix, Arizona with expertise in Agentic AI and Python. The role involves developing AI-driven applications, such as chatbots and automation...  ...AI. Familiarity with cloud platforms and CI/CD tools is also required. Join Omega Hires to impact AI technology... 

    Omega Hires

    Phoenix, AZ
    1 day ago
  •  ...Phoenix, AZ seeks a Deployed Engineer to develop production AI agents that real teams depend on. The role...  ...customer engineering teams, guiding evaluations, and advising on architecture and...  ...relevant technical roles, strong skills in Python and JavaScript, and experience... 

    LangChain

    Phoenix, AZ
    22 hours ago
  •  ...impact workflows where AI agents can amplify value by 100 times Develop and implement future-...  ...performance indicators, conduct evaluations, and continuously...  ...expertise to collaborate with MCP, command-line interfaces...  ...to educate staff on AI tools and best practices.... 

    Place

    Scottsdale, AZ
    3 days ago
  •  ...organization where AI agents operate...  ...AI ready data infrastructure that power trusted...  ...and manage the MCP based data...  ...automation Evaluate emerging AI, semantic...  ...and analytics tooling while making principled...  ...scalability, developer productivity,...  ...proficiency in Python for pipeline... 

    Radix

    Scottsdale, AZ
    22 days ago
  •  ...Infrastructure Engineer We are seeking an Infrastructure Engineer (24...  ...solutions using Ansible, Terraform, Python, and Linux in an Agile...  ...Key Responsibilities: Develop automation on ServiceNow Ansible...  ...Have: Agile experience, additional scripting/automation tools... 

    Glint Tech Solutions LLC

    Phoenix, AZ
    4 days ago
  • $91.48k

     ...solutions for the environment and infrastructure, and an unmatched experience...  ...plumbing design while also developing detailed energy models and...  ...assess energy consumption, evaluate HVAC performance, identify...  ...your very best and given the tools and resources to do so.... 
    Full time
    H1b
    Local area
    Relocation package
    Flexible hours

    CDM Smith

    Phoenix, AZ
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!