Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

MCP & Tools Python Developer - Agent Evaluation Infrastructure

$80 per hour

Mind Rift

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

What We Do
The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe.

Who Were Looking For
Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We’re looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability.

About the Project
We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You’ll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team.

What You’ll Be Doing

Developing and maintaining MCP‑compatible evaluation servers

Implementing logic to check agent actions against scenario definitions

Creating or extending tools that writers and QAs use to test agents

Working closely with infrastructure engineers to ensure compatibility

Occasionally helping with test writing or debug sessions when needed

Although we’re only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects.

How to Get Started
Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

Requirements
The ideal contributor will have:

4+ years of Python development experience, ideally in backend or tools

Solid experience building APIs, testing frameworks, or protocol‑based interfaces

Understanding of Docker, Linux CLI, and communication

Ability to integrate new tools into existing infrastructures

Familiarity with how LLM agents are prompted, executed, and evaluated

Clear documentation and communication skills – you’ll work with QA and writers

We also value applicants who have:

Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces

Knowledge of FastAPI or similar async web frameworks

Experience working with LLM logs, scoring functions, or sandbox environments

Ability to support dev environments (devcontainers, CI configs, linters)

JS experience

Benefits

Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs

Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments

Participate in an advanced AI project and gain valuable experience to enhance your portfolio

Influence how future AI models understand and communicate in your field of expertise

Other Information
Seniority level: Mid‑Senior level

Employment type: Part‑time

Job function: Other

Industries: IT Services and IT Consulting

Referrals increase your chances of interviewing at Mindrift by 2x.

Location: Missouri, United States (remote)

Pay range: $20.00‑$40.00 per hour (freelance project)

Note: We’re open to applicants from selected countries; please check eligibility.

#J-18808-Ljbffr
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the MCP & Tools Python Developer - Agent Evaluation Infrastructure in Richmond, VA vacancy
  • $96.8k - $251.6k

     ...Senior Principal AI Agent / ML Software...  ...Oracle Cloud Infrastructure (OCI). This...  ...orchestration of LLMs, tools, APIs, memory, retrieval, evaluation, guardrails,...  ...Protocol (MCP) integration, vector...  ...models, AI developer tooling, and agentic...  ...skills in Python and ability to... 
    Suggested
    Temporary work
    Flexible hours

    Oracle

    Richmond, VA
    3 days ago
  •  ...and LLM fine-tuning/evaluation . You will be responsible...  ...assurance of our "Agent Mesh" infrastructure, ensuring that the MCP servers built in C# and Python correctly translate...  ..."hallucinating" tool calls. Key Responsibilities...  ...Regression Testing: Develop automated test... 
    Suggested

    HRB

    Richmond, VA
    3 days ago
  •  ...industry pioneers who develop premium crypto...  ...the financial infrastructure that underpins...  ...Finance operations Evaluate how financial...  ...using modern AI tooling including Claude...  ...Anthropic APIs, Python‑based...  ...workflow engines, and MCP or similar agent coordination layers... 
    Suggested
    Local area
    Remote work

    Kraken

    Richmond, VA
    2 days ago
  •  ...Turing Enterprise Inc. is seeking a Software Engineering Evaluator to create datasets and evaluate AI-generated code. This role focuses on Python, alongside JavaScript and other programming languages, collaborating with researchers. Candidates must have 3+ years of software... 
    Suggested
    10 hours per week
    Flexible hours

    Turing Enterprise Inc.

    Richmond, VA
    3 days ago
  • $200.2k - $357.5k

     ...Internet of Things (IoT) data to develop actionable insights and...  ...Senior Staff Machine Learning Infrastructure Engineer to lead the design...  ...) and standardize tracking, evaluation, and regression testing for...  ...and production observability tooling. Proven experience... 
    Suggested
    Full time
    Work at office
    Remote work
    Flexible hours

    Samsara

    Richmond, VA
    2 days ago
  • $40 per hour

     ...AI Trainer - Advanced Python Developers - Paris Remote About Prolific Prolific is not...  ...for Python Developers to help train and evaluate cutting‑edge AI models. If you have the...  .... By providing this crucial data infrastructure, Prolific is positioning itself at the... 
    Self employment
    Remote work
    Work from home
    Flexible hours

    Prolific Academic Ltd

    Richmond, VA
    22 hours ago
  • $111.3k - $174.9k

     ...experienced Full Stack Developer with expertise in React, TypeScript and Python to join our team. The...  ...Proficiency with AWS infrastructure and deployment practices...  ...CI/CD pipelines and tools like Jenkins, GitHub Actions...  ...treated fairly and evaluated based on their... 
    Full time
    Contract work
    Part time
    Work experience placement
    Internship
    Local area

    Paychex Deutschland GmbH

    Richmond, VA
    22 hours ago
  •  ...Job Title Python Developer Education N/A Location Remote (Virtual)...  ...Seamless system. As DSD migrates its infrastructure into the cloud, a CI/CD tool is required. Therefore, DSD is...  .... The database research will evaluate the feasibility of replacing datasets... 
    Full time
    Contract work
    For contractors
    Work experience placement
    Work at office
    Remote work

    Leisnoi Inc

    Richmond, VA
    2 days ago
  •  ...Senior ML Engineer to design and build scalable AI/ML platform infrastructure. In this role, you will collaborate with machine learning...  ...technologies. Ideal candidates have strong programming skills in Python, experience with AI/ML infrastructure, and the ability to work... 

    General Motors

    Richmond, VA
    3 days ago
  • $99.6k - $223.4k

     ...is seeking a Senior AI Agent Engineer to build...  ...balancing speed with safety, evaluation, and maintainability....  ...work across prompts, tools, code, APIs,...  ...engineering workflows. Develop structured tool-calling...  ...brings together the data, infrastructure, applications, and expertise... 
    Temporary work
    Flexible hours

    Oracle

    Richmond, VA
    3 days ago
  • $187k - $220k

     ...Things (IoT) data to develop actionable insights and...  ...industries are the infrastructure of our planet, including...  ...intelligent agents to automate complex tasks...  ...modalities, internal tooling, evaluations, and observability....  ...systems. Proficiency in Python. Strong system... 
    Work at office
    Remote work
    Relocation package
    Flexible hours

    Samsara

    Richmond, VA
    22 hours ago
  • $145k - $160k

     ...mission to make high-performance cloud infrastructure easy to use, affordable, and locally accessible...  ...provisioning of GPU infrastructure Develop automated testing systems for GPU-based...  ...building infrastructure automation using Python and Bash Familiarity with GPU drivers,... 
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Vultr

    Richmond, VA
    3 days ago
  •  ...A leading technology firm in the United Kingdom is seeking a machine learning engineer to design evaluation suites and assess AI-generated solutions. The role requires strong experience in machine learning engineering and the ability to work independently. Ideal candidates... 

    Crossing Hurdles

    Richmond, VA
    3 days ago
  • $60 - $70 per hour

     ...Senior Technical Recruiter - AI Infrastructure and Engineering Remote, US $60 - $70 Job Title Senior Technical Recruiter - AI Infrastructure...  ...Run world-class interview processes: calibration, structured evaluation, closing, and offer management. Drive pipeline creation via... 
    Remote work

    The Leadership Agency Inc.

    Richmond, VA
    2 days ago
  •  ...WHO YOU ARE An experienced AWS Infrastructure Engineer to join our cloud...  ...Infrastructure as Code (IaC) using tools such as Terraform or AWS...  ...AWS services and features and evaluate their potential benefits to...  ...Competence in scripting languages (Python, Bash, or PowerShell). AWS... 

    Framework Ventures

    Richmond, VA
    3 days ago
  •  ...Headquarters: Idaho, USA URL: Lincoln Freedom Group, LLC is hiring a Python Developer on behalf of a growing client expanding into multiple new...  ...REST/SOAP API clients Kafka PostgreSQL and other standard infrastructure The Environment: The team is collaborative but self-... 
    Full time
    Contract work
    Remote work

    CloudDevs

    Richmond, VA
    5 days ago
  • $140k - $180k

     ...showcase your technical skillset and take ownership of a data infrastructure that powers large-scale product catalogs, analytics, and AI-driven...  ...and insight generation Qualifications Strong experience with Python and SQL in production data environments Experience with cloud... 
    Remote work

    Rise Technical

    Richmond, VA
    3 days ago
  • $122.43k - $183.64k

     ...available for this position. The Senior Data Infrastructure Engineer designs, builds, and scales...  ...and scripting or programming languages (Python, Shell, etc.). Hands-on experience using...  ...analytics workflow using orchestration tools (Airflow, dbt). Experience supporting database... 
    Full time
    Visa sponsorship

    Lyric

    Richmond, VA
    3 days ago
  • $190k - $220k

     ...Data Engineer, Data Lakehouse Infrastructure United States Build to...  ...management using cutting-edge tools and frameworks like Apache Spark...  ...compatible catalogs. Develop and orchestrate robust ETL/ELT...  ...Exceptional programming skills in Python, as well as adeptness in SQL... 
    Remote work
    Shift work

    TRM

    Richmond, VA
    22 hours ago
  • $55 per hour

     ...Freelance AI Trainer - Civil Engineering & Python 3 days ago Be among the first 25...  ...Engineers with Python skills to train and evaluate AI models on realistic civil...  ...Construction Engineering & Management Infrastructure, Coastal, Earthquake, Sustainable Engineering... 
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Richmond, VA
    2 days ago
  •  ...business combined with the quality and infrastructure of a larger firm. Recognized as an...  ...Description Concept Plus is seeking a Junior Python Developer with at least 3 years of hands‑on...  ...using Pyodide and related tooling. Follow secure coding standards aligned... 
    Local area
    Flexible hours

    Concept Plus

    Richmond, VA
    11 hours ago
  • $200k - $240k

     ...years of backend engineering experience, expert proficiency in Python for AI/ML, and proven experience with distributed systems....  ...include designing APIs, owning the technical roadmap for backend infrastructure, and mentoring fellow engineers. The position offers a competitive... 

    Framework Ventures

    Richmond, VA
    3 days ago
  •  ...Senior Software Engineer to lead the design and architecture of analytics infrastructure supporting research reporting and analysis. The ideal candidate will have over 6 years of experience, strong Python and SQL skills, and a background in data infrastructure. The position... 

    The Voleon Group

    Richmond, VA
    3 days ago
  • $180k - $220k

     ...confidently with Pinecones developer-friendly...  ...era – knowledge infrastructure that powers high-...  ...quality through evaluation and observability...  ...Rust, C++, Java, or Python....  ...modern infrastructure tools, such as Kubernetes...  ...developers and autonomous agents will love.... 
    Full time
    Local area
    Remote work
    Work from home
    Flexible hours

    Menlo Ventures

    Richmond, VA
    2 days ago
  •  ...Python Developer With AWS Infrastructure Experience Visa status: U.S. Citizens and those authorized to work in the U.S. are encouraged to apply. Tax Terms: W2, 1099 Corp-Corp or 3rd Parties: Yes Location: Richmond VA, Chicago IL, Plano TX. Long Term 5+ years of... 

    Keylent Inc

    Richmond, VA
    26 days ago
  • £75k - £90k per year

     ...Description for the Infrastructure Manager role This is a fantastic opportunity with a technology division of a global company. The role will...  ...Solid networking/Security understanding/experience Scripting (Python or similar) Salary For the Infrastructure Manager role will be... 
    Remote work

    Bentley Whitaker Search and Selection

    Richmond, VA
    3 days ago
  •  ...platform teams. You will develop solutions powered...  ...with the AI Infrastructure team to architect robust...  ...performance using evaluation frameworks, observability tools, and iterative fine...  ...of experience in Python programming Proficient...  ...and deploying AI agents in production environments... 
    Permanent employment
    Contract work

    Genesis10

    Richmond, VA
    1 day ago
  • $145k - $170k

     ...this capacity you will develop an understanding of...  ...across model performance, infrastructure, reliability, and cost...  ...infrastructure. Evaluate and optimize model performance...  ...and recommend tools or frameworks that improve...  ...learning models with Python and AWS. Hands‑on... 
    Remote work
    Monday to Friday
    Flexible hours

    Sandbox Industries

    Richmond, VA
    22 hours ago
  • $170k - $210k

     ...more compute capacity from existing energy infrastructure. For over a decade, we have applied AI...  ..., including build vs. buy tradeoffs, tooling selection, and platform evolution as the...  ...training, and why it matters Proficiency in Python; C++, CUDA, Go, Rust a plus Excellent... 
    Local area
    Remote work
    Flexible hours

    Utilidata

    Richmond, VA
    22 hours ago
  •  ...players in compute, cloud, data, and AI infrastructure have converged on TetraScience as...  ...serve data for ML and analytics. Develop infrastructure‑as‑code using tools like Cloudformation, AWS CDK to...  ...coding skills in TypeScript and Python building robust APIs and backend services... 
    Immediate start
    Remote work
    Flexible hours

    TetraScience

    Richmond, VA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!