MCP & Tools Python Developer - Agent Evaluation Infrastructure
$80 per hourMind Rift
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.
At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We DoThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who Were Looking For
Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We’re looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About the Project
We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You’ll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What You’ll Be Doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed Although we’re only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects. How to Get Started
Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements
The ideal contributor will have: 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you’ll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Other Information
Seniority level: Mid‑Senior level Employment type: Part‑time Job function: Other Industries: IT Services and IT Consulting Referrals increase your chances of interviewing at Mindrift by 2x. Location: Missouri, United States (remote) Pay range: $20.00‑$40.00 per hour (freelance project) Note: We’re open to applicants from selected countries; please check eligibility. #J-18808-Ljbffr
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the MCP & Tools Python Developer - Agent Evaluation Infrastructure in Richmond, VA vacancy
$96.8k - $251.6k
...Senior Principal AI Agent / ML Software... ...Oracle Cloud Infrastructure (OCI). This... ...orchestration of LLMs, tools, APIs, memory, retrieval, evaluation, guardrails,... ...Protocol (MCP) integration, vector... ...models, AI developer tooling, and agentic... ...skills in Python and ability to...SuggestedTemporary workFlexible hours- ...and LLM fine-tuning/evaluation . You will be responsible... ...assurance of our "Agent Mesh" infrastructure, ensuring that the MCP servers built in C# and Python correctly translate... ..."hallucinating" tool calls. Key Responsibilities... ...Regression Testing: Develop automated test...Suggested
- ...industry pioneers who develop premium crypto... ...the financial infrastructure that underpins... ...Finance operations Evaluate how financial... ...using modern AI tooling including Claude... ...Anthropic APIs, Python‑based... ...workflow engines, and MCP or similar agent coordination layers...SuggestedLocal areaRemote work
- ...Turing Enterprise Inc. is seeking a Software Engineering Evaluator to create datasets and evaluate AI-generated code. This role focuses on Python, alongside JavaScript and other programming languages, collaborating with researchers. Candidates must have 3+ years of software...Suggested10 hours per weekFlexible hours
$200.2k - $357.5k
...Internet of Things (IoT) data to develop actionable insights and... ...Senior Staff Machine Learning Infrastructure Engineer to lead the design... ...) and standardize tracking, evaluation, and regression testing for... ...and production observability tooling. Proven experience...SuggestedFull timeWork at officeRemote workFlexible hours$40 per hour
...AI Trainer - Advanced Python Developers - Paris Remote About Prolific Prolific is not... ...for Python Developers to help train and evaluate cutting‑edge AI models. If you have the... .... By providing this crucial data infrastructure, Prolific is positioning itself at the...Self employmentRemote workWork from homeFlexible hours$111.3k - $174.9k
...experienced Full Stack Developer with expertise in React, TypeScript and Python to join our team. The... ...Proficiency with AWS infrastructure and deployment practices... ...CI/CD pipelines and tools like Jenkins, GitHub Actions... ...treated fairly and evaluated based on their...Full timeContract workPart timeWork experience placementInternshipLocal area- ...Job Title Python Developer Education N/A Location Remote (Virtual)... ...Seamless system. As DSD migrates its infrastructure into the cloud, a CI/CD tool is required. Therefore, DSD is... .... The database research will evaluate the feasibility of replacing datasets...Full timeContract workFor contractorsWork experience placementWork at officeRemote work
- ...Senior ML Engineer to design and build scalable AI/ML platform infrastructure. In this role, you will collaborate with machine learning... ...technologies. Ideal candidates have strong programming skills in Python, experience with AI/ML infrastructure, and the ability to work...
$99.6k - $223.4k
...is seeking a Senior AI Agent Engineer to build... ...balancing speed with safety, evaluation, and maintainability.... ...work across prompts, tools, code, APIs,... ...engineering workflows. Develop structured tool-calling... ...brings together the data, infrastructure, applications, and expertise...Temporary workFlexible hours$187k - $220k
...Things (IoT) data to develop actionable insights and... ...industries are the infrastructure of our planet, including... ...intelligent agents to automate complex tasks... ...modalities, internal tooling, evaluations, and observability.... ...systems. Proficiency in Python. Strong system...Work at officeRemote workRelocation packageFlexible hours$145k - $160k
...mission to make high-performance cloud infrastructure easy to use, affordable, and locally accessible... ...provisioning of GPU infrastructure Develop automated testing systems for GPU-based... ...building infrastructure automation using Python and Bash Familiarity with GPU drivers,...Work at officeImmediate startRemote workFlexible hours- ...A leading technology firm in the United Kingdom is seeking a machine learning engineer to design evaluation suites and assess AI-generated solutions. The role requires strong experience in machine learning engineering and the ability to work independently. Ideal candidates...
$60 - $70 per hour
...Senior Technical Recruiter - AI Infrastructure and Engineering Remote, US $60 - $70 Job Title Senior Technical Recruiter - AI Infrastructure... ...Run world-class interview processes: calibration, structured evaluation, closing, and offer management. Drive pipeline creation via...Remote work- ...WHO YOU ARE An experienced AWS Infrastructure Engineer to join our cloud... ...Infrastructure as Code (IaC) using tools such as Terraform or AWS... ...AWS services and features and evaluate their potential benefits to... ...Competence in scripting languages (Python, Bash, or PowerShell). AWS...
- ...Headquarters: Idaho, USA URL: Lincoln Freedom Group, LLC is hiring a Python Developer on behalf of a growing client expanding into multiple new... ...REST/SOAP API clients Kafka PostgreSQL and other standard infrastructure The Environment: The team is collaborative but self-...Full timeContract workRemote work
$140k - $180k
...showcase your technical skillset and take ownership of a data infrastructure that powers large-scale product catalogs, analytics, and AI-driven... ...and insight generation Qualifications Strong experience with Python and SQL in production data environments Experience with cloud...Remote work$122.43k - $183.64k
...available for this position. The Senior Data Infrastructure Engineer designs, builds, and scales... ...and scripting or programming languages (Python, Shell, etc.). Hands-on experience using... ...analytics workflow using orchestration tools (Airflow, dbt). Experience supporting database...Full timeVisa sponsorship$190k - $220k
...Data Engineer, Data Lakehouse Infrastructure United States Build to... ...management using cutting-edge tools and frameworks like Apache Spark... ...compatible catalogs. Develop and orchestrate robust ETL/ELT... ...Exceptional programming skills in Python, as well as adeptness in SQL...Remote workShift work$55 per hour
...Freelance AI Trainer - Civil Engineering & Python 3 days ago Be among the first 25... ...Engineers with Python skills to train and evaluate AI models on realistic civil... ...Construction Engineering & Management Infrastructure, Coastal, Earthquake, Sustainable Engineering...Part timeFreelanceRemote workFlexible hours- ...business combined with the quality and infrastructure of a larger firm. Recognized as an... ...Description Concept Plus is seeking a Junior Python Developer with at least 3 years of hands‑on... ...using Pyodide and related tooling. Follow secure coding standards aligned...Local areaFlexible hours
$200k - $240k
...years of backend engineering experience, expert proficiency in Python for AI/ML, and proven experience with distributed systems.... ...include designing APIs, owning the technical roadmap for backend infrastructure, and mentoring fellow engineers. The position offers a competitive...- ...Senior Software Engineer to lead the design and architecture of analytics infrastructure supporting research reporting and analysis. The ideal candidate will have over 6 years of experience, strong Python and SQL skills, and a background in data infrastructure. The position...
$180k - $220k
...confidently with Pinecones developer-friendly... ...era – knowledge infrastructure that powers high-... ...quality through evaluation and observability... ...Rust, C++, Java, or Python.... ...modern infrastructure tools, such as Kubernetes... ...developers and autonomous agents will love....Full timeLocal areaRemote workWork from homeFlexible hours- ...Python Developer With AWS Infrastructure Experience Visa status: U.S. Citizens and those authorized to work in the U.S. are encouraged to apply. Tax Terms: W2, 1099 Corp-Corp or 3rd Parties: Yes Location: Richmond VA, Chicago IL, Plano TX. Long Term 5+ years of...
£75k - £90k per year
...Description for the Infrastructure Manager role This is a fantastic opportunity with a technology division of a global company. The role will... ...Solid networking/Security understanding/experience Scripting (Python or similar) Salary For the Infrastructure Manager role will be...Remote work- ...platform teams. You will develop solutions powered... ...with the AI Infrastructure team to architect robust... ...performance using evaluation frameworks, observability tools, and iterative fine... ...of experience in Python programming Proficient... ...and deploying AI agents in production environments...Permanent employmentContract work
$145k - $170k
...this capacity you will develop an understanding of... ...across model performance, infrastructure, reliability, and cost... ...infrastructure. Evaluate and optimize model performance... ...and recommend tools or frameworks that improve... ...learning models with Python and AWS. Hands‑on...Remote workMonday to FridayFlexible hours$170k - $210k
...more compute capacity from existing energy infrastructure. For over a decade, we have applied AI... ..., including build vs. buy tradeoffs, tooling selection, and platform evolution as the... ...training, and why it matters Proficiency in Python; C++, CUDA, Go, Rust a plus Excellent...Local areaRemote workFlexible hours- ...players in compute, cloud, data, and AI infrastructure have converged on TetraScience as... ...serve data for ML and analytics. Develop infrastructure‑as‑code using tools like Cloudformation, AWS CDK to... ...coding skills in TypeScript and Python building robust APIs and backend services...Immediate startRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!
Related searches
- backend python developer Richmond, VA
- work from home python developer Richmond, VA
- senior python developer Richmond, VA
- python engineer Richmond, VA
- full stack / python developer (remote) Richmond, VA
- python developer data analytics Richmond, VA
- python programmer Richmond, VA
- python developer part time Richmond, VA
- python developer Richmond, VA
- python developer remote Richmond, VA

