MCP & Tools Python Developer - Agent Evaluation Infrastructure
$80 per hourMindrift
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We're Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We’re looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You'll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What You’ll Be Doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed How to Get Started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements The ideal contributor will have: 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you'll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Seniority Level Mid‑Senior level Employment Type Part‑time Job Function Other Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Mindrift by 2x. #J-18808-Ljbffr Mindrift
$99.6k - $234.6k
...The Principal AI Agent / ML Software... ...on Oracle Cloud Infrastructure (OCI). This... ...orchestration of LLMs, tools, APIs, memory, retrieval, evaluation, guardrails,... ...Protocol (MCP) integration, vector... ...models, AI developer tooling, and agentic... ...skills in Python and ability to...SuggestedTemporary workFlexible hours$50 per hour
...Description Job Description Python - Senior Developers - AI Training - USA About... ...Network to help train and evaluate cutting-edge AI models. If... ..., or Flask. Testing & Tools: proficiency with pytest,... ...this crucial human data infrastructure, Prolific is positioning...SuggestedWork from homeFlexible hours$80 per hour
...modern AI systems are tested and evaluated? Analysts, researchers or... ...for QAs for autonomous AI agents for a new project focused on... ...closely with QA, writers, or developers to suggest refinements or edge... ...future of AI while building tools that benefit everyone. Requirements...SuggestedPermanent employmentPart timeFreelanceRemote workFlexible hours- ...consulting firm is seeking hands-on Python engineers for a part-time role focused on developing Model Context Protocol servers. The role involves evaluating agent behavior and requires 4+ years of... ..., particularly in backend or tools development. Ideal candidates should...SuggestedRemote jobPart timeFlexible hours
$30 per hour
...Oracle's customer and developer communities; and 4... ...motivated AI Agent Intern to join Oracle... ...Analysis Evaluate logistics AI use... ...Familiarity with Python, SQL, APIs, or... ...Experience with analytics tools. Coursework... ...the data, infrastructure, applications, and...SuggestedHourly payTemporary workInternshipFlexible hours$99.6k - $223.4k
...is seeking a Senior AI Agent Engineer to build... ...balancing speed with safety, evaluation, and maintainability.... ...work across prompts, tools, code, APIs,... ...engineering workflows. Develop structured tool-calling... ...brings together the data, infrastructure, applications, and expertise...Temporary workFlexible hours- ...Cloud & AI-Driven Infrastructure) About the Role... ...it through new tools and approaches.... ...software delivery Develop and maintain Terraform... ...and the Claude Agent SDK, is strongly preferred... ...skills in Python, Bash, or similar... ...Claude Agent SDK and MCP for DevOps...
- ...enterprise identities, AI agents, machine identities,... ...including on-premises infrastructure and public cloud... ...gap analyses, control evaluations, and target-state Agentic... ...Terraform, CI/CD, DevSecOps tooling RESPONSIBILITIES... ...environments. Develop automation for access...
- ...Senior Healthcare Commissioning Agent LJC is seeking a Senior... ...for critical hospital infrastructure, coordinate multidisciplinary... ...design through occupancy. Developing and executing commissioning... ...testing of building systems. Evaluating system performance and...For contractorsFlexible hours
$70 - $85 per hour
...Python Developer With Ai Join a dynamic team focused on developing innovative solutions in the banking sector. This contract position requires... ...architectural design of distributed, scalable systems. Evaluate and monitor AI solutions to ensure high performance and...Contract work- ...Cyber is looking for security focused Python programmers to help develop custom plugins and 3rd party... ...common standards using log aggregation tools (e.g., Elasticsearch and Splunk); enriching... ...tools Experience with cloud infrastructure and networking in AWS and/or Azure...Remote workWork from home
$90.4k - $173.3k
...SME, Primary Engineering, Energy and Infrastructure Projects (Phoenix, AZ) LOCATION: Phoenix... ...will apply your technical mastery to develop innovative Substation Layout Designs spanning... ...and/or MicroStation, and use these tools to produce accurate, high-quality design...Full timeWork at office$127.5k - $224k
...Platform team. The role involves building the data infrastructure for AI use cases, enhancing platform usage with AI tools, and mentoring engineers. The ideal candidate... ...in data engineering and strong skills in Python, SQL, and cloud platforms like Databricks. Benefits...Flexible hours$218.03k - $256.5k
...Partner with the Coinbase Infrastructure team to support and... ...to build surveillance tooling into deployment... ...monitoring/observability ~ Developing and maintaining... ...scripting languages (Bash, python, Ruby, Go, etc) ~ Proficiency... ...you to carefully evaluate how your skills and...Local area- ...Aerospace is looking for a Senior AI Engineer to focus on Agentic Evaluation and V&V. The role involves building evaluation frameworks and... ...6+ years of experience in software or ML engineering, strong Python skills, and a Bachelor’s in a relevant field. This is a full-time...Full timeRemote work
$109.32k - $180.69k
...groundwater, water supply, treatment, infrastructure, recycled water, stormwater, construction... ...practice. West Yost has developed the AWWA cybersecurity resources since... ...Performing system analysis using SKM Power Tools power systems analysis software. At a minimum...Full timeTemporary workWork experience placementWork at officeLocal areaRemote workFlexible hoursNight shiftAfternoon shift$166.83k - $245.34k
...dedicated to solving the world's critical infrastructure, environmental, energy, and resource... ...accelerate the energy transition. We evaluate options and identify opportunities and... ...environment along with the opportunity to develop your career prospects within Canada and...Full timeWork at officeLocal areaRemote workOverseasFlexible hours3 days per week$185.1k - $284.1k
...the Organization The Evaluation team builds and... ...driving technology. We develop metrics, automated workflows... ..., and analysis tooling for simulation and on-... ...metrics, tooling, and infrastructure. Raise the quality... ...proficiency developing Python in production team environments...Local areaWork from homeFlexible hours- ...expertise, particularly in large-scale infrastructure projects. If you are looking for a lifelong... ...team of engineers and designers that develop projects through the project life-cycle... ...equipment/components, technical bid evaluation from vendors, and vendor selection....Full timeContract workWork at officeRemote workFlexible hoursWeekend work
$157.49k - $174.71k
...Intelligent Data Management: Use AI tools to analyze, map, and automate the data... ...Automated Deployment & Self-Healing Infrastructure: Build fully automated CI/CD pipelines... ..., Grafana, ELK stack, or similar) Python scripting experience for automation and...Flexible hours- ...Company GitKraken is the developer experience (DevEx)... ..., issue tracking tools, and AI solutions make... ...and operational data Evaluate ambiguous ideas quickly... ...ecosystem: Languages: Python (for data/ML execution... .... Data & Infrastructure: Snowflake for data warehousing...Flexible hours
$90.4k - $173.3k
...SME, Secondary Engineering, Energy and Infrastructure Projects (Phoenix, AZ) Location Phoenix,... ...reliable, high‑performing substation systems. Develop specifications, bills of materials (BOMs... ...Qualifications Exposure to ETAP and SEL tools. Strong skills in CAD software,...Full timeWork at office$80 per hour
...world. Over 35,000 AI developers, researchers, and... ...Network to help train and evaluate the next generation of... ...& Data: advanced Python (NumPy, Pandas, Scikit... ...Vertex AI), or specialized tools like Weights & Biases... ...crucial human data infrastructure, Prolific is...Hourly payWork from homeFlexible hours- ...Learning Engineer to design, develop, and deploy a multi-modal action... ...machine learning and multi-agent decision making: you will... ...and curation through training, evaluation, and field deployment Integrate... ...environments Write robust Python and C++ code Required...Local area
- ...Engineer in Phoenix, Arizona with expertise in Agentic AI and Python. The role involves developing AI-driven applications, such as chatbots and automation... ...AI. Familiarity with cloud platforms and CI/CD tools is also required. Join Omega Hires to impact AI technology...
- ...Phoenix, AZ seeks a Deployed Engineer to develop production AI agents that real teams depend on. The role... ...customer engineering teams, guiding evaluations, and advising on architecture and... ...relevant technical roles, strong skills in Python and JavaScript, and experience...
- ...impact workflows where AI agents can amplify value by 100 times Develop and implement future-... ...performance indicators, conduct evaluations, and continuously... ...expertise to collaborate with MCP, command-line interfaces... ...to educate staff on AI tools and best practices....
- ...organization where AI agents operate... ...AI ready data infrastructure that power trusted... ...and manage the MCP based data... ...automation Evaluate emerging AI, semantic... ...and analytics tooling while making principled... ...scalability, developer productivity,... ...proficiency in Python for pipeline...
- ...Infrastructure Engineer We are seeking an Infrastructure Engineer (24... ...solutions using Ansible, Terraform, Python, and Linux in an Agile... ...Key Responsibilities: Develop automation on ServiceNow Ansible... ...Have: Agile experience, additional scripting/automation tools...
$91.48k
...solutions for the environment and infrastructure, and an unmatched experience... ...plumbing design while also developing detailed energy models and... ...assess energy consumption, evaluate HVAC performance, identify... ...your very best and given the tools and resources to do so....Full timeH1bLocal areaRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!
- tool design engineer Phoenix, AZ
- python programmer Phoenix, AZ
- python developer data analytics Phoenix, AZ
- python engineer Phoenix, AZ
- python developer Phoenix, AZ
- senior python developer Phoenix, AZ
- python developer remote Phoenix, AZ
- full stack / python developer (remote) Phoenix, AZ
- commissioning agent Phoenix, AZ
- cruise agent Phoenix, AZ


