MCP & Tools Python Developer - Agent Evaluation Infrastructure
$80 per hourMind Rift
THIS OPPORTUNITY IS ONLY FOR CANDIDATES CURRENTLY RESIDING IN THE SPECIFIED COUNTRY. YOUR LOCATION MAY AFFECT ELIGIBILITY AND RATES. PLEASE SUBMIT YOUR RESUME IN ENGLISH AND INDICATE YOUR LEVEL OF ENGLISH PROFICIENCY. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We’re Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We’re looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You’ll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What You’ll Be Doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed Although we’re only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects. Requirements 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you’ll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise #J-18808-Ljbffr
$80 per hour
...tech-focused company is seeking hands-on Python engineers for a project involving Model Context Protocol (MCP) servers and internal tools. Candidates should have 4+ years of... ...commitments. You will work closely with QA and infrastructure teams to enhance tool quality and...SuggestedRemote work$96.8k - $251.6k
...Senior Principal AI Agent / ML Software... ...Oracle Cloud Infrastructure (OCI). This... ...orchestration of LLMs, tools, APIs, memory, retrieval, evaluation, guardrails,... ...Protocol (MCP) integration, vector... ...models, AI developer tooling, and agentic... ...skills in Python and ability to...SuggestedTemporary workFlexible hours$30 per hour
...Oracle's customer and developer communities; and 4... ...motivated AI Agent Intern to join Oracle... ...Analysis Evaluate logistics AI use... ...Familiarity with Python, SQL, APIs, or... ...Experience with analytics tools. Coursework... ...the data, infrastructure, applications, and...SuggestedHourly payTemporary workInternshipFlexible hours$80 per hour
...companies, focused on testing, evaluating, and improving AI systems... ...is suited for a Senior Python developer with deep functional... ...tasks, and confidence using tools like Roo Code or Claude... ...+ Prior experience with agent evaluation platforms and MCP CLI Tools and Technologies...SuggestedPermanent employmentTemporary workFreelanceRemote workFlexible hours$99.6k - $223.4k
...is seeking a Senior AI Agent Engineer to build... ...balancing speed with safety, evaluation, and maintainability.... ...work across prompts, tools, code, APIs,... ...engineering workflows. Develop structured tool-calling... ...brings together the data, infrastructure, applications, and expertise...SuggestedTemporary workFlexible hours$55 per hour
...Freelance AI Trainer - Civil Engineering & Python 3 days ago Be among the first 25... ...Engineers with Python skills to train and evaluate AI models on realistic civil engineering... ...Construction Engineering & Management Infrastructure, Coastal, Earthquake, Sustainable Engineering...Part timeFreelanceRemote workFlexible hours$68k - $107k
...Tooling Engineer - Holt Area Tooling Engineer | Holt, MI $68... ...Responsibilities Design, develop, and test tooling for... ...and end-of-life planning. Evaluate new tooling technologies and... ...automation or data analysis (e.g., Python, VBA). Demonstrated ability...Immediate start- ...advancement Job Title: Network Infrastructure Engineer Location: Colorado (... ...system administration tasks Develop automation scripts to... ...automation experience (Bash, Python, or similar) Preferred Qualifications... ...with network monitoring tools Experience supporting high‑availability...Full time
$76.75k - $142.2k
...Marine Interdiction Agent Air and Marine Operations (AMO), a component of U.S. Customs and Border Protection (CBP) offers those... ...high-tech surveillance equipment to gather evidence, collect, evaluate, and then preserve that physical and documentary evidence....Full time$64.4k - $85k
...Utilities Right of Way Agent - Michigan - (1697) Job Type Full-time Salary UI Exemption Type Exempt Location MI... ...The Right People. The Right Culture. Right of Way is where infrastructure meets innovation—shaping roads, utilities, and communities for...Full timeWork experience placementWork from homeHome office$25.85 - $47.75 per hour
...Content Specialist - Promotional Agent 9-12/Specialist 13 Print (... ...responsibility for the evaluation, allocation, and fiscal oversight... ...This position will plan and develop the agency digital content... ...; rebuild our state’s infrastructure; or support other state agencies...Hourly payPermanent employmentFull timeTraineeshipWork at officeImmediate startRemote workFlexible hours$286.2k - $326.7k
...investments in technology infrastructure and world-class talent... ...at scale and delivers developer joy, for all Capital... ..., guardrails, model evaluation, deployment, and both... ...programming in C, C++, Python, or Scala At least... ...TensorFlow) and orchestration tools (Databricks, Airflow,...Full timePart timeLocal areaRemote workFlexible hours- ...Integrate performance testing and monitoring tools within CI/CD pipelines using Jenkins,... ...planning analysis, determining optimal infrastructure requirements to handle projected... ...systems performance testing, monitoring and evaluation and five years' experience in...Contract workWork at officeRemote workRelocation packageFlexible hours3 days per week
$71k - $110k
...Job Description: Tooling Engineer | Lansing, MI Salary Range: $71,000 - $110,000... ...Key Responsibilities Designs and develops tooling, fixtures, and jigs for manufacturing... ...software for tooling design and simulation. Evaluates new tooling technologies and materials...Immediate start$286.2k - $326.7k
...investments in technology infrastructure and world-class talent... ...at scale and delivers developer joy, for all Capital... ...testing, guardrails, evaluation, deployment, and both... ...messaging ~ Proficiency in Python, Java, C++, or Golang;... ...) and orchestration tools (Databricks, Airflow,...Full timePart timeLocal areaRemote work$170k - $215k
...organization (Citeline, Evaluate, MMIT, Panalgo,... ...maintain multi-agent systems and agentic... ...and extend MCP (Model Context Protocol... ...years of experience developing AI/ML applications... ...knowledge of Python and core data science... ...Experience with Big Data tools like Apache Spark,...Contract workTemporary workLocal areaRemote workFlexible hours$50 - $60 per hour
...for advancement Network Infrastructure Engineer Fort Collins, CO... ..., and Linux-based automation tooling across multiple lab... ...infrastructure platforms across labs Develop custom scripts and automation... ...and automation experience (Python, Bash preferred) Nice to Have...Hourly payFull timeContract workWork from homeFlexible hours- ...based in EST or CST only We are looking for a confident IT Infrastructure Engineer who has demonstrated experience engineering and maintaining... ...and Ansible code ~ Scripting experience (preferably Python and Shell) ~ Knowledge of CI/CD concepts ~ Experience with...Remote work
$125.5k - $230.2k
...and the opportunity to develop your own career path... ...you will monitor and evaluate learning processes to... ...teams. Strong skills in Python. Ability lead, collaborate... ...experience in DevOps tools (GIT, Azure DevOps),... ...., utilities, energy, infrastructure, manufacturing, or transportation...Full timeWork experience placementSummer holidayFlexible hours$10k
...Border Patrol Agent (BPA) in the Federal Security and Public Safety Sector (Entry Level... ...information rapidly and make prompt decisions; or develop and maintain contact with a network of... ..., etc.) to submit. You will be evaluated based on your resume, supporting documents...Full timeWork experience placementImmediate startRelocationNight shift$10k
...Border Patrol Agent (BPA) – in the Federal Security and Public Safety Sector Experienced... ...rapidly and make prompt decisions; or develop and maintain contact with a network of informants... ..., etc.) to submit. You will be evaluated based on your resume, supporting documents...Full timeLocal areaImmediate startRelocationNight shift- ...Optics Engineer responsible for designing camera products, including automotive camera lenses. This role involves developing design specifications, evaluating CMOS chips, and ensuring proper integration into ECU designs. The ideal candidate should possess a Master’s...
$140k - $160k
...the efforts to design, develop, and maintain... ...party vendors to define, evaluate, and align business requirements... ...experience with ETL tools - SSIS Four (4) or... ...engineering, such as Python or Java Our Company... ...the necessary infrastructure to ensure stability and...Work experience placementRemote workWork from home$97.5k - $199.5k
...facilities, and complex electrical infrastructure. This role is responsible... ...quality studies, as well as developing validated simulation models... ...system performance. ~ Evaluate conventional and inverter-based... ...Experience with additional tools such as PSS/E, EMTP,...Temporary workFlexible hours$10k
...Update GL-5/7 grade levels Border Patrol Agent (BPA) – in the Federal Security and... ...information rapidly and make prompt decisions; or develop and maintain contact with a network of... ..., etc.) to submit. You will be evaluated based on your resume, supporting documents...Full timeWork experience placementImmediate startRelocationNight shift$55 per hour
A leading AI consulting firm is seeking a Freelance AI Trainer with expertise in Civil Engineering and Python to develop and evaluate AI models. This remote position offers flexibility, allowing candidates to design rigorous engineering problems and validate AI outputs...FreelanceRemote work- ...Manufacturing Engineer, you will be responsible for developing manufacturing processes and tooling design for composite parts fabrication using approved... ...six month build schedules. You may be tasked with evaluating any damage from normal operations by our customers and...For contractors
$138.8k - $232.7k
...downstream AI consumption. Develop and operate robust... ...with observability tools (e.g., Splunk, Looker/... ...insight tools. Evaluate, prototype, and productionize... ...Gen AI capabilities (agents, function calling,... ...). Experience with infrastructure-as-code and CI/CD (Terraform...Immediate startRemote workFlexible hours$56.7k - $114.85k
...Blue Cross Blue Shield of Michigan is seeking an Infrastructure Engineer I to manage and enhance computing platforms in Lansing, MI. Responsibilities include overseeing IT operations, deploying new technologies, and ensuring secure systems. The ideal candidate will hold...$139.4k - $291.8k
...documentation practices, and developing enterprise-wide best... ...capability. -Develops internal tools to manage compliance documentation... ...across the organization. Evaluates effectiveness of updated... ...brings together the data, infrastructure, applications, and expertise...Temporary workFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!
- full stack / python developer (remote) Lansing, MI
- commissioning agent Lansing, MI
- cruise agent Lansing, MI
- state farm agent Lansing, MI
- airport agent Lansing, MI
- executive protection agent Lansing, MI
- work from home chat agent Lansing, MI
- import export agent Lansing, MI
- remote chat agent Lansing, MI
- agent Lansing, MI



