Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

MCP & Tools Python Developer - Agent Evaluation Infrastructure

$80 per hour

Mind Rift

5 days ago Be among the first 25 applicants This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We're Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We're looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We're on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You'll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What you'll be doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibilityOccasionally helping with test writing or debug sessions when needed Although we're only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects. How to get started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements The ideal contributor will have: 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you'll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Seniority level Mid‑Senior level Employment type Part‑time Job function Other Industries IT Services and IT Consulting #J-18808-Ljbffr

Vacancy posted 14 hours ago
Similar jobs that could be interesting for youBased on the MCP & Tools Python Developer - Agent Evaluation Infrastructure in Florida, NY vacancy
  • $80 per hour

     ...opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not...  ...or related fields 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations) Background in... 
    Suggested
    Permanent employment
    Temporary work
    Freelance

    Mind Rift

    Florida, NY
    14 hours ago
  • $80 per hour

    A tech consulting firm in New York is seeking experienced Python engineers for a part-time, remote role. You will develop Model Context Protocol servers and tools while collaborating with infrastructure engineers. The ideal candidate has over 4 years of Python development... 
    Suggested
    Part time
    Remote work
    Flexible hours

    Mind Rift

    Florida, NY
    14 hours ago
  • $55 per hour

     ...opportunity platform in New York seeks experienced energy engineers for project-based AI evaluations. Contributors will design engineering problems, validate AI solutions, and require strong Python and English skills. Ideal candidates have a degree in Energy Engineering or... 
    Suggested
    Hourly pay
    Part time
    10 hours per week

    Mind Rift

    Florida, NY
    14 hours ago
  • $80 per hour

     ...companies, focused on testing, evaluating, and improving AI systems...  ...is suited for a Senior Python developer with deep functional...  ...tasks, and confidence using tools like Roo Code or Claude...  ...+ Prior experience with agent evaluation platforms and MCP CLI Tools and Technologies... 
    Suggested
    Hourly pay
    Permanent employment
    Temporary work
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Florida, NY
    14 hours ago
  • $55 per hour

     ...Freelance AI Trainer - Civil Engineering & Python 1 week agoBe among the first 25...  ...Engineers with Python skills to train and evaluate AI models on realistic civil engineering...  ...Construction Engineering & Management Infrastructure, Coastal, Earthquake, Sustainable Engineering... 
    Suggested
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Florida, NY
    14 hours ago
  • AI Infrastructure Engineer At BNY, our culture allows us to run our company better and enables...  ...Kubernetes , Docker , and GPU orchestration tools like Run:AI and NVIDIA BCM . Own the...  ...infrastructure operations and provisioning using Python , Bash , and tools like Terraform or... 
    Work experience placement
    Worldwide
    Flexible hours

    BNY Mellon

    Florida, NY
    4 days ago
  • $40 per hour

     ...pay starting at $40. You will design coding problems for AI training, write clear code, and evaluate AI outputs. Proficiency in programming languages such as JavaScript, Python, or C++ and strong English skills are required. Join a dynamic team and help shape the future... 
    Remote job
    Hourly pay
    Contract work

    The Voluntary Protection Programs Participants' Association,...

    Florida, NY
    1 day ago
  • Job Summary As an AI Agent Engineer at IntelePeer, you will build, configure, test, and...  ...the intersection of applied AI, voice infrastructure, and healthcare workflow automation. You...  ...independently. • Leverage AI-assisted development tools to accelerate build velocity, generate... 
    Work at office
    Work visa
    Flexible hours

    IntelePeer

    Florida, NY
    14 hours ago
  •  ...services. Our contact centers are powered by both on-site and remote agents, leveraging advanced technologies to enhance customer journeys,...  ...to maintain regular attendance and punctuality The ability to evaluate, troubleshoot, and follow up on customer issues An aptitude for... 
    Full time
    Contract work
    Temporary work
    Casual work
    Work at office
    Local area
    Remote work
    Monday to Friday
    Shift work
    Weekend work

    MCI Careers

    Florida, NY
    1 day ago
  •  ...participation in pipeline infrastructure development,...  ...productivity improvements Evaluate operational practices...  ...networks and security tools, establishing necessary...  ...operational networks Develop and improve processes...  ...such as Aspen HYSYS, Python, Ignition Proficiency... 
    Full time
    Relocation

    NextEra Energy Resources

    Florida, NY
    2 days ago
  •  ...understanding. Develop and optimize ML models...  ...data. Build agents capable of operating...  ...stack, improve tooling, and drive...  ...improvement of the ML infrastructure and processes for...  ...programming skills in Python (TypeScript...  ...model training and evaluation. Familiarity with... 
    Full time

    Catalyst Labs

    Florida, NY
    1 day ago
  •  ...the benefit of Earth. We’re working to develop reusable, safe, and low‑cost space vehicles...  ..., subsystems, and ground support infrastructure. Design and implement workflow models,...  ...bottlenecks and improve throughput. Support tooling and fixture development to improve... 
    Temporary work
    Local area

    Blue Origin LLC

    Florida, NY
    14 hours ago
  •  ...implementation of novel manufacturing processes and tooling concepts for launch vehicle structures,...  ...) principles are incorporated early. Develop tooling concepts and design...  ...both hardware and operations. Support infrastructure planning for launch pads, integration facilities... 
    Temporary work
    Local area

    Blue Origin LLC

    Florida, NY
    14 hours ago
  • A logistics company is seeking highly motivated Independent Freight Brokers/Agents to leverage industry-leading infrastructure and maximize earning potential. This full-time role involves managing your own book of business, from covering loads to sourcing leads. Candidates... 
    Full time

    Patterson Companies

    Florida, NY
    1 day ago
  • $35k

     ...highly motivated Independent Freight Broker/Agents (1099) to join our team at Patterson...  ...opportunity to leverage our industry‑leading infrastructure, extensive carrier network, and...  ...all aspects of freight transportation. Develop and maintain relationships with clients... 
    Full time

    Patterson Companies

    Florida, NY
    1 day ago
  • $55 per hour

     ...Freelance Mechanical Engineering Expert with Python Expertise - AI Trainer 2 days ago Be among the first 25 applicants This opportunity...  ...that challenge AI. Define comprehensive scoring criteria to evaluate the accuracy of the AI's answers. Correct the model's... 
    Part time
    Freelance
    Remote work

    Mind Rift

    Florida, NY
    14 hours ago
  • $55 per hour

     ...Mechanical Engineer with Python Experience – Freelance AI Trainer 2 days ago Be among the first 25 applicants This opportunity is only...  ...Mechanical Engineers with strong Python skills to train and evaluate AI models on complex, real‑world mechanical engineering problems... 
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Florida, NY
    14 hours ago
  • $55 per hour

     ...driven AI company is looking for experienced Civil Engineers with Python skills to train AI models on realistic problems. This part-time...  ...and have advanced Python proficiency. You will design and evaluate engineering problems, ensuring AI outputs meet professional standards... 
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Florida, NY
    14 hours ago
  • $300 per month

     ...Virtual Utility® technology within critical infrastructure. The company was also awarded the 2025...  ...grid‑interactive power architectures. Evaluate system components and recommend...  ...using MATLAB or equivalent analytical tools. Develop and maintain engineering software tools... 
    Full time
    Work at office
    Remote work
    Relocation
    Relocation package

    e2Companies

    Florida, NY
    4 days ago
  • $55 per hour

     ...and at least 3 years of experience in the field. Responsibilities include designing engineering problems, evaluating AI solutions, and validating results using Python. This part-time role offers flexible working hours, with compensation up to $55 per hour, depending on... 
    Hourly pay
    Contract work
    Part time
    Flexible hours

    Mindrift

    Florida, NY
    4 days ago
  •  ...10 years of experience, including at least 6 years of Test & Evaluation (T&E) experience. Active Top Secret/SCI Clearance. Must be Special...  ...with instrumentation, range, aircraft, aircrew, data, infrastructure, and safety constraints associated with developmental flight... 
    For contractors

    Inteletech Global Inc

    Florida, NY
    3 days ago
  • $55 per hour

     ...opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not...  ...assumptions, and constraints; Validate calculations or simulations using Python (NumPy, Pandas, SciPy); Improve AI reasoning to align with... 
    Hourly pay
    Permanent employment
    Temporary work
    Part time
    Freelance
    10 hours per week

    Mind Rift

    Florida, NY
    14 hours ago
  •  ...oriented Senior Virtual Desktop Infrastructure Engineer to join our team in...  ...user experience optimization tools. Troubleshoot complex VDI...  ...and Agreement process areas. Develop and maintain engineering documentation...  .../automation (PowerShell, Python), endpoint management, or... 
    Work at office

    慨正橡扯

    Florida, NY
    3 days ago
  •  ...in agentic systems, developer tooling, and AI platform integration...  ...— including agent frameworks, tool ecosystems...  ...agentic workflows, MCP server infrastructure, and multi-model...  ..., and rigorous AI evaluation frameworks to ensure...  ...programming language (Python, TypeScript/Node.js,... 
    Local area
    Remote work
    Home office

    E W Scripps CO

    Florida, NY
    14 hours ago
  • $125k - $135k

     ...design scalable data infrastructure that powers business intelligence...  ...operations — design, develop, and maintain robust...  ...modern orchestration tools such as Apache Airflow...  ...; run and update Python and/or JavaScript code...  ...planning sessions. Evaluate and recommend new tools... 

    SteerBridge

    Florida, NY
    14 hours ago
  • $130k - $178k

     ...that haven’t been solved before, helping develop a rocket, a factory, and a business...  ...requests for proposals, bid packages, bid evaluations, subcontractor selection, and...  ...principles to projects in industrial or infrastructure environments Ability to work effectively... 
    For subcontractor

    Relativity Space

    Florida, NY
    2 days ago
  • An innovative engineering firm in the Town of Florida is seeking an Entry Level Engineer to support various engineering assignments under the guidance of experienced mentors. Responsibilities include drafting engineering documents, assisting with client meetings, and managing...

    Commonwealth Associates, Inc.

    Florida, NY
    4 days ago
  • Job Role - Python Developer Location - Florida Position - Contract Role - Onsite Job Description: We are seeking a Python Developer with strong...  ...The ideal professional is comfortable working with AI coding tools such as Codex, Claude Code, Cursor, and Spec Kit to... 
    Contract work

    GCS Recruitment

    Florida, NY
    14 hours ago
  •  ...the development of deep learning models (Python,PyTorch) for computer vision and...  ...maintain end-to-end ML pipelines (training → evaluation → deployment); Own experiment tracking,...  ...; Experience with ML pipelines, cloud infrastructure, orMLOpstooling; You care about what survives... 
    Work at office
    Flexible hours

    sewts.VELUM

    Florida, NY
    3 days ago
  •  ...technical‑first role, with a strong emphasis on backend engineering, APIs, cloud infrastructure, and deployment workflows. Responsibilities Design, develop, and maintain backend services using Python and FastAPI, exposing clean REST APIs Collaborate with frontend, backend,... 

    IT Labs, LLC

    Florida, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!