Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

MCP & Tools Python Developer - Agent Evaluation Infrastructure

$80 per hour

Mind Rift

6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We're Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You'll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What you'll be doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed Although we’re only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects. How to get started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements The ideal contributor will have: 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you’ll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Seniority level Mid‑Senior level Employment type Part‑time Job function Other Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Mindrift by 2x Get notified about new Python Developer jobs in Rhode Island, United States . Rhode Island, United States $20.00-$40.00 1 week ago #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the MCP & Tools Python Developer - Agent Evaluation Infrastructure in Providence, RI vacancy
  • $80 per hour

    A technology firm is seeking QAs for autonomous AI agents to validate and improve task structures within a new project. Candidates will...  ...thinkers with strong attention to detail and experience in evaluating scenarios. This flexible, project-based role offers competitive... 
    Suggested
    Remote work
    Flexible hours

    Mind Rift

    Providence, RI
    3 days ago
  • $20 - $40 per hour

    A leading AI consulting firm is looking for a hands-on Python engineer to join their project in Rhode Island. The role involves developing evaluation servers and tools for agent behavior, with an emphasis on collaboration and creativity. Ideal candidates should have at... 
    Suggested
    Hourly pay
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Providence, RI
    3 days ago
  • $80 per hour

     ...modern AI systems are tested and evaluated? This is a flexible, project...  ...for QAs for autonomous AI agents for a new project focused on...  ...closely with QA, writers, or developers to suggest refinements or edge...  ...future of AI while building tools that benefit everyone. #J-18... 
    Suggested
    Permanent employment
    Part time
    Freelance
    Remote work
    Flexible hours

    Mindrift

    Providence, RI
    3 days ago
  • $80 per hour

     ...consultancy in the United States is seeking Quality Assurance professionals to validate and improve autonomous AI agents. You will analyze complex systems, evaluate task structures, and ensure logical consistency through detailed reviews. Ideal candidates possess strong... 
    Suggested
    Remote job
    Hourly pay
    Flexible hours

    Mindrift

    Providence, RI
    3 days ago
  • $99.6k - $223.4k

     ...is seeking a Senior AI Agent Engineer to build...  ...balancing speed with safety, evaluation, and maintainability....  ...work across prompts, tools, code, APIs,...  ...engineering workflows. Develop structured tool-calling...  ...brings together the data, infrastructure, applications, and expertise... 
    Suggested
    Temporary work
    Flexible hours

    Oracle

    Providence, RI
    6 days ago
  • $40 per hour

    A technology solutions company is seeking a Web Application Developer to improve AI models by evaluating coding outputs and performance. Candidates should be proficient in Python or JavaScript and have experience with algorithms and debugging. This remote position allows... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Providence, RI
    3 days ago
  • $55 per hour

     ...Freelance AI Trainer - Civil Engineering & Python 3 days ago Be among the first 25...  ...Engineers with Python skills to train and evaluate AI models on realistic civil...  ...Construction Engineering & Management ~ Infrastructure, Coastal, Earthquake, Sustainable Engineering... 
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Providence, RI
    3 days ago
  •  ...including design of the instruments, controls, and sequences of operations of commercial HVAC systems and unit processes. Develop and evaluate system approaches, concepts, and options to determine the best design and engineering solution for each project based upon... 
    For contractors
    Local area
    Flexible hours

    Arden Engineering Constructors

    Pawtucket, RI
    8 days ago
  • $60 per hour

    A leading data analysis company is seeking quantitative professionals to evaluate AI-generated work, design solutions for AI training, and provide impactful feedback. This role offers the flexibility to work remotely from various countries including the US and Canada,... 
    Hourly pay
    Remote work

    DataAnnotation

    Providence, RI
    3 days ago
  • $286.2k - $326.7k

     ...investments in technology infrastructure and world-class talent...  ...at scale and delivers developer joy, for all Capital...  ..., guardrails, model evaluation, deployment, and both...  ...programming in C, C++, Python, or Scala At least...  ...TensorFlow) and orchestration tools (Databricks, Airflow,... 
    Full time
    Part time
    Local area
    Remote work
    Flexible hours

    Capital One

    Providence, RI
    1 day ago
  • $10k

     ...Border Patrol Agent (BPA) in the Federal Security and Public Safety Sector Experienced...  ...information rapidly and make prompt decisions; or develop and maintain contact with a network of...  ..., etc.) to submit. You will be evaluated based on your resume, supporting documents... 
    Full time
    Local area
    Immediate start
    Relocation
    Night shift

    US Customs and Border Protection

    Cranston, RI
    6 days ago
  •  ...pharmacy-specific ML projects. Design and develop ML models for pharmacy growth and...  ...baseline model development. Conduct model evaluation and selection through systematic experimentation...  ...testing environments on Azure cloud infrastructure. Support potential migration from Azure... 
    Remote work

    Hispanic Alliance for Career Enhancement

    Providence, RI
    3 days ago
  • $286.2k - $326.7k

     ...investments in technology infrastructure and world-class talent...  ...at scale and delivers developer joy, for all Capital...  ...testing, guardrails, evaluation, deployment, and both...  ...messaging ~ Proficiency in Python, Java, C++, or Golang;...  ...) and orchestration tools (Databricks, Airflow,... 
    Full time
    Part time
    Local area
    Remote work

    Capital One

    Providence, RI
    3 days ago
  • $138k - $208k

     ...team, you will work on developing and deploying ML and...  ...LLMOps reliability and infrastructure. Responsibilities...  ...prompt development and evaluation, training ML models, building...  ...Familiarity with agent orchestration frameworks, LLM observability tools, and prompt... 
    Work experience placement
    Local area

    Indeed

    Providence, RI
    2 days ago
  • $140k - $200k

    Software Engineer, Data Infrastructure & Acquisition — Providence, RI, USA Location: Providence, RI, USA. Full‑time. Salary range: $140k-$2...  ...industry experience in software development. Proficiency with bash/Python scripting in Linux environments. Experience with Docker,... 
    Full time

    TryApplyNow

    Providence, RI
    1 day ago
  • $114.6k - $234.6k

     ...Description At Oracle Cloud Infrastructure (OCI), we build the...  ...Responsibilities: Design and develop AI software in Java, Python, and other languages.?...  ...-tuning, model serving, evaluation/benchmarking and human...  ...Attention mechanism, Prompt tooling Disclaimer: Certain... 
    Temporary work
    Flexible hours

    Oracle

    Providence, RI
    3 days ago
  • $10k

    " Border Patrol Agent (BPA) – in the Federal Security and Public Safety Sector Experienced...  ...rapidly and make prompt decisions; or develop and maintain contact with a network of informants...  ..., etc.) to submit. You will be evaluated based on your resume, supporting documents... 
    Full time
    Local area
    Immediate start
    Relocation
    Night shift

    US Customs and Border Protection

    East Providence, RI
    4 hours ago
  • $80k - $90k

     ...strategy, development, and delivery of sales tools & processes that enable an efficient,...  ...with cross‑functional teams - including developers, designers, product managers/owners, and...  ...Languages and Frameworks JavaScript/Node.js Python REST APIs OAuth2 and other... 
    Work experience placement
    Internship
    Work at office
    Worldwide
    Relocation

    Motorola Solutions

    Providence, RI
    1 day ago
  • $102.3k - $209.5k

     ...Job Description Job Description Oracle Cloud Infrastructure (OCI) is seeking a Senior ICT Project Engineer - Data Center Infrastructure (IC4) to support hyperscale cloud and AI data center deployments within OCI’s Data Center Engineering and Infrastructure organization... 
    Temporary work
    For contractors
    Relocation
    Flexible hours

    Oracle

    Providence, RI
    4 days ago
  • $40 per hour

    A cybersecurity technology company is seeking experienced professionals to evaluate AI-generated security content and solve technical cybersecurity problems. This remote role offers flexibility in project selection and scheduling, with compensation starting at $40+ per... 
    Remote job
    Hourly pay

    DataAnnotation

    Providence, RI
    1 day ago
  • $40 per hour

    A data-driven technology firm is seeking an Aerospace Engineer to help train and evaluate AI models. This role involves assessing AI chatbot performance and addressing complex problems. Candidates should possess a strong understanding of aerospace engineering or related... 
    Remote job
    Hourly pay
    For contractors
    Flexible hours

    DataAnnotation

    Providence, RI
    1 day ago
  • $125.5k - $230.2k

     ...and the opportunity to develop your own career path...  ...you will monitor and evaluate learning processes to...  ...teams. Strong skills in Python. Ability lead, collaborate...  ...experience in DevOps tools (GIT, Azure DevOps),...  ...., utilities, energy, infrastructure, manufacturing, or transportation... 
    Full time
    Work experience placement
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    Providence, RI
    3 days ago
  • $109.2k - $223.4k

     ...organizational pain points while proactively developing recommendations and action plans....  ...operations programs for Engineering & Infrastructure organizations in partnership with CIO-office...  ...practices, forecasting mechanisms, tooling strategies, reporting frameworks, and organizational... 
    Temporary work
    Work at office
    Flexible hours

    Oracle

    Providence, RI
    2 days ago
  • $60 per hour

    A leading AI development company is seeking quantitative professionals to evaluate AI-generated analyses and develop solutions in various quantitative fields. This fully remote position allows for flexible scheduling and competitive pay up to $60/hour. Candidates should... 
    Remote job
    Flexible hours

    DataAnnotation

    Providence, RI
    3 days ago
  • $60 per hour

    A leading AI consultancy is seeking experienced quantitative professionals to evaluate and improve AI models. This fully remote role allows you to set your own schedule while focusing on statistical analysis, predictive modeling, and data-driven insights. Candidates should... 
    Remote job

    DataAnnotation

    Providence, RI
    1 day ago
  • $170k - $215k

     ...organization (Citeline, Evaluate, MMIT, Panalgo, The...  ...internal teams on these developments All other duties as...  ...model context protocol (MCP) servers, retrieval augmented...  ...Excellent knowledge of Python and core data science...  ...usage of AI coding tools and workflows Benefits... 
    Full time
    Temporary work
    Local area
    Remote work
    Flexible hours

    Norstella

    Providence, RI
    4 days ago
  • A cybersecurity company is seeking experienced professionals to evaluate AI-generated security content. The responsibilities include solving technical cybersecurity problems, writing code, and providing feedback to improve AI systems. Candidates should have over 2 years... 
    Remote work
    Flexible hours

    DataAnnotation

    Providence, RI
    1 day ago
  • $40 per hour

    A cybersecurity company is seeking experienced professionals to assist in evaluating AI-generated security content and solving technical cybersecurity problems. This role requires at least 2 years in cybersecurity with a focus on tasks like penetration testing and incident... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Providence, RI
    1 day ago
  •  ...products. Functions: This position utilizes tool design skill and experience to plan,...  ...with blueprints, verbal instructions in developing appropriate tool designs. Works to close...  ...basic and advanced equipment. Capable of evaluating and repairing all mold problems.... 
    Remote work

    Silgan Dispensing

    Providence, RI
    1 day ago
  • $165k - $190k

     ...health in the future. The AI Agent & ML Engineer will design,...  ...Responsibilities Architect and develop multi-agent systems for...  ...Implement reasoning, planning, and tool-use capabilities using frameworks...  ...Strong programming skills in Python; experience with ML frameworks... 
    Temporary work
    Work visa

    Bausch + Lomb

    Providence, RI
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!