Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

MCP & Tools Python Developer - Agent Evaluation Infrastructure

$80 per hour

Mind Rift

Get AI‑powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What We Do At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We’re Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We’re looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You’ll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What You’ll Be Doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed Although we’re only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects. Requirements 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you’ll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise #J-18808-Ljbffr Mind Rift

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the MCP & Tools Python Developer - Agent Evaluation Infrastructure in Dallas, TX vacancy
  •  ...Context Engineer – Fullstack Python (AI Agents, LangGraph, Context...  ...Python background to design, develop, and integrate intelligent...  ...Engineering: Design, optimize, and evaluate prompts for LLMs to achieve...  ...Familiarity with cloud infrastructure, containerization (Docker),... 
    Suggested
    Remote work

    Diversity Nexus

    Dallas, TX
    3 days ago
  • $74 - $78 per hour

     ...Sr Python Developer Opportunity Global Financial Firm located in Irving, TX has an immediate...  ...problems/projects through in-depth evaluation of complex business processes and...  ...of business, such as architecture and infrastructure, integrate to accomplish business goals... 
    Suggested
    Contract work
    Immediate start

    Axelon

    Irving, TX
    1 day ago
  • $55 per hour

     ...modern AI systems are tested and evaluated? This is a flexible, project‑...  ...for QAs for autonomous AI agents for a new project focused on...  ...closely with QA, writers, or developers to suggest refinements or edge...  ...future of AI while building tools that benefit everyone. Requirements... 
    Suggested
    Permanent employment
    Part time
    Freelance
    Remote work
    Flexible hours

    Mindrift

    Dallas, TX
    1 day ago
  • $170k - $190k

    A leading AI solutions company is seeking a Solutions Architect to improve production-grade AI infrastructure and agent systems in Dallas, TX. You will collaborate with enterprise customers, design scalable infrastructure, and optimize AI applications. The ideal candidate... 
    Suggested
    Flexible hours

    LangChain

    Dallas, TX
    2 days ago
  • NTT DATA is seeking an experienced AI Evaluation & Test Engineer in Dallas, Texas. This role requires a minimum of 10 years of experience...  ...The ideal candidate will have strong programming proficiency in Python, familiarity with AI evaluation frameworks, and a degree in a... 
    Suggested

    NTT DATA

    Dallas, TX
    2 days ago
  • $130k

     ...proficient in Python and has practical experience developing AI-powered systems...  ...worked with AI agents, Model Context Protocol (MCP), modern data...  ...agents (e.g., tool using agents, planners...  ...Pipeline Infrastructure as Code (IaC) -...  ...will be evaluated solely on the basis... 
    Full time
    For contractors
    Remote work

    Diverse Lynx

    Dallas, TX
    11 hours ago
  •  ...Senior Healthcare Commissioning Agent LJC is seeking a Senior...  ...activities for critical hospital infrastructure, coordinate multidisciplinary...  ...design through occupancy. Develop and execute commissioning...  ...testing of building systems. Evaluate system performance and... 
    For contractors
    Flexible hours

    CRG

    Dallas, TX
    3 days ago
  • $85k - $145k

     ...are seeking an experienced Public Key Infrastructure (PKI) Engineer responsible for the design...  ..., and revocation using industry tools. Evaluate and maintain cryptographic standards and...  ...Proficiency with scripting languages such as Python, PowerShell, Bash, or KornShell (KSH).... 

    Amatriot Group, LLC

    Dallas, TX
    3 days ago
  • $55 per hour

     ...AI innovation firm in Dallas is seeking QAs for autonomous AI agents to ensure the quality of complex systems and scenarios. This flexible...  ...skills and attention to detail. Candidates should be adept at evaluating scenarios and documenting findings. The role offers competitive... 
    Remote job
    Flexible hours

    Mindrift

    Dallas, TX
    11 hours ago
  • About the Position The AI Infrastructure Engineer - Emerging Technologies will support the Office of the VP of Technology Engineering & Innovation in evaluating, designing, and developing next‑generation AI‑ready data center infrastructure strategies. This role serves... 
    Work at office
    Work visa

    Medium

    Dallas, TX
    2 days ago
  •  ...means building agents that do real things...  ...as primary AI infrastructure. You will build...  ...top of those tools and extend them...  ..., and maintain MCP (Model Context...  ...providers • Develop skills, plugins...  ...AWS services • Evaluate and implement...  ...with production Python and SQL; additional... 

    Braviant Holdings

    Addison, TX
    11 hours ago
  •  ...rate trends. AI‑ready data infrastructure. Ingestion, semantic...  ...Transformation: DBT Language: Python, SQL Cloud: Azure Adjacent: Hex, MCP Requirements 4+ years in...  ..., embedding stores, evaluation frameworks (e.g....  ...overhead. Curious about new tooling. You’ve been watching the... 
    Shift work

    Billee Technologies, Inc.

    Dallas, TX
    2 days ago
  •  ...skilled professional who is passionate about infrastructure automation and eager to drive efficiency...  ..., 2-4 years of experience required Python, 2-4 years of experience required...  ...tasks and expectations for the role) Develop and implement automation solutions using... 
    Work at office

    Artech

    Dallas, TX
    11 hours ago
  •  ...innovative solutions for facility and infrastructure challenges. With locations...  ...acoustic analyses and develop design recommendations for healthcare...  ...-standard software. Evaluate and design for vibration and...  ...acoustic modeling and analysis tools such as EASE, Odeon, or... 
    Temporary work
    For contractors
    Work at office
    Local area
    Immediate start
    Flexible hours

    Smith Seckman Reid

    Dallas, TX
    2 days ago
  •  ...Addison, TX 75001 Job Overview We are seeking a Senior Python Software Engineer to design, develop, and maintain backend automation systems and data...  ...communication platforms such as Twilio or RingCentral Infrastructure & DevOps Docker (containerization) Linux... 

    Oms Medical Billing

    Addison, TX
    1 day ago
  • $80 per hour

    A technology company specializing in AI is seeking an experienced Python engineer to develop Model Context Protocol servers and internal tools. The ideal candidate will have over 4 years of Python experience, with a strong foundation in building APIs and understanding... 
    Remote job
    Freelance
    Flexible hours

    Mindrift

    Dallas, TX
    1 day ago
  • $55 per hour

     ...Freelance AI Trainer – Civil Engineering & Python 4 days ago – Be among the first 25...  ...with Python skills to train and evaluate AI models on realistic civil engineering...  ...Construction Engineering & Management ~ Infrastructure, Coastal, Earthquake, Sustainable Engineering... 
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Dallas, TX
    3 days ago
  •  ...discussions, and document requirements, ensuring high-quality software solutions. The ideal candidate should possess strong Python skills, experience with AI agent frameworks, and a collaborative spirit. This role offers competitive benefits, including a 401(k) plan and health... 

    Infosys

    Addison, TX
    2 days ago
  • A tech company in Dallas, Texas, is seeking an experienced backend developer with over 8 years in Python to develop and maintain essential tools for AI systems. Responsibilities include managing database systems, ensuring safe AI operations, and implementing complex reasoning... 

    Ethereum Technologies LLC

    Dallas, TX
    2 days ago
  • $105k - $150k

     ...runtime challenges by developing agentic AI...  ...Design and implement tool-calling agents that combine...  ...) following MCP protocol. Engineer...  ...Productionize LLMs: Build evaluation framework for...  ...more languages (Python, C/C++, Go, Java...  ...on cloud infrastructure (ideally AWS), including... 
    Full time

    eClercx

    Dallas, TX
    11 hours ago
  •  ...sector agencies develop purpose-built...  ...compliant, and infrastructure-flexible deployments...  ..., used, and evaluated by real...  ...systems and multi-agent frameworks that...  ...function calling, and tool use....  ...AI landscape - MCP, LLM orchestration...  ...similar. ~ Strong Python engineering skills... 
    Remote work
    Worldwide
    Flexible hours

    H2O.ai

    Dallas, TX
    11 hours ago
  •  ...company in Dallas is looking for a skilled backend developer to optimize AI agents through maintenance of backend tools and implementation of complex reasoning...  ...candidate will have over 8 years of experience with Python, familiarity with AI frameworks like LangChain,... 

    Robotics Prcocess Automation, LLC

    Dallas, TX
    3 days ago
  • Job Title: Python Developer with GCP experience Location: 100% REMOTE Duration: 9+ Months to start (Extension Possible) Skills...  ...version control and CI/CD pipelines. Experience with Infrastructure as Code (IaC) tools, particularly Terraform. Experience with Generative... 
    Remote job

    Dexian

    Dallas, TX
    4 days ago
  • Senior Network Infrastructure Operations Lead Senior Network...  ..., technology evaluations, and standardization efforts...  ...to‑site connectivity. Develop scalable network strategies...  ...and inspection tools Experience supporting...  ...tools such as Terraform, Python, or Ansible.... 
    Work at office
    Remote work

    Buchanan Technologies, Inc.

    Dallas, TX
    11 hours ago
  • A technology integration company based in Dallas is seeking an experienced Python Developer with a focus on AI agent integration. You will be responsible for designing and maintaining Python applications, developing autonomous AI agents, and integrating machine learning... 

    Robotics Prcocess Automation, LLC

    Dallas, TX
    3 days ago
  •  ...focused Software Engineer to enhance our AI model training operations. This role involves sourcing audio data, extending cloud infrastructure on GCP, and collaborating closely with scientists. Ideal candidates have a BS/MS/PhD in Computer Science and 5+ years of software... 
    Remote work

    Clutch Canada

    Dallas, TX
    2 days ago
  •  ...one or more languages (Python, C/C++, Go, Java);...  ...deployment/serving, evaluation and monitoring, data...  ...applications using RAG and tool-using agents (vector retrieval,...  ...and operating on cloud infrastructure (ideally AWS),...  ...enforcement) following MCP protocol. Engineer robust... 

    JOHNSON SERVICE GROUP

    Dallas, TX
    3 days ago
  •  ...working alongside AI agents as part of their...  ...frontends to Python and Node.js APIs,...  ...stores, and cloud infrastructure. You will also use...  ...well-documented code Evaluate and review agent-produced...  ...with CI/CD tooling such as GitHub Actions...  ...Context Protocol (MCP) Hands-on... 
    Permanent employment

    Computer Visionaries AI

    Dallas, TX
    2 days ago
  • Senior Infrastructure Automation Engineer Job Description JOB SUMMARY We are seeking a Senior Infrastructure Automation...  ...deploy scalable, secure AWS cloud infrastructure. Develop automation tools and scripts with Python, Bash, PowerShell, and Ansible. Build CI/CD... 

    Compunnel, Inc.

    Dallas, TX
    4 days ago
  •  ...Engineer for the Electrical System Operations team. This role requires over 5 years of experience managing and designing electrical infrastructure within a healthcare environment. The candidate should have a Bachelor's degree in Mechanical or Electrical Engineering and a... 

    The University of Texas Southwestern Medical Center

    Dallas, TX
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!