Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

MCP & Tools Python Developer - Agent Evaluation Infrastructure

$80 per hour

Mindrift

Get AI‑powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What We Do At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We’re Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We’re looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You’ll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What You’ll Be Doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed Although we’re only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects. Requirements 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you’ll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise #J-18808-Ljbffr Mindrift

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the MCP & Tools Python Developer - Agent Evaluation Infrastructure in Dallas, TX vacancy
  •  ...Backend/Agent Engineer – Python, AI Agents, LangGraph, MCP We are seeking an experienced Backend...  ...in designing, developing, and maintaining robust...  ...workflow orchestration tools such as LangGraph. Experience...  ...pipelines, and cloud infrastructure (AWS, GCP, Azure). Knowledge... 
    Suggested

    Diversity Nexus

    Dallas, TX
    4 days ago
  • $74 - $78 per hour

     ...problems/projects through in-depth evaluation of complex business...  ...architecture blueprint. Develop standards for coding, testing...  ...areas such as architecture and infrastructure to accomplish business goals...  ...Experience: 8 years in Python and AI, with 2 years focused... 
    Suggested

    Axelon

    Irving, TX
    2 days ago
  •  ...Python Developer Python Developer 3 to 4 Years (Framework Development & Agentic AI nice to have...  ...integrations while enabling AI-driven agents to perform complex workflows securely and...  ...Streams). Experience with observability tools (Prometheus, OpenTelemetry, Cloud... 
    Suggested

    Photon

    Dallas, TX
    3 days ago
  •  ...Senior Python Developer / Gen AI Developer Grow your career as a Senior Python / Gen AI...  ...impact problems/projects through in-depth evaluation of complex business processes, system...  ...business, such as architecture and infrastructure, integrate to accomplish business... 
    Suggested
    Full time
    Contract work
    Temporary work
    Worldwide
    Flexible hours
    2 days per week
    3 days per week

    Motion Recruitment

    Irving, TX
    1 day ago
  • $50 per hour

     ...Description Job Description Python - Senior Developers - AI Training - USA About...  ...Network to help train and evaluate cutting-edge AI models. If...  ..., or Flask. Testing & Tools: proficiency with pytest,...  ...this crucial human data infrastructure, Prolific is positioning... 
    Suggested
    Work from home
    Flexible hours

    Prolific Academic Ltd

    Dallas, TX
    26 days ago
  • $74 - $78 per hour

     ...Sr Python Developer Global Financial Firm located in Irving...  ...through in-depth evaluation of complex business processes...  ...as architecture and infrastructure, integrate to...  ...Responsibilities AI Agent Development: Build...  ...generative models. Tool Orchestration: Define... 
    Contract work
    Work at office
    Immediate start
    Remote work

    Axelon

    Irving, TX
    1 day ago
  • $55 per hour

     ...modern AI systems are tested and evaluated? This is a flexible, project‑...  ...for QAs for autonomous AI agents for a new project focused on...  ...closely with QA, writers, or developers to suggest refinements or edge...  ...future of AI while building tools that benefit everyone. Requirements... 
    Permanent employment
    Part time
    Freelance
    Remote work
    Flexible hours

    Mindrift

    Dallas, TX
    1 day ago
  •  ...Senior hands-on Python development experience...  ...Senior Python Developer is a senior level...  ...orchestrating AI agents using frameworks...  ...through in-depth evaluation of complex business...  ...architecture and infrastructure, integrate to...  ...Define and manage API/tool access for AI... 

    Rose International

    Irving, TX
    7 days ago
  • $170k - $190k

    A leading AI solutions company is seeking a Solutions Architect to improve production-grade AI infrastructure and agent systems in Dallas, TX. You will collaborate with enterprise customers, design scalable infrastructure, and optimize AI applications. The ideal candidate... 
    Flexible hours

    LangChain

    Dallas, TX
    2 days ago
  •  ...job Lead Software Engineer (Python Developer) Our employees...  ...development methodologies, tools, and programming languages....  ...requirements. Partner with Infrastructure to identify and deploy optimal...  ...issues. Research and evaluate technical solutions consistent... 
    Full time
    Work at office
    Flexible hours

    MyHR Rescue

    Dallas, TX
    1 day ago
  •  ...Senior Healthcare Commissioning Agent LJC is seeking a Senior...  ...activities for critical hospital infrastructure, coordinate multidisciplinary...  ...design through occupancy. Develop and execute commissioning...  ...testing of building systems. Evaluate system performance and... 
    For contractors
    Flexible hours

    CRG

    Dallas, TX
    3 days ago
  • $55 per hour

     ...AI innovation firm in Dallas is seeking QAs for autonomous AI agents to ensure the quality of complex systems and scenarios. This flexible...  ...skills and attention to detail. Candidates should be adept at evaluating scenarios and documenting findings. The role offers competitive... 
    Remote job
    Flexible hours

    Mindrift

    Dallas, TX
    5 days ago
  • $141k - $212k

    Staff Engineer, Test & Evaluation Engineer role is responsible for defining, planning, and executing all integration testing...  ...Experience with test automation frameworks and data analysis tools (e.g., MATLAB, Python, LabVIEW, Minitab). Clearance: Ability to obtain and... 
    Temporary work
    Part time

    BlackCube Labs

    Dallas, TX
    4 days ago
  •  ...Public Key Infrastructure (PKI) Engineer Contract...  ...Infrastructure (PKI) Engineer to develop and implement...  ...integrating security tools to enhance system...  ...scripting and automation (Python, PowerShell, Bash)...  ...management processes Evaluate and improve data protection... 
    Contract work

    Genius Road

    Dallas, TX
    3 days ago
  •  ...means building agents that do real things...  ...as primary AI infrastructure. You will build...  ...top of those tools and extend them...  ..., and maintain MCP (Model Context...  ...providers • Develop skills, plugins...  ...AWS services • Evaluate and implement...  ...with production Python and SQL; additional... 

    Braviant Holdings

    Addison, TX
    5 days ago
  •  ...NTT DATA's Client is seeking a Python Gen AI Developer to design, implement, and optimize generative...  ...machine learning algorithms and tools Integrate generative AI models into...  ...and management, managed edge-to-cloud infrastructure services, BPO, systems integration and... 
    Contract work

    The Nippon Telegraph and Telephone Corporation (NTT)

    Irving, TX
    5 days ago
  • $40 - $60 per hour

     ...Lead Developer The team is hiring a Lead Developer and you will be responsible for working...  ...Fidelity's or vendor Data Management tools and evaluate new technologies. You will possess a...  ...knowledge of Unix OS /Shell scripting Basic Python knowledge is required Good working... 
    Hourly pay

    Cedent Life Talent

    Dallas, TX
    2 days ago
  •  ...| Southwest Airlines Job Title: Python Developer (Pytest, AWS, Terraform) Location...  ...frameworks, and managing cloud infrastructure using Infrastructure-as-Code (IaC) practices...  ...~ Familiarity with containerization tools like Docker (Kubernetes is a plus).... 
    Contract work
    H1b
    Local area

    Concord IT Systems

    Dallas, TX
    3 days ago
  •  ...innovative solutions for facility and infrastructure challenges. With locations...  ...acoustic analyses and develop design recommendations for healthcare...  ...-standard software. Evaluate and design for vibration and...  ...acoustic modeling and analysis tools such as EASE, Odeon, or... 
    Temporary work
    For contractors
    Work at office
    Local area
    Immediate start
    Flexible hours

    Smith Seckman Reid

    Dallas, TX
    7 days ago
  •  ...organization, apply now. NTT DATA's Client is currently seeking a Python Developer to join their team in Irving, Texas (US-TX), United States...  ..., implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital... 

    The Nippon Telegraph and Telephone Corporation (NTT)

    Irving, TX
    1 day ago
  • $130k - $150k

     ...Senior Product Manager, AI Agents Work Type: Full-...  ...and capabilities. Develop single and multi-agent...  ...supporting RAG, tool integration, human-in-...  ...Governance, Data & Infrastructure Alignment Collaborate...  ...Lead A/B testing, agent evaluation, and post-deployment analysis... 
    Full time

    SelectMinds

    Dallas, TX
    1 day ago
  •  ...organization, apply now. We are currently seeking a Hybrid Python Developer to join our team in Irving, Texas (US-TX), United States (...  .... We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale... 
    Work at office
    Remote work
    Flexible hours

    NTT Data Americas, Inc.

    Irving, TX
    17 days ago
  •  ...team is looking for a skilled Python Developer for ML with deep expertise...  ...integrations while enabling AI-driven agents to perform complex workflows...  ...of ML algorithms, model evaluation techniques, and MLOps...  ...Experience with observability tools (Prometheus, OpenTelemetry,... 
    Shift work

    Photon

    Dallas, TX
    2 days ago
  •  ...1. Resolve Escalated Incidents Efficiently By Leveraging Infrastructure As Code (Iac) Practices Combined With Powershell Scripting, Ensuring...  ...Plans By Applying Powershell For Automation And Python For Data Analysis, Ensuring Compliance With Human Error Standards... 

    Prophecy Technologies

    Dallas, TX
    5 days ago
  • $105k - $150k

     ...runtime challenges by developing agentic AI...  ...Design and implement tool-calling agents that combine...  ...) following MCP protocol. Engineer...  ...Productionize LLMs: Build evaluation framework for...  ...more languages (Python, C/C++, Go, Java...  ...on cloud infrastructure (ideally AWS), including... 
    Full time

    eClercx

    Dallas, TX
    5 days ago
  • A tech company in Dallas, Texas, is seeking an experienced backend developer with over 8 years in Python to develop and maintain essential tools for AI systems. Responsibilities include managing database systems, ensuring safe AI operations, and implementing complex reasoning... 

    Ethereum Technologies LLC

    Dallas, TX
    2 days ago
  •  ...Lead Developer (Python, GenAI) As a Lead Developer (Python, GenAI),...  ...and ethics. Technology Evaluation: Proactively evaluate emerging...  ...experience building services and tools within a data engineering...  ...workflows, including Infrastructure as Code, containerization (... 
    Local area

    Luminant

    Irving, TX
    2 days ago
  •  ...company in Dallas is looking for a skilled backend developer to optimize AI agents through maintenance of backend tools and implementation of complex reasoning...  ...candidate will have over 8 years of experience with Python, familiarity with AI frameworks like LangChain,... 

    Robotics Prcocess Automation, LLC

    Dallas, TX
    3 days ago
  •  ...efficient, self-contained microservices using Python 3.x and modern web frameworks (FastAPI,...  ...tests. Configure logging and monitoring tools (Prometheus, Grafana, ELK stack) to...  ...cloud environments (AWS, Azure, or GCP) and infrastructure-as-code principles. ~ Solid... 

    NTT DATA

    Irving, TX
    4 days ago
  •  ...one or more languages (Python, C/C++, Go, Java);...  ...deployment/serving, evaluation and monitoring, data...  ...applications using RAG and tool-using agents (vector retrieval,...  ...and operating on cloud infrastructure (ideally AWS),...  ...enforcement) following MCP protocol. Engineer robust... 

    JSG (Johnson Service Group, Inc.)

    Dallas, TX
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!