MCP & Tools Python Developer - Agent Evaluation Infrastructure
$80 per hourMindrift
Get AI‑powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What We Do At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We’re Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We’re looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We’re on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You’ll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What You’ll Be Doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed Although we’re only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects. Requirements 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you’ll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise #J-18808-Ljbffr Mindrift
- ...Backend/Agent Engineer – Python, AI Agents, LangGraph, MCP We are seeking an experienced Backend... ...in designing, developing, and maintaining robust... ...workflow orchestration tools such as LangGraph. Experience... ...pipelines, and cloud infrastructure (AWS, GCP, Azure). Knowledge...Suggested
$74 - $78 per hour
...problems/projects through in-depth evaluation of complex business... ...architecture blueprint. Develop standards for coding, testing... ...areas such as architecture and infrastructure to accomplish business goals... ...Experience: 8 years in Python and AI, with 2 years focused...Suggested- ...Python Developer Python Developer 3 to 4 Years (Framework Development & Agentic AI nice to have... ...integrations while enabling AI-driven agents to perform complex workflows securely and... ...Streams). Experience with observability tools (Prometheus, OpenTelemetry, Cloud...Suggested
- ...Senior Python Developer / Gen AI Developer Grow your career as a Senior Python / Gen AI... ...impact problems/projects through in-depth evaluation of complex business processes, system... ...business, such as architecture and infrastructure, integrate to accomplish business...SuggestedFull timeContract workTemporary workWorldwideFlexible hours2 days per week3 days per week
$50 per hour
...Description Job Description Python - Senior Developers - AI Training - USA About... ...Network to help train and evaluate cutting-edge AI models. If... ..., or Flask. Testing & Tools: proficiency with pytest,... ...this crucial human data infrastructure, Prolific is positioning...SuggestedWork from homeFlexible hours$74 - $78 per hour
...Sr Python Developer Global Financial Firm located in Irving... ...through in-depth evaluation of complex business processes... ...as architecture and infrastructure, integrate to... ...Responsibilities AI Agent Development: Build... ...generative models. Tool Orchestration: Define...Contract workWork at officeImmediate startRemote work$55 per hour
...modern AI systems are tested and evaluated? This is a flexible, project‑... ...for QAs for autonomous AI agents for a new project focused on... ...closely with QA, writers, or developers to suggest refinements or edge... ...future of AI while building tools that benefit everyone. Requirements...Permanent employmentPart timeFreelanceRemote workFlexible hours- ...Senior hands-on Python development experience... ...Senior Python Developer is a senior level... ...orchestrating AI agents using frameworks... ...through in-depth evaluation of complex business... ...architecture and infrastructure, integrate to... ...Define and manage API/tool access for AI...
$170k - $190k
A leading AI solutions company is seeking a Solutions Architect to improve production-grade AI infrastructure and agent systems in Dallas, TX. You will collaborate with enterprise customers, design scalable infrastructure, and optimize AI applications. The ideal candidate...Flexible hours- ...job Lead Software Engineer (Python Developer) Our employees... ...development methodologies, tools, and programming languages.... ...requirements. Partner with Infrastructure to identify and deploy optimal... ...issues. Research and evaluate technical solutions consistent...Full timeWork at officeFlexible hours
- ...Senior Healthcare Commissioning Agent LJC is seeking a Senior... ...activities for critical hospital infrastructure, coordinate multidisciplinary... ...design through occupancy. Develop and execute commissioning... ...testing of building systems. Evaluate system performance and...For contractorsFlexible hours
$55 per hour
...AI innovation firm in Dallas is seeking QAs for autonomous AI agents to ensure the quality of complex systems and scenarios. This flexible... ...skills and attention to detail. Candidates should be adept at evaluating scenarios and documenting findings. The role offers competitive...Remote jobFlexible hours$141k - $212k
Staff Engineer, Test & Evaluation Engineer role is responsible for defining, planning, and executing all integration testing... ...Experience with test automation frameworks and data analysis tools (e.g., MATLAB, Python, LabVIEW, Minitab). Clearance: Ability to obtain and...Temporary workPart time- ...Public Key Infrastructure (PKI) Engineer Contract... ...Infrastructure (PKI) Engineer to develop and implement... ...integrating security tools to enhance system... ...scripting and automation (Python, PowerShell, Bash)... ...management processes Evaluate and improve data protection...Contract work
- ...means building agents that do real things... ...as primary AI infrastructure. You will build... ...top of those tools and extend them... ..., and maintain MCP (Model Context... ...providers • Develop skills, plugins... ...AWS services • Evaluate and implement... ...with production Python and SQL; additional...
- ...NTT DATA's Client is seeking a Python Gen AI Developer to design, implement, and optimize generative... ...machine learning algorithms and tools Integrate generative AI models into... ...and management, managed edge-to-cloud infrastructure services, BPO, systems integration and...Contract work
$40 - $60 per hour
...Lead Developer The team is hiring a Lead Developer and you will be responsible for working... ...Fidelity's or vendor Data Management tools and evaluate new technologies. You will possess a... ...knowledge of Unix OS /Shell scripting Basic Python knowledge is required Good working...Hourly pay- ...| Southwest Airlines Job Title: Python Developer (Pytest, AWS, Terraform) Location... ...frameworks, and managing cloud infrastructure using Infrastructure-as-Code (IaC) practices... ...~ Familiarity with containerization tools like Docker (Kubernetes is a plus)....Contract workH1bLocal area
- ...innovative solutions for facility and infrastructure challenges. With locations... ...acoustic analyses and develop design recommendations for healthcare... ...-standard software. Evaluate and design for vibration and... ...acoustic modeling and analysis tools such as EASE, Odeon, or...Temporary workFor contractorsWork at officeLocal areaImmediate startFlexible hours
- ...organization, apply now. NTT DATA's Client is currently seeking a Python Developer to join their team in Irving, Texas (US-TX), United States... ..., implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital...
$130k - $150k
...Senior Product Manager, AI Agents Work Type: Full-... ...and capabilities. Develop single and multi-agent... ...supporting RAG, tool integration, human-in-... ...Governance, Data & Infrastructure Alignment Collaborate... ...Lead A/B testing, agent evaluation, and post-deployment analysis...Full time- ...organization, apply now. We are currently seeking a Hybrid Python Developer to join our team in Irving, Texas (US-TX), United States (... .... We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale...Work at officeRemote workFlexible hours
- ...team is looking for a skilled Python Developer for ML with deep expertise... ...integrations while enabling AI-driven agents to perform complex workflows... ...of ML algorithms, model evaluation techniques, and MLOps... ...Experience with observability tools (Prometheus, OpenTelemetry,...Shift work
- ...1. Resolve Escalated Incidents Efficiently By Leveraging Infrastructure As Code (Iac) Practices Combined With Powershell Scripting, Ensuring... ...Plans By Applying Powershell For Automation And Python For Data Analysis, Ensuring Compliance With Human Error Standards...
$105k - $150k
...runtime challenges by developing agentic AI... ...Design and implement tool-calling agents that combine... ...) following MCP protocol. Engineer... ...Productionize LLMs: Build evaluation framework for... ...more languages (Python, C/C++, Go, Java... ...on cloud infrastructure (ideally AWS), including...Full time- A tech company in Dallas, Texas, is seeking an experienced backend developer with over 8 years in Python to develop and maintain essential tools for AI systems. Responsibilities include managing database systems, ensuring safe AI operations, and implementing complex reasoning...
- ...Lead Developer (Python, GenAI) As a Lead Developer (Python, GenAI),... ...and ethics. Technology Evaluation: Proactively evaluate emerging... ...experience building services and tools within a data engineering... ...workflows, including Infrastructure as Code, containerization (...Local area
- ...company in Dallas is looking for a skilled backend developer to optimize AI agents through maintenance of backend tools and implementation of complex reasoning... ...candidate will have over 8 years of experience with Python, familiarity with AI frameworks like LangChain,...
- ...efficient, self-contained microservices using Python 3.x and modern web frameworks (FastAPI,... ...tests. Configure logging and monitoring tools (Prometheus, Grafana, ELK stack) to... ...cloud environments (AWS, Azure, or GCP) and infrastructure-as-code principles. ~ Solid...
- ...one or more languages (Python, C/C++, Go, Java);... ...deployment/serving, evaluation and monitoring, data... ...applications using RAG and tool-using agents (vector retrieval,... ...and operating on cloud infrastructure (ideally AWS),... ...enforcement) following MCP protocol. Engineer robust...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!
- python programmer Dallas, TX
- senior python developer Dallas, TX
- full stack / python developer (remote) Dallas, TX
- python developer data analytics Dallas, TX
- python engineer Dallas, TX
- python developer Dallas, TX
- backend python developer Dallas, TX
- python developer remote Dallas, TX
- signing agent Dallas, TX
- showing agent Dallas, TX


