MCP & Tools Python Developer - Agent Evaluation Infrastructure
$80 per hourMindrift
Overview At Mindrift, innovation meets opportunity. Our platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. We believe in using the power of collective human intelligence to ethically shape the future of AI. Location requirement : This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What you’ll be doing Developing and maintaining MCP‑compatible evaluation servers. Implementing logic to check agent actions against scenario definitions. Creating or extending tools that writers and QAs use to test agents. Working closely with infrastructure engineers to ensure compatibility. Occasionally helping with test writing or debug sessions when needed. Requirements 4+ years of Python development experience, ideally in backend or tools. Solid experience building APIs, testing frameworks, or protocol‑based interfaces. Understanding of Docker, Linux CLI, and communication. Ability to integrate new tools into existing infrastructures. Familiarity with how LLM agents are prompted, executed, and evaluated. Clear documentation and communication skills – you’ll work with QA and writers. We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces. Knowledge of FastAPI or similar async web frameworks. Experience working with LLM logs, scoring functions, or sandbox environments. Ability to support dev environments (devcontainers, CI configs, linters). JS experience. Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs. Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments. Participate in an advanced AI project and gain valuable experience to enhance your portfolio. Influence how future AI models understand and communicate in your field of expertise. Seniority level Mid‑Senior level Employment type Part‑time Job function Other Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Mindrift by 2x. #J-18808-Ljbffr Mindrift
$80 per hour
...you’ll be doing Developing and maintaining MCP‑compatible evaluation servers.... ...Implementing logic to check agent actions against... ...Creating or extending tools that writers and... ...Working closely with infrastructure engineers to... ...Requirements ~4+ years of Python development...SuggestedPart timeFreelanceRemote workFlexible hours$80 per hour
...looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases that simulate human‑performed... ...(gold paths) and scoring logic Basic experience with Python and JavaScript Curious and open to working with AI‑...SuggestedPart timeFreelanceRemote workFlexible hours$80 per hour
...modern AI systems are tested and evaluated? This is a flexible, project... ...for QAs for autonomous AI agents for a new project focused on... ...closely with QA, writers, or developers to suggest refinements or edge... ...future of AI while building tools that benefit everyone. Requirements...SuggestedPermanent employmentPart timeFreelanceRemote workFlexible hours$80 per hour
A leading AI consultancy is seeking a detail-oriented individual to design evaluation scenarios for LLM-based agents. This part-time, remote role allows you to create structured test cases while working flexibly around your commitments. The ideal candidate holds a relevant...SuggestedPart timeRemote work$50 per hour
...Description Job Description Python - Senior Developers - AI Training - USA About... ...Network to help train and evaluate cutting-edge AI models. If... ..., or Flask. Testing & Tools: proficiency with pytest,... ...this crucial human data infrastructure, Prolific is positioning...SuggestedWork from homeFlexible hours$80 per hour
...tech company specializing in AI is seeking QAs for autonomous AI agents to validate and enhance task structures. This flexible, project... ...skills and attention to detail, facilitating AI testing and evaluation without needing a coding background. The role offers competitive...Remote workFlexible hours- ...based in the US is seeking a Mid-Senior level developer. The role involves developing and maintaining evaluation servers, implementing logic for agent actions, and creating testing tools. Candidates should have 4+ years of Python experience, solid API development skills,...Part timeRemote work
$93.4k - $143.8k
...Systems Infrastructure & Network Security Operations Engineer HP seeks a skilled engineer... ...stakeholders. Continuous Process & Tool Improvement : Evaluate emerging trends (intent‑based... ...Automation & DevOps : Proficiency in Python, Ansible, Terraform, Git, Jenkins/GitLab...Temporary workLocal areaFlexible hours- ...Mechanical Engineering Manager – Energy Infrastructure Projects Houston, Texas, United... ...vibration, and environmental conditions. Evaluate failure modes and recommend design... ...to interpret engineering drawings and develop detailed reports and calculations....Local area
$170k - $190k
...solutions, has an immediate opening for a Machine Learning & AI Infrastructure Engineer that is fully remote. Summary: We are... ...administration skills, preferably Ubuntu. Experience with Bash and/or Python scripting. Strong troubleshooting and systems engineering...For contractorsLocal areaImmediate startRemote work- ...is invested in infrastructure globally, with... ...are committed to developing and sustaining... ...and implementing agent-based workflows... ...patterns and tools, context engineering... .... Design evaluation frameworks for... ...-grade Python, and experience... ...LangGraph ~ MCP ~ Langsmith...Work at officeWorldwideFlexible hours
- ...capabilities that support AI agents operating across... ..., state, memory, tool access,... ...execution contracts, evaluation hooks, safety constraints... ...proficiency in Python and integration of... ...standards (e.g., MCP, agent-to-agent... ...integrity of critical infrastructure, as defined in...Local area
$160k - $190k
...are seeking a Cloud & AI Infrastructure Engineer to design,... ...internal applications, AI agents, copilots, automations,... ...clear and practical way. Evaluate and introduce appropriate tools and patterns that... ...effectively with software developers, data engineers, and technical...Local area- ...Description Salary: DOE Position- Python AWS Developer/ Golang Developer Location- Plano,... ...high-quality and well-tested scripts/tooling. Practice a You Build It, You Own... ...extensibility in mind. Experience with infrastructure automation tools (Terraform or...Contract work
- ...position involves leading AI engineering workstreams to develop data infrastructure and embedding AI tools into engineering workflows. Applicants should have over 5 years of data engineering experience, strong Python and SQL skills, and familiarity with NLP systems. The...
- ...including equipment sizing, performance evaluation, and compliance with customer... ...understand the limitations of traditional grid infrastructure and the challenges faced by remote communities... ...-stricken areas. That's why we've developed a fleet of highly efficient, trailer-...Remote workNight shiftWeekend work
- ...innovative solutions for facility and infrastructure challenges. With locations... ...acoustic analyses and develop design recommendations for healthcare... ...-standard software. Evaluate and design for vibration and... ...acoustic modeling and analysis tools such as EASE, Odeon, or...Temporary workFor contractorsWork at officeLocal areaImmediate startFlexible hours
- ...join the Power, Energy and Infrastructure ("PEI") team based in Houston... ...geospatial technologies Lead evaluation and integration of emerging... ...to automate tasks (Python, R) ~ Strong quantitative... ...objectives, we focus on attracting, developing and retaining the best...Local area
$120 per hour
...Senior Python Developer (Contract) Role Role: Senior Python Developer Client: Well-established Trading Firm Compensation:... ...ensuring the stability and scalability of the firm's trading infrastructure. Key Responsibilities Architecting and delivering complex...Hourly payPermanent employmentContract workImmediate start- ...analytics, reporting and other tools and processes required to manager risk. Our team develop the firm's strategic calculation... ...development experience with Python # Experience communicating... ...discrimination. All applicants will be evaluated solely on the basis of their...
- ...Software Engineer with current Python skills. This is a contract... ...Exposure to data visualization tools and frameworks (e.g., Plotly,... ...· Partner with analysts to develop custom interactive dashboard... ...Work with the architecture and infrastructure teams in central IT to ensure...Contract workImmediate start
$55 per hour
...Freelance AI Trainer - Civil Engineering & Python 4 days ago Be among the first 25... ...Engineers with Python skills to train and evaluate AI models on realistic civil... ...Construction Engineering & Management ~ Infrastructure, Coastal, Earthquake, Sustainable Engineering...Part timeFreelanceRemote workFlexible hours$155k - $190k
...with customer data teams Develop and optimize SQL queries against... ...it. ~ Proficiency with Python and PySpark for data transformation... ...Environment: Our data infrastructure is built on modern cloud technologies... ...clinical decision support tools. This role offers deep...Live inFlexible hours3 days per week- ...Description Seeking a Right-of-Way Agent with 5+ years of midstream... ...Outlook, and online research tools commonly used in land and... ...within an assigned territory Develop access, temporary workspace,... ...large-scale midstream infrastructure assets Familiarity with land...Full timeTemporary workLocal area
- ...Description Job Description Volta Group is looking for an experienced Mineral Title Agent to support title research, ownership analysis, and curative efforts across energy and infrastructure projects. If you enjoy digging into complex chain of title, solving ownership...Remote work
$80 per hour
...companies, focused on testing, evaluating, and improving AI systems... ...is suited for a Senior Python developer with deep functional... ...tasks, and confidence using tools like Roo Code or Claude... ...~ Prior experience with agent evaluation platforms and MCP CLI Tools and Technologies...Permanent employmentTemporary workFreelanceRemote workFlexible hours- ...Description Volta Group is seeking a detail-oriented Surface Title Agent to support title research, curative documentation, and land services deliverables across renewable energy and infrastructure projects nationwide. This role is ideal for someone who thrives in...Remote work
$103.15k - $158.59k
...through prototyping, evaluation, deployment, and... ...Engineering, IT Infrastructure, domain SMEs,... ...grounding, tool integration, and agent orchestration on... ...humans and agents Develop and curate... ...Context Protocol (MCP) servers and equivalent... ...~ Strong Python skills, including...Full timeTemporary workWork at officeLocal areaFlexible hours- ...for a mid-level AI developer with strong... ...AI-assisted coding tools, and hands-on experience... ...and tools using Python, TypeScript/JavaScript... ...and configure MCP (Model Context Protocol... ...AI services Evaluate, integrate, and... ...platforms and data infrastructure such as Microsoft...Work at office
- ...AI/GenAI application evaluation, development, and deployment... ..., such as Bash and Python Strong experience... ...platform Designs, develops, evaluates, and deploys... ...software engineers, and IT infrastructure professionals... ...planning and reporting tools for executive and departmental...Local area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!
- tool design engineer Houston, TX
- python programmer Houston, TX
- senior python developer Houston, TX
- full stack / python developer (remote) Houston, TX
- python developer data analytics Houston, TX
- python engineer Houston, TX
- python developer Houston, TX
- python developer part time Houston, TX
- python developer remote Houston, TX
- right of way agent Houston, TX



