Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

MCP & Tools Python Developer - Agent Evaluation Infrastructure

$80 per hour

Mindrift

Overview At Mindrift, innovation meets opportunity. Our platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. We believe in using the power of collective human intelligence to ethically shape the future of AI. Location requirement : This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What you’ll be doing Developing and maintaining MCP‑compatible evaluation servers. Implementing logic to check agent actions against scenario definitions. Creating or extending tools that writers and QAs use to test agents. Working closely with infrastructure engineers to ensure compatibility. Occasionally helping with test writing or debug sessions when needed. Requirements 4+ years of Python development experience, ideally in backend or tools. Solid experience building APIs, testing frameworks, or protocol‑based interfaces. Understanding of Docker, Linux CLI, and communication. Ability to integrate new tools into existing infrastructures. Familiarity with how LLM agents are prompted, executed, and evaluated. Clear documentation and communication skills – you’ll work with QA and writers. We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces. Knowledge of FastAPI or similar async web frameworks. Experience working with LLM logs, scoring functions, or sandbox environments. Ability to support dev environments (devcontainers, CI configs, linters). JS experience. Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs. Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments. Participate in an advanced AI project and gain valuable experience to enhance your portfolio. Influence how future AI models understand and communicate in your field of expertise. Seniority level Mid‑Senior level Employment type Part‑time Job function Other Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Mindrift by 2x. #J-18808-Ljbffr Mindrift

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the MCP & Tools Python Developer - Agent Evaluation Infrastructure in Houston, TX vacancy
  • $80 per hour

     ...you’ll be doing Developing and maintaining MCP‑compatible evaluation servers....  ...Implementing logic to check agent actions against...  ...Creating or extending tools that writers and...  ...Working closely with infrastructure engineers to...  ...Requirements ~4+ years of Python development... 
    Suggested
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Houston, TX
    4 days ago
  • $80 per hour

     ...looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases that simulate human‑performed...  ...(gold paths) and scoring logic Basic experience with Python and JavaScript Curious and open to working with AI‑... 
    Suggested
    Part time
    Freelance
    Remote work
    Flexible hours

    Mindrift

    Houston, TX
    2 days ago
  • $80 per hour

     ...modern AI systems are tested and evaluated? This is a flexible, project...  ...for QAs for autonomous AI agents for a new project focused on...  ...closely with QA, writers, or developers to suggest refinements or edge...  ...future of AI while building tools that benefit everyone. Requirements... 
    Suggested
    Permanent employment
    Part time
    Freelance
    Remote work
    Flexible hours

    Mindrift

    Houston, TX
    2 days ago
  • $80 per hour

    A leading AI consultancy is seeking a detail-oriented individual to design evaluation scenarios for LLM-based agents. This part-time, remote role allows you to create structured test cases while working flexibly around your commitments. The ideal candidate holds a relevant... 
    Suggested
    Part time
    Remote work

    Mindrift

    Houston, TX
    2 days ago
  • $50 per hour

     ...Description Job Description Python - Senior Developers - AI Training - USA About...  ...Network to help train and evaluate cutting-edge AI models. If...  ..., or Flask. Testing & Tools: proficiency with pytest,...  ...this crucial human data infrastructure, Prolific is positioning... 
    Suggested
    Work from home
    Flexible hours

    Prolific Academic Ltd

    Houston, TX
    22 days ago
  • $80 per hour

     ...tech company specializing in AI is seeking QAs for autonomous AI agents to validate and enhance task structures. This flexible, project...  ...skills and attention to detail, facilitating AI testing and evaluation without needing a coding background. The role offers competitive... 
    Remote work
    Flexible hours

    Mind Rift

    Houston, TX
    4 days ago
  •  ...based in the US is seeking a Mid-Senior level developer. The role involves developing and maintaining evaluation servers, implementing logic for agent actions, and creating testing tools. Candidates should have 4+ years of Python experience, solid API development skills,... 
    Part time
    Remote work

    Mind Rift

    Houston, TX
    4 days ago
  • $93.4k - $143.8k

     ...Systems Infrastructure & Network Security Operations Engineer HP seeks a skilled engineer...  ...stakeholders. Continuous Process & Tool Improvement : Evaluate emerging trends (intent‑based...  ...Automation & DevOps : Proficiency in Python, Ansible, Terraform, Git, Jenkins/GitLab... 
    Temporary work
    Local area
    Flexible hours

    Hewlett Packard Enterprise

    Houston, TX
    4 days ago
  •  ...Mechanical Engineering Manager – Energy Infrastructure Projects Houston, Texas, United...  ...vibration, and environmental conditions. Evaluate failure modes and recommend design...  ...to interpret engineering drawings and develop detailed reports and calculations.... 
    Local area

    On.Energy

    Houston, TX
    6 days ago
  • $170k - $190k

     ...solutions, has an immediate opening for a Machine Learning & AI Infrastructure Engineer that is fully remote. Summary: We are...  ...administration skills, preferably Ubuntu. Experience with Bash and/or Python scripting. Strong troubleshooting and systems engineering... 
    For contractors
    Local area
    Immediate start
    Remote work

    KORE1 Technologies

    Houston, TX
    4 days ago
  •  ...is invested in infrastructure globally, with...  ...are committed to developing and sustaining...  ...and implementing agent-based workflows...  ...patterns and tools, context engineering...  .... Design evaluation frameworks for...  ...-grade Python, and experience...  ...LangGraph ~ MCP ~ Langsmith... 
    Work at office
    Worldwide
    Flexible hours

    Vitol

    Houston, TX
    4 days ago
  •  ...capabilities that support AI agents operating across...  ..., state, memory, tool access,...  ...execution contracts, evaluation hooks, safety constraints...  ...proficiency in Python and integration of...  ...standards (e.g., MCP, agent-to-agent...  ...integrity of critical infrastructure, as defined in... 
    Local area

    MD Anderson Center

    Houston, TX
    1 day ago
  • $160k - $190k

     ...are seeking a Cloud & AI Infrastructure Engineer to design,...  ...internal applications, AI agents, copilots, automations,...  ...clear and practical way. Evaluate and introduce appropriate tools and patterns that...  ...effectively with software developers, data engineers, and technical... 
    Local area

    Hanwha Energy USA

    Houston, TX
    26 days ago
  •  ...Description Salary: DOE Position- Python AWS Developer/ Golang Developer Location- Plano,...  ...high-quality and well-tested scripts/tooling. Practice a You Build It, You Own...  ...extensibility in mind. Experience with infrastructure automation tools (Terraform or... 
    Contract work

    Integrated Technology Strategies, Inc.

    Houston, TX
    2 days ago
  •  ...position involves leading AI engineering workstreams to develop data infrastructure and embedding AI tools into engineering workflows. Applicants should have over 5 years of data engineering experience, strong Python and SQL skills, and familiarity with NLP systems. The... 

    Scribd, Inc.

    Houston, TX
    2 days ago
  •  ...including equipment sizing, performance evaluation, and compliance with customer...  ...understand the limitations of traditional grid infrastructure and the challenges faced by remote communities...  ...-stricken areas. That's why we've developed a fleet of highly efficient, trailer-... 
    Remote work
    Night shift
    Weekend work

    Gen-Systems

    Houston, TX
    19 days ago
  •  ...innovative solutions for facility and infrastructure challenges. With locations...  ...acoustic analyses and develop design recommendations for healthcare...  ...-standard software. Evaluate and design for vibration and...  ...acoustic modeling and analysis tools such as EASE, Odeon, or... 
    Temporary work
    For contractors
    Work at office
    Local area
    Immediate start
    Flexible hours

    Smith Seckman Reid

    Houston, TX
    3 days ago
  •  ...join the Power, Energy and Infrastructure ("PEI") team based in Houston...  ...geospatial technologies Lead evaluation and integration of emerging...  ...to automate tasks (Python, R) ~ Strong quantitative...  ...objectives, we focus on attracting, developing and retaining the best... 
    Local area

    Lazard

    Houston, TX
    1 day ago
  • $120 per hour

     ...Senior Python Developer (Contract) Role Role:  Senior Python Developer Client: Well-established Trading Firm Compensation:...  ...ensuring the stability and scalability of the firm's trading infrastructure. Key Responsibilities Architecting and delivering complex... 
    Hourly pay
    Permanent employment
    Contract work
    Immediate start

    Hunter Bond Ltd

    Houston, TX
    9 hours ago
  •  ...analytics, reporting and other tools and processes required to manager risk. Our team develop the firm's strategic calculation...  ...development experience with Python # Experience communicating...  ...discrimination. All applicants will be evaluated solely on the basis of their... 

    Donatech

    Houston, TX
    2 days ago
  •  ...Software Engineer with current Python skills. This is a contract...  ...Exposure to data visualization tools and frameworks (e.g., Plotly,...  ...· Partner with analysts to develop custom interactive dashboard...  ...Work with the architecture and infrastructure teams in central IT to ensure... 
    Contract work
    Immediate start

    Motion Recruitment

    Houston, TX
    3 days ago
  • $55 per hour

     ...Freelance AI Trainer - Civil Engineering & Python 4 days ago Be among the first 25...  ...Engineers with Python skills to train and evaluate AI models on realistic civil...  ...Construction Engineering & Management ~ Infrastructure, Coastal, Earthquake, Sustainable Engineering... 
    Part time
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Houston, TX
    4 days ago
  • $155k - $190k

     ...with customer data teams Develop and optimize SQL queries against...  ...it. ~ Proficiency with Python and PySpark for data transformation...  ...Environment: Our data infrastructure is built on modern cloud technologies...  ...clinical decision support tools. This role offers deep... 
    Live in
    Flexible hours
    3 days per week

    Qualified Health

    Houston, TX
    7 days ago
  •  ...Description Seeking a Right-of-Way Agent with 5+ years of midstream...  ...Outlook, and online research tools commonly used in land and...  ...within an assigned territory Develop access, temporary workspace,...  ...large-scale midstream infrastructure assets Familiarity with land... 
    Full time
    Temporary work
    Local area

    Purple Land Management

    Houston, TX
    2 days ago
  •  ...Description Job Description Volta Group is looking for an experienced Mineral Title Agent to support title research, ownership analysis, and curative efforts across energy and infrastructure projects. If you enjoy digging into complex chain of title, solving ownership... 
    Remote work

    Volta Group Global LLC

    Houston, TX
    25 days ago
  • $80 per hour

     ...companies, focused on testing, evaluating, and improving AI systems...  ...is suited for a Senior Python developer with deep functional...  ...tasks, and confidence using tools like Roo Code or Claude...  ...~ Prior experience with agent evaluation platforms and MCP CLI Tools and Technologies... 
    Permanent employment
    Temporary work
    Freelance
    Remote work
    Flexible hours

    Mind Rift

    Houston, TX
    4 days ago
  •  ...Description Volta Group is seeking a detail-oriented Surface Title Agent to support title research, curative documentation, and land services deliverables across renewable energy and infrastructure projects nationwide. This role is ideal for someone who thrives in... 
    Remote work

    Volta Group Global LLC

    Houston, TX
    25 days ago
  • $103.15k - $158.59k

     ...through prototyping, evaluation, deployment, and...  ...Engineering, IT Infrastructure, domain SMEs,...  ...grounding, tool integration, and agent orchestration on...  ...humans and agents Develop and curate...  ...Context Protocol (MCP) servers and equivalent...  ...~ Strong Python skills, including... 
    Full time
    Temporary work
    Work at office
    Local area
    Flexible hours

    Fervo Energy

    Houston, TX
    4 days ago
  •  ...for a mid-level AI developer with strong...  ...AI-assisted coding tools, and hands-on experience...  ...and tools using Python, TypeScript/JavaScript...  ...and configure MCP (Model Context Protocol...  ...AI services Evaluate, integrate, and...  ...platforms and data infrastructure such as Microsoft... 
    Work at office

    Jacobs

    Houston, TX
    5 days ago
  •  ...AI/GenAI application evaluation, development, and deployment...  ..., such as Bash and Python Strong experience...  ...platform Designs, develops, evaluates, and deploys...  ...software engineers, and IT infrastructure professionals...  ...planning and reporting tools for executive and departmental... 
    Local area

    Baylor Miraca Genetics Laboratories, LLC

    Houston, TX
    28 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!