MCP & Tools Python Developer - Agent Evaluation Infrastructure
$80 per hourMindrift
2 days ago Be among the first 25 applicants Get AI‑powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What We Do At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We're Looking For Calling all security researchers, engineers, and penetration testers with a strong foundation in problem‑solving, offensive security, and AI‑related risk assessment. If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us! We're looking for someone who can bring a hands‑on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. About The Project We're on the hunt for hands‑on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You'll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team. What you'll be doing Developing and maintaining MCP‑compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending tools that writers and QAs use to test agents Working closely with infrastructure engineers to ensure compatibility Occasionally helping with test writing or debug sessions when needed Although we're only looking for experts for this current project, contributors with consistent high‑quality submissions may receive an invitation for ongoing collaboration across future projects. Requirements 4+ years of Python development experience, ideally in backend or tools Solid experience building APIs, testing frameworks, or protocol‑based interfaces Understanding of Docker, Linux CLI, and communication Ability to integrate new tools into existing infrastructures Familiarity with how LLM agents are prompted, executed, and evaluated Clear documentation and communication skills – you'll work with QA and writers We also value applicants who have: Experience with Model Context Protocol (MCP) or similar structured agent‑server interfaces Knowledge of FastAPI or similar async web frameworks Experience working with LLM logs, scoring functions, or sandbox environments Ability to support dev environments (devcontainers, CI configs, linters) JS experience Benefits Get paid for your expertise, with rates that can go up to $80/hour depending on your skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Seniority level Mid‑Senior level Employment type Part‑time Job function Other Industries IT Services and IT Consulting #J-18808-Ljbffr Mindrift
- Responsibilities As a Solution Developer on the GRC Tooling team, you will be responsible for designing... ...generation (RAG) Experience evaluating and operating AI systems, including... ...arrangement: Hybrid Area of work: Infrastructure & Technology Employment type: Regular...SuggestedTemporary workShift work
$99.6k - $234.6k
...The Principal AI Agent / ML Software... ...on Oracle Cloud Infrastructure (OCI). This... ...orchestration of LLMs, tools, APIs, memory, retrieval, evaluation, guardrails,... ...Protocol (MCP) integration, vector... ...models, AI developer tooling, and agentic... ...skills in Python and ability to...SuggestedTemporary workFlexible hours- Manufacturing Test Infrastructure Support Engineer role is... ...Management Identify, evaluate, and procure test‑related... ...racks, and diagnostic tools. Maintain inventory... ...and safety protocols. Develop and maintain SOPs and... ...SSH), and scripting (Python, Bash). Familiarity with...SuggestedFull time
$55 per hour
...modern AI systems are tested and evaluated? This is a flexible, project-... ...for QAs for autonomous AI agents for a new project focused on... ...closely with QA, writers, or developers to suggest refinements or edge... ...future of AI while building tools that benefit everyone. Requirements...SuggestedPermanent employmentPart timeFreelanceRemote workFlexible hours$60 per hour
A tech innovation company is seeking QAs for autonomous AI agents to validate and improve task structures and evaluate logic. The role requires excellent analytical thinking, attention to detail, and the ability to assess complex scenarios. Successful candidates can work...SuggestedRemote jobFlexible hours$80 per hour
...time opportunity focused on quality assurance for autonomous AI agents. You will analyze complex systems, review tasks for logic, and... ...analytical and detail-oriented skills, with experience in policy evaluation or logic puzzles preferred. Compensation can reach up to $80/...Remote jobPart timeFlexible hours$80 per hour
A tech company specializing in AI is seeking experienced Python engineers to join a part-time project developing innovative tools for agent evaluation. The role offers flexibility and the chance to work on cutting-edge AI technology. Candidates should possess 4+ years of...Remote jobPart time$55 per hour
A leading AI innovation firm is seeking QAs for autonomous AI agents to improve evaluation frameworks. Candidates should possess excellent analytical thinking and attention to detail to review tasks and define clear standards. This remote, flexible opportunity offers rates...Remote jobPart timeFlexible hours- ...Transformation team as an AI Agent & Automation Developer. You will work in a... ..., and modern AI infrastructure. Your work will... ...chatbots and automation tools. Collaborate with... ...performance, conduct evaluations, and implement... ...programming skills in Python (or similar) and experience...Work at officeLocal area
$99.6k - $223.4k
...is seeking a Senior AI Agent Engineer to build... ...balancing speed with safety, evaluation, and maintainability.... ...work across prompts, tools, code, APIs,... ...engineering workflows. Develop structured tool-calling... ...brings together the data, infrastructure, applications, and expertise...Temporary workFlexible hours- ...Studies Engineering, Energy and Infrastructure Projects (Raleigh, NC) page... ...optimized grid performance.* Develop and validate advanced system... ...successful project execution.* Evaluate renewable integration and... ...leading modeling and simulation tools like PSS/E, PSCAD, ETAP,...Work at office
- ...development company is seeking experienced quantitative professionals to evaluate AI-generated work and contribute to advancing AI systems. Responsibilities include assessing technical accuracy and developing quantitative problems to support AI training. Ideal candidates have...Remote jobFlexible hours
- ...Looking For: A Sr. Software Developer (Infrastructure) that will help us build the platform and tools that help Bandwidth engineers... ...systems, prompt libraries, MCP server integrations, and internal... ...~ Strong coding skills in Python, Go, TypeScript, or similar...
- ...innovative solutions for facility and infrastructure challenges. With locations... ...acoustic analyses and develop design recommendations for healthcare... ...-standard software. Evaluate and design for vibration and... ...acoustic modeling and analysis tools such as EASE, Odeon, or...Temporary workFor contractorsWork at officeLocal areaImmediate startFlexible hours
$90.4k - $173.3k
...SME, Primary Engineering, Energy and Infrastructure Projects (Raleigh, NC) LOCATION: Raleigh... ...you will apply your technical mastery to develop innovative Substation Layout Designs... ...AutoCAD and/or MicroStation, and use these tools to produce accurate, high-quality design...Full timeWork at office$148k - $356.5k
...Software Engineer, Metrics and Evaluation - Autonomous Vehicles page... ...technologies? NVIDIA is developing groundbreaking solutions in... ...machine learning engineers, infrastructure engineers and roboticists to... ...roles Highly proficient in Python and associated libraries (Pandas...Full timeRemote work- ...broad and complex infrastructure footprint,... ...platforms and tooling, improving reliability... ...platforms. Develop and iterate on... ...Proficiency in Python and/or... ...Ability to evaluate AI capabilities... ...Background building MCP (Model Context... ...that extend AI agent capabilities....
$90.4k - $173.3k
...SME, Secondary Engineering, Energy and Infrastructure Projects (Raleigh, NC) LOCATION: Raleigh... ...substation systems. You will develop specifications, bills of materials (BOMs... ...You have exposure to ETAP and SEL tools. You have strong skills in CAD software...Full timeWork at office$170.9k - $241.4k
Lead Security Tools Engineer Company Overview Docusign... ...consistency Lead the evaluation, testing, and... ...business needs evolve Develop and maintain documentation... ...and automation (e.g., Python, PowerShell, Go, or similar... ...Familiarity with infrastructure‑as‑code (IaC) tools (e...Contract workWork at officeRemote work2 days per week- ...North Carolina is seeking an experienced MLOps Engineer to join our Data and AI team. You will be responsible for implementing infrastructure solutions, ensuring alignment between business needs and data architecture, and collaborating with cross-functional teams. The...
- A leading engineering firm is seeking a Project Engineer for energy and infrastructure projects in Raleigh, NC. You will play a critical role in designing and delivering innovative power solutions. The ideal candidate will have a Bachelor's degree in electrical engineering...
$115.4k - $192.3k
...legal products and developer experiences.... ...workflows and agent-based capabilities... ...workflows, tool calling, and lightweight... ..., LlamaIndex, MCP/A2A, OpenAI... ...in building evaluation, monitoring, and... ...skills in Python and experience... ...optimization, LLM serving infrastructure, or AI...Local area- ...based in Raleigh, North Carolina, to shape high-impact power infrastructure projects across the Americas. In this role, you'll apply your... ...experience, or a Master’s with 2-3 years. Proficiency in CAD tools and the Microsoft Office Suite is essential. Professional Engineering...Work at office
- Dewberry is seeking a Senior Mechanical Engineer to join its Thermal Infrastructure department in Raleigh, NC. This role involves designing thermal infrastructure systems and collaborating with a team of engineers across disciplines. The ideal candidate has at least 15...
$40 per hour
A technology company is seeking a DevOps Engineer to evaluate AI models and improve their performance. The position is remote, offering... ...candidates will have proficiency in programming languages such as Python or JavaScript, and a detail-oriented mindset. The role offers...Remote jobHourly pay$80 per hour
...world. Over 35,000 AI developers, researchers, and... ...Network to help train and evaluate the next generation of... ...& Data: advanced Python (NumPy, Pandas, Scikit... ...Vertex AI), or specialized tools like Weights & Biases... ...crucial human data infrastructure, Prolific is...Hourly payWork from homeFlexible hours$150k - $250k
A tech company specializing in AI agents seeks a Deployed Engineer in Raleigh, NC. This... ...owning technical wins during pre-sales evaluations, and helping clients deploy agent-based... ...years in a relevant technical field, strong Python and JavaScript skills, and experience...- Bandwidth Recruitment in Raleigh, NC, is looking for a Sr. Software Developer (Infrastructure) to enhance AI integration and developer tooling. This role demands 5+ years in web services, strong skills in Python or Go, and experience with cloud infrastructure like AWS. The...
$90.4k - $173.3k
...JOB TITLE:** **Traction Power Engineering Manager, Energy and Infrastructure Projects (Raleigh, NC)****LOCATION: Raleigh, NC (Hybrid)**... ...by collaborating with business development and bid teams to develop viable design concepts, technical clarifications, RFQ responses...Full timeWork at office$30 per hour
...Platform Engineer to enhance AI model quality. This role involves evaluating AI outputs through coding and writing tasks. Candidates must be proficient in programming languages like JavaScript or Python and have a strong grasp of algorithms. The position offers flexibility...Remote jobHourly pay
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to MCP & Tools Python Developer - Agent Evaluation Infrastructure. Be the first to apply!
- tool design engineer Raleigh, NC
- python programmer Raleigh, NC
- python developer data analytics Raleigh, NC
- python engineer Raleigh, NC
- python developer Raleigh, NC
- full stack / python developer (remote) Raleigh, NC
- commissioning agent Raleigh, NC
- cruise agent Raleigh, NC
- state farm agent Raleigh, NC
- airport agent Raleigh, NC


