part-time Senior LLM Evaluation / Agent Reliability 2-5hrAdvisor
Shopint
Position Summary Shopint is looking for a part‑time Senior LLM Evaluation / Agent Reliability Advisor. We are building a focused evaluation product for commerce‑related AI agents, with early case evidence and feedback from principal‑level AI evaluation, reliability, and safety leaders. We are looking for someone to pressure‑test and refine our evaluation approach as we prepare our first pilot‑ready custom diagnostic package. Ideal Background Senior Applied Scientist, LLM Eval Lead, Agent Evaluation Lead, AI Quality Lead, or similar Experience with LLM evaluation, agent reliability, RAG/tool‑use evals, rubric design, human evaluation, or regression testing Comfortable turning messy early‑stage cases into serious eval / regression assets Key Responsibilities Review rubric quality and case structure Pressure‑test logic Review pilot‑ready case cards and diagnostic outputs Other Details Time: 2–5 hours/week to start. Compensation: flexible — paid advisory, equity, part‑time contributor role, or future core/founding role depending on fit and commitment. #J-18808-Ljbffr Shopint
- Shopint is seeking a part-time Senior LLM Evaluation / Agent Reliability Advisor based in Seattle, Washington. This role focuses on reviewing evaluation approaches... ...and agent reliability. The position requires roughly 2-5 hours per week, with flexible compensation options...Part timeSeniorFlexible hours
$202.16k - $368.22k
...development of multi-agent tools, models,... ...and build a stable, reliable Agent infrastructure... ...multimodal, search, graph, LLM, Agent etc. to... ...experience - Familiar with 1-2 areas in natural... .... Base pay is one part of the Total Package... ...days of Paid Personal Time (prorated upon hire...SeniorTemporary workLocal areaOverseas- ...Senior Principal AI Agent / ML Software Engineer The Senior Principal AI Agent... ..., APIs, memory, retrieval, evaluation, guardrails, and cloud... ...to ship, scale, and operate reliable, secure, observable, and cost... ...~ Deep understanding of LLM application patterns, including...Senior
- ...Senior Software Engineer - AI Coding Agents At NiCE, we don't limit our... ...Engineer, you will be part of a team... ...making Work on LLM integrations, prompt... ...to real-time analytics and interactive... ...performance, reliability, and observability Build evaluation and...Senior
- ...languages such as Go, Java, or TypeScript. This role involves shaping how the organization builds and scales agentic AI while ensuring reliability, governance, and security across various teams. Experience with enterprise data systems and AI platforms is essential. #J-18808-...SeniorRemote work
- A leading technology firm based in Seattle is seeking a senior software engineer with over 10 years of experience to lead initiatives in... ...candidate will have a deep expertise in Python, experience with LLM-based systems, and a strong understanding of distributed systems...SeniorRemote job
$55k - $151.47k
...Applicable Time Type: Full time... ...Opportunity As part of the People... ...Generative AI agents and data pipelines, promoting reliability, scalability, and... .... As a Senior Associate you will... ...Science At least 2 years of... ...platform Executing LLM evaluation frameworks...SeniorFull timeWork experience placementH1bRemote work$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company... ...driving conditions. As part of our work, we also... ...realism of embodied AI agents Partner with cross-functional... ...company match Paid Time Off: 20 days of vacation...SeniorFull timeTemporary workRemote work$151.28k - $183.32k
...Summary: As a Senior Application... ...patterns for agent context... ...without giving up reliability,... ...approved frontier LLM models and APIs... ...Observability, and Evaluation: Create... ...for a full-time employee (FTE... ...time, up to 2 paid volunteer... ...full and part-time who are...Part timeSeniorHourly payFull timeTemporary workFor contractorsSummer workLive inWork at officeLocal areaRemote workFlexible hoursShift work$160k - $200k
...about AI to design and build LLM powered systems in Seattle, Washington... ...AI systems, designing reliable LLM architectures, and maintaining... ..., retrieval systems, and evaluation frameworks. Flexible time off and meaningful equity are part of the benefits package. #J-18...SeniorFlexible hours$176.76k - $232k
...responsibilities As a Senior AI/ML Engineer,... ...rigorous evaluation frameworks, and... ...and engineering of LLM and GenAI systems... ...accuracy, performance, reliability, and responsible... ...visa at this time for this role.... ...internal equity. As part of our total rewards...Part timeSeniorPermanent employmentContract workWork visa- ...LLM/Prompt-Context Engineer – Fullstack Python (AI Agents, LangGraph, Context Engineering) Location – 1st... ...you will play a critical part in architecting context-rich... ...: Design, optimize, and evaluate prompts for LLMs to achieve precise, reliable, and contextually...Remote work
$130k - $300k
...more information, please .Senior Staff Machine Learning Engineer, AI Agent Platform page is... ...deployment, and hosting of LLM-based AI agents. This includes... ...lifecycle management, evaluation frameworks, skill... ...AuthZ) that makes AI agents reliable for long-running workflows...SeniorHourly payWork experience placement$50 - $60 per hour
...— whether you’re looking to contribute part-time alongside a current position, pursue it... ...new version of the AI smarter and more reliable. To succeed in this position, you should... ...diverse and complex problems and evaluate their outputs Evaluate the quality produced...SeniorHourly payContract workWork experience placementRemote workFlexible hours$163.62k - $212.71k
...Authorization Notice: At this time, iSpot does not... .... What You’ll Be Part Of: iSpot.tv is... .../Principal Site Reliability Engineer to drive... ...team consists of senior engineers who work... ...the governance of LLM‑driven tooling to... ...Experience with evaluating and rolling out GenAI...Part timePermanent employmentFull timeWork experience placementWork at officeLocal areaImmediate startRemote workWork from homeFlexible hoursShift work3 days per week1 day per week$114.41k - $145.02k
Senior Property Asset Management Program Specialist... ..., DNRP-Staff Full- or Part-Time Full Time Hours/Week 4... ...members in assessing, evaluating, researching, and strategizing... ...that services over 2.3 million King County... ...connection where they can reliably perform work and remain...Part timeSeniorFull timeInterim roleWork at officeRemote workShift work$113.45k - $133.47k
...Job Type: Full-Time Job Number: 202... ...The role of the Senior Management Analyst... ...reviews, analyzes and evaluates various issues; determines... ...the challenge of reliable childcare, the City... ...(MEBT) with 6.2% City matching contribution... ...be prorated for part-time positions)...Part timeSeniorFull timeContract workFor contractorsWork experience placementCasual workWork at officeLocal areaFlexible hoursAfternoon shift- ...Senior Manager Of Software Engineering When you mentor... ...-case delivery, including LLM integration approaches, evaluation strategies, prompt/context... ...experience. In addition, 2 + years of experience leading... ...), with a track record of reliability and cost-aware scaling...SeniorFlexible hoursShift work
$173.5k - $234.7k
...systems, including LLM orchestration, Agent/Skill creation... ...generation, evaluation, and... ...teams and operate reliably in production.... ...strategy, mentoring senior engineers, and... ...! A big part of how we care... ...Full and part-time employees have... ...employees and about 2.5 weeks for...Part timeFull timeTemporary workWork experience placementLocal areaFlexible hours$175k - $245k
...Senior Software Engineer II - Applied AI and Evaluations (Remote Eligible) -REMOTE, USA-... ...our intelligent agent platform. As we... ...intersection of LLM evaluation, prompt... ..., with at least 2 years working directly... ...You will be part of a fast‑moving... ...for full‑time employees 401(k)...SeniorFull timeTemporary workLocal areaImmediate startRemote work- ...and proficiency in completing projects for the company. As a part-time team member, you are offered identity theft protection and 401... ...assignments. Valid driver's license, clean driving record, reliable transportation, and valid automobile insurance. Reliability...Part timeCasual workFlexible hoursShift work
$139.5k - $258.1k
Senior Applied Scientist - AI Evaluation & Quality Systems Seattle, Washington... ...the AI and LLM features behind... ...that generates reliable ground truth and... ...the autonomous QA agents that make those... ...human judgment over time Develop anomaly... ...base pay is one part of our total...SeniorRelocationShift work$21.3 per hour
...We are immediately hiring part time Field Representatives in your area! Are you a military spouse and looking for supplemental income... ...assignments. Valid driver's license, clean driving record, reliable transportation, and valid automobile insurance....Part timeExtra incomeImmediate startFlexible hoursShift work$147.08k - $178.22k
...strategy, the Senior Manager of Intelligence... ...in rigorous evaluation science. This... ...for AI agents, while building... ...agent reasoning, reliability, and generalization... ...above for a full‑time employee (FTE)... ...time, up to 2 paid volunteer... ...employees full and part‑time who are...Part timeSeniorHourly payFull timeTemporary workFor contractorsSummer workLive inWork at officeLocal areaRemote workFlexible hoursShift work$42.56 per hour
...outstanding opportunity for a Senior Social Worker within the... ...Department . WORK SCHEDULE • Part-Time / 20 hours per week •... ...environmental, equipment and services) by evaluating the options available and... ...: Evening Shift Premium - $2.00/hour; Weekend Shift...Part timeSeniorHourly payFull timeTemporary workWork experience placementWork at officeShift workRotating shiftWeekend workAfternoon shift$120k - $127k
...meaningful difference and want to be part of the future of rehabilitation... ..., and Vision plans to Full-Time team members. We offer Dental... ...10-minute conversation within 2-3 business days, depending on... ...earned outside of the U.S. must be evaluated to be the U.S. equivalent to a...Part timeSeniorFull timeReliefH1bWork from homeVisa sponsorshipRelocation packageFree visa- ...have a team of over 2,000 of the... ...responsible for the reliability, performance, security... ...What you'll do As a Senior/Staff Software Engineer... ..., providing real-time visibility into availability... ...) for safe AI agent access: Design and... .../ML technologies, LLM integration, or...SeniorWorldwide
$147.3k - $193.3k
...business friction. The Senior Cybersecurity Engineer... ...security quality and reliability, and mentor junior engineers... ...for complex systems, evaluating attacker behaviour... ...employment visa at this time for this role. Compensation... ...internal equity. As part of our total rewards...Part timeSeniorPermanent employmentWork at officeWork visa$108k - $216k
...clean, efficient, and reliable code using... ...Enablement: Integrates AI agents and ML components... ...functional logic. Evaluates trade-offs and designs... ..., and different parts of the business when... ...life insurance. Paid time off benefits... ...related area. Option 2: 5 years’ experience...Part timeSeniorFull timeTemporary workLocal area$147.3k - $193.3k
...dedicated to building secure, reliable, and performant... ...Engineer, you will work as part of a global team... ...Responsibilities: As a Senior Software Engineer II,... ...technical requirements, evaluate implementation options,... ...employment visa at this time for this role....Part timeSeniorPermanent employmentWork visa
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to part-time Senior LLM Evaluation / Agent Reliability 2-5hrAdvisor. Be the first to apply!


