AI Evaluation Engineer: NLP for Contracts
Ironclad Inc
Ironclad Inc. is seeking an AI Evaluation Engineer to enhance contract management through AI. Located in San Francisco, the role involves analyzing datasets, designing feedback loops, and ensuring continuous improvement of ML systems. Ideal candidates will have a quantitative degree, 1–3 years of ML experience, and strong skills in SQL and Python. Benefits include comprehensive health coverage and flexible leave policies, making it an exciting opportunity to contribute significantly to AI solutions in an evolving industry. #J-18808-Ljbffr
- ...Ironclad, located in San Francisco, is seeking an AI Evaluation Engineer to join their team. This role involves analyzing datasets, designing... ...years of experience in ML or data science, particularly in NLP applications. Strong SQL and Python skills are required. Ironclad...SuggestedContract work
$150k - $250k
...A cutting-edge healthcare tech company is seeking an AI Engineer to build and evaluate AI agents, particularly in medical billing. The role requires a strong background in ML, NLP, and Python packages such as TensorFlow and PyTorch, with at least 6 years of industry experience...Suggested- A leading research accelerator for AI located in San Francisco is looking for a software engineer to evaluate and enhance AI-generated code. The role requires at least... ...excellent communication skills. The position is contract-based, requiring 10 to 40 hours of engagement...SuggestedContract workRemote work
$50 - $150 per hour
A leading AI company is seeking a software engineer to review and evaluate model-generated code. This contract role requires several years of software engineering experience, particularly as a full-stack engineer at notable tech firms. You will assess code quality and...SuggestedHourly payContract workFlexible hours- Drata is seeking a Senior Applied Research Engineer to enhance the quality of AI systems through rigorous evaluation and experimentation. This role emphasizes applied research... ...in applied research and ML, especially in NLP. Responsibilities include collaborating with AI...Suggested
- ...talented individuals in San Francisco to develop AI-platform tools using NLP, Machine Learning, and Deep Learning... ...solving skills. The role involves mentoring research engineers, leading research projects, and evaluating products. This is a fantastic opportunity for those...
$150k - $250k
...Distyl AI Job Posting Distyl is an applied AI technology company partnering with... ...At Distyl, we build AI systems using Evaluation-Driven Development —an approach where evaluation... ...behavior in production. AI Evaluation Engineers focus on designing and implementing the...Work at office3 days per week- ...Obsidian is seeking a candidate for a role focused on evaluating AI-generated content. You will assess Malayalam AI-generated responses... ...contribute to improving AI response quality. The position is contract-based and offers a unique opportunity to work on cutting-edge...Contract work
$150k - $180k
...AI Evaluations Engineer – HealthcareLocation: Remote, located in the USType: Full-timeDepartment: EngineeringReports to: Director Of EngineeringResponsibilitiesBuild and maintain infrastructure and tooling for the AI evaluations platform used by internal teams, including...Remote workFlexible hours- ...A pioneering AI technology firm based in San Francisco is seeking an AI Engineer to own the evaluation infrastructure for AI agents. This role requires designing automated pipelines and building observability systems, ensuring agent performance meets enterprise standards...Remote workFlexible hours
- ...A cutting-edge AI firm in San Francisco is seeking a Research Engineer to develop evaluation systems and benchmarking pipelines for language models. Candidates should have a strong background in applied research, coding skills, and familiarity with ML models. You will...
$180k - $240k
...looking for a Senior Agentic AI Engineer to join our team. You will... ...AI techniques to streamline contract workflows , with a strong focus... ...legal workflow systems. Evaluate third-party models, frameworks... ..., or Azure. Knowledge of NLP techniques, RAG systems, and...Full timeContract workWork at officeRemote workWorldwideFlexible hours- ...A leading AI CRM company in California seeks an experienced AI/ML professional to develop innovative AI applications. The role demands over 5 years in NLP with skills in deep learning and ML Ops. You will collaborate with founders, prototype advanced AI experiences, and...Flexible hours
- ...© 2025 Repovive, Inc. All rights reserved. Back to Jobs Apply Now Compensation Not listed Posted April 25, 2026 Required Skills AI evaluation data pipelines agent instrumentation Requirements Mid/Senior Visa Sponsorship Not mentioned Relocation Not mentioned About the Role...RelocationVisa sponsorship
- ...Jack & Jill is seeking a Founding AI/ML Engineer in San Francisco. The role involves building Generative Engine Optimization systems for AI search. Ideal candidates have strong NLP fundamentals and experience in production ML systems. You'll collaborate with world-class...
$240k - $280k
A leading software monitoring company is seeking a Senior Software Engineer on its AI/ML team to build evaluation infrastructure for measuring the performance of AI systems. This role involves designing datasets, creating benchmarks, and ensuring AI features behave reliably...$150k - $180k
...A health technology company is seeking a skilled infrastructure engineer to build and maintain AI evaluation tooling. The ideal candidate has over 5 years of experience in software engineering with a focus on backend systems and production-grade infrastructure. This role...Remote workFlexible hours- ...Solutions is looking for a ServiceNow Specialty Software Engineer in San Francisco, CA. This 12-month contract-to-hire role involves designing and developing... ...Catalog development, workflow automation, and integrating AI capabilities. Candidates should have over 5 years of...Contract work
- A leading AI contracting platform in San Francisco seeks a Generative AI Application Developer to drive the development of innovative legal AI features. You will work on building and deploying scalable solutions using TypeScript, Node.js, and React. The ideal candidate...Contract work
- ...located in San Francisco is seeking an innovative Quality Engineer for their AI products. This role blends ops, strategy, and analytics to... ...leading labs, and ensure user satisfaction through effective evaluation baselines. Competitive salary and benefits offered, with a...
- B Capital seeks a talented individual for an AI Evaluation role in San Francisco. This position involves conducting critical comparative analysis, refining evaluation systems, and collaborating with various teams to enhance model capabilities. The ideal candidate will have...
- A fast-growing AI company seeks a Software Engineer to focus on Model Evaluation & Benchmarking. This role involves building evaluation systems for multimodal AI, ensuring reliable performance. The ideal candidate will possess strong Python programming skills, familiarity...
- ...A technology firm in San Francisco is seeking a Research Engineer to enhance AI model quality. The ideal candidate will build benchmarks, datasets, and evaluation loops to ensure effective performance on critical tasks. This role requires strong programming skills and...
$214k - $300k
Monograph is seeking an engineer to build and improve AI evaluation systems aimed at increasing shipping quality for AI tools. You will enhance scalable eval runners, improve benchmarks, and ensure reliability in distributed systems. Strong engineering fundamentals and...$192k - $237.1k
A leading compliance software company in San Francisco is seeking an Applied AI Engineer to innovate compliance automation through applied research and evaluation. This role emphasizes experimentation over production engineering, requiring strong skills in information retrieval...- ...not demos, and not single‑step automations. In this contract role, you’ll design and ship multi‑step AI workflows that integrate with real business systems... ...layer mindset). Partner with IT/Security and Data Engineering counterparts to align on infrastructure, access...Full timeContract work
$146.4k - $235.38k
...created by the #1 company in e-signature and contract lifecycle management (CLM). What you... ...a talented and results-oriented AI Engineer to focus on delivering intelligent, trusted... ...orchestration Implement robust Evaluations (e.g., Arize) to monitor and improve model...Contract workWork at officeLocal areaRemote work2 days per week- ...AI Prompt Engineer San Francisco, CA (On-Site M-F) Our client is an early-stage, AI-native... ...prompts, sub-agent architectures, and evaluation harnesses to iteratively improve automation... ...~2+ years of experience with AI/ML, NLP, or prompt engineering in production, including...
$150k
Tzafon is seeking a skilled engineer to enhance their machine intelligence systems in San Francisco. As part of the team, you'll be responsible for building evaluation infrastructure, designing data pipelines, and implementing fine-tuning processes. Ideal candidates have...- Cacheflow is seeking a Senior Applied Research Engineer to enhance the effectiveness of our AI systems through focused research and experimentation. This role... ...in applied research and a strong foundation in NLP and information retrieval. Benefits include stock equity...Flexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Evaluation Engineer: NLP for Contracts. Be the first to apply!
- senior ai engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- ai engineer remote San Francisco, CA
- ai engineer San Francisco, CA
- ai prompt engineer San Francisco, CA
- ai developer San Francisco, CA
- ai research engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- ai network engineer
- ai agent engineer

