Machine Learning Eval Engineer
Reducto
About Reducto Reducto helps AI teams ingest real-world enterprise data with state-of-the-art accuracy.
Most enterprise data, from financial statements to health records, is locked in unstructured file formats like PDFs and spreadsheets. We train vision models to read those documents the way a human would, enabling teams to build products, train models, and automate processes at scale. We've grown rapidly, increasing revenue 8x YOY and partnering with hundreds of companies, from leading AI teams like Harvey, Vanta, and Scale, to enterprise customers across FAANG and top trading firms. Reducto has raised over $100M from world-class investors including a16z, Benchmark, and First Round Capital. The Opportunity As an ML Eval Engineer, you'll play a key role in building the evaluation systems and benchmarks that make Reducto's models better over time. You'll collaborate closely with our ML, platform, and GTM teams to identify model weaknesses, design strong benchmarks, and create metrics and tooling that surface new failure modes as we scale. This is a high-impact role where you'll help define how model quality is measured at Reducto and shape the systems we use to improve it. What You'll Do
Most enterprise data, from financial statements to health records, is locked in unstructured file formats like PDFs and spreadsheets. We train vision models to read those documents the way a human would, enabling teams to build products, train models, and automate processes at scale. We've grown rapidly, increasing revenue 8x YOY and partnering with hundreds of companies, from leading AI teams like Harvey, Vanta, and Scale, to enterprise customers across FAANG and top trading firms. Reducto has raised over $100M from world-class investors including a16z, Benchmark, and First Round Capital. The Opportunity As an ML Eval Engineer, you'll play a key role in building the evaluation systems and benchmarks that make Reducto's models better over time. You'll collaborate closely with our ML, platform, and GTM teams to identify model weaknesses, design strong benchmarks, and create metrics and tooling that surface new failure modes as we scale. This is a high-impact role where you'll help define how model quality is measured at Reducto and shape the systems we use to improve it. What You'll Do
- Design, build, and maintain evaluation benchmarks that reveal where our models perform well and where they fail.
- Develop metrics, heuristics, and workflows to automatically identify new failure modes across large and messy real-world datasets.
- Partner closely with other ML engineers to turn evaluation insights into model improvements and better training priorities.
- Work hands-on with unstructured enterprise data, including PDFs, spreadsheets, and other difficult document formats, to uncover edge cases and hard examples.
- Build lightweight internal and user-facing tools, including simple interfaces in Python frameworks like Flask, to help teams inspect results, analyze model behavior, and communicate evaluation outcomes.
- Collaborate with customers and internal teams to understand real-world data needs and create bespoke benchmarks that highlight Reducto's strengths.
- Hold yourself to a high bar for quality and precision.
- Enjoy solving complex problems and building from first principles.
- Have strong Python skills and can independently build clean, reliable technical solutions. Bonus points for product and frontend experience!
- Are comfortable working with data infrastructure such as AWS S3 and OLAP or analytics systems like Tinybird.
- Love getting your hands dirty with unstructured data and chasing down difficult failure cases.
- Operate well in fast-changing, high-growth environments.
- Collaborate effectively across technical and non-technical teams.
- Take full ownership from strategy through execution.
- Have experience at an early-stage or high-growth startup.
- Have some background in product thinking and can build simple, polished user-facing interfaces.
- Are comfortable working directly with customers to understand their workflows and data needs.
- Have experience in AI/ML, data infrastructure, enterprise software, or document understanding systems.
- Care deeply about combining technical excellence with business impact.
- Accurately extract text and tables even with nonstandard layouts
- Automatically convert graphs to tabular data and summarize images in documents
- Extract important fields from complex forms with simple, natural language instructions
- Build powerful retrieval pipelines using Reducto's document metadata
- Intelligently chunk information using the document's layout data
- Unlimited PTO: We believe great work requires recharging.
- Lunch: Receive a free lunch to eat with your teammates daily at the office
- Reimbursed Transportation: Provide us with your receipts and we'll take care of the costs
- Insurance : Generous health insurance covering medical, dental, and vision.
- Health and Wellness Budget: We provide up to $150/mo reimbursement for health and wellness spending, such as gym memberships, fitness classes, or similar.
- Parental Leave: Work with us to build a leave schedule that works for you and your family
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Machine Learning Eval Engineer in San Francisco, CA vacancy
- Senior Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the Airbnb product. From Trust to Payments, and from Customer Service to Marketing, we rely on ML to ensure that guests and hosts have the best possible experience with Airbnb...SuggestedWork experience placementCasual workLive inWork at officeRemote work
$118k - $176k
...(*Comscore, Total Visits, March 2025) Day to Day The Machine Learning Engineer I role partners closely with business partners across various... ..., and system issues Build training, inference, and eval pipelines Skills/Competencies Requires a Bachelor's degree...SuggestedWork experience placementLocal area$200k - $300k
...observability, and the tooling engineers use to understand what... ...measurement and improvement using eval results, customer feedback, and... ...evaluation, reinforcement learning from human feedback, natural... ...other large systems involving machine learning. ~ Analytically rigorous...SuggestedHome officeFlexible hours3 days per week- ...construction veterans and world-class engineers to solve physical-world problems that simulations... ...team-we'd love to have you join us. Machine Learning Engineer: Evaluation Bedrock is... ...you'll do: Design and maintain eval systems: ~ Build pipelines for...SuggestedWork at officeFlexible hours
$200k - $400k
...Machine Learning Engineer San Francisco, CA & New York, NY About Goodfire Goodfire is a research company using interpretability to understand... ...role in building our core technology, from training and eval tooling to product features, to achieve our mission of...SuggestedWork at officeRemote work- ...progress. We’re a team of researchers, engineers, academics, and builders from places like... ...Arena Intelligence is seeking a Senior Machine Learning Engineer to help scale and strengthen... ...the full stack data, model training, and eval pipelines Help grow a culture of...Permanent employmentWork at office
$251k - $310k
...Staff Machine Learning Engineer, Vehicle Compliance Reasoning Waymo is an autonomous driving technology company with the mission to be the world... ...datasets ~ High quality API design with an eye towards eval and data driven technique ~ Experience with working and...Full timeRemote work$250k - $350k
...define the data flywheel that makes the whole machine move. This includes research around... ...automatically hill-climbs agent-building from an eval set. This will involve creating best-in-... ...vision coverage, retirement benefits, a learning and development stipend, and generous PTO...Full time- ...years in a customer-facing technical role. ~ Strong software engineering skills and familiarity with ML frameworks (e.g., PyTorch, JAX,... ...using human feedback, reward signals, or other adaptive learning techniques ~ Excellent communication skills and ability to operate...Flexible hours
- ...integrity, humility, curiosity, and relentless determination. Role Description We are looking to recruit an exceptional Machine Learning Engineer - Robot Perception to design, implement, test, and deploy robot perception algorithms that power our robots' ability to...
$295k
...production ML systems, developer platforms, model behavior, and large-scale infrastructure. About the Role We're hiring Machine Learning Engineers to build and improve the AI systems that help strategic partners adapt OpenAI models to important use cases in cloud-...- ...generation information extraction product powered by state-of-the-art AI and Deep Learning techniques. Work with an international top-notch engineering team with full commitment on Machine Learning development. Required Candidate Profile Skills Required:...H1bImmediate start
- ...Our mission is to reinvent the way people learn, starting with language. Learning a... ...About this role We're hiring an ML Engineer, Assessments to help build best-in-class... ...Design Lead (Content/Learning Design) , Machine Learning, Product, and Engineering to turn...Live inImmediate start
- ...Machine Learning Engineer At Krea, we are building next-generation AI creative tools. We are dedicated to making AI intuitive and controllable for creatives. Our mission is to build tools that empower human creativity, not replace it. We believe AI is a new medium...
- ...Machine Learning Engineer Title of Role: Machine Learning Engineer Location: San Francisco, CA, onsite Company Stage of Funding: Series C - Software Development Office Type: Onsite Salary: [To be confirmed with final candidates] Company Description...InternshipWork at officeVisa sponsorship
$175k - $215k
...Machine Learning Engineer, Perception Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's...Full timeTemporary workRemote work- ...tools consistently fail. We are a small, fast-growing team of engineers in San Francisco powering Fortune 100 enterprises, YC startups... ...models Build evaluation, data curation, and active learning pipelines Optimize inference, batching, and quantization on...Visa sponsorshipRelocation package
$130k - $200k
...Machine Learning Engineer Location: San Francisco, CA Salary Range: $130,000 - $200,000 About Us: Join our innovative AI company in San Francisco, where we're redefining how knowledge is accessed and understood. We're looking for a passionate Machine...Work at office$200k - $300k
...based on your skills and experience — talk with your recruiter to learn more. Base pay range $200,000.00/yr - $300,000.00/yr Direct message the job poster from Willing Tech Machine Learning Engineer – Scientific Visualisation Platform Location: Remote (US/...Full timeRemote work$150k - $300k
...Founding ML Engineer Location: San Francisco, CA Company Stage: Early-Stage (YC-backed, Profitable, High-Growth) Office Type: Onsite... ...the team is now focused on pushing the boundaries of applied machine learning to power the next generation of AI-native data systems....Visa sponsorship$205k - $316.4k
...Machine Learning Engineer At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way. We're a $1B+ learning platform used by two-thirds of U.S. high school students and half of college students, powering over 1 billion...Work at office3 days per week- ...for processing 3D data of human exercise and for deepening our engines for workout recommendations, difficulty scaling, and... ...solver who has broad knowledge of a variety of fields within machine learning, and is interested in building real world solutions that impact...Work experience placementImmediate start
- ...Machine Learning Engineer At Advex, we're working on solving the hardest problem in all of computer vision – data collection. In order to train a reliable AI model, you need access to the right training data. The inability to gather the right data has been the primary...
- ...Are you a passionate Machine Learning Engineer with a deep love for photography? Join Apple's Camera Hardware Engineering team and help us redefine the camera experience for millions of users worldwide. As a key player in our innovative team, you will collaborate closely...Worldwide
$160k - $220k
...About the Role Together AI is looking for an ML Engineer who will develop systems and APIs that enable our customers to perform inference and fine tune LLMs. Relevant experience includes implementing runtime systems that perform inference at scale using AI/ML models...Full time$225k - $300k
...Machine Learning Engineer About Latent Health Healthcare today is only truly personalized for two groups: those with wealth and access, and those with physicians in their immediate family. For everyone else, care is fragmented and impersonal. Medical history...Work at officeImmediate start$200k - $250k
...Founding Machine Learning Engineer - On-site - San Francisco, CA Location: San Francisco, CA Schedule: In office, five days per week A well-funded seed-stage AI company is building advanced agentic systems that automate complex workflows inside real enterprise...Work at officeImmediate start- ...Machine Learning Engineer We are looking for a Machine Learning Engineer to join the growing AI and Machine Learning team at Strava. This team is responsible for sophisticated machine learning models and systems which provide value to Strava athletes including personalization...Work at officeWorldwideFlexible hours3 days per week
$150k - $220k
...Founding Machine Learning Engineer San Francisco Compensation ~ Estimated base salary $150K – $220K • Offers Equity • Offers Bonus We invest in our team's success with comprehensive benefits Generous Compensation Above-market salary and equity package...H1bWork at officeVisa sponsorshipSleeping nights$266k
...and other actions that could undermine the user experience or harm our operational stability. About the Role As a Machine Learning Engineer in OpenAI's Integrity team, you will have the opportunity to work with some of the brightest minds in AI. You'll work on state...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Eval Engineer. Be the first to apply!
Related searches
- machine learning ai engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- entry level machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA

