Staff Engineer, Trustworthy ML Evaluation Platform
AI Chopping Block, Inc.
AI Chopping Block, Inc. is seeking a Member of Technical Staff to build and maintain the evaluation platform used across Magic. You will develop infrastructure for large-scale evaluations and ensure correctness in measurements that influence key research decisions. The ideal candidate should have strong software engineering skills, attention to detail, and experience with machine learning systems. This is an opportunity to contribute to advanced AI research in a collaborative environment. #J-18808-Ljbffr AI Chopping Block, Inc.
$220.8k - $298.8k
...seeking an Applied AI Engineer to drive the quality and... ...experimentation, and evaluation. In this role, you... ...to deliver accurate, trustworthy results. This is a research... .... Drata's compliance platform is document-heavy: VRM... ..., agents, and classic ML: chunking, embedding...PlatformFlexible hours$220.8k - $298.8k
# Staff Applied Research EngineerHybrid -... ...seeking an Applied AI Engineer to drive the... ...experimentation, and evaluation. In this role, you... ...deliver accurate, trustworthy results.This is a... ...Drata's compliance platform is document-heavy:... ...agents, and classic ML: chunking,...PlatformWork at officeImmediate startWorldwideMonday to FridayFlexible hours- ...advisory workflows. As a Staff AI Engineer, you'll shape the... ...organization Establish evaluation standards,... ...roadmap Create reusable platforms, patterns, and tooling... ...years leading complex AI/ML systems Deep... ...reasoning transparent and trustworthy Complex domains: Navigating...PlatformRemote workFlexible hours
$200k
Dormont Manufacturing Co is seeking a Member of Technical Staff to manage our internal evaluations platform, essential for improving AI model performance. You... .... The ideal candidate has strong software engineering fundamentals and experience with machine learning systems...Platform$240k - $260k
...extraction) to deliver trustworthy consumer health... ..., inference, evaluation, and analytics (batch... ...with the Product, Engineering, and Data teams and... ...use cases. (Staff scope) Lead architecture... ...and reusable platforms. Examples of problems... ...production ML systems (or equivalent...PlatformWork at officeRemote work- ...team of former Scale AI engineers and operators. In less... ...engineers build the pipelines, platforms, and models that... ...this role As a Staff Product Engineer at David... .... Deploy and evaluate LLM- and DSP-based solutions... ...engineering, data engineering, ML, and signal processing....PlatformWork at office
$200k - $250k
...Overview Build and operate the ML platform that powers AppFolio’s AI-native Real Estate platform... ...data pipelines, GPU orchestration, and evaluation. Productionize research prototypes with... ..., TensorRT, Triton, or similar). Prior staff‑level role in a company with a...PlatformRemote work- ...and most valuable kind of AI engineering there is. What You'll Do Architect... ...Lead the rebuilding of core platform systems — including our... ...generation (RAG), and agent evaluation. Experience building reliable... ...production AI systems. Classical ML experience — supervised/...PlatformLive inRemote work
- ...Role As a Founding Applied AI Engineer, you'll be instrumental in building... ...foundation of Kastle's AI platform. You'll fine-tune large... ...AI performance and compliance Evaluate LLMs and AI Agents: Build high... ...years building and deploying ML/AI systems in production environments...PlatformLive inRelocation package
- ...team is a new 5-person engineering team with a singular mission... .... We’re hiring a Staff/Principal IC who has already... ..., infrastructure, or platform engineering systems -... ...mobile tooling, and AI/ML systems without losing... ...architect the test harness and evaluate the output. You...PlatformShift work
$200k - $400k
...leading conversational AI platform empowering every brand to deliver... ..., orchestration, and evaluation in order to make our agents... ...scale. About the Role As a Staff Research Engineer, you’ll be responsible for... ...years of experience in AI/ML engineering or research. Prior...PlatformFull timeWork at officeLocal area$260k - $280k
...Department Technology, Engineering Compensation $260K – $2... .... We are looking for a Staff AI Engineer to join the GenAI + Discovery Platform team at Strava, a team... ...search and retrieval to evaluation and ROI frameworks. This... ...Working fluidly with ML engineers, data engineers...PlatformFull timeWork at officeWorldwideFlexible hours3 days per week- Senior Staff Machine Learning Engineer, Post Training Remote - USA Airbnb was born... ...Service to Marketing we rely on ML to ensure that guests and... ..., RAG/Search, LLM evaluation and testing automation, feedback... ...contribute to Airbnb’s ML platform architecture and strategy....PlatformWork experience placementRemote work
$190k - $285k
...we are seeking multiple GenAI Engineers from junior levels to more... ...products, and strengthening our platform architecture to enable... ...domains. Design and implement ML pipelines for data preprocessing... ...hyperparameter tuning, and model evaluation, enabling rapid...PlatformWork at officeLocal areaWorldwide$112k - $269k
Summary Yelp engineering culture is driven by our values... ...edge Machine Learning (ML) and Artificial... ...geographical locations. As a Staff‑level ML Engineer on... ...contributing to the ML platforms these models rely on.... ...prompt engineering and evaluation. A Bachelor’s Degree...PlatformRemote jobWork experience placementLocal area$250k - $350k
...scribe. We’re building the AI intelligence platform that restores humanity to healthcare... ...just getting started. The Role: As a Staff ML Engineer on the Frontier AI team at Ambience,... ...to ML libraries, benchmarks, or evaluation frameworks. Compensation We offer a base...PlatformWork at officeImmediate startRemote workFlexible hours3 days per week- ...Staff AI Engineer Goodfin is an AI-native wealth platform focused on private markets. We're building intelligent, agentic... ...accredited investors research, evaluate, and act on private investment opportunities... ..." looks like when traditional ML metrics fall short. Trust as...Platform
$160k - $300k
...mission is to revolutionize how engineering decisions are made, turning... ...About the Role As a Senior / Staff Infrastructure Engineer at... ...that powers our intelligence platform. You’ll be responsible for secure... ...distributed systems) Exposure to ML infra Personality & Values:...PlatformWork at officeVisa sponsorshipFlexible hours- ...a web application that distills complex ML signals, building automation tools that run... ...for: We’re looking for an experienced engineer to help shape our architecture, strengthen... ...maintain both internal and external facing API platforms and endpoints to enable customer...Platform
$279.2k - $390.9k
Team The ML Indexing & Retrieval Platform team at Reddit is responsible for building and scaling the core... ...-generation ML Indexing & Retrieval engine, integrating capabilities across lexical... ...us. We will use this information to evaluate your application for employment or...PlatformFor contractorsWork experience placementFlexible hours- ...looking for an experienced Staff Machine Learning Engineer eager to join EvenUp's... ...proprietary claims-intelligence platform, with a focus on machine... ...Working alongside senior ML engineers, data scientists... ...including embedding pipelines, evaluation frameworks, and...PlatformFull timeTemporary workLocal areaHome officeFlexible hours
- ...mission is to build the shared ML and AI infrastructure that... ...to production serving, evaluation, and monitoring. As part of... ...helping establish the core AI platform that enables innovation across Plaid. As a Staff Machine Learning Engineer, you will lead the technical...PlatformWork experience placementLocal areaImmediate start
- ...through our data annotation platform, generative AI solutions,... ...Role Overview As a Senior Staff Forward Deployed AI Engineer on our Enterprise team,... ...and data sources Implement evaluation frameworks to measure agent... ...customer data scientists, ML engineers, and software developers...Platform
- Senior Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the Airbnb product. From Trust... ...optimization, RAG/Search, LLM evaluation and testing automation, feedback... ...building robust evaluation platforms for agent behavior validation...PlatformWork experience placementCasual workLive inWork at officeRemote work
$180.6k - $315k
...around the world. The Enterprise ML Research Lab works on the... ...our enterprise clients. As a Staff Agent Post-Training MLRE, you... ...our next-gen Agent RL training platform. You’ll build out the platform... ...to ensure a fair and thorough evaluation of all applicants. About Us:...PlatformFull time- ...decisions, backed by robust evaluation frameworks. Direct customer... ...economics research, were founding engineers at unicorn startups, and... ...languages such as Python, Go and ML/NLP libraries such as PyTorch... ...plus Experience with cloud platforms (e.g., AWS, Azure, GCP) and...PlatformFull timeContract work
$252k - $315k
...through our data annotation platform, generative AI solutions,... .... Role Overview As a Staff Forward Deployed AI Engineer on our Enterprise team, you... ...and data sources Implement evaluation frameworks to measure agent... ...customer data scientists, ML engineers, and software...PlatformFull time- # Staff AI Software EngineerRemote - US**Our Mission... ...looking for a **Staff AI Engineer** to help shape how... ...closely with product, platform, and security teams to... ...workflow orchestration to evaluation tooling.### Raise the Quality... ...working directly on ML/AI systems.* Real...PlatformWork at officeImmediate startRemote workWorldwideFlexible hours
$200k - $275k
...disrupt crime. TRM's platforms enable investigators to... ...more secure. The AI Engineering Team is chartered with... ...also deeply involved in evaluating and integrating... ...market. As a Senior or Staff AI Infrastructure Engineer... ...infrastructure for AI/ML systems. You will: Build...PlatformWorldwide- Sift Science, Inc. is looking for an experienced engineer to drive architectural improvements and enhance their fraud-fighting technologies. You will work on API development for seamless customer integration and collaborate extensively with cross-functional teams. The...Platform
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Engineer, Trustworthy ML Evaluation Platform. Be the first to apply!
- assistant civil engineer San Francisco, CA
- engineering aide San Francisco, CA
- assistant mechanical engineer San Francisco, CA
- assistant engineering manager San Francisco, CA
- project engineer assistant project manager San Francisco, CA
- senior staff systems engineer San Francisco, CA
- staff automation engineer San Francisco, CA
- staff design engineer San Francisco, CA
- staff security engineer San Francisco, CA
- staff engineer San Francisco, CA


