Staff Engineer, Evals Platform & Model Benchmarking

$200k

Magic

Magic, located in San Francisco, is seeking a Member of Technical Staff to build the internal evaluations platform that supports critical company decisions. You will design, implement, and validate evaluation tasks for large-scale systems, ensuring correctness and reproducibility. The role is pivotal for research decisions and product quality, with a compensation range between $200K - $550K, including equity and benefits like unlimited paid time off and health insurance. #J-18808-Ljbffr Magic

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Staff Engineer, Evals Platform & Model Benchmarking in San Francisco, CA vacancy

Staff Engineer: Foundation Model API & GPU Inference
$192k - $260k
A leading data and AI company is seeking a Staff Engineer to design and implement core systems for Foundation Model Serving. The ideal candidate will have over 10 years of experience in building large-scale distributed systems and will collaborate closely across teams...
Suggested
Databricks Inc.
San Francisco, CA
3 days ago
Model Engineer - Member of Technical Staff
...scale clients. Now, we’re assembling a founding core engineering team to build and train models that understand these systems, optimize operations, anticipate... ...from the ground up. Think in systems, not just benchmarks. Are excited to model the physical world and...
Suggested
Meter
San Francisco, CA
3 days ago
Staff ML Engineer: End-to-End Model Training & Deployment
Parallel is seeking a professional who will own the training pipeline behind models that support both the search stack and agents. Responsibilities include building pathways from product usage to high-quality training data, rigorously fine-tuning models, and shipping them...
Suggested
Parallel
San Francisco, CA
1 day ago
Staff ML Inference Engineer — Model Efficiency (Remote)
Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model execution. You'll work with advanced performance techniques such as GPU/CUDA optimizations...
Suggested
Remote job
Jaide Health
San Francisco, CA
1 day ago
Senior Staff Engineer - AI Safety & Model Evaluation
Xcede is looking for a Member of Technical Staff focused on AI Safety to lead red-teaming efforts and ensure the robustness of next-... ...Applicants should have deep expertise in LLM safety, strong software engineering skills, and relevant academic qualifications in AI or related...
Suggested
Xcede
San Francisco, CA
2 days ago
Staff ML Engineer: End-to-End Model Training & Deployment
Parallel Web Systems in Palo Alto is seeking a professional to own the training pipeline behind models that power both their search stack and agents. The role involves building connections from product usage to training data, fine-tuning models, and ensuring safe deployment...
Parallel Web Systems
San Francisco, CA
4 days ago
Staff ML Engineer: End-to-End Model Training & Deployment
Parallel Bio in San Francisco is looking for a candidate to own the training pipeline behind models essential for both their search stack and agents. You will be responsible for building pathways from real product usage to high-quality training data while rigorously fine...
Parallel Bio
San Francisco, CA
1 day ago
Research Program Manager - Model Evals and Safety
...time Location Type On-site Department Engineering Our Mission Reflection’s mission is to... ...to all . We’re developing open weight models for individuals, agents, enterprises, and... ...foundational role. Reflection is building model evals and safety from the ground up, and this...
Full time
Relocation package
aijoblist
San Francisco, CA
3 days ago
Staff Engineer - Platform Engineering
$174k - $223k
...Scottsdale, San Francisco, Chicago, or New York follow a hybrid work model to allow for a more collaborative working environment.... ...for employment Visa sponsorship. Overall Purpose A Staff Platform Engineer defines and drives technical strategy for platform and runtime...
Hourly pay
Work at office
Immediate start
Visa sponsorship
Work visa
Flexible hours
Dormont Manufacturing Company
San Francisco, CA
2 days ago
Senior Model Serving Engineer - Low-Latency AI Platform
A leading data and AI company in San Francisco is seeking a Staff Engineer to design and implement systems for their AI/ML Model Serving platform. You will collaborate with product, infrastructure, and research teams to ensure high-performance system delivery. The ideal...
Menlo Ventures
San Francisco, CA
1 day ago
Staff Engineer, AI Quality & Platform Productivity
$253k - $308k
Harper Group, based in San Francisco, is seeking a Staff Engineer to lead efforts in engineering productivity and AI quality. This role involves establishing CI/CD quality gates, integration test harnesses, and developing automated PR preflights that enhance coding efficiency...
Harper Group
San Francisco, CA
1 day ago
Staff Engineer - AI Integrations & API Platforms
...design and build robust API integrations within the rapidly evolving AI ecosystem. You will create seamless connections between our platform and third-party AI tools by building custom nodes and plugins. The ideal candidate is well-versed in the modern AI landscape, has...
Parallel Web Systems
San Francisco, CA
2 days ago
Staff Engineer, Scalable Identity & Platform
A tech company specialized in identity management is looking for staff-level engineers in San Francisco, California. Candidates should have a strong background in scalable product development and proficiency in technologies like Next.js, JavaScript, TypeScript, and Go....
Clerk, Inc.
San Francisco, CA
4 days ago
Staff Engineer, AI Commerce Platform
...Francisco is seeking a Member of Technical Staff to build core systems and own product... ...and moving the mission from prototype to platform in a talent-dense team. The ideal... ...development, API design, and possesses a strong engineering culture. You will have the opportunity...
Getcatalog
San Francisco, CA
4 days ago
Staff Engineer, AI Infrastructure & Platform
...superintelligence stack, enabling end-to-end reinforcement learning at frontier scale. This hybrid role covers infrastructure and platform development, focusing on distributed systems, high-performance networking, and cloud orchestration. You will design and implement...
Prime-Intellect
San Francisco, CA
2 days ago
Staff Engineer, Developer Experience & DX Platform
$176k - $253k
...in San Francisco, is looking for a Senior Member of Technical Staff to enhance developer experience through optimizing CI/CD processes... ...performance and involves building an efficient development platform that integrates closely with internal teams. The ideal applicant...
Harper Group
San Francisco, CA
5 days ago
Staff Engineer, Trustworthy ML Evaluation Platform
...Chopping Block, Inc. is seeking a Member of Technical Staff to build and maintain the evaluation platform used across Magic. You will develop infrastructure... .... The ideal candidate should have strong software engineering skills, attention to detail, and experience with...
AI Chopping Block, Inc.
San Francisco, CA
5 days ago
Staff Engineer — AI & Platform Tools
$150k - $300k
Alumni Ventures is looking for a passionate engineer based in San Francisco to join our innovative team. In this role, you will develop cutting-edge social networking experiences and collaborate closely with product and research teams. Ideal candidates have over 3 years...
Alumni Ventures
San Francisco, CA
1 day ago
AI Product Manager - End-to-End Model Launches & Evals
$305k
Anthropic is looking for a Product Manager for Claude Code's model performance team in San Francisco. As a Product Manager, you will... ...end model launches, implement evaluations, and collaborate with engineers and researchers. The ideal candidate has an engineering...
Anthropic
San Francisco, CA
2 days ago
Staff Engineer - Enterprise AI Agents & Platform
...in San Francisco, is seeking a Member of Technical Staff to contribute to its cutting-edge AI platform. In this role, you will be responsible for shipping... ...ideal candidate has a strong background in full-stack engineering or deep infrastructure knowledge, with 1-15 years...
Work experience placement
Ersilia
San Francisco, CA
1 day ago
Staff Engineer - Voice AI Platform & Agent Tech
Vapi is hiring a full-stack engineer in San Francisco, California, to build innovative voice AI products. You will develop features on our platform, ensuring excellent user experiences and leveraging AI tools. Must have 5+ years of product development experience, proficiency...
Flexible hours
Vapi
San Francisco, CA
4 days ago
Senior Staff Engineer, AI Agent Harness Platform
$187k - $264k
Harper Group is seeking a Senior Member of Technical Staff in San Francisco to focus on building innovative agent-loop primitives and advanced harness infrastructure. The role demands strong software development skills along with experience in production environments, aimed...
Harper Group
San Francisco, CA
1 day ago
Staff Engineer, AI Product & Platform Lead
Perplexity is hiring engineers to develop innovative AI products that enhance human productivity. The role requires building, launching, and owning systems that empower users through AI. Ideal candidates will have over 4 years of software engineering experience, strong...
Aimling
San Francisco, CA
2 days ago
Staff AI Engineer, ATS Platform - Recruiting Intelligence
A leading technology company in San Francisco is seeking a Staff Engineer to join their Applicant Tracking System (ATS) team. This role focuses on designing and building AI-driven features that enhance the recruiting process. The ideal candidate brings extensive software...
Rippling
San Francisco, CA
1 day ago
Staff Product Engineer Voice Platform & APIs
A tech startup specializing in voice technologies is looking for a Product Engineer to manage voice agent projects. You will ramp up on the technology, handle large projects end-to-end, and engage with customers to create valuable APIs. This role offers a competitive salary...
Flexible hours
VAPI
San Francisco, CA
5 days ago
Staff Systems Engineer - AI Platform & Distributed Infra
deCircle is seeking an engineer to design and implement core systems for its agentic AI platform. This role involves building production systems, ensuring reliable cloud-native infrastructure, and developing secure execution environments. The ideal candidate has over 3...
deCircle
San Francisco, CA
1 day ago
Staff Product Engineer, Voice Platform (Hybrid)
$200k - $280k
A leading voice technology company in San Francisco is looking for a Product Engineer to drive product initiatives and enhance the voice agent pipeline. This role involves ramping up quickly on current projects and taking ownership of large initiatives. Ideal candidates...
Flexible hours
Slope
San Francisco, CA
4 days ago
Staff Backend Engineer — AI-Driven GTM Platform
$230k - $285k
A fast-growing tech company in San Francisco is looking for a Software Engineer to shape the product roadmap and engineering foundation. You will work closely with founding members, leveraging the latest AI technologies in a collaborative environment. The ideal candidate...
Unify
San Francisco, CA
4 days ago
Product Manager, Claude Code Model Performance San Francisco, CA | New York City, NY
$305k
...committed researchers, engineers, policy experts, and business... ...on Claude Code's model performance team, you will... ...end‑to‑end, build evals that measure what matters... ...developers, and competitive benchmarks into clear priorities... ..., we expect all staff to be in one of our offices...
Visa sponsorship
Anthropic
San Francisco, CA
2 days ago
Senior Software Engineer, AI Platform
$226k - $306k
...Senior Software Engineer, AI Platform San Francisco, US (Hybrid) Mixpanel... ...from production and evals, given the context of the invocation... ...and securely leverage AI models. Agent Orchestration:... ...by role and level and are benchmarked to the SF Bay Area Technology...
Contract work
Remote work
Mixpanel
San Francisco, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Engineer, Evals Platform & Model Benchmarking. Be the first to apply!