Staff Research Engineer - LLM Post-Training & Evaluation

Gravity Engineering Services Pvt Ltd.

Gravity Engineering Services Pvt Ltd. is seeking a Member of Technical Staff in San Francisco, California. In this role, you will design and build the infrastructure necessary for models to learn from production workflows continually. You will manage end-to-end experiments related to data, training, and system evaluation, working closely with the company's founders. The ideal candidate will have a strong background in large language models and be equipped with experience in experimental design, preferably with a Master's or PhD in a related field. This role is critical in shaping both the technology and the organizational culture. #J-18808-Ljbffr Gravity Engineering Services Pvt Ltd.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Staff Research Engineer - LLM Post-Training & Evaluation in San Francisco, CA vacancy

Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
$264.8k - $331k
...out state of the art post-training algorithms to reach the... ...The Enterprise ML Research Lab works on the front... ...enterprise clients. As a Staff Agent Post-Training MLRE... ...: ~5+ years of LLM training in a production... ...a fair and thorough evaluation of all applicants....
Training
Full time
Scale AI
San Francisco, CA
3 days ago
Member of Technical Staff, Research Engineer
...generation systems, run evaluations, inspect model... ...world trajectories or researcher hypotheses, materialize... ...AI research, systems engineering, and model evaluation.... ...Have experience with RL, LLM agents, computer‑use agents, evals, post‑training, synthetic data, simulation...
Training
Plato
San Francisco, CA
2 days ago
Staff RL Research Engineer — Post-Training Environments
$200k
...San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role... ...on designing and improving data and evaluation systems to enhance model capabilities... ...position is an opportunity to influence post-training strategies as part of a fast-paced...
Training
SupportFinity™
San Francisco, CA
2 days ago
Staff Research Engineer, Model Efficiency
...intelligence to serve humanity. We’re training and deploying frontier... .... Cohere is a team of researchers, engineers, designers, and more, who are... ...responsible for pushing the limits of LLM inference efficiency across... ...preferred locations. As a Staff Research Engineer, you will...
Training
Full time
Work at office
Remote work
Flexible hours
Cohere
San Francisco, CA
3 days ago
Senior Staff ML Engineer (Driver Understanding and Evaluation)
...large-scale distributed training and data processing ,... ...experience in ML/RL research and application , (Desirable... ...and using metrics for evaluating complex AI systems , (... ...and software engineers who are passionate about... ...models and Generative AI (LLM/VLM) solutions. These...
Training
Waymo
San Francisco, CA
3 days ago
Research Engineer, Model Evaluations - Remote-Friendly Impact
$320k
Anthropic in New York City is seeking a Research Engineer to develop evaluations for Claude’s capabilities. The ideal candidate should have strong Python... ...for running evaluations, and debugging results during training runs. The role offers a hybrid work model and...
Training
Remote job
Menlo Ventures
San Francisco, CA
5 days ago
Member of Technical Staff, ML Research Engineer
...preference and judgment. That lets us evaluate models on what people actually care about... ...About the Role We’re looking for an ML Research Engineer to help us build better ways to... ...analyses What We’re Looking For Experience training, fine-tuning, or evaluating models, including...
Training
Arcada Labs Incorporated
San Francisco, CA
2 days ago
Staff Engineer - LLM Inference & Serving at Scale
$150k - $300k
Prime Intellect is looking for a skilled ML Systems Engineer to build and optimize LLM serving infrastructure and inference systems. This hybrid role... ...to the scalability of their reinforcement learning training. Successful candidates will have over 3 years of experience...
Training
Relocation package
Prime Intellect
San Francisco, CA
4 days ago
Staff Engineer, RL Infrastructure & AI Evaluation
$180k
...focused on AI is seeking experienced software engineers to develop robust data pipelines and automation... ...frameworks. This role involves creating and maintaining evaluation tasks and improving operational procedures for RL training. The ideal candidate has extensive experience...
Training
xAI
San Francisco, CA
4 days ago
Staff Research Engineer - Pre-Training for Open LLMs
...Capital in San Francisco is seeking talented individuals for AI research roles focused on open superintelligence. Candidates will... ...in Computer Science or a related field, possess solid software engineering skills, and have experience with large-scale systems. The position...
Training
B Capital
San Francisco, CA
1 day ago
Staff Applied Research Engineer
$220.8k - $298.8k
# Staff Applied Research EngineerHybrid - San FranciscoApply**Our Mission & Values... ...is seeking an Applied AI Engineer to drive the quality and... ...research, experimentation, and evaluation. In this role, you will... ...systems**: cross-encoders, LLM-based rerankers, learning-to...
Work at office
Immediate start
Worldwide
Monday to Friday
Flexible hours
Careers at Drata
San Francisco, CA
1 day ago
Staff Applied Research Engineer
$220.8k - $298.8k
...automation. Drata is seeking an Applied AI Engineer to drive the quality and... ...of our AI systems through rigorous research, experimentation, and evaluation. In this role, you will optimize retrieval... ...reranking systems: cross‑encoders, LLM‑based rerankers, learning‑to‑rank,...
Flexible hours
Drata
San Francisco, CA
2 days ago
Autonomy Safety Evaluations Research Engineer
$315k
We are looking for Research Engineers to build “gold standard” evaluations for catastrophic risks, in order... ...for the way we train, deploy, and secure our... ...capabilities. Using our post training infrastructure... ...Currently, we expect all staff to be in one of our offices...
Training
Currently hiring
Work at office
Immediate start
Home office
Visa sponsorship
Relocation package
Anthropic
San Francisco, CA
3 days ago
Research Engineer
...infrastructure / Reinforcement Learning (RL) training data & evaluations Compensation: Competitive (range... ...Our partner is hiring a Research Engineer to help scale the quality assurance... ...Familiarity with modern AI tooling and LLM capabilities Equal Opportunity &...
Training
Remote work
talentpluto
San Francisco, CA
13 days ago
Applied Research Engineer - Member of Technical Staff
$200k - $400k
About the Company Pilots don’t train with real passengers. Surgeons... ...based on real humans. Our research pioneered the field of AI-based... ...Role As a Member of Technical Staff (MTS) in Research, you will work across the stack to train, evaluate, deploy, and monitor our models...
Training
Flexible hours
Simile
San Francisco, CA
3 days ago
Staff Research Engineer: AI Task & Curriculum Architect
...unique role at the intersection of AI research and systems engineering. You will design experiments, build task generation systems, and evaluate model failures. This is a hands-on role... ...background in reinforcement learning, LLM agents, and model behavior analysis, and...
Plato
San Francisco, CA
2 days ago
Staff Machine Learning Engineer (Research Scientist) - DFAI
...pretraining to production serving, evaluation, and monitoring. As part... ...across Plaid. As a Staff Machine Learning Engineer, you will lead the... ...pipelines that translate research into production impact. You... ...architecture design, distributed training, serving infrastructure, monitoring...
Training
Work experience placement
Local area
Immediate start
Plaid Inc
San Francisco, CA
3 days ago
Research Engineer
...everything Gamma creates. As our Research Engineer, you'll design evaluation frameworks that measure AI output... ...experience with prompt engineering, LLM experimentation, and systematic evaluation... ...improvements. Experience with post‑training techniques for LLMs including...
Training
Work at office
Work from home
Gamma
San Francisco, CA
5 days ago
Applied Research Engineer
...layer for AI agents. As a Senior Applied Research Engineer, you'll explore novel approaches to... ...engineers who can run rigorous experiments, train and evaluate models, and ship the result as... ...work in retrieval, memory systems, or LLM evaluation. Tech stack Python, Rust/C++...
Training
Work experience placement
Zep AI (YC W24)
San Francisco, CA
3 days ago
Research Engineer
$200k - $250k
Research Engineer Location San Francisco (On-site) Compensation $200,000 - $250,000 + variable... ...engineering: dataset curation, model training and evaluation, retrieval and tool use, safety and... ...Track record shipping applied ML or LLM features to real users, not just prototypes...
Training
Lotus Health AI
San Francisco, CA
2 days ago
Research Engineer, Frontier Red Team (Autonomy)
$320k
...growing group of committed researchers, engineers, policy experts, and... ..., you’ll build and evaluate model organisms of... ...AI. Create evals and training environments to... ...building and working with LLM‑based agents or autonomous... ...: We expect all staff to be in one of our offices...
Training
Relocation
Visa sponsorship
Anthropic
San Francisco, CA
1 day ago
Research Engineer — Search/IR
$180k - $270k
Research Engineer (Focused on Search/IR) You'll own and advance the search... ...reliably convert URLs into LLM‑ready markdown or structured... .../IR improvements with model training and broader product strategy... ...production implications. We'll evaluate on technical depth,...
Training
Full time
Temporary work
Remote work
Firecrawl
San Francisco, CA
3 days ago
Research Engineer - Evals
$160k - $240k
Research Engineer — Evals Location: San Francisco, CA (Hybrid) OR Remote (... ...Overview You'll build the evaluation systems that tell us whether... ...URL into clean, structured, LLM-ready data reliably — is hard... ...reporting layer — they're a training signal. You'll work closely...
Training
Full time
Temporary work
Work at office
Remote work
AI Chopping Block, Inc.
San Francisco, CA
5 days ago
Research Engineer, Codex
...coding. We operate across research, engineering, product, and... ...research insights into model training, alignment, and evaluation. Hunt down and address inefficiencies... ...—from agent behavior to LLM inference to container... ...you believe this job posting is non-compliant, please...
Training
Work at office
Relocation package
OpenAI
San Francisco, CA
4 days ago
Research Engineer - Benchmarking, Evals & Failure Analysis
...vast talent network trains frontier AI models... ...ll work alongside researchers, operators, and AI... ...As a Research Engineer at Mercor, you’ll... ...benchmarking pipelines, evaluation systems, and... ...improvements for post-training, RLVR, and... ...Build and operate LLM evaluation systems...
Training
Work at office
Mercor
San Francisco, CA
5 days ago
Research Engineer, AI Observability
$320k - $405k
...growing group of committed researchers, engineers, policy experts, and... ...About the Team As AI training and deployments scale... ...Are familiar with LLM application... ...context engineering, evaluation, orchestration) Enjoy... ...Currently, we expect all staff to be in one of our offices...
Training
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
5 days ago
Research - engineering
$315k
...growing group of committed researchers, engineers, policy experts, and... ...and Scaling team trains our production... ...training dynamics and evaluation infrastructure Design... ...experience training LLM\'s or working extensively... ...Currently, we expect all staff to be in one of our...
Training
Full time
Work at office
Visa sponsorship
Flexible hours
Weekend work
Afternoon shift
Menlo Ventures
San Francisco, CA
5 days ago
Research Engineer — Reinforcement Learning
$180k - $270k
Research Engineer (Focused on RL) You'll bring reinforcement learning to Firecrawl... ...core product — building the training infrastructure, reward... ...RL approaches and modern LLM agent systems. If you care as... ...the systems that train and evaluate Firecrawl's models. You'll own...
Training
Full time
Temporary work
Remote work
Firecrawl
San Francisco, CA
3 days ago
Staff Forward Deployed Engineer
$265k - $295k
...with frontier AI lab researchers to create evaluations, publish benchmarks,... ...Work together with engineers, scientists, operators... ...data-intensive post-training techniques. We believe... ...About the Role As a Staff Forward Deployed Engineer... ...strategy for LLM-powered or AI-native...
Training
Full time
Work at office
Remote work
Flexible hours
Handshake
San Francisco, CA
19 days ago
Staff Data Quality Engineer, LLM Post-Training
B Capital is seeking a data engineer to ensure high data quality for training AI models. You will own the upstream data quality for LLM post-training and design automated QA methods in a collaborative environment. Ideal candidates will have strong engineering skills, a...
Training
B Capital
San Francisco, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Research Engineer - LLM Post-Training & Evaluation. Be the first to apply!