Production RL & LLM Fine-Tuning Researcher

Baseten

A leading AI platform company in San Francisco is seeking a talented individual to work on training models that leverage reinforcement learning. This role involves designing post-training pipelines, building training environments, and interacting with clients to improve model performance. Ideal candidates will have hands-on experience with LLM fine-tuning, an understanding of production ML systems, and strong problem-solving skills. The company promotes a diverse and inclusive workplace, offering competitive compensation and generous benefits. #J-18808-Ljbffr Baseten

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Production RL & LLM Fine-Tuning Researcher in San Francisco, CA vacancy

Artificial Intelligence Researcher
...AI Researcher – Video World Generation San Francisco (Bay Area) Help... ...video creation RL techniques for more adaptive... ...generation Required: Strong production experience with large multimodal... ...): Experience training or fine-tuning diffusion models Background...
Suggested
DeepRec.ai
San Francisco, CA
4 days ago
Applied Researcher
$150k
Haize Labs gets LLM apps out of POCs and into production. We eliminate the risk and improve the reliability of... ...-testing them. We are looking for Research Engineers to help develop our reliability... ...evaluation. A subset of this is fine. Annual Salary $150,000 - $600,000...
Suggested
Visa sponsorship
Enboarder
San Francisco, CA
1 day ago
Post-Training Applied Researcher
...Gamma and Writer. By uniting applied AI research, flexible infrastructure, and... ...AI to bring cutting‑edge models into production. We're growing quickly and recently raised... ...are looking for people with hands‑on LLM fine‑tuning and RL experience. Researchers who are excited...
Suggested
Flexible hours
Shift work
Baseten
San Francisco, CA
3 days ago
Security Researcher
...Engineering Security Researcher Contract €50k – €100k Remote / In-person San Francisco... ...bugs in real systems. You understand LLM‑specific attack surfaces — prompt... ...that companies trust to run AI agents in production. The stakes are real. Direct access to...
Suggested
Contract work
Remote work
Flexible hours
OpenCompany
San Francisco, CA
4 days ago
Machine Learning Researcher, Multimodal LLMs
$180k - $260k
...Machine Learning Researcher, Multimodal LLMs Location: San Francisco... ...next-generation multimodal LLM stack, combining speech, text... ...them all the way from idea to production. At Bland, we're not just... ...understanding of prompting, fine-tuning, and alignment techniques...
Suggested
Work at office
Remote work
Bland AI
San Francisco, CA
1 day ago
AI Researcher, Core ML (Turbo)
$200k - $280k
...engines) and post-training / RL systems. We build and... ...that can run at production scale. Our mandate is... ...creating many knobs to tune across the RL algorithm... ...using these to train or fine-tune real models. Model... .... Have a solid research foundation in your area...
Full time
Together AI
San Francisco, CA
2 days ago
AI Researcher
$250k - $325k
...agentic RAG [2023] Large-scale LLM-based legal fact extraction [... ...: Why, What, and Who Why AI Researchers are the engine of innovation... ...t nice-to-haves, they're the product. Your research will translate... .... Explore and apply advanced fine-tuning, PEFT, and distillation...
Contract work
Work at office
Immediate start
Remote work
Ivo Inc.
San Francisco, CA
3 days ago
Multimodal LLM Researcher: Real-Time Speech & Tools
$180k - $260k
BLAND is seeking a Machine Learning Researcher focused on multimodal LLM technology. The role involves developing conversational AI models that integrate... ...a strong background in machine learning, LLMs, and product intuition to enhance user interactions. Compensation ranges...
BLAND
San Francisco, CA
5 days ago
Applied Researcher I at Capital One - San Francisco, CA, United States
$195k - $222.5k
Overview Applied Researcher I Overview: At Capital One, we... ...with breakthrough product experiences and scalable... ...Engineering or related fields LLM PhD focus on NLP or... ..., Instruction-Tuning, Dialogue-Finetuning, Parameter... ...Experience deploying a fine-tuned large language...
Full time
Part time
Local area
Victrays
San Francisco, CA
4 days ago
Senior / Staff Machine Learning Research Scientist, Agents
$302.4k - $378k
...team, part of Scale's Research organization, brings together... ...agent environments and RL reward signals,... ...not only expertise in LLM agents and planning algorithms... ...a research setting or product development. Strong... ...with open source LLM fine-tuning or involvement in bespoke...
Full time
Scale AI
San Francisco, CA
6 days ago
Machine Learning Researcher
$180k - $280k
...Machine Learning Researcher Kiddom is a groundbreaking educational platform that promotes... ...Retrieval-Augmented Generation (RAG), fine-tuning, and evaluation of large language model... ...diverse team of engineers, designers, product managers, and educators who are driven...
Permanent employment
Full time
Local area
Flexible hours
Kiddom
San Francisco, CA
1 day ago
Remote AI Analytics & LLM Researcher
A leading AI research accelerator in San Francisco is looking for candidates proficient in English and analytical skills to assist in training large language models. This role demands independence, creativity, and the ability to work flexibly in a remote environment. Ideal...
Remote work
The10minutecareersolution
San Francisco, CA
4 days ago
Applied AI Researcher, Benchmarking
$150k - $250k
...goods, and global social organizations. We research and deploy technologies that power AI-... ...reliable execution of AI systems, and products that transform mission-critical workflows... ...systems using models rather than training or fine-tuning them. Ideal candidates have expertise in...
Work at office
3 days per week
Distyl AI
San Francisco, CA
1 day ago
Research Scientist (Intern)
...models and verifiers: Developing fine‑grained supervision over... ...Contributing to alignment and oversight research - figuring out how to reliably... ...across the research-to-production lifecycle. Evaluation: Contributing... ...language models, but strong RL backgrounds from other domains...
Full time
Internship
Xterraai
San Francisco, CA
2 days ago
RL Algorithms Research Scientist - Post-LLM Learning
$300k
...develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers to innovate algorithmic solutions. Candidates...
Vmax
San Francisco, CA
5 days ago
Applied Researcher, Audio Understanding
...engineering paired with a design-minded product engineering team to build and ship... ...experts in AI. The Role As a Senior Applied Researcher in Audio Understanding, you will be... ...systems. Build large scale pre-training and fine-tuning datasets for audio understanding...
Work at office
Relocation package
Cartesia
San Francisco, CA
2 days ago
Machine Learning Researcher, Audio
$160k - $250k
Machine Learning Researcher, Audio Location: San Francisco, CA or Remote (US) About Bland... ...from theory to large‑scale training to production inference systems serving millions of... ...Advance Speech‑to‑Text Modeling Build and fine‑tune large scale ASR systems robust to...
Work at office
Remote work
Bland
San Francisco, CA
5 days ago
Applied AI Researcher, AI Systems
$150k - $250k
...goods, and global social organizations. We research and deploy technologies that power AI-... ...reliable execution of AI systems, and products that transform mission-critical workflows... ...using models rather than training or fine-tuning them. Ideal candidates have expertise in...
Work at office
3 days per week
Distyl AI
San Francisco, CA
5 days ago
AI Anime Researcher - LLM
...Building We’ve developed an in-house LLM storytelling system that blends AI... ...Project Astra, and top-tier AI researchers. As an early member of this... ...balance research exploration with product-focused development Or, putting it in RL jargon, exploration and exploitation...
Work at office
Visa sponsorship
Spellbrush
San Francisco, CA
28 days ago
Sr. Principal Security Researcher - Wildfire
$162.7k - $263.18k
Job Summary As a Security Researcher on the WildFire Team, you will play a crucial role in... ...deep threat knowledge into scalable, productized detection capabilities to combat cyber... ...Qualifications Experience building or fine‑tuning AI agents to autonomously triage alerts...
Visa sponsorship
Work visa
Shift work
Palo Alto Networks
San Francisco, CA
2 days ago
Legal AI Researcher: Groundbreaking LLMs in Production
Ivo is looking for an AI Researcher to push the boundaries of legal technology using LLMs and advanced AI techniques. You'll be a core... ...innovation team, translating research into industry-changing product features. Your creative solutions will revolve around complex...
Ivo
San Francisco, CA
2 days ago
AI Researcher — Legal LLMs in Production
Icehouseventures seeks an AI researcher in San Francisco to drive innovation in legal technology. This role requires owning research roadmaps, designing experiments, and collaborating with engineering to deliver impactful AI solutions for legal professionals. The ideal...
Icehouseventures
San Francisco, CA
2 days ago
Applied Researcher: Agentic AI & RL Systems
Patronus AI is seeking an Applied Researcher to drive foundational research on agentic AI systems in San Francisco. This position emphasizes innovation in reinforcement learning and scalable oversight, requiring a strong educational background in related fields and experience...
Patronus AI
San Francisco, CA
4 days ago
Researcher, Recursive Self-Improvement Preparedness
$295k
...and deploying advanced AI systems. As a Researcher for loss of control mitigations, you... ...controllable model behavior across OpenAI’s products and internal deployments. This role... ...familiar with methods for training and fine‑tuning large language models, including distillation...
Full time
Work at office
Local area
Relocation package
Flexible hours
Slope
San Francisco, CA
2 days ago
Principal AI Researcher
.... We are seeking a Principal AI Researcher to join Veracode's AI & Innovation Research... ...projects for improving Veracode's product portfolio or creating new products through... ...on experience with Context Engineering, Fine-Tuning (e.g. LoRA), Retrieval-Augmented-...
Worldwide
Veracode
San Francisco, CA
6 days ago
Staff Machine Learning Research Scientist, LLM Evals
$264.8k - $331k
...(LLMs). We are building industry-leading LLM evals, setting new standards for model performance... ...next generation of AI capabilities. Our Research teams work with the industry's leading AI... ...world's most important decisions. Our products provide the high-quality data and full-...
Full time
DiversityJobs Inc
San Francisco, CA
13 days ago
AI Research Scientist: RL & Scientific Reasoning
Xterraai, based in San Francisco, is seeking research scientists to develop innovative AI systems that reason about complex scientific... ...engineering, allowing you to take ownership from ideation to production. Locally and remotely, you will focus on reinforcement learning...
Remote work
Xterraai
San Francisco, CA
2 days ago
Lead Machine Learning Researcher
...runs on real machines, not benchmarks. About the role As an AI Researcher at Droyd, you’ll own meaningful parts of the learning and... ...directions that improve speed, reliability, or capability Develop fine‑tuning and optimization methods tailored to robotics workloads...
Droyd
San Francisco, CA
1 day ago
Staff ML Research Scientist, LLM Evaluations & Benchmarks
A leading AI evaluation company is looking for a Staff Machine Learning Research Scientist to advance LLM evaluation methodologies. This role involves designing benchmarks, collaborating with teams, and mentoring others. Ideal candidates have significant experience in NLP...
Scale AI, Inc.
San Francisco, CA
3 days ago
Forum Intern: Hybrid News Producer & Researcher
...offering a paid internship in San Francisco to assist in the production of the 'Forum' radio show. The internship runs from July 6, 20... ..., requiring 16 hours of work per week. Interns will engage in research, interviews, and production tasks while developing journalism...
16 hours
Internship
Work at office
Remote work
KQED
San Francisco, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Production RL & LLM Fine-Tuning Researcher. Be the first to apply!