Production RL & LLM Fine-Tuning Researcher
Baseten
A leading AI platform company in San Francisco is seeking a talented individual to work on training models that leverage reinforcement learning. This role involves designing post-training pipelines, building training environments, and interacting with clients to improve model performance. Ideal candidates will have hands-on experience with LLM fine-tuning, an understanding of production ML systems, and strong problem-solving skills. The company promotes a diverse and inclusive workplace, offering competitive compensation and generous benefits. #J-18808-Ljbffr Baseten
- ...AI Researcher – Video World Generation San Francisco (Bay Area) Help... ...video creation RL techniques for more adaptive... ...generation Required: Strong production experience with large multimodal... ...): Experience training or fine-tuning diffusion models Background...Suggested
$150k
Haize Labs gets LLM apps out of POCs and into production. We eliminate the risk and improve the reliability of... ...-testing them. We are looking for Research Engineers to help develop our reliability... ...evaluation. A subset of this is fine. Annual Salary $150,000 - $600,000...SuggestedVisa sponsorship- ...Gamma and Writer. By uniting applied AI research, flexible infrastructure, and... ...AI to bring cutting‑edge models into production. We're growing quickly and recently raised... ...are looking for people with hands‑on LLM fine‑tuning and RL experience. Researchers who are excited...SuggestedFlexible hoursShift work
- ...Engineering Security Researcher Contract €50k – €100k Remote / In-person San Francisco... ...bugs in real systems. You understand LLM‑specific attack surfaces — prompt... ...that companies trust to run AI agents in production. The stakes are real. Direct access to...SuggestedContract workRemote workFlexible hours
$180k - $260k
...Machine Learning Researcher, Multimodal LLMs Location: San Francisco... ...next-generation multimodal LLM stack, combining speech, text... ...them all the way from idea to production. At Bland, we're not just... ...understanding of prompting, fine-tuning, and alignment techniques...SuggestedWork at officeRemote work$200k - $280k
...engines) and post-training / RL systems. We build and... ...that can run at production scale. Our mandate is... ...creating many knobs to tune across the RL algorithm... ...using these to train or fine-tune real models. Model... .... Have a solid research foundation in your area...Full time$250k - $325k
...agentic RAG [2023] Large-scale LLM-based legal fact extraction [... ...: Why, What, and Who Why AI Researchers are the engine of innovation... ...t nice-to-haves, they're the product. Your research will translate... .... Explore and apply advanced fine-tuning, PEFT, and distillation...Contract workWork at officeImmediate startRemote work$180k - $260k
BLAND is seeking a Machine Learning Researcher focused on multimodal LLM technology. The role involves developing conversational AI models that integrate... ...a strong background in machine learning, LLMs, and product intuition to enhance user interactions. Compensation ranges...$195k - $222.5k
Overview Applied Researcher I Overview: At Capital One, we... ...with breakthrough product experiences and scalable... ...Engineering or related fields LLM PhD focus on NLP or... ..., Instruction-Tuning, Dialogue-Finetuning, Parameter... ...Experience deploying a fine-tuned large language...Full timePart timeLocal area$302.4k - $378k
...team, part of Scale's Research organization, brings together... ...agent environments and RL reward signals,... ...not only expertise in LLM agents and planning algorithms... ...a research setting or product development. Strong... ...with open source LLM fine-tuning or involvement in bespoke...Full time$180k - $280k
...Machine Learning Researcher Kiddom is a groundbreaking educational platform that promotes... ...Retrieval-Augmented Generation (RAG), fine-tuning, and evaluation of large language model... ...diverse team of engineers, designers, product managers, and educators who are driven...Permanent employmentFull timeLocal areaFlexible hours- A leading AI research accelerator in San Francisco is looking for candidates proficient in English and analytical skills to assist in training large language models. This role demands independence, creativity, and the ability to work flexibly in a remote environment. Ideal...Remote work
$150k - $250k
...goods, and global social organizations. We research and deploy technologies that power AI-... ...reliable execution of AI systems, and products that transform mission-critical workflows... ...systems using models rather than training or fine-tuning them. Ideal candidates have expertise in...Work at office3 days per week- ...models and verifiers: Developing fine‑grained supervision over... ...Contributing to alignment and oversight research - figuring out how to reliably... ...across the research-to-production lifecycle. Evaluation: Contributing... ...language models, but strong RL backgrounds from other domains...Full timeInternship
$300k
...develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers to innovate algorithmic solutions. Candidates...- ...engineering paired with a design-minded product engineering team to build and ship... ...experts in AI. The Role As a Senior Applied Researcher in Audio Understanding, you will be... ...systems. Build large scale pre-training and fine-tuning datasets for audio understanding...Work at officeRelocation package
$160k - $250k
Machine Learning Researcher, Audio Location: San Francisco, CA or Remote (US) About Bland... ...from theory to large‑scale training to production inference systems serving millions of... ...Advance Speech‑to‑Text Modeling Build and fine‑tune large scale ASR systems robust to...Work at officeRemote work$150k - $250k
...goods, and global social organizations. We research and deploy technologies that power AI-... ...reliable execution of AI systems, and products that transform mission-critical workflows... ...using models rather than training or fine-tuning them. Ideal candidates have expertise in...Work at office3 days per week- ...Building We’ve developed an in-house LLM storytelling system that blends AI... ...Project Astra, and top-tier AI researchers. As an early member of this... ...balance research exploration with product-focused development Or, putting it in RL jargon, exploration and exploitation...Work at officeVisa sponsorship
$162.7k - $263.18k
Job Summary As a Security Researcher on the WildFire Team, you will play a crucial role in... ...deep threat knowledge into scalable, productized detection capabilities to combat cyber... ...Qualifications Experience building or fine‑tuning AI agents to autonomously triage alerts...Visa sponsorshipWork visaShift work- Ivo is looking for an AI Researcher to push the boundaries of legal technology using LLMs and advanced AI techniques. You'll be a core... ...innovation team, translating research into industry-changing product features. Your creative solutions will revolve around complex...
- Icehouseventures seeks an AI researcher in San Francisco to drive innovation in legal technology. This role requires owning research roadmaps, designing experiments, and collaborating with engineering to deliver impactful AI solutions for legal professionals. The ideal...
- Patronus AI is seeking an Applied Researcher to drive foundational research on agentic AI systems in San Francisco. This position emphasizes innovation in reinforcement learning and scalable oversight, requiring a strong educational background in related fields and experience...
$295k
...and deploying advanced AI systems. As a Researcher for loss of control mitigations, you... ...controllable model behavior across OpenAI’s products and internal deployments. This role... ...familiar with methods for training and fine‑tuning large language models, including distillation...Full timeWork at officeLocal areaRelocation packageFlexible hours- .... We are seeking a Principal AI Researcher to join Veracode's AI & Innovation Research... ...projects for improving Veracode's product portfolio or creating new products through... ...on experience with Context Engineering, Fine-Tuning (e.g. LoRA), Retrieval-Augmented-...Worldwide
$264.8k - $331k
...(LLMs). We are building industry-leading LLM evals, setting new standards for model performance... ...next generation of AI capabilities. Our Research teams work with the industry's leading AI... ...world's most important decisions. Our products provide the high-quality data and full-...Full time- Xterraai, based in San Francisco, is seeking research scientists to develop innovative AI systems that reason about complex scientific... ...engineering, allowing you to take ownership from ideation to production. Locally and remotely, you will focus on reinforcement learning...Remote work
- ...runs on real machines, not benchmarks. About the role As an AI Researcher at Droyd, you’ll own meaningful parts of the learning and... ...directions that improve speed, reliability, or capability Develop fine‑tuning and optimization methods tailored to robotics workloads...
- A leading AI evaluation company is looking for a Staff Machine Learning Research Scientist to advance LLM evaluation methodologies. This role involves designing benchmarks, collaborating with teams, and mentoring others. Ideal candidates have significant experience in NLP...
- ...offering a paid internship in San Francisco to assist in the production of the 'Forum' radio show. The internship runs from July 6, 20... ..., requiring 16 hours of work per week. Interns will engage in research, interviews, and production tasks while developing journalism...16 hoursInternshipWork at officeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Production RL & LLM Fine-Tuning Researcher. Be the first to apply!
- court researcher San Francisco, CA
- data collection researcher San Francisco, CA
- survey researcher San Francisco, CA
- security researcher San Francisco, CA
- qualitative researcher San Francisco, CA
- academic researcher San Francisco, CA
- criminal researcher San Francisco, CA
- researcher San Francisco, CA
- legal researcher San Francisco, CA
- machine learning researcher San Francisco, CA



