Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Production RL & LLM Fine-Tuning Researcher

Baseten

A leading AI platform company in San Francisco is seeking a talented individual to work on training models that leverage reinforcement learning. This role involves designing post-training pipelines, building training environments, and interacting with clients to improve model performance. Ideal candidates will have hands-on experience with LLM fine-tuning, an understanding of production ML systems, and strong problem-solving skills. The company promotes a diverse and inclusive workplace, offering competitive compensation and generous benefits. #J-18808-Ljbffr Baseten

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Production RL & LLM Fine-Tuning Researcher in San Francisco, CA vacancy
  •  ...AI Researcher – Video World Generation San Francisco (Bay Area) Help...  ...video creation RL techniques for more adaptive...  ...generation Required: Strong production experience with large multimodal...  ...): Experience training or fine-tuning diffusion models Background... 
    Suggested

    DeepRec.ai

    San Francisco, CA
    4 days ago
  • $150k

    Haize Labs gets LLM apps out of POCs and into production. We eliminate the risk and improve the reliability of...  ...-testing them. We are looking for Research Engineers to help develop our reliability...  ...evaluation. A subset of this is fine. Annual Salary $150,000 - $600,000... 
    Suggested
    Visa sponsorship

    Enboarder

    San Francisco, CA
    1 day ago
  •  ...Gamma and Writer. By uniting applied AI research, flexible infrastructure, and...  ...AI to bring cutting‑edge models into production. We're growing quickly and recently raised...  ...are looking for people with hands‑on LLM fine‑tuning and RL experience. Researchers who are excited... 
    Suggested
    Flexible hours
    Shift work

    Baseten

    San Francisco, CA
    3 days ago
  •  ...Engineering Security Researcher Contract €50k – €100k Remote / In-person San Francisco...  ...bugs in real systems. You understand LLM‑specific attack surfaces — prompt...  ...that companies trust to run AI agents in production. The stakes are real. Direct access to... 
    Suggested
    Contract work
    Remote work
    Flexible hours

    OpenCompany

    San Francisco, CA
    4 days ago
  • $180k - $260k

     ...Machine Learning Researcher, Multimodal LLMs Location: San Francisco...  ...next-generation multimodal LLM stack, combining speech, text...  ...them all the way from idea to production. At Bland, we're not just...  ...understanding of prompting, fine-tuning, and alignment techniques... 
    Suggested
    Work at office
    Remote work

    Bland AI

    San Francisco, CA
    1 day ago
  • $200k - $280k

     ...engines) and post-training / RL systems. We build and...  ...that can run at production scale. Our mandate is...  ...creating many knobs to tune across the RL algorithm...  ...using these to train or fine-tune real models. Model...  .... Have a solid research foundation in your area... 
    Full time

    Together AI

    San Francisco, CA
    2 days ago
  • $250k - $325k

     ...agentic RAG [2023] Large-scale LLM-based legal fact extraction [...  ...: Why, What, and Who Why AI Researchers are the engine of innovation...  ...t nice-to-haves, they're the product. Your research will translate...  .... Explore and apply advanced fine-tuning, PEFT, and distillation... 
    Contract work
    Work at office
    Immediate start
    Remote work

    Ivo Inc.

    San Francisco, CA
    3 days ago
  • $180k - $260k

    BLAND is seeking a Machine Learning Researcher focused on multimodal LLM technology. The role involves developing conversational AI models that integrate...  ...a strong background in machine learning, LLMs, and product intuition to enhance user interactions. Compensation ranges... 

    BLAND

    San Francisco, CA
    5 days ago
  • $195k - $222.5k

    Overview Applied Researcher I Overview: At Capital One, we...  ...with breakthrough product experiences and scalable...  ...Engineering or related fields LLM PhD focus on NLP or...  ..., Instruction-Tuning, Dialogue-Finetuning, Parameter...  ...Experience deploying a fine-tuned large language... 
    Full time
    Part time
    Local area

    Victrays

    San Francisco, CA
    4 days ago
  • $302.4k - $378k

     ...team, part of Scale's Research organization, brings together...  ...agent environments and RL reward signals,...  ...not only expertise in LLM agents and planning algorithms...  ...a research setting or product development. Strong...  ...with open source LLM fine-tuning or involvement in bespoke... 
    Full time

    Scale AI

    San Francisco, CA
    6 days ago
  • $180k - $280k

     ...Machine Learning Researcher Kiddom is a groundbreaking educational platform that promotes...  ...Retrieval-Augmented Generation (RAG), fine-tuning, and evaluation of large language model...  ...diverse team of engineers, designers, product managers, and educators who are driven... 
    Permanent employment
    Full time
    Local area
    Flexible hours

    Kiddom

    San Francisco, CA
    1 day ago
  • A leading AI research accelerator in San Francisco is looking for candidates proficient in English and analytical skills to assist in training large language models. This role demands independence, creativity, and the ability to work flexibly in a remote environment. Ideal... 
    Remote work

    The10minutecareersolution

    San Francisco, CA
    4 days ago
  • $150k - $250k

     ...goods, and global social organizations. We research and deploy technologies that power AI-...  ...reliable execution of AI systems, and products that transform mission-critical workflows...  ...systems using models rather than training or fine-tuning them. Ideal candidates have expertise in... 
    Work at office
    3 days per week

    Distyl AI

    San Francisco, CA
    1 day ago
  •  ...models and verifiers: Developing fine‑grained supervision over...  ...Contributing to alignment and oversight research - figuring out how to reliably...  ...across the research-to-production lifecycle. Evaluation: Contributing...  ...language models, but strong RL backgrounds from other domains... 
    Full time
    Internship

    Xterraai

    San Francisco, CA
    2 days ago
  • $300k

     ...develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers to innovate algorithmic solutions. Candidates... 

    Vmax

    San Francisco, CA
    5 days ago
  •  ...engineering paired with a design-minded product engineering team to build and ship...  ...experts in AI. The Role As a Senior Applied Researcher in Audio Understanding, you will be...  ...systems. Build large scale pre-training and fine-tuning datasets for audio understanding... 
    Work at office
    Relocation package

    Cartesia

    San Francisco, CA
    2 days ago
  • $160k - $250k

    Machine Learning Researcher, Audio Location: San Francisco, CA or Remote (US) About Bland...  ...from theory to large‑scale training to production inference systems serving millions of...  ...Advance Speech‑to‑Text Modeling Build and fine‑tune large scale ASR systems robust to... 
    Work at office
    Remote work

    Bland

    San Francisco, CA
    5 days ago
  • $150k - $250k

     ...goods, and global social organizations. We research and deploy technologies that power AI-...  ...reliable execution of AI systems, and products that transform mission-critical workflows...  ...using models rather than training or fine-tuning them. Ideal candidates have expertise in... 
    Work at office
    3 days per week

    Distyl AI

    San Francisco, CA
    5 days ago
  •  ...Building We’ve developed an in-house LLM storytelling system that blends AI...  ...Project Astra, and top-tier AI researchers. As an early member of this...  ...balance research exploration with product-focused development Or, putting it in RL jargon, exploration and exploitation... 
    Work at office
    Visa sponsorship

    Spellbrush

    San Francisco, CA
    28 days ago
  • $162.7k - $263.18k

    Job Summary As a Security Researcher on the WildFire Team, you will play a crucial role in...  ...deep threat knowledge into scalable, productized detection capabilities to combat cyber...  ...Qualifications Experience building or fine‑tuning AI agents to autonomously triage alerts... 
    Visa sponsorship
    Work visa
    Shift work

    Palo Alto Networks

    San Francisco, CA
    2 days ago
  • Ivo is looking for an AI Researcher to push the boundaries of legal technology using LLMs and advanced AI techniques. You'll be a core...  ...innovation team, translating research into industry-changing product features. Your creative solutions will revolve around complex... 

    Ivo

    San Francisco, CA
    2 days ago
  • Icehouseventures seeks an AI researcher in San Francisco to drive innovation in legal technology. This role requires owning research roadmaps, designing experiments, and collaborating with engineering to deliver impactful AI solutions for legal professionals. The ideal... 

    Icehouseventures

    San Francisco, CA
    2 days ago
  • Patronus AI is seeking an Applied Researcher to drive foundational research on agentic AI systems in San Francisco. This position emphasizes innovation in reinforcement learning and scalable oversight, requiring a strong educational background in related fields and experience... 

    Patronus AI

    San Francisco, CA
    4 days ago
  • $295k

     ...and deploying advanced AI systems. As a Researcher for loss of control mitigations, you...  ...controllable model behavior across OpenAI’s products and internal deployments. This role...  ...familiar with methods for training and fine‑tuning large language models, including distillation... 
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    2 days ago
  •  .... We are seeking a Principal AI Researcher to join Veracode's AI & Innovation Research...  ...projects for improving Veracode's product portfolio or creating new products through...  ...on experience with Context Engineering, Fine-Tuning (e.g. LoRA), Retrieval-Augmented-... 
    Worldwide

    Veracode

    San Francisco, CA
    6 days ago
  • $264.8k - $331k

     ...(LLMs). We are building industry-leading LLM evals, setting new standards for model performance...  ...next generation of AI capabilities. Our Research teams work with the industry's leading AI...  ...world's most important decisions. Our products provide the high-quality data and full-... 
    Full time

    DiversityJobs Inc

    San Francisco, CA
    13 days ago
  • Xterraai, based in San Francisco, is seeking research scientists to develop innovative AI systems that reason about complex scientific...  ...engineering, allowing you to take ownership from ideation to production. Locally and remotely, you will focus on reinforcement learning... 
    Remote work

    Xterraai

    San Francisco, CA
    2 days ago
  •  ...runs on real machines, not benchmarks. About the role As an AI Researcher at Droyd, you’ll own meaningful parts of the learning and...  ...directions that improve speed, reliability, or capability Develop fine‑tuning and optimization methods tailored to robotics workloads... 

    Droyd

    San Francisco, CA
    1 day ago
  • A leading AI evaluation company is looking for a Staff Machine Learning Research Scientist to advance LLM evaluation methodologies. This role involves designing benchmarks, collaborating with teams, and mentoring others. Ideal candidates have significant experience in NLP... 

    Scale AI, Inc.

    San Francisco, CA
    3 days ago
  •  ...offering a paid internship in San Francisco to assist in the production of the 'Forum' radio show. The internship runs from July 6, 20...  ..., requiring 16 hours of work per week. Interns will engage in research, interviews, and production tasks while developing journalism... 
    16 hours
    Internship
    Work at office
    Remote work

    KQED

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Production RL & LLM Fine-Tuning Researcher. Be the first to apply!