Research Scientist — Truthful LLM Fine-Tuning
$280k - $425kDormont Manufacturing Co
Dormont Manufacturing Co is seeking a Research Scientist/Engineer to join the Finetuning Alignment team in San Francisco. The candidate will lead efforts to minimize AI hallucinations and enhance truthfulness in language models, collaborating with researchers and engineers. This role requires strong programming skills in Python and an MS/PhD in a related field, offering a competitive salary between $280,000 and $425,000. The role involves designing robust systems to maintain high standards of accuracy and honesty in AI. #J-18808-Ljbffr Dormont Manufacturing Co
- ...development of general-purpose robots. In this role, you will design fine-tuning and adaptation strategies for various robotic tasks, while... ...to apply your skills in a dynamic environment that bridges research with real-world applications. Join us on our mission to revolutionize...Suggested
- ...This role is for an experienced scientist who thrives both in... ...and deep content extraction. Research, evaluate, and integrate the... ...validate use cases, including LLM orchestration, context engineering... .... Experience optimizing and fine-tuning large models, knowledge of quantization...Suggested
$300k - $320k
...quickly growing group of committed researchers, engineers, policy experts,... ...an exceptional Research Scientist to join our Life Sciences team... ...Hands‑on experience training or fine‑tuning ML models (LLMs, protein... ...experience Experience with LLM post‑training: RLHF, RL from...SuggestedVisa sponsorship$350k
Anthropic in San Francisco is seeking a Research Scientist/Engineer dedicated to enhancing truthfulness in language models. You will spearhead efforts to minimize inaccuracies and promote robust, honest AI systems, ensuring high standards are maintained. This role involves...Suggested- ...We are looking for an Applied Scientist to own the models that decide... ...a day. Ad selection inside LLM chats is a genuinely new problem... ...for the publisher. This is a research role with a production... ...preference modeling, or LLM fine-tuning. Background in auction theory...SuggestedRelocation
$280k
...quickly growing group of committed researchers, engineers, policy experts,... ...describe yourself as both a scientist and an engineer. As a... ...including Interpretability, Fine-Tuning, and the Frontier Red Team. Our... ...evaluate the effectiveness of novel LLM-generated jailbreaks. Write...Contract workFor contractorsFor subcontractorWork at officeRelocationVisa sponsorshipWork visaFlexible hours$300k
...develop new reinforcement learning algorithms for post-training language models in San Francisco. The successful candidate will own a research agenda, establish baselines for evaluating efficiency, and collaborate with peers to innovate algorithmic solutions. Candidates...- Research Scientist / Machine Learning Scientist Location:SF Bay Area/Hybrid / Remote Type:Full-Time About the Role: The Clientis seeking... ...scale models, including reward models, preference models, and fine-tuning LLMs with methods like RLHF, DPO, and contrastive learning....Full timeRemote work
- Open role Founding Applied Scientist San Francisco (On-site) About... ...solved before. This is a hybrid Research and Engineering role. You... .... You will define the ground truth, design the benchmarks that measure... ...Responsibilities Train and fine-tune deep learning models (PyTorch...Work at officeRelocation package
- ...with AI. About the role AI research at WRITER isn't just about publishing... .... As a staff AI research scientist, you'll be at the center of... ...experiments using supervised fine-tuning, reinforcement learning from... ...model improvement — including LLM-as-judge frameworks,...Full timeWork at officeLocal areaFlexible hours
$400k
...and early commercial traction across several high‑profile industry verticals. Role As a Senior Research Scientist, your focus is post‑training — curating data, fine‑tuning pre‑trained speech models, and building the evaluation infrastructure that validates it all. You...Relocation packageShift work$200k - $250k
...Center for AI Safety (CAIS) is a leading research and advocacy organization focused on... ...Safety Action Fund. As a Senior Research Scientist here, you will lead and execute high‑impact... ...for quality and outcomes. Train and fine‑tune large transformer models across domains....Work at officeLocal area$181.1k - $318.4k
Apple Inc. is seeking a Senior Applied Researcher in San Francisco to drive innovation in Generative AI and advanced NLP systems. This role involves architecting LLM-powered systems and leading research in large-scale representation learning, semantic modeling, and more...$181.1k - $318.4k
...nexus of these systems—where deep applied research, advanced machine learning, and large... ...demonstrated expertise in Generative AI, LLM architectures, and advanced NLP systems.... ...dimensional, unstructured data. Drive LLM fine‑tuning, evaluation, safety alignment, and...Relocation$250k - $325k
...favor of agentic RAG [2023] Large-scale LLM‑based legal fact extraction [2024] A... ...] The Role: Why, What, and Who Why: AI Researchers are the engine of innovation at Ivo. You... ...production. Explore and apply advanced fine‑tuning, PEFT, and distillation techniques to make...Contract workImmediate start$50 per hour
...experience required Role Overview: Help fine-tune large language models (like ChatGPT) using... ...how well AI solves them, and work with researchers to build better benchmarks.... ...on cutting-edge AI projects with leading LLM companies. Pay rate: $50+/hour (depends...Contract workRemote workFlexible hours- ...Process reward models and verifiers: Developing fine‑grained supervision over intermediate... ...: Contributing to alignment and oversight research - figuring out how to reliably supervise models on geological tasks where ground truth is expensive, delayed, or ambiguous....Full timeInternship
$54 - $60 per hour
...systems using their own data, with technologies ranging from fine-tuning LLMs for enterprise domains, to a platform for building compound... ...problems lie in enterprise domains, behind closed doors. Our research team's goal is to push the frontier of "domain adaptation" - how...Hourly payInternship- ...provider in San Francisco is looking for a highly skilled Computer Scientist to enhance the performance of Large Language Models (LLMs),... ...expertise in advanced prompt engineering and knowledge of various LLM frameworks, making it ideal for innovative professionals eager to...
- ...large pretrained robot models into production-ready systems via fine-tuning, reinforcement learning, steering, human feedback, task... ...ML or controls, and all the places in between. This is where research meets reality. You’ll be responsible for: Designing fine-tuning...
- Overview We are Genmo, a research lab dedicated to building open, state-of-the-art models... ...We are seeking an exceptional Research Scientist to join our team, focusing on alignment... ...Design and implement supervised fine-tuning and reinforcement learning from human feedback...Relocation
- ...the real world. Created by researchers from UC Berkeley’s SkyLab, our... ..., and Discord. We seek truth, move fast, and value craftsmanship... ...variety of Machine Learning Scientist to help advance how we... ...models, preference models, and fine-tuning LLMs with methods like RLHF,...Permanent employmentWork at office
- ...Perplexity Perplexity is seeking top‑tier AI Research Scientists and Engineers to advance our AI... ...Products Team (Vertical) : Concentrate on fine‑tuning and optimizing models for our Deep... ...products Stay current with the latest LLM research, especially in model training,...
- ...are seeking a high-caliber AI Research Engineer / Scientist to join our specialized... ...combined with hands-on experience tuning Large Language Models (LLMs)... ...and maintain the supervised fine-tuning (SFT) and post-... ...Self-Supervised Learning, and LLM fine-tuning techniques). Systems...
$50 per hour
...experience required Role Overview: Help fine-tune large language models (like ChatGPT) using... ...how well AI solves them, and work with researchers to build better benchmarks.... ...on cutting-edge AI projects with leading LLM companies. Pay rate: $50+/hour (depends...Remote jobContract workFlexible hours$400k
Trades Workforce Solutions, based in San Francisco, is seeking a Senior Research Scientist to work on cutting-edge voice AI challenges. The role involves curating data, fine-tuning speech models, and building evaluation infrastructures. Ideal candidates will have a PhD...Relocation package$147.6k - $274k
...discovery and development. Roche’s Research and Early Development... ...Intelligence (AI) to assist our scientists in both pRED and gRED to... ...You will develop autonomous, LLM‑driven agentic workflows that... ...small‑molecule drug design. Fine‑tune foundation models for drug discovery...Local areaWorldwideRelocation package- ...deployment. We are founded by leading scientists in robot reinforcement learning (ex-Nvidia... .... Fifty of the world's best robotics researchers are already building the future at... ...robot learning Vision Language Model Fine-Tuning (SFT and RL-based) Transformer-based 3...
$114.2k - $306.6k
...are not duplicating efforts.*Salesforce Research advances state-of-the-art AI techniques,... ...Research is looking for outstanding AI Research Scientists / Research Engineers.**Our team discovers... ...areas:*** **Agentic AI & Reasoning:** LLM-powered agents, reinforcement learning,...Full time- A leading AI CRM company is seeking an AI Research Scientist/Research Engineer in San Francisco. You will focus on advancing AI techniques, developing models, and solving real-world enterprise problems. Ideal candidates hold a Ph.D. or have strong AI product experience...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist — Truthful LLM Fine-Tuning. Be the first to apply!
- r&d scientist San Francisco, CA
- quality control scientist San Francisco, CA
- machine learning research scientist San Francisco, CA
- validation scientist San Francisco, CA
- scientist San Francisco, CA
- qc scientist San Francisco, CA
- remote scientist San Francisco, CA
- protein scientist San Francisco, CA
- cell culture scientist San Francisco, CA
- research scientist San Francisco, CA


