Model Training Engineer: RLHF & LLM Fine-Tuning

$175k - $350k

Inflection AI

A pioneering AI company is seeking a Model Training Engineer to design and scale post-training pipelines for large language models. The ideal candidate will have hands-on experience with training and fine-tuning large transformer models. Responsibilities include end-to-end workflow contributions, alignment techniques prototyping, and automating training at scale. Competitive salary range is $175,000 – $350,000 per year based on experience and location, along with equity and benefits including unlimited paid time off. #J-18808-Ljbffr Inflection AI

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Model Training Engineer: RLHF & LLM Fine-Tuning in Palo Alto, CA vacancy

Remote Senior LLM Fine-Tuning Engineer
Bright Vision Technologies is seeking an LLM Engineer to design and operationalize workflows for large language models. The role involves deep experience with modern training stacks, dataset construction, and evaluation methodologies. Ideal candidates possess a Master’s...
Training
Remote job
Full time
Bright Vision Technologies
Newark, CA
2 days ago
Member of Technical Staff - Model Training
$175k - $350k
...with human-centered AI models that unite emotional intelligence... ...the Role As a Model Training engineer, you will design, build,... ...that turn a general LLM into a brand-fluent, production... .... Your innovations in fine-tuning and preference optimization (RLHF, DPO, GRPO, RLAIF) will...
Training
Inflection AI
Palo Alto, CA
9 hours ago
Sr. ML Production Model Automation Engineer, Siri Speech
$181.1k - $318.4k
...Apple. These are multimodal models that power Siri on-device speech... ...researchers and modeling engineers train models, iterate on data... ...corpora, and stack supervised fine-tuning, LoRA adapter training, and... ...multi-agent workflows using LLM-native tooling for on-device...
Training
Relocation
Apple
Cupertino, CA
3 days ago
Machine Learning Engineer, LLM Post-Training
$150k - $230k
...hands‑on Machine Learning Engineer to drive the post‑training of our large language models, with a strong emphasis... ...(CPT), supervised fine‑tuning (SFT), and RL — along with... ...primary focus (e.g., RLHF, PPO, GRPO, DPO, and related... ...Requirements Hands‑on LLM post‑training...
Training
Full time
GoTo Meeting
Mountain View, CA
3 days ago
Principal High-Performance LLM Training Engineer
$272k - $431.25k
...is seeking a Principal Engineer to drive the performance of large‑scale AI training and post‑training workloads... ...frontier‑scale LLM workloads running on thousands... ...software, tools, models, benchmarks, and analysis... ...reinforcement learning, fine‑tuning, or other post‑training...
Training
NVIDIA Gruppe
Santa Clara, CA
4 days ago
LLM Engineer
$100k - $150k
...re looking for a skilled LLM Engineer to join our dynamic team... ..., and operationalize fine-tuning workflows for large language models across supervised, preference... ...experience with modern training stacks, careful dataset... ...using supervised, DPO, RLHF, and related techniques....
Training
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Newark, CA
6 days ago
Principal ML Architect - Security AI & Advanced Model Systems
$254k - $349.25k
...requires deep expertise in model architecture, training, fine‑tuning, and distillation ,... ...experiments for: Alignment (RLHF/RLAIF) in security‑sensitive... ...architectures and modern LLM techniques Retrieval‑... ..., CASB, etc.) Systems & Engineering Experience designing high...
Training
Flexible hours
Proofpoint
Sunnyvale, CA
4 days ago
Technical Program Manager, Model Alignment and Deployment
About the role Model Alignment and Deployment is a critical... ...spanning our Post-Training, Safety Engineering, Trust & Safety, ML... ..., alignment workflows (RLHF, DPO, Constitutional AI... ...alignment concepts (supervised fine-tuning, RLHF, AI safety frameworks, LLM evaluation) as well as...
Training
Character.AI
Redwood City, CA
3 days ago
Deep Learning Engineer
$170k - $260k
...for the Deep Learning Engineer role at GenBio AI Base... ...Large Biological Models (LBM), we are pioneering... ...biomedicine, with our LBM training leading to ground‑... ...team and leadership in LLM and generative AI... ...pipeline, pre‑training, fine‑tuning, serving) Build and...
Training
Full time
Work at office
GenBio AI
Palo Alto, CA
9 hours ago
AIML Researcher/Engineer - Foundation Model Post-Training
...knit group of researchers and engineers responsible for building large scale frontier foundation models at Apple. We believe the most... ...role shaping the future of our LLM efforts, specifically in transforming... .... You will tackle core training challenges in instruction following...
Training
Apple Inc.
Cupertino, CA
4 days ago
Model Serving Engineer
$100k - $150k
...we’re looking for a skilled Model Serving Engineer to join our dynamic team and... ...latency, throughput, and cost. Tune GPU utilization, memory... ...and KV cache strategies for LLM serving workloads.... ...including recruitment, hiring, training, compensation, promotion, transfer...
Training
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Newark, CA
7 days ago
Member of Technical Staff - Voice Model
$150k
...highly motivated, and focused on engineering excellence. This... ...You will join the Grok Voice Model team to help build the world... ...scenarios. We own the full training pipeline: massive data curation... ...enhancements through supervised fine-tuning, reinforcement learning, and...
Training
Temporary work
Pantera Capital
Palo Alto, CA
5 days ago
Member of Technical Staff, Model Training
Job You will own the training pipeline behind the models that power both Parallel’s search stack and Parallel’s agents. On the search side, that means... ...from real product usage to high‑quality training data, fine‑tune and evaluate these models rigorously, and ship them...
Training
Work at office
Visa sponsorship
Parallel Web Systems
Palo Alto, CA
1 day ago
LLM Post-Training Engineer - RL, SFT & Data Pipelines
GoTo Meeting is seeking a hands-on Machine Learning Engineer to drive the post-training of large language models focusing on reinforcement learning. You will own... ...including continuous pre-training and supervised fine-tuning. The ideal candidate has significant experience...
Training
GoTo Meeting
Mountain View, CA
3 days ago
ML Engineer - Inference & Model Deployment
...100x better job search engine: fast, comprehensive, honest... ...powerful AI and ML models into fast, reliable... ...and integrate researcher-trained model checkpoints into... ...efficient training and fine‑tuning workflows where needed,... ...optimization, or modern LLM/embedding/ranking model...
Training
Relocation package
HiringCafe
Cupertino, CA
1 day ago
Staff ML Engineer: End-to-End Model Training & Deployment
...Systems in Palo Alto is seeking a professional to own the training pipeline behind models that power both their search stack and agents. The role... ...connections from product usage to training data, fine-tuning models, and ensuring safe deployment. Ideal candidates have...
Training
Parallel Web Systems
Palo Alto, CA
2 days ago
Research Engineer, Frontier Safety Mitigations, DeepMind
$174k - $252k
...leading applied ML projects, including LLM training, inference, and fine-tuning. Experience using AI coding agents... ..., automated red-teaming, or model interpretability and probes. About... ...AI. We are looking for a research engineer for the Frontier Safety Mitigation...
Training
Full time
Google Inc.
Mountain View, CA
1 day ago
Senior Research Engineer, Foundation Model Training Infrastructure
$224k - $356.5k
...searching for a senior or principal engineer who specializes in building... ...for large‑scale foundation model training in the Generalist Embodied... ...model training and fine-tuning on massive datasets. Implement... ...experience at building large-scale LLM and multimodal LLM training...
Training
Full time
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior DL Engineer: Edge Model Optimization & Inference
...a skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California. Candidates should have a strong background in DL model training and deployment, ideally with a PhD or equivalent experience in Computer...
Training
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior/Staff ML Engineer - Speech & NLP Systems
$155k - $207k
...industry-veteran scientists and engineers. As a Senior/Staff Machine... ...-scale SID / ASR / NLP / LLM systems that power mission-critical... ...and implementation of training, fine-tuning, post-training, and inference... ...large language and speech models using PyTorch and/or JAX, making...
Training
Permanent employment
Cacheflow
Mountain View, CA
5 days ago
Senior Datacenter Performance Model Engineer
This software engineering role involves developing datacenter‑scale performance‑modeling and prediction tools for AI researchers running AI workloads in GPU clusters.... ...frameworks such as PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job...
Training
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior Deep Learning Engineer - Model Evaluation & AI Systems
$224k - $356.5k
...performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in crafting... ...roadmap, and sharing best practices.* Work alongside model training, inference, and product divisions to provide trusted...
Training
NVIDIA Corporation
Santa Clara, CA
5 days ago
Senior High-Performance LLM Training Engineer
...analyze, profile, and optimize AI training workloads on innovative... ...Computer Science, Electrical Engineering or Computer Engineering and 5... ...Proven experience analyzing and tuning application performance &... ...and system‑level performance modelling. Programming skills in C++, Python...
Training
Work experience placement
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior AI Systems Performance Engineer San Jose, California, United States
Senior AI Systems Performance Engineer Palo Alto, California,... ...AI platform, from chip to model, optimized for enterprise... ...can be easily and securely fine‑tuned using customer data for greater... ...Hands‑on experience with LLM or multimodal model training and inference. Background...
Training
Full time
Temporary work
Local area
Flexible hours
SambaNova
Palo Alto, CA
2 days ago
Inference Optimization Engineer United States - Remote · Remote
$198k - $286k
...across kernels, the inference engine, and distributed systems so that... ...of Modular Cloud, delivering LLM performance on the Pareto... ...highly customized LLM inference tuned to specific customer use cases... ...are not limited to education, training, work experience, business needs...
Training
Remote job
Work experience placement
Work at office
Local area
Flexible hours
Modular Mailing Systems, Inc.
Los Altos, CA
3 days ago
Deep Learning Senior Engineer, End-To-End Autonomous Driving
...NVIDIA, we are seeking exceptional engineers to join our autonomous driving team... ...together! What You’ll Be Doing Design and train innovative large‑scale models—including generative, imitation,... ...systems. Build, pre‑train, and fine‑tune LLM/VLM/VLA systems for deployment in real...
Training
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Runtime Engineer San Jose, California, United States
...want to hear from you. Runtime Engineer Palo Alto, California, United... ...AI platform, from chip to model, optimized for enterprise and... ...that can be easily and securely fine‑tuned using customer data for... ...flow applications such as ML training and inference and HPC applications...
Training
Full time
Temporary work
Local area
Flexible hours
SambaNova
Palo Alto, CA
5 days ago
AI Computer Vision Engineer Jobs
Software Engineer, Computer Vision and Deep Learning... ...collection techniques for training Deep Neural Nets,... ...the-art deep learning models for perception of urban... ...strategies for their effective fine‑tuning and adaptation to... ...detection, multimodal LLM/LVM extraction,...
Training
Shift work
AI Chopping Block, Inc.
Palo Alto, CA
3 days ago
Member of Technical Staff — Diffusion Model
...of Technical Staff — Diffusion Model About the Role RadixArk is seeking... ...research thinking with strong engineering execution — from designing novel algorithms to training and deploying models at scale.... ...GitHub stars, the fastest open LLM serving engine), and developed Miles...
Training
Flexible hours
RadixArk
Palo Alto, CA
5 days ago
Research Scientist - Vision Language Model
$150k
...Institute of Foundation Models We are a dedicated... ...-edge foundation model training, alongside world-class... ..., data scientists, and engineers, tackling the most fundamental... ...-training, instruction tuning, reasoning, and agents.... ...pre-training, fine-tuning, evaluation, or...
Training
Institute of Foundation Models
Sunnyvale, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Model Training Engineer: RLHF & LLM Fine-Tuning. Be the first to apply!