Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Model Training Engineer: RLHF & LLM Fine-Tuning

$175k - $350k

Inflection AI

A pioneering AI company is seeking a Model Training Engineer to design and scale post-training pipelines for large language models. The ideal candidate will have hands-on experience with training and fine-tuning large transformer models. Responsibilities include end-to-end workflow contributions, alignment techniques prototyping, and automating training at scale. Competitive salary range is $175,000 – $350,000 per year based on experience and location, along with equity and benefits including unlimited paid time off. #J-18808-Ljbffr Inflection AI

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Model Training Engineer: RLHF & LLM Fine-Tuning in Palo Alto, CA vacancy
  • Bright Vision Technologies is seeking an LLM Engineer to design and operationalize workflows for large language models. The role involves deep experience with modern training stacks, dataset construction, and evaluation methodologies. Ideal candidates possess a Master’s... 
    Training
    Remote job
    Full time

    Bright Vision Technologies

    Newark, CA
    2 days ago
  • $175k - $350k

     ...with human-centered AI models that unite emotional intelligence...  ...the Role As a Model Training engineer, you will design, build,...  ...that turn a general LLM into a brand-fluent, production...  .... Your innovations in fine-tuning and preference optimization (RLHF, DPO, GRPO, RLAIF) will... 
    Training

    Inflection AI

    Palo Alto, CA
    9 hours ago
  • $181.1k - $318.4k

     ...Apple. These are multimodal models that power Siri on-device speech...  ...researchers and modeling engineers train models, iterate on data...  ...corpora, and stack supervised fine-tuning, LoRA adapter training, and...  ...multi-agent workflows using LLM-native tooling for on-device... 
    Training
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  • $150k - $230k

     ...hands‑on Machine Learning Engineer to drive the post‑training of our large language models, with a strong emphasis...  ...(CPT), supervised fine‑tuning (SFT), and RL — along with...  ...primary focus (e.g., RLHF, PPO, GRPO, DPO, and related...  ...Requirements Hands‑on LLM post‑training... 
    Training
    Full time

    GoTo Meeting

    Mountain View, CA
    3 days ago
  • $272k - $431.25k

     ...is seeking a Principal Engineer to drive the performance of large‑scale AI training and post‑training workloads...  ...frontier‑scale LLM workloads running on thousands...  ...software, tools, models, benchmarks, and analysis...  ...reinforcement learning, fine‑tuning, or other post‑training... 
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $100k - $150k

     ...re looking for a skilled LLM Engineer to join our dynamic team...  ..., and operationalize fine-tuning workflows for large language models across supervised, preference...  ...experience with modern training stacks, careful dataset...  ...using supervised, DPO, RLHF, and related techniques.... 
    Training
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Newark, CA
    6 days ago
  • $254k - $349.25k

     ...requires deep expertise in model architecture, training, fine‑tuning, and distillation ,...  ...experiments for: Alignment (RLHF/RLAIF) in security‑sensitive...  ...architectures and modern LLM techniques Retrieval‑...  ..., CASB, etc.) Systems & Engineering Experience designing high... 
    Training
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    4 days ago
  • About the role Model Alignment and Deployment is a critical...  ...spanning our Post-Training, Safety Engineering, Trust & Safety, ML...  ..., alignment workflows (RLHF, DPO, Constitutional AI...  ...alignment concepts (supervised fine-tuning, RLHF, AI safety frameworks, LLM evaluation) as well as... 
    Training

    Character.AI

    Redwood City, CA
    3 days ago
  • $170k - $260k

     ...for the Deep Learning Engineer role at GenBio AI Base...  ...Large Biological Models (LBM), we are pioneering...  ...biomedicine, with our LBM training leading to ground‑...  ...team and leadership in LLM and generative AI...  ...pipeline, pre‑training, fine‑tuning, serving) Build and... 
    Training
    Full time
    Work at office

    GenBio AI

    Palo Alto, CA
    9 hours ago
  •  ...knit group of researchers and engineers responsible for building large scale frontier foundation models at Apple. We believe the most...  ...role shaping the future of our LLM efforts, specifically in transforming...  .... You will tackle core training challenges in instruction following... 
    Training

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $100k - $150k

     ...we’re looking for a skilled Model Serving Engineer to join our dynamic team and...  ...latency, throughput, and cost. Tune GPU utilization, memory...  ...and KV cache strategies for LLM serving workloads....  ...including recruitment, hiring, training, compensation, promotion, transfer... 
    Training
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Newark, CA
    7 days ago
  • $150k

     ...highly motivated, and focused on engineering excellence. This...  ...You will join the Grok Voice Model team to help build the world...  ...scenarios. We own the full training pipeline: massive data curation...  ...enhancements through supervised fine-tuning, reinforcement learning, and... 
    Training
    Temporary work

    Pantera Capital

    Palo Alto, CA
    5 days ago
  • Job You will own the training pipeline behind the models that power both Parallel’s search stack and Parallel’s agents. On the search side, that means...  ...from real product usage to high‑quality training data, fine‑tune and evaluate these models rigorously, and ship them... 
    Training
    Work at office
    Visa sponsorship

    Parallel Web Systems

    Palo Alto, CA
    1 day ago
  • GoTo Meeting is seeking a hands-on Machine Learning Engineer to drive the post-training of large language models focusing on reinforcement learning. You will own...  ...including continuous pre-training and supervised fine-tuning. The ideal candidate has significant experience... 
    Training

    GoTo Meeting

    Mountain View, CA
    3 days ago
  •  ...100x better job search engine: fast, comprehensive, honest...  ...powerful AI and ML models into fast, reliable...  ...and integrate researcher-trained model checkpoints into...  ...efficient training and fine‑tuning workflows where needed,...  ...optimization, or modern LLM/embedding/ranking model... 
    Training
    Relocation package

    HiringCafe

    Cupertino, CA
    1 day ago
  •  ...Systems in Palo Alto is seeking a professional to own the training pipeline behind models that power both their search stack and agents. The role...  ...connections from product usage to training data, fine-tuning models, and ensuring safe deployment. Ideal candidates have... 
    Training

    Parallel Web Systems

    Palo Alto, CA
    2 days ago
  • $174k - $252k

     ...leading applied ML projects, including LLM training, inference, and fine-tuning. Experience using AI coding agents...  ..., automated red-teaming, or model interpretability and probes. About...  ...AI. We are looking for a research engineer for the Frontier Safety Mitigation... 
    Training
    Full time

    Google Inc.

    Mountain View, CA
    1 day ago
  • $224k - $356.5k

     ...searching for a senior or principal engineer who specializes in building...  ...for large‑scale foundation model training in the Generalist Embodied...  ...model training and fine-tuning on massive datasets. Implement...  ...experience at building large-scale LLM and multimodal LLM training... 
    Training
    Full time

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...a skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California. Candidates should have a strong background in DL model training and deployment, ideally with a PhD or equivalent experience in Computer... 
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $155k - $207k

     ...industry-veteran scientists and engineers. As a Senior/Staff Machine...  ...-scale SID / ASR / NLP / LLM systems that power mission-critical...  ...and implementation of training, fine-tuning, post-training, and inference...  ...large language and speech models using PyTorch and/or JAX, making... 
    Training
    Permanent employment

    Cacheflow

    Mountain View, CA
    5 days ago
  • This software engineering role involves developing datacenter‑scale performance‑modeling and prediction tools for AI researchers running AI workloads in GPU clusters....  ...frameworks such as PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job... 
    Training

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in crafting...  ...roadmap, and sharing best practices.* Work alongside model training, inference, and product divisions to provide trusted... 
    Training

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  •  ...analyze, profile, and optimize AI training workloads on innovative...  ...Computer Science, Electrical Engineering or Computer Engineering and 5...  ...Proven experience analyzing and tuning application performance &...  ...and system‑level performance modelling. Programming skills in C++, Python... 
    Training
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • Senior AI Systems Performance Engineer Palo Alto, California,...  ...AI platform, from chip to model, optimized for enterprise...  ...can be easily and securely fine‑tuned using customer data for greater...  ...Hands‑on experience with LLM or multimodal model training and inference. Background... 
    Training
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova

    Palo Alto, CA
    2 days ago
  • $198k - $286k

     ...across kernels, the inference engine, and distributed systems so that...  ...of Modular Cloud, delivering LLM performance on the Pareto...  ...highly customized LLM inference tuned to specific customer use cases...  ...are not limited to education, training, work experience, business needs... 
    Training
    Remote job
    Work experience placement
    Work at office
    Local area
    Flexible hours

    Modular Mailing Systems, Inc.

    Los Altos, CA
    3 days ago
  •  ...NVIDIA, we are seeking exceptional engineers to join our autonomous driving team...  ...together! What You’ll Be Doing Design and train innovative large‑scale models—including generative, imitation,...  ...systems. Build, pre‑train, and fine‑tune LLM/VLM/VLA systems for deployment in real... 
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...want to hear from you. Runtime Engineer Palo Alto, California, United...  ...AI platform, from chip to model, optimized for enterprise and...  ...that can be easily and securely fine‑tuned using customer data for...  ...flow applications such as ML training and inference and HPC applications... 
    Training
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova

    Palo Alto, CA
    5 days ago
  • Software Engineer, Computer Vision and Deep Learning...  ...collection techniques for training Deep Neural Nets,...  ...the-art deep learning models for perception of urban...  ...strategies for their effective fine‑tuning and adaptation to...  ...detection, multimodal LLM/LVM extraction,... 
    Training
    Shift work

    AI Chopping Block, Inc.

    Palo Alto, CA
    3 days ago
  •  ...of Technical Staff — Diffusion Model About the Role RadixArk is seeking...  ...research thinking with strong engineering execution — from designing novel algorithms to training and deploying models at scale....  ...GitHub stars, the fastest open LLM serving engine), and developed Miles... 
    Training
    Flexible hours

    RadixArk

    Palo Alto, CA
    5 days ago
  • $150k

     ...Institute of Foundation Models We are a dedicated...  ...-edge foundation model training, alongside world-class...  ..., data scientists, and engineers, tackling the most fundamental...  ...-training, instruction tuning, reasoning, and agents....  ...pre-training, fine-tuning, evaluation, or... 
    Training

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Model Training Engineer: RLHF & LLM Fine-Tuning. Be the first to apply!