Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Model Training Engineer: RLHF & LLM Fine-Tuning

$175k - $350k

Inflection AI

A pioneering AI company is seeking a Model Training Engineer to design and scale post-training pipelines for large language models. The ideal candidate will have hands-on experience with training and fine-tuning large transformer models. Responsibilities include end-to-end workflow contributions, alignment techniques prototyping, and automating training at scale. Competitive salary range is $175,000 – $350,000 per year based on experience and location, along with equity and benefits including unlimited paid time off. #J-18808-Ljbffr Inflection AI

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Model Training Engineer: RLHF & LLM Fine-Tuning in Palo Alto, CA vacancy
  • $175k - $350k

     ...Model Training Engineer At Inflection AI, our public benefit mission is to harness the...  ...pipelines that turn a general LLM into a brand-fluent, production-...  ...assistant. Your innovations in fine-tuning and preference optimization (RLHF, DPO, GRPO, RLAIF) will directly... 
    Training
    Full time

    Humanx

    Palo Alto, CA
    2 days ago
  • $175k - $350k

     ...Inflection AI's foundation model, proving that AI can be...  ...Role As a Model Training engineer, you will design, build,...  ...pipelines that turn a general LLM into a brand-fluent,...  .... Your innovations in fine-tuning and preference optimization (RLHF, DPO, GRPO, RLAIF) will... 
    Training

    Inflection AI

    Palo Alto, CA
    2 days ago
  • $148.91k - $252k

     ...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA...  ...Job Responsibilities Fine-tune the pre-trained LLM on particular use cases...  ...small LLMs, weight sharing, model quantization, etc.) that can...  ...technologies, such as SFT, RLHF, prompt engineering, RAG, etc... 
    Training
    Full time

    XPENG

    Santa Clara, CA
    2 days ago
  • $155k - $235k

     ...Senior Model Based Systems Engineer Arlington, VA, Mountain View, CA, San Diego, CA We're a combat...  ...Team, No Tourists. Whether you're tuning a control law at 2 a.m. or heat-treating...  ...) work experience, education and/or training, critical skills, and/or business considerations... 
    Training
    Full time
    Work experience placement
    Local area
    Relocation package

    Atropos Inc

    Mountain View, CA
    2 days ago
  • $272k - $431.25k

     ...is seeking a Principal Engineer to drive the performance of large‑scale AI training and post‑training workloads...  ...frontier‑scale LLM workloads running on thousands...  ...software, tools, models, benchmarks, and analysis...  ...reinforcement learning, fine‑tuning, or other post‑training... 
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $254k - $349.25k

     ...requires deep expertise in model architecture, training, fine-tuning, and distillation ,...  ...experiments for: Alignment (RLHF/RLAIF) in security-...  ...architectures and modern LLM techniques Retrieval-augmented...  ...CASB, etc.) Systems & Engineering Experience designing... 
    Training
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    1 day ago
  • $254k - $349.25k

     ...requires deep expertise in model architecture, training, fine‑tuning, and distillation ,...  ...experiments for: Alignment (RLHF/RLAIF) in security‑sensitive...  ...architectures and modern LLM techniques Retrieval‑...  ..., CASB, etc.) Systems & Engineering Experience designing high... 
    Training
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    3 days ago
  •  ...About The Role Model Alignment And Deployment Is...  ...Effort Spanning Our Post-Training, Safety Engineering, Trust & Safety, ML...  ..., Alignment Workflows (RLHF, DPO, Constitutional AI...  ...Alignment Concepts (Supervised Fine-Tuning, RLHF, AI Safety Frameworks, LLM Evaluation) As Well As... 
    Training

    Character

    Redwood City, CA
    6 days ago
  • $215.28k - $364.32k

     ...Staff Machine Learning Engineer - Foundation Model Santa Clara, CA...  ...infrastructure experts to design, train, and deploy large-...  ...pretraining and fine-tuning strategies leveraging...  ...driving models, or LLM/VLM architectures (e....  ...Familiarity with RLHF/DPO/GRPO, trajectory... 
    Training
    Full time

    XPENG

    Santa Clara, CA
    1 day ago
  • $100k - $200k

     ...meticulous and innovative AI/LLM Test Engineer to join our cutting-edge AI...  ...and safety of Large Language Models (LLMs) in real-world product...  ...ML engineers to validate fine-tuned models and optimize prompts...  ...LLM fine-tuning (e.g., LoRA, RLHF), or optimization techniques... 
    Full time
    Contract work
    For contractors
    Work experience placement

    OPPO US Research Center

    Palo Alto, CA
    2 days ago
  • $281k - $356k

     ...Senior Staff Software Engineer, Model Post Training Waymo is an autonomous driving technology company...  ...join our team to lead the post-training LLM for autonomous driving. You will be...  ...applications. This may include fine-tuning strategies, reinforcement learning techniques... 
    Training
    Full time
    Remote work

    Waymo

    Mountain View, CA
    4 days ago
  • $180k

     ...looking for a candidate passionate about truth-seeking AI and model building. You will tackle critical AI modeling challenges and get...  ...in this hands-on role. Join a motivated team invested in engineering excellence and the pursuit of knowledge. #J-18808-Ljbffr xAI
    Training

    xAI

    Palo Alto, CA
    1 day ago
  •  ...that fuels it, recursively accelerating the path to artificial superintelligence. We are interested in best-in-class engineers to focus on a variety of challenges relating to scaling, low-level optimization, and core infrastructure for LLM training and inference.... 
    Training

    Ricursive Intelligence

    Palo Alto, CA
    4 days ago
  • $170k - $260k

     ...visionary scientists, engineers, and entrepreneurs are...  ...modal Large Biological Models (LBM), we are pioneering...  ...biomedicine, with our LBM training leading to ground-...  ...team and leadership in LLM and generative AI position...  ...Performance Optimization: Fine-tune GPU clusters and deep... 
    Training
    Work at office

    GenBio AI

    Palo Alto, CA
    4 days ago
  • $184k - $287.5k

     ...are now looking for a Senior High-Performance LLM Training Engineer! NVIDIA is seeking experienced engineers...  ...architecture. Proven experience analyzing and tuning application performance & system-level performance modeling. Programming skills in C++, Python, and CUDA... 
    Training
    Work experience placement

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $28.55 - $47.96 per hour

     ...Serviceisintegrating AI tools - LLM assistants, copilots,...  ...technicians, mobile service, field engineers, and back-office teams....  ...context hints ~ Scheduled training sessions and roundtable for...  ...workflow, when to use retrieval vs. fine-tuning, how todesignreliability and... 
    Training
    Full time
    Temporary work
    Part time
    Internship
    Local area
    Flexible hours

    Tesla

    Palo Alto, CA
    4 days ago
  • $301.75k - $355k

     ...Senior Director for the Model LifeCycle team will...  ...Life Cycle Team. Manage fine‑tuning systems for large...  ...and maintain end‑to‑end training pipelines for Large Language...  ...in Computer Science, Engineering, or a related field....  ...preprints in the LLM post‑training space Proficiency... 
    Training
    Temporary work

    Crusoe

    Sunnyvale, CA
    3 days ago
  • $109k - $157k

     ...intelligence specialist with experience in LLM and agent harnesses for industrial engineering, system engineering and/or chemical...  ...solutions in which approaches/models can apply new and existing project workflows Create/train/evaluate agentic AI models, features,... 
    Training
    Internship
    Flexible hours

    X Development, LLC

    Mountain View, CA
    3 days ago
  •  ...LLMs, our proprietary models, and a sophisticated Agentic...  ...Moveworks' Reasoning Engine and natural language...  ...infrastructure needed to fine-tune, evaluate, and serve...  ...preserving datasets for model training and evaluation. You...  ...alignment with RLHF/RLAIF/DPO), agent evaluation... 
    Training
    Work at office
    Immediate start
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    2 days ago
  • $174k - $252k

     ...leading applied ML projects, including LLM training, inference, and fine-tuning. Experience using AI coding agents...  ..., automated red-teaming, or model interpretability and probes. About...  ...AI. We are looking for a research engineer for the Frontier Safety Mitigation... 
    Training
    Full time

    Google Inc.

    Mountain View, CA
    17 hours ago
  • $224k - $356.5k

     ...searching for a senior or principal engineer who specializes in building...  ...for large‑scale foundation model training in the Generalist Embodied...  ...model training and fine-tuning on massive datasets. Implement...  ...experience at building large-scale LLM and multimodal LLM training... 
    Training
    Full time

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Senior AI Systems Performance Engineer Palo Alto, California,...  ...AI platform, from chip to model, optimized for enterprise...  ...be easily and securely fine-tuned using customer data for greater...  ...Hands-on experience with LLM or multimodal model training and inference.... 
    Training

    SambaNova Systems

    Palo Alto, CA
    2 days ago
  •  ...a skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California. Candidates should have a strong background in DL model training and deployment, ideally with a PhD or equivalent experience in Computer... 
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    Responsibilities Build performance modeling and prediction tools for AI workloads at Data...  ...like PyTorch and TensorFlow, distributed training and inference Knowledge of GPU cluster...  ...What makes a candidate stand out Proven SW engineering experience experience in deploying SW at... 
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    17 hours ago
  • $224k - $356.5k

     ...performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in crafting...  ...roadmap, and sharing best practices. Work alongside model training, inference, and product divisions to provide trusted... 
    Training

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...moving multifaceted software team! This software engineering role involves developing datacenter scale performance modeling and predictions tools for AI researchers...  ...frameworks like PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job... 
    Training

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...100x better job search engine: fast, comprehensive, honest...  ...powerful AI and ML models into fast, reliable...  ...and integrate researcher-trained model checkpoints into...  ...efficient training and fine-tuning workflows where needed,...  ...optimization, or modern LLM/embedding/ranking model... 
    Training
    Full time
    Relocation package

    HiringCafe

    Cupertino, CA
    2 days ago
  • $184k - $287.5k

     ...platform upon which every new AI-powered application is built. We are seeking a senior vision language model engineer to design and build agentic data and training workflows for Autonomous Vehicles, Robotics, and Medical applications. The right person for this role brings... 
    Training

    NVIDIA

    Santa Clara, CA
    17 hours ago
  •  ...rethinking the operating model including...  ...guide research on prompt engineering techniques to improve...  ...exploring and utilizing LLM orchestration and agentic...  ...for prompt-based model training, evaluation, and optimization...  ...with model fine-tuning techniques. Knowledge... 
    Training

    Chase

    Palo Alto, CA
    2 days ago
  •  ...NVIDIA, we are seeking exceptional engineers to join our autonomous driving team...  ...together! What You’ll Be Doing Design and train innovative large‑scale models—including generative, imitation,...  ...systems. Build, pre‑train, and fine‑tune LLM/VLM/VLA systems for deployment in real... 
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Model Training Engineer: RLHF & LLM Fine-Tuning. Be the first to apply!