Model Training Engineer: RLHF & LLM Fine-Tuning
$175k - $350kInflection AI
A pioneering AI company is seeking a Model Training Engineer to design and scale post-training pipelines for large language models. The ideal candidate will have hands-on experience with training and fine-tuning large transformer models. Responsibilities include end-to-end workflow contributions, alignment techniques prototyping, and automating training at scale. Competitive salary range is $175,000 – $350,000 per year based on experience and location, along with equity and benefits including unlimited paid time off. #J-18808-Ljbffr Inflection AI
- Bright Vision Technologies is seeking an LLM Engineer to design and operationalize workflows for large language models. The role involves deep experience with modern training stacks, dataset construction, and evaluation methodologies. Ideal candidates possess a Master’s...TrainingRemote jobFull time
$175k - $350k
...with human-centered AI models that unite emotional intelligence... ...the Role As a Model Training engineer, you will design, build,... ...that turn a general LLM into a brand-fluent, production... .... Your innovations in fine-tuning and preference optimization (RLHF, DPO, GRPO, RLAIF) will...Training$181.1k - $318.4k
...Apple. These are multimodal models that power Siri on-device speech... ...researchers and modeling engineers train models, iterate on data... ...corpora, and stack supervised fine-tuning, LoRA adapter training, and... ...multi-agent workflows using LLM-native tooling for on-device...TrainingRelocation$150k - $230k
...hands‑on Machine Learning Engineer to drive the post‑training of our large language models, with a strong emphasis... ...(CPT), supervised fine‑tuning (SFT), and RL — along with... ...primary focus (e.g., RLHF, PPO, GRPO, DPO, and related... ...Requirements Hands‑on LLM post‑training...TrainingFull time$272k - $431.25k
...is seeking a Principal Engineer to drive the performance of large‑scale AI training and post‑training workloads... ...frontier‑scale LLM workloads running on thousands... ...software, tools, models, benchmarks, and analysis... ...reinforcement learning, fine‑tuning, or other post‑training...Training$100k - $150k
...re looking for a skilled LLM Engineer to join our dynamic team... ..., and operationalize fine-tuning workflows for large language models across supervised, preference... ...experience with modern training stacks, careful dataset... ...using supervised, DPO, RLHF, and related techniques....TrainingFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$254k - $349.25k
...requires deep expertise in model architecture, training, fine‑tuning, and distillation ,... ...experiments for: Alignment (RLHF/RLAIF) in security‑sensitive... ...architectures and modern LLM techniques Retrieval‑... ..., CASB, etc.) Systems & Engineering Experience designing high...TrainingFlexible hours- About the role Model Alignment and Deployment is a critical... ...spanning our Post-Training, Safety Engineering, Trust & Safety, ML... ..., alignment workflows (RLHF, DPO, Constitutional AI... ...alignment concepts (supervised fine-tuning, RLHF, AI safety frameworks, LLM evaluation) as well as...Training
$170k - $260k
...for the Deep Learning Engineer role at GenBio AI Base... ...Large Biological Models (LBM), we are pioneering... ...biomedicine, with our LBM training leading to ground‑... ...team and leadership in LLM and generative AI... ...pipeline, pre‑training, fine‑tuning, serving) Build and...TrainingFull timeWork at office- ...knit group of researchers and engineers responsible for building large scale frontier foundation models at Apple. We believe the most... ...role shaping the future of our LLM efforts, specifically in transforming... .... You will tackle core training challenges in instruction following...Training
$100k - $150k
...we’re looking for a skilled Model Serving Engineer to join our dynamic team and... ...latency, throughput, and cost. Tune GPU utilization, memory... ...and KV cache strategies for LLM serving workloads.... ...including recruitment, hiring, training, compensation, promotion, transfer...TrainingFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$150k
...highly motivated, and focused on engineering excellence. This... ...You will join the Grok Voice Model team to help build the world... ...scenarios. We own the full training pipeline: massive data curation... ...enhancements through supervised fine-tuning, reinforcement learning, and...TrainingTemporary work- Job You will own the training pipeline behind the models that power both Parallel’s search stack and Parallel’s agents. On the search side, that means... ...from real product usage to high‑quality training data, fine‑tune and evaluate these models rigorously, and ship them...TrainingWork at officeVisa sponsorship
- GoTo Meeting is seeking a hands-on Machine Learning Engineer to drive the post-training of large language models focusing on reinforcement learning. You will own... ...including continuous pre-training and supervised fine-tuning. The ideal candidate has significant experience...Training
- ...100x better job search engine: fast, comprehensive, honest... ...powerful AI and ML models into fast, reliable... ...and integrate researcher-trained model checkpoints into... ...efficient training and fine‑tuning workflows where needed,... ...optimization, or modern LLM/embedding/ranking model...TrainingRelocation package
- ...Systems in Palo Alto is seeking a professional to own the training pipeline behind models that power both their search stack and agents. The role... ...connections from product usage to training data, fine-tuning models, and ensuring safe deployment. Ideal candidates have...Training
$174k - $252k
...leading applied ML projects, including LLM training, inference, and fine-tuning. Experience using AI coding agents... ..., automated red-teaming, or model interpretability and probes. About... ...AI. We are looking for a research engineer for the Frontier Safety Mitigation...TrainingFull time$224k - $356.5k
...searching for a senior or principal engineer who specializes in building... ...for large‑scale foundation model training in the Generalist Embodied... ...model training and fine-tuning on massive datasets. Implement... ...experience at building large-scale LLM and multimodal LLM training...TrainingFull time- ...a skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California. Candidates should have a strong background in DL model training and deployment, ideally with a PhD or equivalent experience in Computer...Training
$155k - $207k
...industry-veteran scientists and engineers. As a Senior/Staff Machine... ...-scale SID / ASR / NLP / LLM systems that power mission-critical... ...and implementation of training, fine-tuning, post-training, and inference... ...large language and speech models using PyTorch and/or JAX, making...TrainingPermanent employment- This software engineering role involves developing datacenter‑scale performance‑modeling and prediction tools for AI researchers running AI workloads in GPU clusters.... ...frameworks such as PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job...Training
$224k - $356.5k
...performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in crafting... ...roadmap, and sharing best practices.* Work alongside model training, inference, and product divisions to provide trusted...Training- ...analyze, profile, and optimize AI training workloads on innovative... ...Computer Science, Electrical Engineering or Computer Engineering and 5... ...Proven experience analyzing and tuning application performance &... ...and system‑level performance modelling. Programming skills in C++, Python...TrainingWork experience placement
- Senior AI Systems Performance Engineer Palo Alto, California,... ...AI platform, from chip to model, optimized for enterprise... ...can be easily and securely fine‑tuned using customer data for greater... ...Hands‑on experience with LLM or multimodal model training and inference. Background...TrainingFull timeTemporary workLocal areaFlexible hours
$198k - $286k
...across kernels, the inference engine, and distributed systems so that... ...of Modular Cloud, delivering LLM performance on the Pareto... ...highly customized LLM inference tuned to specific customer use cases... ...are not limited to education, training, work experience, business needs...TrainingRemote jobWork experience placementWork at officeLocal areaFlexible hours- ...NVIDIA, we are seeking exceptional engineers to join our autonomous driving team... ...together! What You’ll Be Doing Design and train innovative large‑scale models—including generative, imitation,... ...systems. Build, pre‑train, and fine‑tune LLM/VLM/VLA systems for deployment in real...Training
- ...want to hear from you. Runtime Engineer Palo Alto, California, United... ...AI platform, from chip to model, optimized for enterprise and... ...that can be easily and securely fine‑tuned using customer data for... ...flow applications such as ML training and inference and HPC applications...TrainingFull timeTemporary workLocal areaFlexible hours
- Software Engineer, Computer Vision and Deep Learning... ...collection techniques for training Deep Neural Nets,... ...the-art deep learning models for perception of urban... ...strategies for their effective fine‑tuning and adaptation to... ...detection, multimodal LLM/LVM extraction,...TrainingShift work
- ...of Technical Staff — Diffusion Model About the Role RadixArk is seeking... ...research thinking with strong engineering execution — from designing novel algorithms to training and deploying models at scale.... ...GitHub stars, the fastest open LLM serving engine), and developed Miles...TrainingFlexible hours
$150k
...Institute of Foundation Models We are a dedicated... ...-edge foundation model training, alongside world-class... ..., data scientists, and engineers, tackling the most fundamental... ...-training, instruction tuning, reasoning, and agents.... ...pre-training, fine-tuning, evaluation, or...Training
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Model Training Engineer: RLHF & LLM Fine-Tuning. Be the first to apply!

