Model Training
$175k - $350kHumanx
Model Training Engineer
At Inflection AI, our public benefit mission is to harness the power of AI to improve human well-being and productivity. The next era of AI will be defined by agents we trust to act on our behalf. We're pioneering this future with human-centered AI models that unite emotional intelligence (EQ) and raw intelligence (IQ)—transforming interactions from transactional to relational, to create enduring value for individuals and enterprises alike. Our work comes to life in two ways today: Pi, your personal AI, designed to be a kind and supportive companion that elevates everyday life with practical assistance and perspectives. Platform — large-language models (LLMs) and APIs that enable builders, agents, and enterprises to bring Pi-class emotional intelligence into experiences where empathy and human understanding matter most. We are building toward a future of AI agents that earn trust, deepen understanding, and create aligned, long-term value for all.
About the Role
As a Model Training engineer, you will design, build, and scale the post-training pipelines that turn a general LLM into a brand-fluent, production-ready assistant. Your innovations in fine-tuning and preference optimization (RLHF, DPO, GRPO, RLAIF) will directly improve reliability, alignment, and cost.
This is a good role for you if you:
- Have hands-on experience training and fine-tuning large transformer models on multi-GPU / multi-node clusters.
- Are fluent in PyTorch and its ecosystem tools (Torchtune, FSDP, DeepSpeed) and enjoy digging into distributed-training internals, mixed precision, and memory-efficiency tricks.
- Have shipped or published work in RLHF, DPO, GRPO, or RLAIF and understand their practical trade-offs.
- Care deeply about training tools, pipelines, and reproducibility—you automate the boring parts so you can iterate on the fun parts.
- Balance research curiosity with product pragmatism—you know when to run an ablation and when to ship.
- Communicate crisply with both technical and non-technical teammates.
Responsibilities include:
- Contribute to end-to-end post-training workflows—dataset curation, hyper-parameter search, evaluation, and rollout—using PyTorch, Torchtune, FSDP/DeepSpeed, and our internal orchestration stack.
- Prototype and compare alignment techniques (e.g., curriculum RL, multi-objective reward modeling, tool-use fine-tuning) and push the best ideas into production.
- Automate training at scale: build robust pipeline components, tools, scripts, and dashboards so experiments are reproducible and easy to trace.
- Define the metrics that matter; run A/B tests and iterate quickly to meet aggressive quality targets.
- Collaborate with inference, safety, and product teams to land improvements in customer-facing systems.
Employee Pay Disclosures At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary will fall in the range of approximately $175,000 - $350,000 depending on experience.
Interview Process Apply: Please apply on Linkedin or our website for a specific role. After speaking with one of our recruiters, you'll enter our structured interview process, which includes the following stages:
- Hiring Manager Conversation – An initial discussion with the hiring manager to assess fit and alignment.
- Technical Interview – A deep dive with an Inflection Engineer to evaluate your technical expertise.
- Onsite Interview – A comprehensive assessment, including: A domain-specific interview A system design interview A final conversation with the hiring manager
- Depending on the role, we may also ask you to complete a take-home exercise or deliver a presentation. For non-technical roles, be prepared for a role-specific interview, such as a portfolio review.
- We aim to provide feedback within one week of your final interview.
Required Skills Python PyTorch RLHF DPO GRPO RLAIF
Job Type: full-time
Experience Level: mid
Salary Range: $170,000 - $300,000
Location: Palo Alto, CA
$57.69 per hour
...robotics. Role We are seeking a highly motivated intern to build state-of-the-art vision foundation models for industrial robotics. You will design, train and test your own vision foundation models with the goal of achieving state-of-the-art results in a given...TrainingFull timeInternshipLocal area- ...Job Title: CW Research on Large Vehicle Data Model - Summer Intern (99W210) About Kyyba: Founded in 1998 and headquartered... ...and develop foundation models, including pretraining and post-training, leveraging language supervision, and enhancing multimodal...TrainingSummer internshipVisa sponsorshipWork visa
$190k - $210k
...manager to lead the validation and closure of quantum architectural models against hardware system data - a vital activity in our progress... ...a candidate's qualifications including relevant education and training, competencies, experience, geographic location, and business...TrainingFull timeShift work$150k
...their teammates. ABOUT THE ROLE: You will join the Grok Voice Model team to help build the world's best voice AI. We deliver smooth... ...across devices and real-time scenarios. We own the full training pipeline: massive data curation, premium audio processing, frontier...TrainingTemporary work$170k - $216k
...Machine Learning Engineer, Model Optimization Waymo is an autonomous driving technology company with the mission to be the world... ...from large scale real-world data, to (2) develop models and model training at scale, to (3) analyze real-world behavior and develop...TrainingFull timeRemote work$180k
...teammates. ABOUT THE ROLE: As a multimodal engineer on the Imagine Model Team, you will develop cutting-edge AI experiences beyond text,... ...for video). Responsibilities span data curation, modeling, training, inference serving, and product integration, covering both...TrainingTemporary work$124k
...What to Expect At Tesla AI, we're not just training models, we're building the foundation models that power the future of real-world autonomy. Your work will directly control millions of Tesla vehicles and robotaxis on the road today, Optimus humanoid robots in factories...TrainingHourly payFull timeTemporary workImmediate startFlexible hours$190k - $250k
...the world works. We are developing large-scale generative world models that learn to predict realistic, physically consistent futures... ...This capability serves as the foundation for scalable closed-loop training, validation, and long-tail scenario generation, and is...TrainingTemporary workWork at officeVisa sponsorshipFlexible hours$140k - $390k
...What to Expect Our team designs, trains, and deploys large-scale neural networks optimized for inference on compute-constrained edge... .../ custom AI ASIC). This role sits at the intersection of ML modeling and hardware-aware systems engineering - you will architect and...TrainingHourly payFull timeTemporary workFlexible hours$281k - $356k
...Senior Staff Software Engineer, Model Post Training Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver...TrainingFull timeRemote work$204k - $259k
...role you will report to a Technical Lead Manager. You will: * Conduct applied foundation model research and development * Design compelling experiments by training and evaluating large deep learning models * Present results to peers and leadership * Write...TrainingFull timeRemote work$122.93k - $145.39k
...ML Engineer – Healthcare Data Curation & Model Workflows School of Medicine, Stanford, California, United States Research... ...build robust and modular software engineering infrastructures for training and inference of ML models that can be used for a variety of downstream...TrainingHourly payFull time$139.9k - $274.8k
...global well-being. We are hiring a Product Manager to own AI model security - the discipline of making our frontier models... ...security threats - prompt injection, data exfiltration, jailbreaking, training data extraction, zero-day exploit generation, model poisoning,...TrainingOngoing contractWork at officeLocal area$180k
...teammates. About the Role You will join the Grok Chat Model team, focused on the foundational Grok models powering chat. We... ...seamless chat systems. Develop and optimize scalable inference and training pipelines for production reliability at massive scale. Build...TrainingFull timeTemporary workShift work$254k - $349.25k
...leveraging state-of-the-art LLMs/SLMs and advanced machine learning techniques . This role requires deep expertise in model architecture, training, fine-tuning, and distillation , combined with a strong understanding of security domains such as threat detection,...TrainingFlexible hours$224k - $356.5k
...performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in... ...roadmap, and sharing best practices. Work alongside model training, inference, and product divisions to provide trusted evaluation...Training$174.72k - $295.68k
...Senior Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading smart technology company at the forefront of... ...perception and planning engineers, and infrastructure experts to design, train, and deploy large-scale multi-modal models that unify vision,...TrainingFull time$150k
...About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing... ...opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers...TrainingVisa sponsorship- ...Description Job Description About the Institute of Foundation Models We are a dedicated research lab for building, understanding,... ...to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers...Training
$215.28k - $364.32k
...Staff Machine Learning Engineer – Autonomous Driving Model Quantization & Deployment Santa Clara, CA XPENG is a leading smart technology... ...VLMs). Model Compression: Apply and innovate in PTQ (Post-Training Quantization), QAT (Quantization-Aware Training), and pruning...TrainingFull time$184k - $287.5k
...platform upon which every new AI-powered application is built. We are seeking a senior vision language model engineer to design and build agentic data and training workflows for Autonomous Vehicles, Robotics, and Medical applications. The right person for this role brings...Training$165.2k - $223.6k
...frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance...TrainingWork experience placementInternshipLocal areaFlexible hours$172.43k - $230.95k
...Senior Software Engineer For The Ai Model Lifecycle Team Crusoe is on a mission to accelerate the abundance of energy and intelligence... ...cost-efficient scaling. Implement and maintain end-to-end training pipelines for Large Language Models. RFT and Reinforcement...TrainingTemporary work$181.1k - $318.4k
...Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model Work Locations (2) Submit Resume Apple is where individual... ...and enable reliable, efficient execution of large-scale training and inference jobs. This role spans scheduling algorithms, cluster...TrainingRelocation$165k - $185k
...research, our AI research in Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural... ...equivalent graduate research experience on foundation models, including training, fine-tuning, and prompting ~ In-depth experiences in deep...TrainingWork experience placementWorldwide- ...Description Job Description About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using... ...to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers...TrainingLive inImmediate startVisa sponsorship
- ...device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to... ...GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups....Training
$152k - $241.5k
...engineering role involves developing datacenter scale performance modeling and predictions tools for AI researchers running AI workloads... ...Learning frameworks like PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job scheduling (Slurm or...TrainingFull time$126k - $423k
...WILL: * Conduct research on pretraining world-action foundation model with various world modalities including vision and physics... ...vision, robotics systems, and distributed machine learning model training NICE TO HAVE: Hands-on experience in at least one of the...TrainingFull timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift- Enterprise Medical is partnering with a membership-based primary care organization that has multiple clinics across Northern California to add an Internal Medicine physician to its expanding Palo Alto team. This opportunity offers the rare combination of clinical excellence...Work at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Model Training. Be the first to apply!



