Member of Technical Staff - Post-Training and RL

$180k

xAI

Job Description

About xAI

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company's mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

ABOUT THE ROLE:

You will work on the most critical post-training and reinforcement learning challenges at any given time — including reward modeling, preference optimization (RLHF/DPO), and RL for improving reasoning, truthfulness, and real-world capabilities.
You will get clarity on your first project before an offer.

BASIC QUALIFICATIONS:

You believe truth-seeking AI is the most important and challenging problem.
You are obsessed about building incredibly useful models through post-training and RL techniques.
You are a power user of AI models and eager to push the boundaries of what's possible with reinforcement learning and alignment methods.
If you previously worked on post-training, RLHF, or trained models used by millions of people it's a big plus, but relevant experience is not required.
You take pride in your work and thrive in meritocratic environments.

COMPENSATION AND BENEFITS:

$180,000 - $600,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Apply

Vacancy posted a month ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff - Post-Training and RL in Palo Alto, CA vacancy

Member of Technical Staff - RL Infrastructure [data, evals, agent]
$180k
Member of Technical Staff - RL Infrastructure About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid... ...complex pre-processing to prepare it for large-scale RL training. How do we standardize our preprocessing pipelines to minimize...
Training
Temporary work
xAI
Palo Alto, CA
1 day ago
Member of Technical Staff - RL Infrastructure
$180k
...infrastructure for reinforcement learning training environments, including building and... ...interview (“phone interview”) during which a member of our team will ask some basic... ...the main process, which consists of four technical interviews: Coding assessment in a language...
Training
Local area
Relocation
Pantera Capital
Palo Alto, CA
2 days ago
Member of Technical Staff - Training
...the Role RadixArk is seeking a Member of Technical Staff - Training to build and scale the systems that... ...building or operating large-scale agentic post-training systems. ~ Experience... ...inference optimization for large-scale RL or other production workload....
Training
Flexible hours
RadixArk
Palo Alto, CA
22 hours ago
Member of Technical Staff - Mid-training
$180k
...Member Of Technical Staff - Mid-training Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and aid humanity... ...mid-training data mixtures to boost the ceiling for RL. Engineer long-context data recipes. Develop...
Training
Temporary work
Xai
Palo Alto, CA
22 hours ago
Member of Technical Staff - Supercomputing
...About the Role RadixArk is hiring a Member of Technical Staff - Supercomputing to help build,... ...infrastructure for frontier-scale inference and training workloads. This role sits at the... ...developed Miles , our large-scale RL framework. We are building world-...
Training
Flexible hours
RadixArk
Palo Alto, CA
1 day ago
Member of Technical Staff Cluster / Platform
...Member of Technical Staff — Cluster / Platform RadixArk is looking for a Member of Technical Staff... ...platform that powers frontier-level AI training and inference. You will design and... ...and developed Miles (our large-scale RL framework). We're on a mission to democratize...
Training
Flexible hours
RadixArk
Palo Alto, CA
1 day ago
Member of Technical Staff, Reinforcement Learning
$200k - $350k
...Member Of Technical Staff, Reinforcement Learning Inception creates the world's fastest, most efficient... ...and engineers with deep expertise in post-training large language models through... ...learning. You will design and implement RL training pipelines for our diffusion...
Training
Immediate start
Flexible hours
Inception LLC
Palo Alto, CA
2 days ago
Member of Technical Staff - Multimodal Understanding
$180k
...Member of Technical Staff - Multimodal Understanding Palo Alto, CA About xAI xAI's... ...curation/acquisition, tokenizer training, large-scale pre-training, post-training/alignment, infrastructure... ...across the stack (pre-training → SFT/RL/post-training) to enable...
Training
Temporary work
Xai
Palo Alto, CA
1 day ago
Member of Technical Staff - Inference
RadixArk is seeking a Member of Technical Staff — Inference to push the limits of large-scale AI inference... ...and developed Miles (our large‑scale RL framework). We're on a mission to... ...‑class open systems for inference and training. Our team has optimized kernels serving...
Training
Worldwide
Flexible hours
RadixArk
Palo Alto, CA
3 days ago
Member of Technical Staff [Research]
About the Role As a Member of Technical Staff [Research] at NeoCognition , you’ll be part of the core... ...in the areas of LLM reasoning, post-training, and agentic system design. Develop... ...LLM post-training (instruction tuning, RL, reasoning) Data pipeline design and...
Training
NeoCognition Inc.
Palo Alto, CA
3 days ago
Member of Technical Staff - ML Research
...What You'll Do As a Founding Member of the Technical Staff at Architect, you'll be at the forefront of training AI models for chip design, verification... ..., scaling, and improving post-training techniques to... ...where you'll own the end-to-end RL workflow—from reward modeling...
Training
Architect Labs
Palo Alto, CA
3 days ago
Member of Technical Staff, Mid-training
$180k
...teammates. About the Role The mid‑training team at xAI aims to provide an... ...to boost the ceiling for RL. Engineer long‑context data recipes... ...interview”) during which a member of our team will ask some... ...process, which consists of four technical interviews: Coding assessment...
Training
Temporary work
Relocation
Pantera Capital
Palo Alto, CA
1 day ago
Member of Technical Staff — Kernel / Compiler / Communication
Member of Technical Staff — Kernel / Compiler / Communication About the Role RadixArk is seeking a Member... .... This role is critical to scaling training and inference across thousands of GPUs... ...and developed Miles, our large‑scale RL framework. We build world‑class systems...
Training
Flexible hours
RadixArk
Palo Alto, CA
1 day ago
Member of Technical Staff - TPU Systems (JAX / XLA / PALLAS)
$180k - $250k
Member of Technical Staff -- TPU Systems (JAX / XLA / PALLAS) About the Role RadixArk is looking for a... ...build high-performance inference and training systems using JAX, XLA, and Pallas. You... ...and developed Miles (our large-scale RL framework). We're on a mission to democratize...
Training
Full time
Flexible hours
RadixArk
Palo Alto, CA
3 days ago
Members of Technical Staff, Physical AI (Robotics / World Models)
Member of Technical Staff Physical AI (Robotics / World Models) Palo Alto, CA About Orbifold AI Orbifold... ...of data, evaluation, and model training. We design evaluation harnesses that expose... ...frameworks, and training and RL pipelines that shape how their models...
Training
Shift work
Bonfirevc
Palo Alto, CA
1 day ago
Member of Technical Staff - RL Data Scaling
$180k
...reinforcement learning data for training Grok reasoning models. Our... ...training data, and advancing RL algorithms. About the Role... ...phone interview”) during which a member of our team will ask some basic... ...process, which consists of four technical interviews: # Coding...
Training
Temporary work
Work at office
Work from home
Relocation
xAI
Palo Alto, CA
more than 2 months ago
Member of Technical Staff - Imagine Model
$180k
...Member Of Technical Staff - Imagine Model Palo Alto, CA; Seattle, WA About... ...span data curation, modeling, training, inference serving, and product... ...both pretraining and post-training phases. You will collaborate... ..., agentic planning, RL training, and world simulation...
Training
Temporary work
Xai
Palo Alto, CA
4 days ago
Member of Technical Staff CI Engineer
...Member of Technical Staff — CI Engineer Palo Alto, CA About the Role RadixArk is hiring a Member... ...and developed Miles (our large-scale RL framework). We're on a mission to... ...-class open systems for inference and training. Our team has optimized kernels serving...
Training
Flexible hours
Night shift
RadixArk
Palo Alto, CA
1 day ago
Founding Member of Technical Staff - ML Infra
...What You'll Do As a Founding Member of the Technical Staff (ML infra) at Architect, you'll be responsible... ...that our researchers depend on to train models. Your work will directly enable... ...algorithms and engineering systems used to post-train models for chip designs,...
Training
Architect Labs
Palo Alto, CA
3 days ago
Member of Technical Staff - Pre-Training
$180k
...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence...
Training
Temporary work
Xai
Palo Alto, CA
3 days ago
Member of Technical Staff, ML Infrastructure
...accurate. In partnership with Radiology Partners, Cognita’s models are trained and validated on one of the world’s largest real-world radiology datasets. About the Role As a Member of Technical Staff on the ML Infrastructure team, you will build and operate the platform...
Training
Cognita Imaging Inc.
Palo Alto, CA
4 days ago
Member of Technical Staff, Training
$200k - $350k
...Member Of Technical Staff Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning... ...and engineers with deep expertise in pre-, mid-, and post- training large language models. You will advance our diffusion-based...
Training
Immediate start
Flexible hours
Inception LLC
Palo Alto, CA
3 days ago
Member of Technical Staff
...Member Of Technical Staff Memphis, TN About XAI XAI's mission is to create AI systems that can... ...Participate in on-call rotations, post-incident reviews (blameless postmortems... ...ideally in hyperscale, cloud, or AI/ML training infrastructure environments with multi...
Training
Xai
Palo Alto, CA
4 days ago
Member of Technical Staff - Imagine Safety
$180k
...Member of Technical Staff - Imagine Safety Palo Alto, CA About xAI xAI's mission is to create AI systems that can accurately understand... ...loops between user interactions, model outputs, and training data to continuously improve safety while maintaining high...
Training
Temporary work
Worldwide
Xai
Palo Alto, CA
3 days ago
Member of Technical Staff, Data Infrastructure
$200k - $350k
...Member Of Technical Staff, Data Infrastructure Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world... ...and scale the core infrastructure behind distributed training pipelines and petabyte-scale data catalogs. You'll work directly...
Training
Immediate start
Flexible hours
Inception LLC
Palo Alto, CA
4 days ago
Member of Technical Staff [Platform]
About the Role As a Member of Technical Staff [Platform] at NeoCognition , you’ll design and build the internal systems that power everything... ...have: Experience with machine learning infrastructure , training pipelines, or model evaluation tooling. Background in monitoring...
Training
NeoCognition Inc.
Palo Alto, CA
3 days ago
Member of Technical Staff, ML Evaluation
...accurate. In partnership with Radiology Partners, Cognita’s models are trained and validated on one of the world’s largest real-world radiology datasets. About the Role As a Member of Technical Staff focused on ML Evaluation, you will be responsible for understanding...
Training
Cognita Imaging Inc.
Palo Alto, CA
3 days ago
Senior Member of Technical Staff, AI Research
...iterate agentic AI systems with customers. With your strong technical competence, strategic thinking and customer engagement, you... ...with meaningful implementation track records, such as LLM, pre/post-training, RL, agentic system. Prioritize deep and strategic thinking. Has...
Training
B Capital
Palo Alto, CA
1 day ago
Member of Technical Staff, Machine Learning
...reliable despite non-deterministic model behavior. Role As a Member of Technical Staff, Machine Learning, you will build core ML components. You... ...ML. Focus Build and improve ML components across data, training, evaluation, and inference. Fine‑tune and adapt models as...
Training
Immediate start
A1
Palo Alto, CA
1 day ago
Member of Technical Staff, ML Training
...accurate. In partnership with Radiology Partners, Cognita’s models are trained and validated on one of the world’s largest real-world radiology datasets. About the Role As a Member of Technical Staff focused on model training, you will be responsible for building...
Training
Cognita Imaging Inc.
Palo Alto, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Post-Training and RL. Be the first to apply!