Member of Technical Staff - Post-Training and RL

Gravity Engineering Services Pvt Ltd.

ABOUT THE ROLE:

You will work on the most critical post-training and reinforcement learning challenges at any given time — including reward modeling, preference optimization (RLHF/DPO), and RL for improving reasoning, truthfulness, and real-world capabilities. You will get clarity on your first project before an offer.

BASIC QUALIFICATIONS:

You believe truth-seeking AI is the most important and challenging problem. You are obsessed about building incredibly useful models through post-training and RL techniques. You are a power user of AI models and eager to push the boundaries of what’s possible with reinforcement learning and alignment methods. If you previously worked on post-training, RLHF, or trained models used by millions of people it’s a big plus, but relevant experience is not required. You take pride in your work and thrive in meritocratic environments. #J-18808-Ljbffr Gravity Engineering Services Pvt Ltd.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff - Post-Training and RL in Palo Alto, CA vacancy

Member of Technical Staff - RL Inference
...able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: The RL infrastructure team is looking for an engineer to help with low precision RL training and inference. RESPONSIBILITIES: Design and optimize our inference stack for all shapes...
Training
Pantera Capital
Palo Alto, CA
3 days ago
Member of Technical Staff - RL Training Framework
$180k
...be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE The RL infrastructure team is looking for an engineer to help develop our RL training framework. RESPONSIBILITIES Design and implement the systems backing all RL workloads at xAI...
Training
Temporary work
Pantera Capital
Palo Alto, CA
3 days ago
Member of Technical Staff: RL Environments
...Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts... ...agent rollouts and identifying subtle patterns. Technical execution: Proficiency in Python and ML frameworks (PyTorch...
Training
Flexible hours
BespokeLabs.AI, Inc
Mountain View, CA
4 days ago
Member of Technical Staff - RL Data Scaling
$180k
...reinforcement learning data for training Grok reasoning models. Our... ...training data, and advancing RL algorithms. About the Role... ...phone interview”) during which a member of our team will ask some basic... ...process, which consists of four technical interviews: # Coding...
Training
Temporary work
Work at office
Work from home
Relocation
xAI
Palo Alto, CA
more than 2 months ago
Member of Technical Staff [Research]
About the Role As a Member of Technical Staff [Research] at NeoCognition , you’ll be part of the core... ...in the areas of LLM reasoning, post-training, and agentic system design. Develop... ...LLM post-training (instruction tuning, RL, reasoning) Data pipeline design and...
Training
NeoCognition Inc.
Palo Alto, CA
1 day ago
Member of Technical Staff — InferencePalo Alto, CA
Member of Technical Staff — Inference RadixArk is seeking a Member of Technical Staff — Inference to push... ...and developed Miles (our large‑scale RL framework). We're on a mission to... ...‑class open systems for inference and training. Our team has optimized kernels serving...
Training
Worldwide
Flexible hours
RadixArk
Palo Alto, CA
2 days ago
Members of Technical Staff, Physical AI (Robotics / World Models)
Member of Technical Staff Physical AI (Robotics / World Models) Palo Alto, CA About Orbifold AI Orbifold... ...of data, evaluation, and model training. We design evaluation harnesses that expose... ...frameworks, and training and RL pipelines that shape how their models...
Training
Shift work
Bonfirevc
Palo Alto, CA
4 days ago
Member of Technical Staff - ML Research
...What You'll Do As a Founding Member of the Technical Staff at Architect, you'll be at the forefront of training AI models for chip design, verification... ..., scaling, and improving post‑training techniques to... ...where you'll own the end‑to‑end RL workflow—from reward modeling...
Training
Architect Labs
Palo Alto, CA
3 days ago
Member of Technical Staff — Cluster / PlatformPalo Alto, CA
About the Role RadixArk is looking for a Member of Technical Staff (Cluster / Platform) to architect and... ...that powers frontier-level AI training and inference. You will design and operate... ...and developed Miles (our large-scale RL framework). We're on a mission to democratize...
Training
Flexible hours
RadixArk
Palo Alto, CA
2 days ago
Member of Technical Staff — Training
RadixArk is seeking a Member of Technical Staff — Training to build and scale the systems that train frontier AI models. You will work on large-scale... ...LLM serving engine), and developed Miles, our large-scale RL framework. We build world-class infrastructure for AI training...
Training
Flexible hours
RadixArk
Palo Alto, CA
3 days ago
Member of Technical Staff — Diffusion ModelPalo Alto, CA
About The Role RadixArk is seeking a Member of Technical Staff — Diffusion Model to advance the frontier... ...—from designing novel algorithms to training and deploying models at scale. Your work... ...and developed Miles (our large‑scale RL framework). We’re on a mission to...
Training
Flexible hours
RadixArk
Palo Alto, CA
2 days ago
Member of Technical Staff - Mid-training
$180k
...teammates. About the Role The mid‑training team at xAI aims to provide an... ...to boost the ceiling for RL. Develop robust and diverse evaluation... ...interview") during which a member of our team will ask some... ..., which consists of four technical interviews: Coding assessment...
Training
Temporary work
Relocation
xAI
Palo Alto, CA
3 days ago
Member of Technical Staff — SupercomputingPalo Alto, CA
About The Role RadixArk is hiring a Member of Technical Staff — Supercomputing to help build, deploy,... ...infrastructure for frontier‑scale inference and training workloads. This role sits at the... ...and developed Miles , our large‑scale RL framework. We are building world‑class...
Training
Flexible hours
RadixArk
Palo Alto, CA
2 days ago
Member of Technical Staff
$324k - $396k
...share knowledge with their teammates. Member of Technical Staff (X.AI LLC; Palo Alto, CA): Build... ...include models and large language model training, pipeline parallelism training of large... ...related field, plus 5 years of progressive post-baccalaureate experience in AI/ML...
Training
Xai
Palo Alto, CA
15 days ago
Member of Technical Staff
$324k - $396k
...About the Role Member of Technical Staff (X.AI LLC; Palo Alto, CA): Introduce innovative techniques and analyses to the AI field to facilitate... ...and language understanding. Stabilize large language model training, pipeline parallelism training of large language models,...
Training
Remote work
Xai
Palo Alto, CA
10 hours ago
Member of Technical Staff
$150k
...Member of Technical Staff Location: Palo Alto, CA Company Stage of Funding: Seed Stage AI Startup ($27M Raised) Office Type: Onsite... ...trading firms. Background in AI model evaluation, post-training, reinforcement learning, or large language model infrastructure...
Training
Work at office
Visa sponsorship
Relocation package
Recruiting from Scratch
Palo Alto, CA
3 days ago
Member of Technical Staff - Model Training
$175k - $350k
...all. About the Role As a Model Training engineer, you will design, build, and scale the post-training pipelines that turn a general... .... Communicate crisply with both technical and non-technical teammates.... ...techniques (e.g., curriculum RL, multi-objective reward modeling...
Training
Inflection AI
Palo Alto, CA
4 days ago
Member of Technical Staff, ML Infrastructure
...accurate. In partnership with Radiology Partners, Cognita’s models are trained and validated on one of the world’s largest real-world radiology datasets. About the Role As a Member of Technical Staff on the ML Infrastructure team, you will build and operate the platform...
Training
Cognita Imaging Inc.
Palo Alto, CA
2 days ago
Founding Technical Staff: Equity in AI-Driven Chip Design RL
Kindredventures in Palo Alto is looking for a Founding Member of the Technical Staff to lead AI model training for chip design. You will work on cutting-edge Reinforcement Learning environments and build end-to-end ML pipelines. The successful candidate will hold a PhD...
Training
Kindredventures
Palo Alto, CA
10 hours ago
Remote Senior Member of Technical Staff - AI & ML Research
$324k - $396k
Pantera Capital is seeking a Member of Technical Staff in Palo Alto, CA. This role involves introducing innovative techniques in AI, stabilizing large language model training, and working on hands-on technical problems. Candidates should have relevant experience and a Bachelor...
Training
Remote job
Pantera Capital
Palo Alto, CA
4 days ago
Member of Technical Staff, Machine Learning
...reliable despite non-deterministic model behavior. Role As a Member of Technical Staff, Machine Learning, you will build core ML components. You... ...ML. Focus Build and improve ML components across data, training, evaluation, and inference. Fine-tune and adapt models as...
Training
A1
Palo Alto, CA
10 hours ago
Member of Technical Staff, ML Evaluation
...accurate. In partnership with Radiology Partners, Cognita’s models are trained and validated on one of the world’s largest real-world radiology datasets. About the Role As a Member of Technical Staff focused on ML Evaluation, you will be responsible for understanding...
Training
Cognita Imaging Inc.
Palo Alto, CA
1 day ago
Member of Technical Staff - Research
...to reinvent security for an adversary that never sleeps. What You'll Do Own Post-Training Strategy: Drive the post-training roadmap (fine-tuning, preference optimization, reward modeling, RL, distillation) to make models more capable, reliable, and aligned. Push...
Training
Armadin
Palo Alto, CA
3 days ago
Member of Technical Staff [Platform]
About the Role As a Member of Technical Staff [Platform] at NeoCognition , you’ll design and build the internal systems that power everything... ...have: Experience with machine learning infrastructure , training pipelines, or model evaluation tooling. Background in monitoring...
Training
NeoCognition Inc.
Palo Alto, CA
1 day ago
Member of Technical Staff — Kernel / Compiler / CommunicationPalo Alto, CA
Member of Technical Staff — Kernel / Compiler / Communication RadixArk is seeking a deeply technical engineer who pushes the limits of performance... ...and interconnects. This role is critical to scaling training and inference across thousands of GPUs, where microseconds...
Training
Flexible hours
RadixArk
Palo Alto, CA
2 days ago
Member of Technical Staff - Applied AI
...SuperIntelligence, xAI, Apple and Intel. What You’ll Do As a Founding Member of the Technical Staff (Applied AI) at Architect, you’ll sit at the... ...correct ones. Partner closely with the ML research, post‑training, and infra teams to turn hardware domain expertise into...
Training
Architect Labs
Palo Alto, CA
4 days ago
Member of Technical Staff - Pre-Training
$180k
Member of Technical Staff - Pre-Training About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in... ...discoverability and data quality at scale for both pre‑training and post‑training across different modalities. Build, run, and...
Training
Temporary work
xAI
Palo Alto, CA
2 days ago
Member of Technical Staff - Research Intern
...Reinforcement Learning experiments (GRPO/PPO/DPO), training data mixes and reward signal explorations. Contribute to research on post‑training techniques, running ablation studies... ...experience are also encouraged to apply. RL Knowledge: Strong academic understanding or...
Training
Internship
Architect Labs
Palo Alto, CA
3 days ago
Member of Technical Staff - Recommendation Systems
$180k
Member of Technical Staff, Recommendation Systems About xAI xAI’s mission is to create AI systems that can accurately understand the universe... ...dramatically enhance the user experience Write data pipelines and training jobs that continuously learn from product data. Iterate...
Training
Temporary work
Relocation
xAI
Palo Alto, CA
2 days ago
Senior Member of Technical Staff - AI Research
$148.5k - $223.9k
...iterate agentic AI systems with customers. With your strong technical competence, strategic thinking and customer engagement, you... ...with meaningful implementation track records, such as LLM, pre/post‑training, RL, agentic system. Prioritizes deep and strategic thinking....
Training
salesforce.com, inc.
Palo Alto, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Post-Training and RL. Be the first to apply!