Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff - Post-Training and RL

$180k

x.ai

Member of Technical Staff - Post-Training and RL ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands‑on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

ABOUT THE ROLE:

You will work on the most critical post‑training and reinforcement learning challenges at any given time — including reward modeling, preference optimization (RLHF/DPO), and RL for improving reasoning, truthfulness, and real‑world capabilities. You will get clarity on your first project before an offer.

BASIC QUALIFICATIONS:

You believe truth‑seeking AI is the most important and challenging problem. You are obsessed about building incredibly useful models through post‑training and RL techniques. You are a power user of AI models and eager to push the boundaries of what’s possible with reinforcement learning and alignment methods. If you previously worked on post‑training, RLHF, or trained models used by millions of people it’s a big plus, but relevant experience is not required. You take pride in your work and thrive in meritocratic environments.

COMPENSATION AND BENEFITS:

$180,000 - $600,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long‑term disability insurance, life insurance, and various other discounts and perks. #J-18808-Ljbffr x.ai

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - Post-Training and RL in Palo Alto, CA vacancy
  • $180k

    Member of Technical Staff - RL Infrastructure About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid...  ...complex pre-processing to prepare it for large-scale RL training. How do we standardize our preprocessing pipelines to minimize... 
    Training
    Temporary work

    xAI

    Palo Alto, CA
    5 days ago
  •  ...Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts...  ...agent rollouts and identifying subtle patterns. Technical execution: Proficiency in Python and ML frameworks (PyTorch... 
    Training
    Flexible hours

    BespokeLabs.AI, Inc

    Mountain View, CA
    4 days ago
  • $180k

     ...reinforcement learning data for training Grok reasoning models. Our...  ...training data, and advancing RL algorithms. About the Role...  ...phone interview”) during which a member of our team will ask some basic...  ...process, which consists of four technical interviews: # Coding... 
    Training
    Temporary work
    Work at office
    Work from home
    Relocation

    xAI

    Palo Alto, CA
    more than 2 months ago
  • Member of Technical Staff — Kernel / Compiler / Communication About the Role RadixArk is seeking a Member...  .... This role is critical to scaling training and inference across thousands of GPUs...  ...and developed Miles, our large‑scale RL framework. We build world‑class systems... 
    Training
    Flexible hours

    RadixArk

    Palo Alto, CA
    5 days ago
  • RadixArk is seeking a Member of Technical Staff — Inference to push the limits of large-scale AI inference...  ...and developed Miles (our large‑scale RL framework). We're on a mission to...  ...‑class open systems for inference and training. Our team has optimized kernels serving... 
    Training
    Worldwide
    Flexible hours

    RadixArk

    Palo Alto, CA
    2 days ago
  • About the Role As a Member of Technical Staff [Research] at NeoCognition , you’ll be part of the core...  ...in the areas of LLM reasoning, post-training, and agentic system design. Develop...  ...LLM post-training (instruction tuning, RL, reasoning) Data pipeline design and... 
    Training

    NeoCognition Inc.

    Palo Alto, CA
    2 days ago
  •  ...What You'll Do As a Founding Member of the Technical Staff at Architect, you'll be at the forefront of training AI models for chip design, verification...  ..., scaling, and improving post-training techniques to...  ...where you'll own the end-to-end RL workflow—from reward modeling... 
    Training

    Architect Labs

    Palo Alto, CA
    2 days ago
  • Member of Technical Staff Physical AI (Robotics / World Models) Palo Alto, CA About Orbifold AI Orbifold...  ...of data, evaluation, and model training. We design evaluation harnesses that expose...  ...frameworks, and training and RL pipelines that shape how their models... 
    Training
    Shift work

    Bonfirevc

    Palo Alto, CA
    5 days ago
  • RadixArk is seeking a Member of Technical Staff — Training to build and scale the systems that train frontier AI models. You will work on large-scale...  ...LLM serving engine), and developed Miles, our large-scale RL framework. We build world-class infrastructure for AI training... 
    Training
    Flexible hours

    RadixArk

    Palo Alto, CA
    4 days ago
  • $180k - $250k

    Member of Technical Staff -- TPU Systems (JAX / XLA / PALLAS) About the Role RadixArk is looking for a...  ...build high-performance inference and training systems using JAX, XLA, and Pallas. You...  ...and developed Miles (our large-scale RL framework). We're on a mission to democratize... 
    Training
    Full time
    Flexible hours

    RadixArk

    Palo Alto, CA
    2 days ago
  • Member of Technical Staff -- Cluster / Platform About the Role RadixArk is looking for a Member of Technical...  ...that powers frontier-level AI training and inference. You will design and operate...  ...and developed Miles (our large-scale RL framework). We're on a mission to... 
    Training
    Flexible hours

    RadixArk

    Palo Alto, CA
    2 days ago
  • Member of Technical Staff — Supercomputing About the Role RadixArk is hiring a Member...  ...‑scale inference and training workloads. This role sits at...  ...incident response, runbooks, post‑mortems, and operational processes...  ...Miles , our large‑scale RL framework. We are building... 
    Training
    Flexible hours

    RadixArk

    Palo Alto, CA
    4 days ago
  • $180k

     ...teammates. About the Role The mid‑training team at xAI aims to provide an...  ...to boost the ceiling for RL. Develop robust and diverse evaluation...  ...interview") during which a member of our team will ask some...  ..., which consists of four technical interviews: Coding assessment... 
    Training
    Temporary work
    Relocation

    xAI

    Palo Alto, CA
    4 days ago
  • $180k

    Member of Technical Staff - Multimodal Understanding About xAI xAI’s mission is to create AI...  ...curation/acquisition, tokenizer training, large‑scale pre‑training, post‑training/alignment, infrastructure...  ...across the stack (pre‑training → SFT/RL/post‑training) to enable... 
    Training
    Temporary work

    x.ai

    Palo Alto, CA
    4 days ago
  • Member of Technical Staff — Diffusion Model About the Role RadixArk is seeking a Member of Technical Staff...  ...— from designing novel algorithms to training and deploying models at scale. Your...  ...and developed Miles, our large-scale RL framework. We build world-class systems... 
    Training
    Flexible hours

    RadixArk

    Palo Alto, CA
    5 days ago
  • RadixArk is hiring a Member of Technical Staff — CI Engineer to own the infrastructure that keeps SGLang...  ...and developed Miles (our large-scale RL framework). We’re on a mission to...  ...-class open systems for inference and training. Our team has optimized kernels serving... 
    Training
    Flexible hours
    Night shift

    RadixArk

    Palo Alto, CA
    1 day ago
  •  ...Role We are seeking a highly skilled Member of Technical Staff to join our team in managing and enhancing...  ...downtime\'s ripple effects on AI training pipelines. This role encourages broad...  ...solutions. Participate in on-call rotations, post-incident reviews (blameless... 
    Training

    Pantera Capital

    Palo Alto, CA
    1 day ago
  •  ...accurate. In partnership with Radiology Partners, Cognita’s models are trained and validated on one of the world’s largest real-world radiology datasets. About the Role As a Member of Technical Staff on the ML Infrastructure team, you will build and operate the platform... 
    Training

    Cognita Imaging Inc.

    Palo Alto, CA
    3 days ago
  •  ...accurate. In partnership with Radiology Partners, Cognita’s models are trained and validated on one of the world’s largest real-world radiology datasets. About the Role As a Member of Technical Staff focused on ML Evaluation, you will be responsible for understanding... 
    Training

    Cognita Imaging Inc.

    Palo Alto, CA
    2 days ago
  •  ...accurate. In partnership with Radiology Partners, Cognita’s models are trained and validated on one of the world’s largest real-world radiology datasets. About the Role As a Member of Technical Staff focused on model training, you will be responsible for building... 
    Training

    Cognita Imaging Inc.

    Palo Alto, CA
    2 days ago
  •  ...iterate agentic AI systems with customers. With your strong technical competence, strategic thinking and customer engagement, you...  ...with meaningful implementation track records, such as LLM, pre/post-training, RL, agentic system. Prioritize deep and strategic thinking.... 
    Training

    Salesforce, Inc..

    Palo Alto, CA
    4 days ago
  • $175k - $350k

     ...all. About the Role As a Model Training engineer, you will design, build, and scale the post-training pipelines that turn a general...  .... Communicate crisply with both technical and non-technical teammates....  ...techniques (e.g., curriculum RL, multi-objective reward modeling... 
    Training

    Inflection AI

    Palo Alto, CA
    5 days ago
  • About the Role As a Member of Technical Staff [Platform] at NeoCognition , you’ll design and build the internal systems that power everything...  ...have: Experience with machine learning infrastructure , training pipelines, or model evaluation tooling. Background in monitoring... 
    Training

    NeoCognition Inc.

    Palo Alto, CA
    2 days ago
  •  ...reliable despite non-deterministic model behavior. Role As a Member of Technical Staff, Machine Learning, you will build core ML components. You...  ...ML. Focus Build and improve ML components across data, training, evaluation, and inference. Fine‑tune and adapt models as... 
    Training
    Immediate start

    A1

    Palo Alto, CA
    5 days ago
  •  ...SuperIntelligence, xAI, Apple and Intel. What You'll Do As a Founding Member of the Technical Staff (Applied AI) at Architect, you'll sit at the...  ...correct ones. Partner closely with the ML research, post-training, and infra teams to turn hardware domain expertise into... 
    Training

    Architect

    Palo Alto, CA
    4 days ago
  • $280k - $350k

     ...working product prototypes, shaping both the technical roadmap and the research culture of an...  ...initiatives across LLM reasoning, post‑training, and agentic system design. Develop new...  ...technical roadmap as one of the first members of the team. What we’re looking for A... 
    Training
    Internship
    Relocation
    Visa sponsorship
    Relocation package

    Raydar

    Palo Alto, CA
    5 days ago
  • $180k

    Member of Technical Staff, Recommendation Systems About xAI xAI’s mission is to create AI systems that can accurately understand the universe...  ...dramatically enhance the user experience Write data pipelines and training jobs that continuously learn from product data. Iterate... 
    Training
    Temporary work
    Relocation

    xAI

    Palo Alto, CA
    3 days ago
  •  ...two minutes and one that lasts two hours. As our RecSys founding member, you'll own this problem end-to-end - set the architecture,...  ...Own the full data pipeline - ingestion, feature engineering, training data construction, and low-latency serving Build personalization... 
    Training

    Astrocade

    Palo Alto, CA
    4 days ago
  • $148.5k - $223.9k

     ...iterate agentic AI systems with customers. With your strong technical competence, strategic thinking and customer engagement, you...  ...with meaningful implementation track records, such as LLM, pre/post‑training, RL, agentic system. Prioritizes deep and strategic thinking.... 
    Training

    salesforce.com, inc.

    Palo Alto, CA
    2 days ago
  • $148.5k - $223.9k

    Senior Member of Technical Staff - AI ResearchSkip to main content#Senior Member of Technical Staff -...  ...Franciscotime type: Full timeposted on: Posted Yesterdayjob requisition id: JR339261*...  ...implementing and debugging model training, evaluation, and inference pipelines**... 
    Training
    Work at office

    Salesforce, Inc.

    Palo Alto, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Post-Training and RL. Be the first to apply!