Member of Technical Staff - RL Infrastructure
$180kXai
Member Of Technical Staff - RL Infrastructure
Palo Alto, CA
About XAI
XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company's mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.
About The Role
XAI is seeking experienced software engineers to create robust data pipelines, comprehensive evaluations for benchmarking LLMs, and automation frameworks to increase the productivity of researchers and engineers.
Typical problems you will deal with include the following:
- We have a new agentic model capability that we'd like to improve. How do we design an efficient and robust environment for the agent to perform actions in?
- Evaluations and observability are a core part of knowing what we need to improve in our models. What new features can we add into our evaluation framework to ease the workflow of researchers & engineers and increase observability?
- A new open-source evaluation dataset has been released and researchers would like to track our models performance on it. How should we onboard it into our internal evaluation framework?
- Datasets have been collected that require complex pre-processing to prepare it for large-scale RL training. How do we standardize our preprocessing pipelines to minimize dataset onboarding time?
- A researcher on the team has an idea for how to augment a dataset to produce additional training data. How should we go about creating the data augmentation pipeline?
Responsibilities
- Creating and maintaining frameworks for agent, data, and model evaluation tasks.
- Building environments for AI agents.
- Tools for automating common workflows.
- Improving alerts, metrics and error handling on large scale RL jobs.
- Refactoring existing agent, data, eval, training frameworks for better modularity.
- Designing operation procedures and coding standards to streamline the transition from small scale experimentation to large scale RL training.
- Writing unit tests, CI/CD frameworks to support rapid development cycles.
Basic Qualifications
- Experience building and maintaining frameworks that are used by many engineers.
- Experience in building high-performance sandboxes, virtual machines, and simulations.
- Experience building full-stack apps for automating workflows and data visualization.
- Experience in rapid iteration of research to production cycles.
- Experience in test automation, CI/CD.
Compensation And Benefits
$180,000 - $440,000 USD
Base salary is just one part of our total rewards package at XAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.
XAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.
$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all capacities by optimising beyond the local maxima of learning from human expertise. About...SuggestedWork at officeLocal area$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building... ..., evals, interpretability, reward modeling, and infrastructure to turn algorithmic ideas into reliable training systems...SuggestedWork at officeLocal areaShift work- ...research lab building the foundational infrastructure to train specialized AI agents. We... ...systems are not designed for long-running RL environments, persistent agent... ...feel like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own...Suggested
$200k
Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10 yrs) Magic’s mission is to build safe... ...measurably improve user‑facing behavior. You will own the infrastructure and experimental workflows that connect product...SuggestedFull timeRelocationVisa sponsorship- ...Building environments for AI agents , Tools for automating common workflows , Improving alerts, metrics and error handling on large scale RL jobs , Refactoring existing agent, data, eval, training frameworks for better modularity , Designing operation procedures and coding...Suggested
- ...observe their code. We are responsible for designing, building, and scaling core infrastructure that powers a high-volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers and power our rapidly...Work at office
$200k - $350k
...About the job Pantheon - Member of Technical Staff: Infrastructure Member of Technical Staff: Infrastructure Posted by Transparent Search Group on behalf of Pantheon . About Pantheon Autonomous physical labor Website: The role We are...H1bRemote workVisa sponsorship$256k - $276k
...World" graphic novel to understand the bigger picture and our vision at Postman. The Opportunity As a Member of Technical Staff on AI Infrastructure, you will build and maintain the foundational systems and distributed infrastructure that power AI model post training...Work at officeFlexible hours3 days per week- ...full ownership of NeoSigma's platform infrastructure - lead architectural decisions and design... ...enterprise customers Own the technical relationship with enterprise customers... ...Visa sponsorship As a founding member, you'll help define the technical foundation...Visa sponsorshipFlexible hours
- Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure company is building a scalable cloud platform designed for next-generation machine learning workloads ($80M series A). As AI systems continue to grow in complexity...
- Member of Technical Staff, Infrastructure and Training Systems Location: SF Bay Area or Tokyo, Japan Type: Full-time About Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics research to the challenges...Full time
$180k - $300k
...world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL... ...has previous experience on Anthropic’s data team building data infrastructure, and datasets behind Claude. We are partnering with leading AI...Visa sponsorshipRelocation package- ...DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond. Role Overview Reflection.AI is looking for a Member of Technical Staff - Infrastructure Security to secure our geographically diverse multi-cloud Kubernetes and cloud environments. In this role, you’ll...Relocation package
$150k - $250k
...people take ownership, grow together, and share both the challenges and the wins. What you'll do Build the supercomputing infrastructure that runs our agents. Our agents tackle long-horizon, high-performance workloads, and you'll design the cloud compute,...Work at officeRemote workFlexible hours- ...into upside. Make high-conviction bets - Try and fail. But succeed an unfair amount. Job: You will build, operate, and scale our infrastructure, including our infrastructure around large language models, and ensure that our systems are reliable and cost-efficient as we...
$250k
...V max is an applied research lab working at the frontier of reinforcement learning (RL). We are building new techniques for leveraging RL with Large Language Models (LLMs). Our research contributes directly to our RL platform, which automates the engineering involved...Work at office$180k
...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI... ...will own everything from distributed infrastructure (global KV cache, continuous batching... ...research on scaling test-time compute, RL rollout, and model-hardware co-design...Temporary work$160k - $270k
...About Mandolin Nearly every disease will become treatable in our lifetimes. Mandolin is laying the clinical and financial infrastructure to get groundbreaking treatments to patients faster, powered by AI agents. Mandolin partners closely with the largest healthcare...Full timeWork at officeLocal area$200k
...-scale pre-training, domain-specific RL, ultra-long context, and inference-time... ...'s most important decisions. As a Member of Technical Staff on Evals, you will build both the platform... ...themselves. You'll develop infrastructure for large-scale evaluations, data ablations...Visa sponsorshipRelocation package$10k
...Hiring This Role Vapi runs live phone calls — when something breaks, callers hear it. We're building cell-based, multi-region infrastructure to drive 99.99% call completion, and this hire owns the foundation: multi-cluster Kubernetes on EKS, a stateful data plane (...Flexible hours- ...About Us Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes...
$10k
...millions of calls. It's freakin hard and a lot of fun. What You'll Do: 30 Day: You'll ramp on our multi-cluster, multi-cloud infrastructure. 60 Day: You'll deliver a new service like Anycast Global Router. 90 Day: You'll own a domain like GPU inference clusters....Flexible hoursShift work$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab... ...representations evolve during RL and post-training, and use these insights... ...improve training objectives. Develop infrastructure for reproducible, large-scale experiments...Work at officeLocal area$150k - $300k
Building Open Superintelligence Infrastructure Prime Intellect is building... ...and pair it with the full rl post-training stack: environments... ...our RL training stack. Core Technical Responsibilities LLM Serving... ...development and encourage team members to contribute to the broader...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work- ...SDK , API surfaces , and integration infrastructure that help customers understand, debug... ...interfaces that sit on top of our leading RL infrastructure. You’ll work directly... ...We’re building our team of founding Members of Technical Staff to design the frontier of continually...
- About Mandolin Nearly every disease will become treatable in our lifetimes. Mandolin is laying the clinical and financial infrastructure to get groundbreaking treatments to patients faster, powered by AI agents. Mandolin partners closely with the largest healthcare institutions...Local area
- .... Successful candidates typically come from staff or principal-level roles and are recognized for establishing technical direction, leading large-scale initiatives,... ...teams use to right‑size space and budgets. This infrastructure already powers 16,000 workplaces and 9,000+...Work at officeLocal areaMonday to Thursday
$150k - $350k
About Us Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes up 80...- ...Hands‑on experience building or significantly enhancing distributed compute platforms, orchestration systems, or high‑performance infrastructure at scale Ability to thrive in a fast‑paced, meritocratic environment with full ownership, high standards, and a focus on...
- ...precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who... ...at its core. Multi‑tenant data infrastructure across very different portcos. Event‑... ...observability. Background in offline RL, contextual bandits, or sequential decision...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - RL Infrastructure. Be the first to apply!
- IT assistant San Francisco, CA
- desktop support analyst San Francisco, CA
- senior IT support technician San Francisco, CA
- personal computer support technician San Francisco, CA
- technical analyst San Francisco, CA
- customer support technician San Francisco, CA
- tech assistant San Francisco, CA
- technical support assistant San Francisco, CA
- customer support analyst San Francisco, CA
- remote (work from home) technical support representative San Francisco, CA


