Member of Technical Staff - RL Infrastructure [data, evals, agent]
$180kxAI
Member of Technical Staff - RL Infrastructure About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands‑on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. About the role xAI is seeking experienced software engineers to create robust data pipelines, comprehensive evaluations for benchmarking LLMs, and automation frameworks to increase the productivity of researchers and engineers. Typical problems you will deal with include the following: We have a new agentic model capability that we’d like to improve. How do we design an efficient and robust environment for the agent to perform actions in? Evaluations and observability are a core part of knowing what we need to improve in our models. What new features can we add into our evaluation framework to ease the workflow of researchers & engineers and increase observability? A new open-source evaluation dataset has been released and researchers would like to track our models performance on it. How should we onboard it into our internal evaluation framework? Datasets have been collected that require complex pre-processing to prepare it for large-scale RL training. How do we standardize our preprocessing pipelines to minimize dataset onboarding time? A researcher on the team has an idea for how to augment a dataset to produce additional training data. How should we go about creating the data augmentation pipeline? Responsibilities Creating and maintaining frameworks for agent, data, and model evaluation tasks. Building environments for AI agents. Tools for automating common workflows. Improving alerts, metrics and error handling on large scale RL jobs. Designing operation procedures and coding standards to streamline the transition from small scale experimentation to large scale RL training. Writing unit tests, CI/CD frameworks to support rapid development cycles. Basic qualifications Experience building and maintaining frameworks that are used by many engineers. Experience in building high-performance sandboxes, virtual machines, and simulations. Experience building full-stack apps for automating workflows and data visualization. Experience in rapid iteration of research to production cycles. Experience in test automation, CI/CD. Compensation and benefits
$180,000 - $440,000 USD
Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks. #J-18808-Ljbffr xAI$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building... ...distributed rollouts, training orchestration, inference, evals, data pipelines, observability, and reliability. You will...DataWork at officeLocal area$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied... ...on environments, evals, interpretability, reward modeling, and infrastructure to turn algorithmic... ...experiment tracking, data pipelines, and debugging... ...language models, or agents. Experience with...DataWork at officeLocal areaShift work- ...develop the simulation infrastructure to build worlds (e.g.... ..., logic, and agents that can perceive and... ...We're looking for a Member of Technical Staff - Embodied Agents to... ...Policy optimization RL and imitation learning... ...worlds Synthetic data pipelines Agent evaluation...Data
$300k
...engineers who can turn RL research ideas... ...systems, evals, environment and... ...language model based agents. Translate research... ..., rewards, data, infrastructure, or evals. Build... ...RL projects. Own technically ambiguous projects... ...other technical team members. Clear written...DataFull timeWork at officeLocal area$200k
Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced (5-10... ...team, you will design and operate the data, evaluation, and environment systems... ...‑facing behavior. You will own the infrastructure and experimental workflows that...DataFull timeRelocationVisa sponsorship$150k - $220k
# Founding Member of Technical Staff (MTS)Bay Area, CAFull-time... ..., security, and infrastructure best practices.... ...optimization loops for AI agents—from evaluation pipelines and data/trace... ...services for training, evals, telemetry, and... ...•Experience with RL, reward modeling,...Data- ...cross-portfolio data lake on open table... ...that makes the data agent-readable. An... ...About the Role Members of Technical Staff (MTS) are the senior... ...Multi‑tenant data infrastructure across very different... ...and evals. The harness that... ...Background in offline RL, contextual bandits...Data
$150k - $300k
...it with the full RL post‑training stack... ...sandboxes, verifiable evals, and our async RL... ...: Building the infrastructure to serve LLMs... ...training stack. Core Technical Responsibilities LLM... ...: Rust, C++. Data & Observability: Kafka... ...encourage team members to contribute to the...DataWork at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work$150k - $300k
...Superintelligence Infrastructure Prime Intellect... ...with the full RL post-training... ...sandboxes, verifiable evals, and our async... ...jobs. Core Technical... ...control-plane agents that watch pods... ...fundamentals (data/tensor/pipeline... ...encourage team members to contribute to...DataWork at officeLocal areaRemote workVisa sponsorshipRelocation packageFlexible hours- Responsibilities Develop LM/VLM-powered agents that autonomously interface... ...-quality, diverse physical data by orchestrating simulation,... ...with simulation and infrastructure teams to co-develop APIs, GUIs... ...agents Deep curiosity and strong technical ownership, with a track...DataRemote job
- ...Anchorage Digital is building infrastructure that enables the world’s... ...Responsibilities Contribute to the technical direction of our... ...and assist or teach other team members when possible. Qualifications... ...science fundamentals (algorithms, data structures, systems design)—formal...Data
- ...efficiently across deployment targets, from data center accelerators to on-device... ...product we sell, but the internal data and agent infrastructure that makes Liquid run at the speed of a... ...with prompt engineering, tool use, evals, and the practical limits of what models...Data
- ...Our first product is a set of APIs for AIs to do more with web data. We are a fully in-person team based in both San Francisco... ...an unfair amount. Job: You will build, operate, and scale our infrastructure, including our infrastructure around large language models, and...Data
- ...surfaces , and integration infrastructure that help customers... ...top of our leading RL infrastructure. You’... ...-powered products or agent systems - you... ...SDKs Experience with data pipelines, telemetry... ...our team of founding Members of Technical Staff to design the frontier...Data
- ...Mandolin is laying the clinical and financial infrastructure to get groundbreaking treatments to patients faster, powered by AI agents. Mandolin partners closely with the largest... ...while processing sensitive healthcare data. We’re looking for a DevSecOps leader who can...DataLocal area
$150k - $300k
About Us Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes up 80%...Data- ...semantic search, RAG, and agents. We believe that our... ...all the compute, data, and talent available... ...remote-friendly! As a Member of Technical Staff, you will: Design... ...both on the SFT and the RL regime. Research,... ...supercompute and data infrastructure. Learn from and work...DataFull timeWork at officeRemote workFlexible hours
- ...candidates typically come from staff or principal-level roles and... ...recognized for establishing technical direction, leading large-scale... ...size space and budgets. This infrastructure already powers 16,000... ...connects people, spaces, and data end‑to‑end. Join our lean, product...DataWork at officeLocal areaMonday to Thursday
$180k - $350k
...Open Superintelligence Infrastructure Prime Intellect is building... ...pair it with the full RL post-training stack:... ...sandboxes, verifiable evals, and our async RL trainer... ...models, training data, and the compute that powers... ...validate our defenses. Core Technical Responsibilities...DataFull timeWork at officeRemote workVisa sponsorshipRelocation packageFlexible hours$150k - $300k
...Intellect builds the infrastructure that frontier AI... ...frontier scale, from RL and SFT to tool use, agent workflows, and deployment... ...into clear technical requirements that guide... ...and deploy agents, evals, and harnesses for real... ...evaluations and/or synthetic data generation....DataRemote workVisa sponsorshipRelocation packageFlexible hours$200k - $400k
...Infrastructure Engineer Opportunity We are looking for an Infrastructure Engineer who thrives... ...across AWS and GCP to support global data residency and failover requirements.... ...Communication: Ability to write clear technical specs for both internal teams and external...DataFlexible hours$200k - $300k
...engineer to join our highly driven Comet Agents engineering team. The Comet Agents team... ...agentic capabilities; Designing optimal data representations and modes of interaction... ...and leverage cutting‑edge AI models, infrastructure, and browser technologies to advance the...DataFull timeFlexible hours$100k - $300k
...AI Lab building the next generation of AI agents for cybersecurity. AI has fundamentally... ...Taskforce" assesses petabytes of enterprise data to remediate these issues before critical... ...‑impact environments, love solving hard technical problems, and want to see the tangible...Data- Overview At Composio, we are building infrastructure that allows agents to communicate with the tools you... .... What you'll do build large evals with real tool calling data, measuring where models suck in... ...SFT on our agentic traces and RL models on top of our agentic harness...Data
$210k - $385k
...Full time Location Type Hybrid Department Data Science Compensation $210K - $385K •... ...In this role, you will build specialized evals to improve answer quality across Perplexity... ...changes, collaborating closely with technical leadership to measure and improve Answer...DataFull timeLocal area$150k - $350k
...Job Description Job Description Member of Technical Staff, Applied Research — Sieve Location: San Francisco, CA (Onsite) Compensation... ...only AI research company exclusively focused on video data infrastructure and video intelligence. Their thesis: high-quality video...DataFull timeH1bVisa sponsorship- ...working quietly on a SOTA personal agent that learns what real people... ...a few of them. We have the data, we have the revenue, we have... ...train on and unique usage to RL on possess strong opinions... ...that this role should be renamed "member of tomo staff" #J-18808-Ljbffr TomoDataImmediate start
- Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving... ...reinforcement learning, reasoning systems, and infrastructure for large-scale experiments. Our team... ..., throughput, cost) Developing data pipelines and evaluation tooling...Data
- ...commerce layer for AI - the missing infrastructure that lets agents not just search the web, but... ...discover and buy online. Role As a Member of Technical Staff, you will ship core systems, set engineering... ...with sharp product taste and data intuition. AI-native. Move fast,...DataWork at office
$225k
...frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time... ...will design and operate the distributed infrastructure that trains Magic’s long-context models... ...distributed training across large GPU clusters (data, tensor, pipeline parallelism) Optimize...DataRelocationVisa sponsorship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - RL Infrastructure [data, evals, agent]. Be the first to apply!
- technical support assistant San Francisco, CA
- technical analyst San Francisco, CA
- end user support technician San Francisco, CA
- IT assistant San Francisco, CA
- help desk assistant San Francisco, CA
- IT support technician San Francisco, CA
- operations support technician San Francisco, CA
- desktop support analyst San Francisco, CA
- support analyst San Francisco, CA
- technical associate San Francisco, CA


