Research Member of Technical Staff- Training Systems

Rhoda AI

At Rhoda AI, we’re building the next generation of generalist intelligent robots. We own the full robotics stack from high-performance hardware and robot systems to the infrastructure and state-of-the-art foundation world models that control our robots. Our robots are designed to be generalists capable of operating in complex, real-world environments and handling long-tail edge cases, made possible by our cutting edge research and end-to-end system design. We've raised over $400M and are investing aggressively in model research, infrastructure, hardware development, and manufacturing scale-up to make generalist robotics a reality. We're looking for a Staff / Principal ML Training Systems Engineer to own training systems performance end-to-end. You will define how our models train at scale — driving efficiency, scalability, and correctness across large-scale multimodal training. This is a core systems role, not infrastructure support. Your work directly determines how efficiently we use compute, how well models scale across thousands of GPUs, and how quickly research can iterate. What You'll Do Own training performance end-to-end Diagnose and improve performance of large-scale multimodal training (vision, video, proprioception, actions, language) Build systematic performance attribution: step-time decomposition (compute vs communication vs input pipeline), scaling curves across cluster sizes, and bottleneck identification and prioritization Drive measurable gains in: Distributed efficiency (comm/compute overlap, bucketization, topology-aware mapping, parallelism strategies) Compute efficiency (kernel hotspots, operator fusion, attention optimization, framework/runtime overhead) Memory efficiency (activation checkpointing, sequence packing/bucketing, fragmentation reduction) Design training systems (not just tune them) Define and evolve parallelism strategies: data / tensor / pipeline / sharding / hybrid approaches Improve execution efficiency through communication scheduling and overlap, graph capture and execution optimization, and runtime-level improvements Contribute to and extend training frameworks where needed Make performance observable and measurable Establish source-of-truth performance metrics: step-time breakdowns, MFU / throughput / scaling efficiency Build tools to identify bottlenecks quickly, track performance across model families, and compare scaling behavior across configurations Develop regression detection: microbenchmarks, performance baselines, and automated detection of efficiency regressions Partner deeply with researchers Work side-by-side with research scientists and research engineers — no silos Translate model innovations into scalable, efficient implementations Advise on training tradeoffs for robotics world models: long-horizon sequences, rollout/evaluation cadence, multimodal and variable-length data Collaborate on cluster-level efficiency Work with infrastructure/SRE teams to improve utilization across large distributed jobs, impact of network and collective performance on training, and topology-aware job placement and scaling behavior What We're Looking For Proven track record improving large-scale distributed training performance Deep hands-on experience with modern ML stacks (PyTorch required; JAX a plus) Strong understanding of data / tensor / pipeline parallelism, sharded training (FSDP / ZeRO-style), communication patterns and overlap strategies, and scaling behavior across large GPU clusters Strong systems intuition — ability to reason across compute, communication, and memory bottlenecks Exceptional debugging and measurement ability: turn “training is slow” into clear bottlenecks, experiments, and validated improvements High ownership mindset and comfort in a fast-moving environment Nice to Have (But Not Required) GPU kernel or compiler-level experience (CUDA, Triton, graph capture, operator fusion) Experience with multimodal or video training (variable-length sequences, packing/bucketing) Experience working on large-scale training frameworks or distributed runtimes Familiarity with cluster topology, networking, and large-scale scheduling effects Why This Role Direct leverage on research velocity — every efficiency gain you make accelerates model iteration across the entire research team Own the scalability and performance of large-scale multimodal training for real-world embodied intelligence, not static benchmarks Improvements you make compound across every training run the company executes — high ownership, high impact, small elite team #J-18808-Ljbffr Rhoda AI

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Research Member of Technical Staff- Training Systems in Mountain View, CA vacancy

Member of Technical Staff — Pre-Training AI Systems
$180k
A leading AI research organization in Palo Alto seeks engineers to develop and optimize AI systems that understand the universe. Qualified candidates will engage with complex challenges, such as training large neural networks and innovating inference stacks. Required experience...
Technical training
Pantera Capital
Palo Alto, CA
2 days ago
Research Member of Technical Staff - Training Platform
...from high-performance hardware and robot systems to the infrastructure and state-of-the-... ...cases, made possible by our cutting edge research and end-to-end system design. We've... ...Research Engineer to build and maintain the training platform that powers our model development...
Technical training
Rhoda AI
Mountain View, CA
3 days ago
Research Member of Technical Staff- Post-training & Robot Learning
...performance hardware and robot systems to the infrastructure... ...by our cutting edge research and end-to-end system... ...robot tasks. Post-training at Rhoda means taking... ...levels — from senior to staff. What You'll Do... ...are expected to define technical direction and drive research...
Technical training
Shift work
Rhoda AI
Mountain View, CA
3 days ago
Member of Technical Staff — TrainingPalo Alto, CA
About The Role RadixArk is seeking a Member of Technical Staff — Training to build and scale the systems that train frontier AI models. You will work on large-scale... ...infrastructure tooling Collaborate with model researchers to support frontier experiments Debug and...
Technical training
Flexible hours
RadixArk
Palo Alto, CA
1 day ago
Member of Technical Staff - Pre-Training
$180k
...xAI xAI’s mission is to create AI systems that can accurately understand... ...and as a result, all engineers and researchers share the title "Member of Technical Staff." We operate with a flat organizational... ...or open to relocation. Focus Training trillion‑parameter neural...
Technical training
Relocation
Pantera Capital
Palo Alto, CA
2 days ago
Member of Technical Staff — Training
RadixArk is seeking a Member of Technical Staff — Training to build and scale the systems that train frontier AI models. You will work on large-scale distributed... ...infrastructure tooling Collaborate with model researchers to support frontier experiments Debug and resolve...
Technical training
Flexible hours
RadixArk
Palo Alto, CA
4 days ago
Member of Technical Staff — TPU Systems (JAX / XLA / PALLAS)
$180k - $250k
Member of Technical Staff -- TPU Systems (JAX / XLA / PALLAS) About the Role RadixArk is looking for a TPU Systems... ...high-performance inference and training systems using JAX, XLA, and Pallas.... ...that powers leading AI companies and research labs. Join us in building...
Full time
Flexible hours
Dormont Manufacturing Co
Palo Alto, CA
2 days ago
Member of Technical Staff [Research]
About the Role As a Member of Technical Staff [Research] at NeoCognition , you’ll be part of the core team advancing... ...the frontier of LLM agents — systems that can reason, plan, and act... ...in the areas of LLM reasoning, post-training, and agentic system design. Develop...
NeoCognition Inc.
Palo Alto, CA
2 days ago
Member of Technical Staff - Recommendation Systems
$180k
Member of Technical Staff, Recommendation Systems About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity... ...the user experience Write data pipelines and training jobs that continuously learn from product data. Iterate...
Temporary work
Relocation
xAI
Palo Alto, CA
3 days ago
Member of Technical Staff - Post-Training and RL
$180k
About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge... .... ABOUT THE ROLE: You will work on the most critical post‑training and reinforcement learning challenges at any given time — including...
Technical training
Temporary work
Pantera Capital
Palo Alto, CA
3 days ago
Member of Technical Staff - RL Training Framework
$180k
ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge... ...team is looking for an engineer to help develop our RL training framework. RESPONSIBILITIES Design and implement the systems...
Technical training
Temporary work
Pantera Capital
Palo Alto, CA
4 days ago
Member of Technical Staff (RL Training Framework)
The RL infrastructure team is looking for an engineer to help develop our RL training framework Design and implement the systems backing all RL workloads at xAI, from small scale ablations to production training runs Profile, debug, and optimize end-to-end training performance...
Technical training
Visa sponsorship
Flexible hours
xAI
Palo Alto, CA
1 day ago
Member of Technical Staff, Model Training
Job You will own the training pipeline behind the models that power both Parallel’s search stack and Parallel’s agents... ...models that serve all three. You care about your research being applied to product and systems that millions use. Compensation & benefits Competitive...
Technical training
Work at office
Visa sponsorship
Parallel Web Systems
Palo Alto, CA
2 days ago
Member of Technical Staff - Model Training
$175k - $350k
.... About the Role As a Model Training engineer, you will design, build... ...on the fun parts. Balance research curiosity with product... ...Communicate crisply with both technical and non-technical teammates.... ...improvements in customer-facing systems. Salary Range : $175,000 - $...
Technical training
Inflection AI
Palo Alto, CA
15 hours ago
Senior Member of Technical Staff - AI Research
$148.5k - $223.9k
...engineering, product, and AI-focused activities across agentic AI systems. Specific responsibilities are not enumerated as a separate... ...frameworks and strong ML fundamentals; experience debugging model training, evaluation, and inference pipelines. Infrastructure &...
Salesforce
Palo Alto, CA
15 hours ago
Member of Technical Staff — Web-scale IR & Distributed Systems
Parallel Web Systems in Palo Alto is looking for a skilled engineer to build and manage their large-scale full-text indexing systems.... ...distributed systems. Join us in our fully in-person team committed to solving complex technical challenges. #J-18808-Ljbffr Parallel Web Systems
Parallel Web Systems
Palo Alto, CA
2 days ago
Member of Technical Staff - Research Intern
About Architect Architect is an AI research and product lab for chip design. We build AI models and systems that can explore, design, optimize, and verify new hardware... ...Learning experiments (GRPO/PPO/DPO), training data mixes and reward signal explorations. Contribute...
Internship
Architect Labs
Palo Alto, CA
4 days ago
Senior Member of Technical Staff - AI Research
$148.5k - $223.9k
...future of Salesforce. Salesforce AI Research is looking for a Machine Learning... ...implement and iterate agentic AI systems with customers. With your strong technical competence, strategic thinking... ...track records, such as LLM, pre/post‑training, RL, agentic system. Prioritizes...
salesforce.com, inc.
Palo Alto, CA
2 days ago
Remote Senior Member of Technical Staff - AI & ML Research
$324k - $396k
Pantera Capital is seeking a Member of Technical Staff in Palo Alto, CA. This role involves introducing innovative techniques in AI, stabilizing large language model training, and working on hands-on technical problems. Candidates should have relevant experience and a Bachelor...
Remote job
Pantera Capital
Palo Alto, CA
15 hours ago
Member of Technical Staff (AI Researcher)
$200k - $300k
...Department AI Perplexity is seeking top-tier AI Research Scientists and Engineers to advance our... ...foundational model capabilities, post-training techniques, building RL infra and... ...with large-scale LLMs and Deep Learning systems Strong programming skills in Python/PyTorch...
Full time
Pantera Capital
Palo Alto, CA
3 days ago
Member of Technical Staff, Finance Research
$180k - $230k
...Job Description Job Description Member of Technical Staff, Finance Research 1 opening $6 - $8/hr Required Skills Financial research... ...at the intersection of large language models, agentic systems, and enterprise finance, with a focus on building rigorous...
Full time
Local area
Remote work
ESRhealthcare and EXEC STAFF RECRUITERS
Palo Alto, CA
1 day ago
Research Member of Technical Staff- Video Generation Modeling
...Full time Department Research At Rhoda AI, we’re building... ...scenarios unseen in training. We work at the... ...learning, robotics, and systems, with a research team... ...levels — from senior to staff — and welcome both research... ...expected to define technical direction and drive...
Full time
Rhoda AI
Palo Alto, CA
4 days ago
Research Member of Technical Staff- Dexterous Manipulation
...from high-performance hardware and robot systems to the infrastructure and state‑of‑the‑... ...cases, made possible by our cutting edge research and end‑to‑end system design. We've... ...contact‑rich manipulation tasks Design training strategies and data collection protocols...
Rhoda AI
Mountain View, CA
3 days ago
Research Member of Technical Staff - Data & Evaluation
...performance hardware and robot systems to the infrastructure... ...by our cutting edge research and end-to-end system... ...levels — from MTS-Staff What You’ll Do... ...Collaborate closely with pre-training and post-training... ...are expected to define technical direction and drive research...
Rhoda AI
Palo Alto, CA
3 days ago
Research Member of Technical Staff- Data Infrastructure
...performance hardware and robot systems to the infrastructure... ...by our cutting edge research and end-to-end system... ...that power our model training data pipeline, from... ...— from senior to staff. What You'll Do Architect... ...are expected to define technical direction and own architectural...
Immediate start
Rhoda AI
Palo Alto, CA
1 day ago
Research Member of Technical Staff- Deployment
...performance hardware and robot systems to the infrastructure... ...by our cutting edge research and end-to-end system... ...— from senior/MTS to staff. This is a customer-... ...Communicate technical findings clearly to both... ...modern ML pipelines: pre-training, fine-tuning, evaluation...
Rhoda ai
Mountain View, CA
15 hours ago
Research Member of Technical Staff- Robot Learning Data
...from high-performance hardware and robot systems to the infrastructure and state-of-the-art... ...cases, made possible by our cutting edge research and end-to-end system design. We've... ...quality of the demonstrations our models train on. What You’ll Do Design and implement...
Rhoda AI
Mountain View, CA
15 hours ago
Principal Member of Technical Staff
$220k - $300k
...We are looking for a Principal Member of Technical Staff to join our engineering organization as a... ...needs into scalable, high-performance systems. This role is ideal for someone who has... ...to open-source security tooling or research. Endor Labs is an Equal Opportunity Employer...
Endor Labs
Palo Alto, CA
1 day ago
Member of Technical Staff, Research Scientist
...purpose world models: causal, multimodal systems that learn to predict and interact... ...for We’re looking for a deeply technical and creative researcher who thrives on invention. You’re motivated... ..., learning objectives, and training paradigms that move beyond today’s diffusion...
Dormont Manufacturing Co
Palo Alto, CA
6 hours ago
Member of Technical Staff - Model Training
$180k
...Description Job Description About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in... .... You are a power user of AI models. If you previously trained models used by millions of people it's a big plus, but modeling...
Technical training
Temporary work
xAI
Palo Alto, CA
18 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Member of Technical Staff- Training Systems. Be the first to apply!