Research Engineer - Reinforcement Learning (RL) Systems & Infrastructure (Seed Infra)

$244.8k

ByteDance

Responsibilitie

About the Team The Seed Infrastructures team oversees the distributed training, reinforcement learning framework, high-performance inference, and heterogeneous hardware compilation technologies for AI foundation models. Responsibilities - Design and build end-to-end reinforcement learning (RL) systems for large-scale models, covering rollout, training, evaluation, and deployment pipelines. - Develop scalable and fault-tolerant RL infrastructure that operates efficiently under dynamic workloads and heterogeneous compute environments. - Optimize distributed training performance across GPU clusters, improving throughput, resource utilization, and system stability. - Collaborate with cross-team researchers on targeted system-algorithm co-design to translate research ideas into robust, production-grade implementations. - Build tooling, monitoring, and debugging frameworks to ensure reliability and observability of large-scale RL training systems.

Qualification

Minimum Qualifications: - Strong background in distributed systems, large-scale ML systems, or deep learning infrastructure - Experience building or optimizing large-scale training systems (e.g., RL, LLM, multimodal models) - Solid engineering skills in Python/C++ and familiarity with modern ML stacks (PyTorch, distributed training frameworks, etc.) - Experience with GPU optimization, parallelism strategies, and system-level performance tuning - Understanding of reinforcement learning workflows (rollout, policy update, evaluation loops) Preferred Qualifications: - Experience with large-scale agent systems - Familiarity with system design under heterogeneous or dynamic workloads - Exposure to RL + LLM training or post-training pipeline

Job Information

[For Pay Transparency]Compensation Description (Annually)

The base salary range for this position in the selected city is $244800 - $450000 annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.

Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates:

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:

1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;

2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and

3. Exercising sound judgment.

About Doubao (Seed)

Established in 2023, the ByteDance Seed team is dedicated to pioneering new paths toward artificial general intelligence. We aspire to advance the frontier of intelligence to drive progress for both technology and society.

With a long-term vision for the AI sector, the Seed team's research spans MLLM, GenMedia, AI for Science, and Robotics. We maintain a global presence with laboratories and career opportunities across China, Singapore, and the United States. To date, we have launched industry-leading general foundation models and cutting-edge multimodal capabilities. Our technology powers over 50 application scenarios - including Doubao, Jimeng, TRAE, Dola and Dreamnia - and serves enterprise customers through Volcano Engine and BytePlus. Third-party data shows that the Doubao App ranks first in user volume in the Chinese market, while Doubao foundation models lead the industry in average daily token consumption.

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.

As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.

Diversity & Inclusion

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Reasonable Accommodation

ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Research Engineer - Reinforcement Learning (RL) Systems & Infrastructure (Seed Infra) in San Jose, CA vacancy

Research Engineer - Multimodal Training Infrastructure (Seed Infra)
$244.8k
...Research Engineer – Multimodal Training Infrastructure (Seed Infra) Location: San Jose Team: Technology Employment... ...oversees the distributed training, reinforcement learning framework, high-performance... ...GPU clusters Investigate system reliability and resilience...
Suggested
Temporary work
Local area
ByteDance
San Jose, CA
2 days ago
Research Engineer - LLM Training Infrastructure - Seed Infra
$244.8k
...Team Information: The Seed Infrastructures team oversees the... ...training, reinforcement learning framework, high-performance... ...Responsibilities - Conduct research and development on... ...- Investigate system reliability and... ...leading or mentoring engineering teams or cross-functional...
Suggested
Temporary work
Local area
ByteDance
San Jose, CA
17 hours ago
Research Engineer - Reinforcement Learning, Self-Driving
$126k - $423k
...Research Engineer Applied Intuition, Inc. is... ...creating the digital infrastructure needed to bring... ..., operating systems, and autonomy.... ...tools and infra, researchers can... ...contribute to and learn from best practices... ...research on reinforcement learning (RL) and its...
Suggested
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
4 days ago
Research Engineer - AI/RL Infrastructure
$126k - $423k
...Research Engineer (AI/RL Infrastructure) Applied Intuition, Inc. is powering the future... ...infrastructure, operating systems, and autonomy. Eighteen of... ...industry-leading tools and infra, researchers can access millions... ...will contribute to and learn from best practices in the...
Suggested
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
4 days ago
Senior Staff Research Engineer - Reinforcement Learning for AI Agents
$244.14k - $413.16k
...Senior Staff Research Engineer – Reinforcement Learning for AI Agents Santa Clara, CA... ...Scientists to design learning systems that allow agents to plan... ...built on top of our agent infrastructure platform. Evaluation... ...Experience implementing RL algorithms such as PPO, Actor...
Suggested
Full time
XPENG
Santa Clara, CA
4 days ago
Research Scientist/ Research Engineer
$160k - $350k
Collinear is a research-focused AI... ...building systems that make intelligent... ...-training, RL,... ...trusted to learn fast, ask questions... ...scientists, and engineering leads High-... ..., scalable infrastructure that moves... ...experience with Reinforcement Learning (... ...: At a seed-backed startup...
Full time
Work at office
Local area
Immediate start
Relocation package
Flexible hours
Collinear AI, Inc.
Sunnyvale, CA
5 days ago
Senior / Staff AI Research Engineer, Data Infrastructure
...Senior / Staff AI Research Engineer, Data Infrastructure Milpitas, CA... ...build the data and learning engine behind RoboForce... ...Develop post-SFT reinforcement learning infrastructure... ...and retrieval systems for heterogeneous robot... ...reward modeling, or RL training loops (e.g...
Work at office
Visa sponsorship
RoboForce
Milpitas, CA
4 days ago
Research Engineer Intern (Ads ML Infra) - 2026 Start (PhD)
$60 per hour
...About the team The ads system at TikTok operates on... ...data, and machine learning is widely used to improve... ...efficient and stable infrastructure to collect and... ...enforcement, including feature engineering, feature store,... ...to our products and research, and to the organization...
Hourly pay
Summer work
Internship
Local area
Tik Tok
San Jose, CA
4 days ago
Research Engineer L5 - Machine Learning Efficiency
$100k
...for an exceptional applied research engineer to help us develop the technology... ...skills, a love of learning, a passion for solving problems... ...in training and serving infrastructure for LLMs and other large foundation... ...web-scale distributed systems. Experience in applied...
Hourly pay
Full time
Immediate start
Flexible hours
Netflix
Los Gatos, CA
5 days ago
HPE Labs - Principal AI and Machine Learning Research Engineer
* Synthesize research directions and vision based on... ...) via simulators or system coding/analysis. Research... ...** in Electrical Engineering, Computer Science, Data... ...in AI & Machine learning ( academic or industrial... ...enabled, mobile-friendly infrastructure. Many rely on a...
Work experience placement
Local area
Hewlett Packard Enterprise Development LP
Milpitas, CA
1 day ago
Software Engineer/Researcher, AI-Native Database Systems
$156k - $387.6k
...Software Engineer/Researcher, AI-Native Database Systems Location: San Jose Team: Infrastructure Employment Type: Regular Job Code: A16... ...backends. Drive innovations in learned index structures, self-... ..., and ANN indexing / AI infra or model-serving infrastructure...
Temporary work
Local area
ByteDance
San Jose, CA
5 days ago
Senior Research Engineer, Foundation Model Training Infrastructure
$224k - $356.5k
...a senior or principal engineer who specializes in building cutting‑edge infrastructure for large‑scale foundation... ...Embodied Agent Research (GEAR) group. Our team... ...models, large-scale robot learning, embodied AI, and... ...distributed training systems to support multi-modal...
Full time
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior Research Engineer, Training Data Infrastructure in Foundation Models
Senior Research Engineer, Training Data Infrastructure in Foundation Models Cupertino, California, United States - Software and... ...depends on a data-centric approach to Machine Learning. Our objective is to engineer a large-scale system that acquires,ликಿತ್ರ, processes, and...
Apple Inc.
Cupertino, CA
5 days ago
Research Engineer
...Research Engineer, Foundation Models About the Opportunity... ...of large-scale AI systems. This role sits at... ...-of-the-art machine learning models. You will... ...datasets and training infrastructure to experimenting with... ...language models, reinforcement learning, long-context...
Visa sponsorship
Relocation package
Flexible hours
Acceler8 Talent
Santa Clara, CA
4 days ago
Senior Research Engineer - Foundation Models, Ads Integrity
$212.8k - $387.6k
...for strong machine learning engineers who are excited to grow... ...understanding system for TikTok monetization... ...seeking sophisticated Research Engineers who are... ...product, engineering, and infrastructure teams to... ...- Familiarity with reinforcement learning for autonomous...
Temporary work
Local area
Flexible hours
Shift work
Tik Tok
San Jose, CA
3 days ago
Senior Research Engineer, Controls
$150k - $200k
...growing teams. As a Research Engineer, you will deliver... ...development of machine-learning vehicle models and learning... ...real-world autonomy system challenges by... ...based approaches (DL/RL/IL) Work cross-functionally... ...Develop tools and infrastructure for dataset...
PlusAI, Inc.
Santa Clara, CA
3 days ago
Research Engineer (L5) - Growth and Commerce
$170k
...Data Science & Engineering team plays a critical... ...& machine learning to develop compelling... ...As a Senior Research Engineer, you will... ...optimization systems ML for Growth... ...development of better infrastructure for developing... ...causal ML or reinforcement learning is a plus...
Hourly pay
Full time
Immediate start
Remote work
Flexible hours
Netflix
Los Gatos, CA
5 days ago
Research Engineer/Scientist (all levels), Efficient Models
$212.8k - $387.6k
...The Vision-Applied Research team focuses on applied... ...for a Research Engineer / Scientist who can... ...developing methods and infrastructure for transferring capabilities... ...model design; or reinforcement learning training methods (e.... ...technology systems; and 3. Exercising...
Temporary work
Local area
Tik Tok
San Jose, CA
18 hours ago
Sr. Research Engineer/Scientist (all levels), World Models
$244.8k
...The Vision-Applied Research team focuses on... ...looking for Research Engineer / Scientists who... ...methods and infrastructure to train large-scale... ..., Machine Learning, or equivalent experience... ...simulation; or reinforcement learning for agentic... ...technology systems; and 3. Exercising...
Temporary work
Local area
Tik Tok
San Jose, CA
4 days ago
Senior Research Engineer - Autonomous Vehicles
Overview We are recruiting top research engineers in the Autonomous... ...intelligence topics, such as deep learning, reinforcement learning, and generative... ...and optimize simulation infrastructure (based on GPU-accelerated... ...distributed training systems with frameworks like...
Full time
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Research Engineer - RL for Autonomous Driving
$126k - $423k
Decisive Point is seeking a passionate Research Engineer to join our Research Group focused on advancing physical AI and autonomy. Candidates should possess a strong background in reinforcement learning along with hands-on experience in applied engineering and research....
Decisive Point
Sunnyvale, CA
15 hours ago
Research Engineer / Scientist -AI for Databases
$156k - $387.6k
...Responsibilitie About the Team The Infrastructure System Lab is a hybrid research and engineering team dedicated to building the... ...optimization using machine learning, LLM-based developer tools... .... - Familiarity with LLM, reinforcement learning, neural architecture...
Temporary work
Local area
ByteDance
San Jose, CA
18 hours ago
Research Engineer (L5) - Growth and Commerce
$170k
Senior Research Engineer - Growth & Commerce Data Science... ...experimentation and machine learning. As a Senior... ...experience optimization systems and innovate on... ...development of better infrastructure for developing and deploying... ...with causal ML or reinforcement learning is a plus....
Hourly pay
Full time
Immediate start
Remote work
Flexible hours
Netflix, Inc.
Los Gatos, CA
4 days ago
Research Engineer, Machine Learning L6
$100k
...experiences. We are looking for exceptional applied machine learning engineers to help us develop technology to power future member... ...closely with cross‑functional partners to translate research findings into production systems. Qualifications Solid software development skills...
Hourly pay
Full time
Immediate start
Flexible hours
Netflix, Inc.
Los Gatos, CA
4 days ago
Machine Learning Research Engineer
$2,000 per month
...Machine Learning Research Engineer Cupertino, CA Etched is building AI chips that are hard-coded for individual model architectures. Our... ...organizations Strong Candidates May Also Have ML Systems Research and HW Co-design backgrounds Published...
Work at office
Relocation package
ETCHED LLC
Cupertino, CA
1 day ago
HPE Labs - Principal AI and Machine Learning Research Engineer
...HPE Labs - Principal AI and Machine Learning Research Engineer This role has been designed as 'Onsite' with an expectation that you will primarily... ...HPE's broader innovation community. Excellent software systems building skills are a significant plus. Role and...
Work experience placement
Work at office
Immediate start
Hewlett Packard Enterprise
Milpitas, CA
3 days ago
Senior Research Engineer, Microfluidic Systems, R&D
$87.5k - $162.5k
...use. By accelerating clinical research, streamlining workflows, and... ...in, result-out workflow. Our Systems Technology Group, a part of... ...team-oriented Senior Research Engineer to join the effort. In this role... ...a strong desire to learn and be challenged. Preferred...
Local area
Relocation package
F. Hoffmann-La Roche Gruppe
Santa Clara, CA
1 day ago
Machine Learning Research Engineer, Input Experience, Apple Intelligence
$147.4k - $272.1k
Cupertino, California, United States Machine Learning and AI Apple is where individual... ...seeking an outstanding Machine Learning Engineer to help build our next generation of products... ...other applications of machine learning systems. Minimum Qualifications PhD or MSc in...
Relocation
Apple
Cupertino, CA
4 days ago
Senior Machine Learning Research Engineer, NLP, Input Experience
$181.1k - $318.4k
Senior Machine Learning Research Engineer, NLP, Input Experience Cupertino, California, United States Machine Learning and AI Our team’s mission... .... Additionally, you’ll implement the building blocks and infrastructure that integrate these innovations into our production...
Relocation
Apple
Cupertino, CA
5 days ago
Research Engineer - Special Projects
$147.4k - $272.1k
...California, United States Machine Learning and AI We're seeking research engineers to build infrastructure for breakthrough innovations in AI agents, reinforcement learning, and simulation... ...quality data pipelines, simulation systems, and tooling that enable cutting...
Relocation
Apple Inc.
Santa Clara, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer - Reinforcement Learning (RL) Systems & Infrastructure (Seed Infra). Be the first to apply!