Research Engineer - Reinforcement Learning (RL) Systems & Infrastructure (Seed Infra)
$244.8kByteDance
Responsibilitie
About the Team The Seed Infrastructures team oversees the distributed training, reinforcement learning framework, high-performance inference, and heterogeneous hardware compilation technologies for AI foundation models. Responsibilities - Design and build end-to-end reinforcement learning (RL) systems for large-scale models, covering rollout, training, evaluation, and deployment pipelines. - Develop scalable and fault-tolerant RL infrastructure that operates efficiently under dynamic workloads and heterogeneous compute environments. - Optimize distributed training performance across GPU clusters, improving throughput, resource utilization, and system stability. - Collaborate with cross-team researchers on targeted system-algorithm co-design to translate research ideas into robust, production-grade implementations. - Build tooling, monitoring, and debugging frameworks to ensure reliability and observability of large-scale RL training systems. Qualification Minimum Qualifications: - Strong background in distributed systems, large-scale ML systems, or deep learning infrastructure - Experience building or optimizing large-scale training systems (e.g., RL, LLM, multimodal models) - Solid engineering skills in Python/C++ and familiarity with modern ML stacks (PyTorch, distributed training frameworks, etc.) - Experience with GPU optimization, parallelism strategies, and system-level performance tuning - Understanding of reinforcement learning workflows (rollout, policy update, evaluation loops) Preferred Qualifications: - Experience with large-scale agent systems - Familiarity with system design under heterogeneous or dynamic workloads - Exposure to RL + LLM training or post-training pipeline Job Information [For Pay Transparency]Compensation Description (Annually) The base salary range for this position in the selected city is $244800 - $450000 annually.Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) Candidates:
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:
1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and
3. Exercising sound judgment. About Doubao (Seed) Established in 2023, the ByteDance Seed team is dedicated to pioneering new paths toward artificial general intelligence. We aspire to advance the frontier of intelligence to drive progress for both technology and society. With a long-term vision for the AI sector, the Seed team's research spans MLLM, GenMedia, AI for Science, and Robotics. We maintain a global presence with laboratories and career opportunities across China, Singapore, and the United States. To date, we have launched industry-leading general foundation models and cutting-edge multimodal capabilities. Our technology powers over 50 application scenarios - including Doubao, Jimeng, TRAE, Dola and Dreamnia - and serves enterprise customers through Volcano Engine and BytePlus. Third-party data shows that the Doubao App ranks first in user volume in the Chinese market, while Doubao foundation models lead the industry in average daily token consumption. Why Join ByteDance Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.
As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.
Diversity & Inclusion
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. Reasonable Accommodation ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Research Engineer - Reinforcement Learning (RL) Systems & Infrastructure (Seed Infra) in San Jose, CA vacancy
$244.8k
...Research Engineer – Multimodal Training Infrastructure (Seed Infra) Location: San Jose Team: Technology Employment... ...oversees the distributed training, reinforcement learning framework, high-performance... ...GPU clusters Investigate system reliability and resilience...SuggestedTemporary workLocal area$244.8k
...Team Information: The Seed Infrastructures team oversees the... ...training, reinforcement learning framework, high-performance... ...Responsibilities - Conduct research and development on... ...- Investigate system reliability and... ...leading or mentoring engineering teams or cross-functional...SuggestedTemporary workLocal area$126k - $423k
...Research Engineer Applied Intuition, Inc. is... ...creating the digital infrastructure needed to bring... ..., operating systems, and autonomy.... ...tools and infra, researchers can... ...contribute to and learn from best practices... ...research on reinforcement learning (RL) and its...SuggestedFull timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift$126k - $423k
...Research Engineer (AI/RL Infrastructure) Applied Intuition, Inc. is powering the future... ...infrastructure, operating systems, and autonomy. Eighteen of... ...industry-leading tools and infra, researchers can access millions... ...will contribute to and learn from best practices in the...SuggestedFull timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift$244.14k - $413.16k
...Senior Staff Research Engineer – Reinforcement Learning for AI Agents Santa Clara, CA... ...Scientists to design learning systems that allow agents to plan... ...built on top of our agent infrastructure platform. Evaluation... ...Experience implementing RL algorithms such as PPO, Actor...SuggestedFull time$160k - $350k
Collinear is a research-focused AI... ...building systems that make intelligent... ...-training, RL,... ...trusted to learn fast, ask questions... ...scientists, and engineering leads High-... ..., scalable infrastructure that moves... ...experience with Reinforcement Learning (... ...: At a seed-backed startup...Full timeWork at officeLocal areaImmediate startRelocation packageFlexible hours- ...Senior / Staff AI Research Engineer, Data Infrastructure Milpitas, CA... ...build the data and learning engine behind RoboForce... ...Develop post-SFT reinforcement learning infrastructure... ...and retrieval systems for heterogeneous robot... ...reward modeling, or RL training loops (e.g...Work at officeVisa sponsorship
$60 per hour
...About the team The ads system at TikTok operates on... ...data, and machine learning is widely used to improve... ...efficient and stable infrastructure to collect and... ...enforcement, including feature engineering, feature store,... ...to our products and research, and to the organization...Hourly paySummer workInternshipLocal area$100k
...for an exceptional applied research engineer to help us develop the technology... ...skills, a love of learning, a passion for solving problems... ...in training and serving infrastructure for LLMs and other large foundation... ...web-scale distributed systems. Experience in applied...Hourly payFull timeImmediate startFlexible hours- * Synthesize research directions and vision based on... ...) via simulators or system coding/analysis. Research... ...** in Electrical Engineering, Computer Science, Data... ...in AI & Machine learning ( academic or industrial... ...enabled, mobile-friendly infrastructure. Many rely on a...Work experience placementLocal area
$156k - $387.6k
...Software Engineer/Researcher, AI-Native Database Systems Location: San Jose Team: Infrastructure Employment Type: Regular Job Code: A16... ...backends. Drive innovations in learned index structures, self-... ..., and ANN indexing / AI infra or model-serving infrastructure...Temporary workLocal area$224k - $356.5k
...a senior or principal engineer who specializes in building cutting‑edge infrastructure for large‑scale foundation... ...Embodied Agent Research (GEAR) group. Our team... ...models, large-scale robot learning, embodied AI, and... ...distributed training systems to support multi-modal...Full time- Senior Research Engineer, Training Data Infrastructure in Foundation Models Cupertino, California, United States - Software and... ...depends on a data-centric approach to Machine Learning. Our objective is to engineer a large-scale system that acquires,ликಿತ್ರ, processes, and...
- ...Research Engineer, Foundation Models About the Opportunity... ...of large-scale AI systems. This role sits at... ...-of-the-art machine learning models. You will... ...datasets and training infrastructure to experimenting with... ...language models, reinforcement learning, long-context...Visa sponsorshipRelocation packageFlexible hours
$212.8k - $387.6k
...for strong machine learning engineers who are excited to grow... ...understanding system for TikTok monetization... ...seeking sophisticated Research Engineers who are... ...product, engineering, and infrastructure teams to... ...- Familiarity with reinforcement learning for autonomous...Temporary workLocal areaFlexible hoursShift work$150k - $200k
...growing teams. As a Research Engineer, you will deliver... ...development of machine-learning vehicle models and learning... ...real-world autonomy system challenges by... ...based approaches (DL/RL/IL) Work cross-functionally... ...Develop tools and infrastructure for dataset...$170k
...Data Science & Engineering team plays a critical... ...& machine learning to develop compelling... ...As a Senior Research Engineer, you will... ...optimization systems ML for Growth... ...development of better infrastructure for developing... ...causal ML or reinforcement learning is a plus...Hourly payFull timeImmediate startRemote workFlexible hours$212.8k - $387.6k
...The Vision-Applied Research team focuses on applied... ...for a Research Engineer / Scientist who can... ...developing methods and infrastructure for transferring capabilities... ...model design; or reinforcement learning training methods (e.... ...technology systems; and 3. Exercising...Temporary workLocal area$244.8k
...The Vision-Applied Research team focuses on... ...looking for Research Engineer / Scientists who... ...methods and infrastructure to train large-scale... ..., Machine Learning, or equivalent experience... ...simulation; or reinforcement learning for agentic... ...technology systems; and 3. Exercising...Temporary workLocal area- Overview We are recruiting top research engineers in the Autonomous... ...intelligence topics, such as deep learning, reinforcement learning, and generative... ...and optimize simulation infrastructure (based on GPU-accelerated... ...distributed training systems with frameworks like...Full time
$126k - $423k
Decisive Point is seeking a passionate Research Engineer to join our Research Group focused on advancing physical AI and autonomy. Candidates should possess a strong background in reinforcement learning along with hands-on experience in applied engineering and research....$156k - $387.6k
...Responsibilitie About the Team The Infrastructure System Lab is a hybrid research and engineering team dedicated to building the... ...optimization using machine learning, LLM-based developer tools... .... - Familiarity with LLM, reinforcement learning, neural architecture...Temporary workLocal area$170k
Senior Research Engineer - Growth & Commerce Data Science... ...experimentation and machine learning. As a Senior... ...experience optimization systems and innovate on... ...development of better infrastructure for developing and deploying... ...with causal ML or reinforcement learning is a plus....Hourly payFull timeImmediate startRemote workFlexible hours$100k
...experiences. We are looking for exceptional applied machine learning engineers to help us develop technology to power future member... ...closely with cross‑functional partners to translate research findings into production systems. Qualifications Solid software development skills...Hourly payFull timeImmediate startFlexible hours$2,000 per month
...Machine Learning Research Engineer Cupertino, CA Etched is building AI chips that are hard-coded for individual model architectures. Our... ...organizations Strong Candidates May Also Have ML Systems Research and HW Co-design backgrounds Published...Work at officeRelocation package- ...HPE Labs - Principal AI and Machine Learning Research Engineer This role has been designed as 'Onsite' with an expectation that you will primarily... ...HPE's broader innovation community. Excellent software systems building skills are a significant plus. Role and...Work experience placementWork at officeImmediate start
$87.5k - $162.5k
...use. By accelerating clinical research, streamlining workflows, and... ...in, result-out workflow. Our Systems Technology Group, a part of... ...team-oriented Senior Research Engineer to join the effort. In this role... ...a strong desire to learn and be challenged. Preferred...Local areaRelocation package$147.4k - $272.1k
Cupertino, California, United States Machine Learning and AI Apple is where individual... ...seeking an outstanding Machine Learning Engineer to help build our next generation of products... ...other applications of machine learning systems. Minimum Qualifications PhD or MSc in...Relocation$181.1k - $318.4k
Senior Machine Learning Research Engineer, NLP, Input Experience Cupertino, California, United States Machine Learning and AI Our team’s mission... .... Additionally, you’ll implement the building blocks and infrastructure that integrate these innovations into our production...Relocation$147.4k - $272.1k
...California, United States Machine Learning and AI We're seeking research engineers to build infrastructure for breakthrough innovations in AI agents, reinforcement learning, and simulation... ...quality data pipelines, simulation systems, and tooling that enable cutting...Relocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer - Reinforcement Learning (RL) Systems & Infrastructure (Seed Infra). Be the first to apply!
Related searches
- ai research engineer San Jose, CA
- research engineer San Jose, CA
- research programmer San Jose, CA
- deep learning research engineer San Jose, CA
- research software engineer San Jose, CA
- senior research engineer San Jose, CA
- system engineer contract San Jose, CA
- application system engineer San Jose, CA
- senior windows systems engineer San Jose, CA
- lead system engineer San Jose, CA


