Senior RL Infra Engineer for Self-Improving Agents
$224k - $356.5kNVIDIA
We’re hiring an engineer to help us bring reinforcement learning to every agent team at NVIDIA. This is a rare chance to shape how autonomous, self‑improving agents learn and evolve across the enterprise. The role sits at the intersection of ML research and production engineering. What if every agent developer could add self‑improvement loops to their workflows without needing deep RL expertise? That’s the challenge here: evaluate emerging approaches, adapt them into enterprise‑ready blueprints, and make them available inside sandboxed execution environments with the security and governance the enterprise demands. We believe the best training and self‑evolving agent platforms come from people with diverse backgrounds and want this person to help us build ours. What you’ll be doing The work splits between creating enterprise‑ready RL capabilities and partnering with agent teams to put them into practice. Building RL cookbooks and environments Evaluate and adapt democratized RL approaches into reusable cookbooks and blueprints so agent developers can integrate self‑improvement loops (GRPO, DPO, PPO, RLAIF) on their own Design verifiable reward environments building on NeMo Gym, extending to domain‑specific environments for internal use cases Operationalize NVIDIA and third‑party training backends as production services inside Sandbox Integrate with NeMo Microservices (Curator, Customizer, Evaluator, Guardrails) to enable end‑to‑end data flywheel workflows for RL Infrastructure, reliability, and collaboration Lead data curation and active learning strategies to continuously improve training data quality Design RL training loops for agent self‑improvement: reward modeling, policy optimization, safety constraints Integrate with AI Factory GPU infrastructure for throughput, data locality, and multi‑node training Build observability for training runs and ensure workloads meet security and governance requirements Collaborate with platform, security, agent infrastructure, and internal customer teams on safe deployment of training outputs What we need to see MS in CS, ML, or related field (or equivalent experience) 10+ years of experience Experience operationalizing fine‑tuning methods (LoRA, SFT) and especially RL techniques (DPO, GRPO, PPO, RLAIF) into reusable cookbooks and self‑service workflows Familiarity with distributed training frameworks (e.g., Megatron, NeMo, DeepSpeed, FSDP, HF Accelerate) and ML ops skills covering pipeline automation, job orchestration, and GPU cluster management are important here Proficiency in Python, Go, Rust, or similar Background in CS, ML, or related field through formal education or equivalent experience Ways to stand out from the crowd Building RL environments or training recipes that other teams consumed as self‑service capabilities Familiarity with NVIDIA infrastructure (DGX, AI Factory, NVLink/InfiniBand), NeMo Microservices, or the evolving RL‑for‑agents ecosystem (rLLM, Agent Lightning, HUD, OpenRLHF, SkyRL) Experience with data curation, active learning, continuous learning loops, or data flywheel architectures also valued Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD – 356,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until April 2, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA
$160.36k - $240.54k
...Senior Software Engineer – GenAI Infrastructure & Agent Systems for Engineering Efficiency Mountain View, California... ...(HQ) Who We Are Nuro is a self-driving technology company on a... ...knowledge discovery, and ML model improvement. We sit at the intersection of...Senior- CoreWeave is seeking a Senior Manager, Observability Engineering to lead a team focused on building and scaling observability systems for metrics, logs... ...will hire and manage a team while driving platform improvements. This role emphasizes operational ownership and...Senior
$136k - $264.5k
NVIDIA Gruppe is seeking a Senior Design Verification Engineer to enhance efficiency in their High Speed IO engineering teams. This role requires improving verification flows and collaborating with various engineers to deliver high-quality design solutions. The ideal candidate...Senior$152k - $241.5k
...understandings of our sophisticated, distributed infrastructure. As an engineer on our team, you will play a key role in building the next... ...their workflows and determine observability needs to help improve the overall efficiency of our chip development process. Compose...Senior$262k - $365k
Senior Staff Software Engineer, Infrastructure, Agents Infra Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain. Apply Bachelor’s degree or equivalent practical experience. 8 years...SeniorFull time$240k - $334k
Google Inc. is looking for a Senior Staff Technical Program Manager to lead AI Innovation and Research initiatives. In this role, you will... ...projects and drive the strategic goals associated with self-sustaining AI systems. The ideal candidate will have over 10 years...Senior- NVIDIA Gruppe is looking for an RL Frameworks Engineer in Santa Clara, California, to architect and build scalable RL post-training infrastructure. You will ensure efficient scaling from single GPU experimentation to production across thousands of nodes, while collaborating...Senior
$140k - $224.25k
...the brains of computers, robots, and self-driving cars that can understand the world... ..., creative, and highly motivated Senior SE-MEM Engineer to support high volume, fast ramp data... ...with MEM suppliers to conduct process improvement efforts involving engineering data examination...Senior$224k - $356.5k
...brains of computers, robots, and self-driving cars that can... ...developers to harness always-on agents in secure, efficient environments... ...quality and velocity of AI engineering practices. If you thrive on... ...assisted development tools to improve the engineering loop, while applying...Senior$184k - $287.5k
...We are looking for software engineers to join our development efforts in the area of dense... ...Finding and realizing opportunities to improve library quality, performance and maintainability... ...as the brain of computers, robots, and self-driving cars that can perceive and...SeniorRemote work$262k - $365k
A leading technology company in Sunnyvale, CA seeks a Senior Staff Software Engineer specializing in ML Infrastructure. The role involves designing back-end services and collaborating with AI teams. Candidates should have significant software development experience, particularly...Senior$224k - $356.5k
NVIDIA Corporation seeks a Diagnostics Software Infrastructure Engineer in Santa Clara, California. The ideal candidate possesses extensive DevOps experience, is proficient with CI/CD tools like Gitlab and Jenkins, and has a solid background in software development. Responsibilities...Senior$224k - $356.5k
NVIDIA is seeking a Deep Learning Engineer to advance the future of autonomous driving. You will explore and refine top-tier LLM/VLM models to enhance AI systems in real-world applications. The ideal candidate has a Master's or PhD in a related field, alongside substantial...Senior$152k - $241.5k
...as the brains of computers, robots, and self-driving cars that can understand the world... ...highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU... ...scalable automation solutions. Continuously improve infrastructure provisioning, management,...Senior$184k - $287.5k
...as the brain of computers, robots, and self-driving cars that can perceive and... ...the world.We are seeking an exceptional Senior Perception Engineer to help design and productize NVIDIA’s... ....What you’ll be doing: Develop and improve the technical design, architecture, and...Senior$152k - $241.5k
NVIDIA Gruppe is seeking a Senior Infrastructure Software Engineer for Deep Learning Libraries in Santa Clara, California. In this role, you'll design and develop scalable infrastructure for NVIDIA’s deep learning libraries, impacting various platforms, including autonomous...Senior$112.3k - $181.5k
...Job Summary Job Summary As a Senior Customer Trust Engineer, you will join our Governance, Risk &... ...customer audit program. Maintain and improve internal and external FAQs regarding... ...responses. Develop and maintain a central self-service portal to improve efficiency...SeniorFull timeWork at officeVisa sponsorshipWork visa$136k - $212.75k
...the brains of computers, robots, and self-driving cars that can understand the... ...join us today. We are now looking for a Senior Validation Engineer in the DGX Server Product Engineering... ...Track record of continuous process improvement with a passion for tools and automation...Senior$136k - $218.5k
...computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today,... .... Promote NVIDIA ideas to advise EDA tool development and improvement to maximize benefit. Review past problem cases, address issues...Senior$133k - $254k
...moves autonomously through a self‑managing urban transportation... ...system. Our AI Data Pipeline Engineers build up the core data processing... ...we deliver could highly improve the efficiency of ML model development... ..., ML application, and Cloud Infra to align data pipelines with...SeniorWork experience placement$136k - $218.5k
...as the brains of computers, robots, and self‑driving cars that can understand the world... ...technology. Leverage your expertise to improve timing convergence flows in... ...equivalent experience) in Electrical or Computer Engineering with 5 years’ experience or MS (or equivalent...Senior- Apptronik is seeking a Senior Mechanical Engineer to focus on continuous improvement for the Apollo humanoid robot. The ideal candidate will lead design optimization, conduct root cause analyses, and mentor junior engineers. A Bachelor's in Mechanical Engineering is required...Senior
- ...Senior Electrical Engineer At Omnitron Sensors, our team of award-winning industry veterans is pioneering... ...simplifies the assembly process to improve the performance of sensor applications... ...and verbal communication skills Self-motivation, out-of-the-box thinking, and...SeniorVisa sponsorship
$136k - $218.5k
...seeking an innovative Timing Methodology Engineer to help drive multi‑physics sign‑off strategies... ...AI applications. What you'll be doing Improve and validate flows for Prime‑Time, Prime‑... ...physical effects such as aging, self‑heating, thermal impact, IR drop, and more...Senior$168k - $264.5k
...the brains of computers, robots, and self-driving cars that can understand the world... .... We are now looking for a motivated Senior Timing Engineer (Circuits) to join our dynamic and... ...highly motivated individual who excels at improving design and timing quality and enjoys...SeniorShift work$184k - $287.5k
...NVIDIA is searching for a highly motivated engineer with a passion for system software... ...doing: Be responsible for maintaining and improving mission critical infrastructure and test... ...for us. If you're creative, passionate and self-motivated, we want to hear from you! NVIDIA...Senior$140k - $197k
...Senior Systems Engineer (Pre-Sales) Arista Networks is an industry leader in data-driven, client... ...an immediate need for a collaborative, self-motivated Senior Systems Engineer to partner... ...design guidelines and recommend improvements to customers for the networks they support...SeniorWork experience placementLocal areaImmediate start- ...NVIDIA, we are seeking exceptional engineers to join our autonomous driving... ...2.0— teaching an intelligent agent to drive. This next phase... ...and reinforcement learning—to improve the planning and reasoning capabilities... ...‑grade ML models for self‑driving, robotics, or related...Senior
$83.9k - $155.7k
...technologies and workflows that can be used to improve the lives of people in a wide variety... .... We are seeking a passionate Senior Systems Engineer to join our Systems Development Group... ...experiments. The candidate must be self-motivated and able to work well in a multidisciplinary...SeniorLocal areaRelocation package$152k - $241.5k
...longer science fiction. Today, a self-driving car powered by AI can... .... The GPU started out as the engine for simulating human... ...architectures. We are now looking for a Senior Integration Engineer,... ...Developing efficient mechanisms to improve utilization on computers with...SeniorNight shift
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior RL Infra Engineer for Self-Improving Agents. Be the first to apply!
- commissioning agent Santa Clara, CA
- remote chat agent Santa Clara, CA
- airport agent Santa Clara, CA
- agent Santa Clara, CA
- sourcing agent Santa Clara, CA
- executive protection agent Santa Clara, CA
- import export agent Santa Clara, CA
- state farm agent Santa Clara, CA
- cruise agent Santa Clara, CA
- senior game producer Santa Clara, CA

