Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior RL Infra Engineer for Self-Improving Agents

$224k - $356.5k

NVIDIA

We’re hiring an engineer to help us bring reinforcement learning to every agent team at NVIDIA. This is a rare chance to shape how autonomous, self‑improving agents learn and evolve across the enterprise. The role sits at the intersection of ML research and production engineering. What if every agent developer could add self‑improvement loops to their workflows without needing deep RL expertise? That’s the challenge here: evaluate emerging approaches, adapt them into enterprise‑ready blueprints, and make them available inside sandboxed execution environments with the security and governance the enterprise demands. We believe the best training and self‑evolving agent platforms come from people with diverse backgrounds and want this person to help us build ours. What you’ll be doing The work splits between creating enterprise‑ready RL capabilities and partnering with agent teams to put them into practice. Building RL cookbooks and environments Evaluate and adapt democratized RL approaches into reusable cookbooks and blueprints so agent developers can integrate self‑improvement loops (GRPO, DPO, PPO, RLAIF) on their own Design verifiable reward environments building on NeMo Gym, extending to domain‑specific environments for internal use cases Operationalize NVIDIA and third‑party training backends as production services inside Sandbox Integrate with NeMo Microservices (Curator, Customizer, Evaluator, Guardrails) to enable end‑to‑end data flywheel workflows for RL Infrastructure, reliability, and collaboration Lead data curation and active learning strategies to continuously improve training data quality Design RL training loops for agent self‑improvement: reward modeling, policy optimization, safety constraints Integrate with AI Factory GPU infrastructure for throughput, data locality, and multi‑node training Build observability for training runs and ensure workloads meet security and governance requirements Collaborate with platform, security, agent infrastructure, and internal customer teams on safe deployment of training outputs What we need to see MS in CS, ML, or related field (or equivalent experience) 10+ years of experience Experience operationalizing fine‑tuning methods (LoRA, SFT) and especially RL techniques (DPO, GRPO, PPO, RLAIF) into reusable cookbooks and self‑service workflows Familiarity with distributed training frameworks (e.g., Megatron, NeMo, DeepSpeed, FSDP, HF Accelerate) and ML ops skills covering pipeline automation, job orchestration, and GPU cluster management are important here Proficiency in Python, Go, Rust, or similar Background in CS, ML, or related field through formal education or equivalent experience Ways to stand out from the crowd Building RL environments or training recipes that other teams consumed as self‑service capabilities Familiarity with NVIDIA infrastructure (DGX, AI Factory, NVLink/InfiniBand), NeMo Microservices, or the evolving RL‑for‑agents ecosystem (rLLM, Agent Lightning, HUD, OpenRLHF, SkyRL) Experience with data curation, active learning, continuous learning loops, or data flywheel architectures also valued Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD – 356,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until April 2, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior RL Infra Engineer for Self-Improving Agents in Santa Clara, CA vacancy
  • $160.36k - $240.54k

     ...Senior Software Engineer – GenAI Infrastructure & Agent Systems for Engineering Efficiency Mountain View, California...  ...(HQ) Who We Are Nuro is a self-driving technology company on a...  ...knowledge discovery, and ML model improvement. We sit at the intersection of... 
    Senior

    Nuro

    Mountain View, CA
    21 hours ago
  • CoreWeave is seeking a Senior Manager, Observability Engineering to lead a team focused on building and scaling observability systems for metrics, logs...  ...will hire and manage a team while driving platform improvements. This role emphasizes operational ownership and... 
    Senior

    Dormont Manufacturing Co

    Sunnyvale, CA
    4 days ago
  • $136k - $264.5k

    NVIDIA Gruppe is seeking a Senior Design Verification Engineer to enhance efficiency in their High Speed IO engineering teams. This role requires improving verification flows and collaborating with various engineers to deliver high-quality design solutions. The ideal candidate... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...understandings of our sophisticated, distributed infrastructure. As an engineer on our team, you will play a key role in building the next...  ...their workflows and determine observability needs to help improve the overall efficiency of our chip development process. Compose... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $262k - $365k

    Senior Staff Software Engineer, Infrastructure, Agents Infra Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain. Apply Bachelor’s degree or equivalent practical experience. 8 years... 
    Senior
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $240k - $334k

    Google Inc. is looking for a Senior Staff Technical Program Manager to lead AI Innovation and Research initiatives. In this role, you will...  ...projects and drive the strategic goals associated with self-sustaining AI systems. The ideal candidate will have over 10 years... 
    Senior

    Google Inc.

    Mountain View, CA
    1 day ago
  • NVIDIA Gruppe is looking for an RL Frameworks Engineer in Santa Clara, California, to architect and build scalable RL post-training infrastructure. You will ensure efficient scaling from single GPU experimentation to production across thousands of nodes, while collaborating... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $140k - $224.25k

     ...the brains of computers, robots, and self-driving cars that can understand the world...  ..., creative, and highly motivated Senior SE-MEM Engineer to support high volume, fast ramp data...  ...with MEM suppliers to conduct process improvement efforts involving engineering data examination... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $224k - $356.5k

     ...brains of computers, robots, and self-driving cars that can...  ...developers to harness always-on agents in secure, efficient environments...  ...quality and velocity of AI engineering practices. If you thrive on...  ...assisted development tools to improve the engineering loop, while applying... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...We are looking for software engineers to join our development efforts in the area of dense...  ...Finding and realizing opportunities to improve library quality, performance and maintainability...  ...as the brain of computers, robots, and self-driving cars that can perceive and... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $262k - $365k

    A leading technology company in Sunnyvale, CA seeks a Senior Staff Software Engineer specializing in ML Infrastructure. The role involves designing back-end services and collaborating with AI teams. Candidates should have significant software development experience, particularly... 
    Senior

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $224k - $356.5k

    NVIDIA Corporation seeks a Diagnostics Software Infrastructure Engineer in Santa Clara, California. The ideal candidate possesses extensive DevOps experience, is proficient with CI/CD tools like Gitlab and Jenkins, and has a solid background in software development. Responsibilities... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $224k - $356.5k

    NVIDIA is seeking a Deep Learning Engineer to advance the future of autonomous driving. You will explore and refine top-tier LLM/VLM models to enhance AI systems in real-world applications. The ideal candidate has a Master's or PhD in a related field, alongside substantial... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...as the brains of computers, robots, and self-driving cars that can understand the world...  ...highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU...  ...scalable automation solutions. Continuously improve infrastructure provisioning, management,... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...as the brain of computers, robots, and self-driving cars that can perceive and...  ...the world.We are seeking an exceptional Senior Perception Engineer to help design and productize NVIDIA’s...  ....What you’ll be doing: Develop and improve the technical design, architecture, and... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

    NVIDIA Gruppe is seeking a Senior Infrastructure Software Engineer for Deep Learning Libraries in Santa Clara, California. In this role, you'll design and develop scalable infrastructure for NVIDIA’s deep learning libraries, impacting various platforms, including autonomous... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $112.3k - $181.5k

     ...Job Summary Job Summary As a Senior Customer Trust Engineer, you will join our Governance, Risk &...  ...customer audit program. Maintain and improve internal and external FAQs regarding...  ...responses. Develop and maintain a central self-service portal to improve efficiency... 
    Senior
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $136k - $212.75k

     ...the brains of computers, robots, and self-driving cars that can understand the...  ...join us today. We are now looking for a Senior Validation Engineer in the DGX Server Product Engineering...  ...Track record of continuous process improvement with a passion for tools and automation... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $136k - $218.5k

     ...computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today,...  .... Promote NVIDIA ideas to advise EDA tool development and improvement to maximize benefit. Review past problem cases, address issues... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $133k - $254k

     ...moves autonomously through a self‑managing urban transportation...  ...system. Our AI Data Pipeline Engineers build up the core data processing...  ...we deliver could highly improve the efficiency of ML model development...  ..., ML application, and Cloud Infra to align data pipelines with... 
    Senior
    Work experience placement

    42dot

    Sunnyvale, CA
    3 days ago
  • $136k - $218.5k

     ...as the brains of computers, robots, and self‑driving cars that can understand the world...  ...technology. Leverage your expertise to improve timing convergence flows in...  ...equivalent experience) in Electrical or Computer Engineering with 5 years’ experience or MS (or equivalent... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • Apptronik is seeking a Senior Mechanical Engineer to focus on continuous improvement for the Apollo humanoid robot. The ideal candidate will lead design optimization, conduct root cause analyses, and mentor junior engineers. A Bachelor's in Mechanical Engineering is required... 
    Senior

    jobs.frontdoordefense.com - Jobboard

    Mountain View, CA
    2 days ago
  •  ...Senior Electrical Engineer At Omnitron Sensors, our team of award-winning industry veterans is pioneering...  ...simplifies the assembly process to improve the performance of sensor applications...  ...and verbal communication skills Self-motivation, out-of-the-box thinking, and... 
    Senior
    Visa sponsorship

    Omnitron Sensors

    Santa Clara, CA
    3 days ago
  • $136k - $218.5k

     ...seeking an innovative Timing Methodology Engineer to help drive multi‑physics sign‑off strategies...  ...AI applications. What you'll be doing Improve and validate flows for Prime‑Time, Prime‑...  ...physical effects such as aging, self‑heating, thermal impact, IR drop, and more... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $168k - $264.5k

     ...the brains of computers, robots, and self-driving cars that can understand the world...  .... We are now looking for a motivated Senior Timing Engineer (Circuits) to join our dynamic and...  ...highly motivated individual who excels at improving design and timing quality and enjoys... 
    Senior
    Shift work

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...NVIDIA is searching for a highly motivated engineer with a passion for system software...  ...doing: Be responsible for maintaining and improving mission critical infrastructure and test...  ...for us. If you're creative, passionate and self-motivated, we want to hear from you! NVIDIA... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $140k - $197k

     ...Senior Systems Engineer (Pre-Sales) Arista Networks is an industry leader in data-driven, client...  ...an immediate need for a collaborative, self-motivated Senior Systems Engineer to partner...  ...design guidelines and recommend improvements to customers for the networks they support... 
    Senior
    Work experience placement
    Local area
    Immediate start

    Arista Networks, Inc.

    Santa Clara, CA
    3 days ago
  •  ...NVIDIA, we are seeking exceptional engineers to join our autonomous driving...  ...2.0— teaching an intelligent agent to drive. This next phase...  ...and reinforcement learning—to improve the planning and reasoning capabilities...  ...‑grade ML models for self‑driving, robotics, or related... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $83.9k - $155.7k

     ...technologies and workflows that can be used to improve the lives of people in a wide variety...  .... We are seeking a passionate Senior Systems Engineer to join our Systems Development Group...  ...experiments. The candidate must be self-motivated and able to work well in a multidisciplinary... 
    Senior
    Local area
    Relocation package

    F. Hoffmann-La Roche Ltd

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...longer science fiction. Today, a self-driving car powered by AI can...  .... The GPU started out as the engine for simulating human...  ...architectures. We are now looking for a Senior Integration Engineer,...  ...Developing efficient mechanisms to improve utilization on computers with... 
    Senior
    Night shift

    NVIDIA

    Santa Clara, CA
    21 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior RL Infra Engineer for Self-Improving Agents. Be the first to apply!