Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Researcher - Reinforcement Learning

$200k - $300k
Full-time

1X

About 1X We’re building humanoid robots that work in home - doing the chores, handling the tasks, and giving people their time back. Simple, but it’s not. To do this right, we have to solve robotics, AI, manufacturing - at the same time, at scale, in a form factor that has to be safe enough to live with your family. If you’re inspired by this, you’ll thrive here. We’ve been at this since 2014 and we’re at the point where the hard problems are behind us and the hard work is in front of us. NEO is our flagship - a home robot designed to move, learn, and operate in the real world alongside real people. We’re not demoing it - we’re shipping it. We’re excited to meet you, if this excites you. If you’ve spent your career working on problems that matter and want to see them actually reach the world - this is that moment. We’re scaling, we’re hiring with intention, and we need people who want to build something that will genuinely change how humans spend their time - safely creating abundance for all. About the Team The Reinforcement Learning team teaches NEO new capabilities, training policies for manipulation and locomotion tasks across simulation and real-world environments, then deploying them into homes. We work at the intersection of algorithm development, sim-to-real transfer, and production deployment: our research is only successful when a policy runs reliably on a physical robot in the field. If you want to directly expand what a humanoid robot can do for people, this is that team. Your Charter Own the full pipeline from RL algorithm development through production deployment—training NEO on manipulation and locomotion tasks in simulation, closing the sim-to-real gap, and shipping policies that work reliably in real-world home environments. This is critical-path work: the range of tasks NEO can perform safely and reliably is a direct function of the quality of RL policies your team ships. You will collaborate closely with hardware, controls, data collection, and QA teams, and measure your impact by what NEO can do in the field. Key Outcomes Train and deploy RL policies for manipulation and locomotion tasks that perform reliably in real-world home environments measured by field task success rates, not just simulation benchmarks Advance sim-to-real transfer techniques that measurably narrow the gap between simulation training performance and real-world policy behavior, enabling faster iteration cycles Build training and evaluation infrastructure that lets the team iterate on policies faster with standardized benchmarks, automated regression detection, and clear connections between training metrics and field performance Partner with hardware, controls, data, and QA teams to ship RL-trained skills to production customer sites, owning the handoff from research to deployment Key Competencies Sim-to-real practitioner closing the sim-to-real gap on physical systems; understands domain randomization, reward shaping, and the engineering required to make simulated policies transfer reliably to real hardware RL algorithms depth with strong foundation in RL algorithms (PPO, SAC, TD-MPC, or similar); can choose the right approach for the task and modify or extend it when standard methods fall short Full-stack ownership owning data engineering, model architecture, and deployment; treats a promising training curve as the beginning of the job, not the end Effective cross-functional partner working closely with hardware, controls, QA, and data teams to translate RL research into deployed robot skills, and communicates technical constraints clearly across disciplines Minimum Requirements Strong Python and/or C++ with experience in large codebases and build tools (Bazel or equivalent) Proficiency with PyTorch for RL policy training and experimentation Hands-on experience with simulation platforms (Isaac Sim, MuJoCo, or equivalent) for policy training at scale Demonstrated experience training RL policies for manipulation or locomotion tasks, including addressing the sim-to-real gap on physical hardware Preferred Skills Experience with model-based RL or world-model-guided policy learning that leverages predictive models to improve sample efficiency Familiarity with imitation learning or learning from demonstration (behavior cloning, GAIL, IQL) as a complement or bootstrap to RL Experience deploying RL-trained policies to physical robots in production environments, including monitoring, failure analysis, and iterative improvement Background in legged locomotion, dexterous manipulation, or contact-rich control for physical systems Compensation Range $200,000 - $300,000 + Equity Benefits Comprehensive medical, dental, and vision coverage Generous paid time off, company holidays, and parental leave 401(k) plan with company match (100% on the first 3% of contributions, 50% on the next 2%) Flexible Spending Accounts (FSA) and Health Savings Accounts (HSA) options Commuter benefits (transit and parking) Short-term and long-term disability, and life insurance Employee Assistance Program (EAP) for mental health, financial, and personal support Onsite snacks and catered lunches Equal Opportunity Employer 1X is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, ancestry, citizenship, age, marital status, medical condition, genetic information, disability, military or veteran status, justice system impact, or any other characteristic protected under applicable federal, state, or local law.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI Researcher - Reinforcement Learning in San Carlos, CA vacancy
  •  ...scientist who created a team consisting of AI pioneers, including CTO Jan Chorowski who...  ...(“the T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models. Pathway...  ...with a strong track record in machine learning models research . This is an extremely... 
    Suggested
    Permanent employment
    Full time
    Contract work
    Immediate start
    Remote work
    Flexible hours

    Pathway Genomics

    Palo Alto, CA
    18 hours ago
  • $163.2k - $220k

     ...Natera is seeking a Staff Machine Learning Scientist – Agentic AI to join our AI team, an advanced R...  ...ReAct and Chain-of-Thought, alongside reinforcement fine-tuning (RFT) to ensure agents...  ...or more years of experience in AI research or engineering, with a proven track... 
    Suggested
    Work at office
    Immediate start
    Remote work
    Worldwide

    Natera

    San Carlos, CA
    13 days ago
  • ## Senior Staff Research Scientist, Agentic AI & RLApplylocations: East Palo Alto, CAtime type: Full timeposted on: Posted Todayjob requisition...  ...are hiring a **Staff Research Scientist, Agentic AI & Reinforcement Learning** — a dual-track role designed for exceptional... 
    Suggested

    Centific Global Solutions, Inc.

    Palo Alto, CA
    18 hours ago
  • $140k - $195k

     ...Technical Center About the Team : Our AI Research team, reporting directly to the Chief...  ...is pioneering how cutting-edge machine learning can transform the way vehicles are...  ...applications Experience with reinforcement learning for robotic control or process... 
    Suggested
    Work at office
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    4 days ago
  •  ...Machine Learning Applied Researcher Job type: Full Time · Department: AI Research Lab · Work type: Hybrid San Mateo, California, United States About Archetype AI Archetype AI is developing the world’s first AI platform to bring AI into the real world. Formed by an exceptionally... 
    Suggested
    Full time

    Neara

    San Mateo, CA
    4 days ago
  •  ...About Centific Centific is a frontier AI data foundry that curates diverse, high‑quality...  ...‑turn, tool‑using agents with closed learning loops Design reward functions and verifiers...  ...review, engineering standards Mentor researchers and engineers; drive technical direction... 

    Centific Global Solutions, Inc.

    East Palo Alto, CA
    18 hours ago
  • $200k - $300k

     ...Type Full time Location Type Hybrid Department AI Perplexity is seeking top-tier AI Research Scientists and Engineers to advance our AI products...  ...Post-train SOTA LLMs using the latest supervised and reinforcement learning techniques (SFT/DPO/GRPO) Leverage our rich query... 
    Full time

    Pantera Capital

    Palo Alto, CA
    2 days ago
  • At Rhoda AI, we’re building the next generation of generalist...  ...possibly by our cutting edge research and end-to-end system design....  ...performance beyond what imitation learning alone achieves — reward...  ...Required) Hands-on experience with reinforcement learning — reward design,... 
    Shift work

    Rhoda ai

    Palo Alto, CA
    3 days ago
  • $250k - $350k

    About the Lab The 1X World Model Lab is an embodied AI research organization focused on pretraining the foundation models to accelerate the...  ..., infrastructure, and data engines that enable the robot to learn from experience and become increasingly capable in real-world environments... 
    Full time
    Local area

    1X

    San Carlos, CA
    1 day ago
  •  ...across the entire robotics stack. We're training state-of-the-art AI models that leverage our large-scale, high-quality, real-world...  ...more time on the things they value most. As a Machine Learning Research Engineer, you will work on the software and algorithms that enable... 

    Sunday

    Redwood City, CA
    1 day ago
  •  ...Autoscience Institute, we create AI systems that autonomously conduct AI research. Recently, we announced the first...  ...-time on-site role for a Machine Learning Research Scientist located in the...  ...models, including deep learning, reinforcement learning or genetic algorithms. This... 
    Full time
    Flexible hours

    Autoscience

    Menlo Park, CA
    7 days ago
  • $202.8k - $376.6k

     ...That's what makes us Roche. Advances in AI, data, and computational sciences are...  ...transforming drug discovery and development. Roche's Research and Early Development organisations at...  ...Our mission is to develop AI models that learn from biological data at unprecedented... 
    Local area
    Worldwide
    Relocation package

    Genentech

    South San Francisco, CA
    2 days ago
  • $152k - $218.5k

    At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life...  ...class team advancing the state of the art in AI, robotics, driving, and material sciences...  ...background in CAD Generation and Machine Learning or related fields, with demonstrated... 
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    2 days ago
  • Toyota Research Institute is looking for a highly motivated postdoctoral researcher for a one-year position focused on Generative AI technologies in automotive product design. The ideal candidate...  ...in CAD Generation and Machine Learning. This role involves collaboration... 

    Toyota Research Institute

    Los Altos, CA
    2 days ago
  • $141.1k - $262.1k

     ...That's what makes us Roche. Advances in AI, data, and computational sciences are transforming...  ...drug discovery and development. Roche's Research and Early Development organisations at...  ...drug discovery with cutting‑edge machine learning (ML) techniques. We are seeking a Machine... 
    Work experience placement
    Local area
    Worldwide
    Relocation package

    NLP PEOPLE

    South San Francisco, CA
    3 days ago
  • $167.4k - $310.8k

     ...That's what makes us Roche. Advances in AI, data, and computational sciences are transforming...  ...drug discovery and development. Roche's Research and Early Development organisations at...  ...drug discovery with cutting‑edge machine learning (ML) techniques. We are seeking a Senior... 
    Local area
    Worldwide
    Relocation package

    NLP PEOPLE

    South San Francisco, CA
    2 days ago
  • Senior Machine Learning Scientist, AI for Drug Discovery (Frontier Research) Aug 6, 2025 Link to job posting: Advances in AI, data, and computational sciences are transforming drug discovery and development. Roche’s Research and Early Development organisations at Genentech... 
    Worldwide
    Relocation package

    The International Society for Bayesian Analysis

    South San Francisco, CA
    4 days ago
  • $168.1k - $312.3k

     ...Responsibilities Develop methods in self‑supervised representation learning and generative modeling. Contribute to publications and present...  ...and external scientific conferences. Collaborate and execute research that pushes the state of the art in machine learning. Directly... 
    Local area
    Worldwide

    F. Hoffmann-La Roche AG

    South San Francisco, CA
    2 days ago
  • $180k - $250k

     ...have an opening in our lab for a senior computer vision researcher, with 3D Reconstruction and Deep Learning expertise, to develop and improve the underlying...  ...a dynamic team of professionals in computer vision, AI, computational photography, AR, graphics, and design.... 
    Full time
    Remote work

    Geomagical Labs

    Palo Alto, CA
    4 days ago
  •  ...powered by a proprietary embodied AI foundation model that...  ...million, and former DeepMind research scientist Jason Ma. The company...  ...frontier of AI‑driven robotics! Learn more at dyna.co Position...  ...learning algorithms, such as reinforcement learning and imitation learning... 
    Temporary work

    Dyna Robotics

    Redwood City, CA
    3 days ago
  •  ...exciting journey. The mission of the Waymo Research team is to develop machine learning solutions addressing open problems in autonomous driving...  ...areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling... 
    Internship
    Summer internship
    Local area

    Waymo

    Mountain View, CA
    18 hours ago
  • $150k - $200k

     ...contribute to our clients' success. About AI & Digital Asset Solutions Financial...  ...is a newly formed, entrepreneurial research team within Franklin Templeton Investment...  ...field such as computer science, machine learning, statistics, mathematics, or a related discipline... 
    Local area

    Franklin Templeton

    San Mateo, CA
    3 days ago
  • $110.8k - $253.4k

     ...and products, accelerating the advancement of gaming technologies worldwide. Responsibilities: Research and analyze the latest advancements in gaming AI technologies, including but not limited to AI-driven 2D/3D content generation, intelligent NPCs, AI-assisted... 
    Worldwide
    Relocation package

    Tencent

    Palo Alto, CA
    1 day ago
  • $263.67k - $322.82k

     ...of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences-...  ...PhD) or equivalent degree in Statistics, Economics, Operations Research, Computer Science, Applied Math, Physics, Engineering, or other... 
    Full time
    Work experience placement
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    3 days ago
  • $170k - $220k

     ...Scientist, Machine Learning At Atomic AI, we build artificial intelligence to pioneer new frontiers in drug discovery. Our unique R&D platform...  ...results and present progress to colleagues in regular research meetings. Work within a collaborative, high-caliber, interdisciplinary... 
    Full time

    Atomic AI

    South San Francisco, CA
    3 days ago
  • $273k - $321k

     ...pathway. The Opportunity: We are seeking a Principal Machine Learning Scientist to lead the development of advanced machine learning...  ...early-stage drug discovery. Identify opportunities where AI and advanced analytics can meaningfully improve scientific decision... 
    Full time
    Local area

    REVOLUTION Medicines

    Redwood City, CA
    4 days ago
  • $278.53k - $345.04k

     ...of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences-...  ...at a record pace, we are seeking experienced machine learning researchers who thrive on solving complex challenges and designing scalable... 
    Full time
    Work experience placement
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    18 hours ago
  •  ...Machine Learning Scientist Recommender systems (e.g., image and content recommendations...  ...push notifications. Conduct applied research to improve recommender systems using...  ...traditional ML techniques, deep learning and reinforcement learning. Build scalable ML... 
    Work at office
    Remote work
    2 days per week

    Wayfair

    Mountain View, CA
    4 days ago
  • $157k - $188k

     ...We are looking for a mixed-method Vehicle Experience Researcher, which will be the glue between the digital interface and the physical cabin...  ..., and life insurance. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing... 
    Temporary work
    Relocation package

    Zoox

    Foster, CA
    18 hours ago
  • $130k - $160k

     ...Senior User Researcher Our mission is to open the door to the life-changing impact of education. We build market-leading, online...  ...innovative education companies, we are also at the forefront of AI-first learning, combining artificial intelligence with the latest learning... 
    For contractors
    Work experience placement
    Work at office
    Local area

    Study.com

    Mountain View, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Researcher - Reinforcement Learning. Be the first to apply!