Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Reinforcement Learning Engineer

$100k - $150k
Full-time

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.

As we continue to grow, we’re looking for a skilled Reinforcement Learning Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.

This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

Job Title: Reinforcement Learning Engineer

Location: 100% Remote (Continental United States)

Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)

Salary: $100K - $150K

Experience: 6+ years

Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.

Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)

Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap

Compensation: Competitive base salary commensurate with experience, plus benefits.

Employment Terms & Visa Policy

This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.

This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.

We do not engage in C2C, 1099, or third-party arrangements for this role

BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.

Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.

No new H1B sponsorship is available for this role.

However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.

For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary

We are looking for a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for high-impact decision-making problems where supervised learning alone is insufficient. The role requires deep familiarity with modern reinforcement learning algorithms, simulation environments, reward modeling, and the engineering complexity of training and evaluating policies at scale. The ideal candidate has both research depth and engineering pragmatism, with experience taking RL solutions out of the lab and into production where stability, safety, and ongoing improvement are critical.

Key Responsibilities

  • Design and implement reinforcement learning solutions for sequential decision-making problems in real and simulated environments
  • Develop, calibrate, and maintain simulation environments suitable for large-scale agent training
  • Implement and evaluate modern RL algorithms including policy gradient, actor-critic, off-policy, and offline RL methods
  • Engineer reward functions and shaping strategies that align agent behavior with desired outcomes and safety constraints
  • Apply offline RL and imitation learning techniques where exploration is costly or unsafe
  • Use RLHF, DPO, and related techniques for fine-tuning large language models when relevant
  • Build scalable training infrastructure for distributed RL, including efficient experience collection and replay systems
  • Optimize training stability and sample efficiency through algorithmic and engineering improvements
  • Design rigorous evaluation protocols, including out-of-distribution and adversarial test cases
  • Implement safety mechanisms such as constraint enforcement, conservative policies, and human-in-the-loop oversight
  • Collaborate with applied scientists and product teams to identify high-value RL use cases
  • Monitor deployed policies and models in production for drift, regression, and unintended behaviors, building the alerting and dashboards that surface issues before they meaningfully affect users
  • Document methodology, design decisions, and operational characteristics for internal stakeholders
  • Stay current with RL research and translate promising techniques into production-ready solutions

Required Qualifications

  • Master’s or PhD in Computer Science, Machine Learning, or a related field; or equivalent applied experience
  • Six or more years of combined RL research and engineering experience
  • Strong proficiency in Python and modern deep learning frameworks
  • Hands-on experience with at least one major RL library or in-house RL stack
  • Solid understanding of probability, optimization, and the theoretical foundations of RL
  • Experience designing and tuning reward functions in non-trivial environments
  • Familiarity with simulation environments and large-scale experience collection
  • Experience training neural network policies on GPU clusters
  • Strong written and verbal communication skills
  • Track record of shipping or publishing impactful RL work

Preferred Qualifications

  • Experience with RLHF for large language models
  • Familiarity with multi-agent RL or hierarchical RL
  • Exposure to robotics, control systems, or autonomous driving
  • Publications in RL or related research venues
  • Open-source contributions to RL libraries or environments

How To Apply

Would you like to know more about this opportunity?

For immediate consideration, please send your resume to ***email_hidden***

Learn more about Bright Vision Technologies at

We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.

We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.

Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.

Position offered by “No Fee Agency.”

Equal Employment Opportunity (EEO) Statement

Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.

BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.

Powered by JazzHR

wc4LVWuXGt

Vacancy posted 28 days ago
Similar jobs that could be interesting for youBased on the Reinforcement Learning Engineer in Pleasanton, CA vacancy
  • $125.3k - $250.6k

     ...business, and our customers can achieve their full potential. As a Learning Architect - Fellow, you will be responsible for all aspects of...  ...materials include lectures, hands‑on labs, and assessments reinforcing learning. The primary objective of this position is to... 
    Suggested
    Remote job
    Temporary work
    Work at office
    Immediate start
    Worldwide
    Flexible hours

    Siemens Digital Industries Software

    Fremont, CA
    1 day ago
  • $150k - $250k

    Albert Invent is looking for an AI/ML expert to build scalable systems for training and fine-tuning generative models. You will implement RL-style methods and ship solutions that enhance automated workflows in autonomous vehicles. The role requires a strong educational ...
    Suggested

    Albert Invent

    Fremont, CA
    5 days ago
  • $175.53k - $267.06k

     ...opening for a Senior Guidance, Navigation & Control (GNC) Engineer to supportwork on projects and initiatives with a focus on...  ...as AGI STK, SSAPy, Battleschool or AFSIM; or with machine learning/reinforcement learning techniques and with LINUX/UNIX operating systems.... 
    Suggested
    Full time
    For contractors
    Relocation package
    Flexible hours

    Lawrence Livermore National Laboratory

    Livermore, CA
    5 days ago
  • $100k

     ...Job Description Job Description As a Senior Project Engineer, you’ll serve as a critical link between the field and project management...  ...of Microsoft Office (Word, Excel, Outlook). Ability to learn new programs. Strong written and verbal communication skills... 
    Suggested
    For contractors
    For subcontractor
    Work at office
    Night shift
    Day shift

    DeSilva Gates Construction

    Dublin, CA
    20 days ago
  •  ...a multi-service infrastructure consulting firm providing civil engineering and surveying services across California, the Pacific Northwest...  ...BKF offers competitive and award-winning benefits and perks. To learn more click here. BKF Engineers provides equal employment... 
    Suggested
    Bi-weekly pay
    Work at office
    Local area
    Remote work
    Flexible hours

    BKF

    Pleasanton, CA
    14 days ago
  •  ...atmosphere. Summary: Calyxo is seeking a Staff Systems Engineer to architect, validate, and own the system-level behavior of a...  ...of the system through early experimentation and focused learning cycles. Cross-Functional Technical Leadership: ~... 
    Shift work

    Calyxo USA Inc

    Pleasanton, CA
    5 days ago
  •  ...THE U.S. ARMY OR ARMY RESERVE*** As a Horizontal Construction Engineer, you'll operate heavy construction equipment like bulldozers,...  ...construction equipment from one site to another. Skills you'll learn align with Vehicle Operations, Maintenance & Repairs, and... 
    Full time
    Part time
    Immediate start
    Relocation package

    U.S. Army Reserves

    Dublin, CA
    1 day ago
  • Field Engineer Department: Operations Employment Type: Full Time Location: Pleasanton, CA Description The primary function of the Field...  ..., strives for increased responsibility and has a desire to learn Shows emotional maturity, punctuality and dependability Shows... 
    Full time
    For subcontractor

    Fres-co System USA, Inc.

    Pleasanton, CA
    2 days ago
  •  ...Job Title: Quality Engineer III Location: Pleasanton, CA Pay Range: $35 to $45/hr on W2 Mode: 12 Months / 100% Onsite What...  ...If this is a role that interests you and you'd like to learn more, click apply now and a recruiter will be in touch with you... 
    Weekly pay
    Temporary work
    Flexible hours

    Manpower Group Inc.

    Pleasanton, CA
    1 day ago
  • $28.84 - $57.69 per hour

     ...issues early to drive solutions to resolution. Leverage standard engineering technologies to provide technical support, participate in...  ...shows personal humility. Technical Curiosity/Willingness to Learn: Interest in seeking out new experiences, knowledge, and candid... 
    Hourly pay
    Temporary work
    For contractors
    Work experience placement
    For subcontractor
    Work at office
    Monday to Friday

    ACCO Engineered Systems

    Pleasanton, CA
    11 days ago
  •  ...Job Title: Systems Engineer Location: Santa Clara, CA Need PCR and Sequencing experience for this position. We are...  ...revolutionizing sequencing technology. As a Senior System Engineer, you will learn how to use the new Roche Single Molecule Sequencing platform... 
    Flexible hours
    Early shift

    Tranzeal

    Pleasanton, CA
    2 days ago
  • $75.58k - $107.97k

     ...internal department relationships (i.e. accounting, purchasing, engineering, construction, fabrication shops, legal, etc.). Collaborates...  ...shows personal humility. Technical Curiosity / Willingness to Learn: Interest in seeking out new experiences, knowledge, and candid... 
    Full time
    Temporary work
    For contractors
    For subcontractor
    Work at office
    Monday to Friday

    Acco Engineered Systems

    Pleasanton, CA
    1 day ago
  • $222.9k - $334.3k

     ...As a Principal Software Engineer at Workday, you will partner with engineering and product leadership to shape the platform’s long‑term...  ...the overall technical bar and fostering a culture of continuous learning and engineering excellence. Lead cloud infrastructure... 
    Contract work

    Workday, Inc.

    Pleasanton, CA
    1 day ago
  • $146.34k - $222.56k

     ...impact. Job Description We have an opening for a Systems Engineer to drive the adoption of Model-Based Systems Engineering (MBSE...  ...be fraudulent, we strongly recommend you do not respond. To learn more about recruitment scams: Equal Employment Opportunity... 
    Minimum wage
    For contractors
    Work at office
    Local area
    Work from home
    Relocation package
    Flexible hours
    1 day per week

    LLNL

    Livermore, CA
    5 days ago
  • $100k - $120k

    Overview Salary: $100,000 - $120,000 per year. As a project engineer on our team, you will be responsible for overseeing the technical,...  ...? Easy Apply now by clicking the "Easy Apply" button. Want to learn more about this role and Jobot? Click our Jobot logo and follow... 

    Jobot

    Pleasanton, CA
    5 days ago
  • $110k - $135k

     ..., geophysical exploration, and telecommunications. Our team of engineers, scientists, software developers, and operations professionals...  ...methodical bring-up and debugging. Nice to have (great to learn on the job): Signal integrity/EMI: impedance control,... 
    Permanent employment
    Internship
    Immediate start
    Flexible hours

    Vector Atomic

    Pleasanton, CA
    6 days ago
  • $69.5 - $72.86 per hour

     ...Services Construction Management Solutions provides strategic engineering, program management, construction management, and operational...  ...false positive rates. Collaborate with data analytics, machine learning, and engineering teams to validate and refine predictive... 
    Hourly pay
    Full time
    Temporary work
    Work at office

    MasTec

    Dublin, CA
    4 days ago
  •  ...Calyxo is seeking a highly skilled and innovative electrical engineer to help develop electronic subsystems in kidney stone treatment...  ...opportunity to shape our company culture. You will experience constant learning and dynamic challenges to help you grow and be the best version... 
    Contract work
    For contractors
    Shift work

    Calyxo USA Inc

    Pleasanton, CA
    5 days ago
  • $129.15k - $196.48k

     ...Job Description We have an opening for an Electrical Designer/Engineer who will prepare construction documents for new construction, remodeling...  ...be fraudulent, we strongly recommend you do not respond. To learn more about recruitment scams: Equal Employment Opportunity... 
    For contractors
    Internship
    Work at office
    Relocation package
    Flexible hours

    Lawrence Livermore National Laboratory

    Livermore, CA
    3 days ago
  • $105k - $115k

     ...Title: Substation Project Engineer Location: Sacramento, Oakland, or Fresno Department: Substation Work Type: hybrid K&A...  ...professional and personal growth is a top priority. Innovation & Learning: We encourage continuous learning and invest in your... 
    Temporary work
    For contractors
    Flexible hours

    K&A Engineering Consulting

    Pleasanton, CA
    5 days ago
  • $75k - $130k

     ...and other comprehensive benefits Department Overview GILLIG Engineering is responsible for the development, customization, and configuration...  ...and practices to determine appropriate action. Applies learned techniques and contributes to analysis and investigation to solve... 
    Full time
    Remote work
    Flexible hours

    GILLIG

    Livermore, CA
    3 days ago
  • $50 - $100 per hour

    Work Setting: 100% onsite engineering and manufacturing environment in Fremont, CA Employment Type: Contract (40 hours per week) Compensation...  ...photonics and semiconductor technology is seeking a Machine Learning Operations (MLOps) Engineer to help develop, deploy, and scale... 
    Hourly pay
    Full time
    Contract work
    For contractors

    Frey Consulting Group

    Fremont, CA
    2 days ago
  • $85k - $110k

     ...computer skills and understanding of computer-operated systems and engineering design tools (e.g., AutoCAD, VISIO, EDS) is required....  ...shows personal humility. Technical Curiosity / Willingness to Learn: Interest in seeking out new experiences, knowledge, and candid... 
    Temporary work
    Work experience placement

    Sunbelt Controls

    Pleasanton, CA
    3 days ago
  •  ...R&D Mechanical Engineer Primary Location US-CA-Livermore ID 2026-4578 Category Mechanical Engineer...  ...and well-being programs to support you and your family. To Learn More Visit: Essential Duties and Responsibilities The... 
    Contract work
    Temporary work
    For contractors
    Remote work

    Geosyntec Consultants

    Livermore, CA
    3 days ago
  •  ...on kidney stone treatment. We are looking for a Staff Systems Engineer to architect, validate, and own the system-level behavior of a...  ...or critical aspects through early experimentation and focused learning cycles. Cross‑Functional Technical Leadership Serve as the systems... 

    Calyxo, Inc.

    Pleasanton, CA
    1 day ago
  • $50 - $72 per hour

     ...Title This position is responsible for independently delivering engineering services, from conceptual design through construction...  ...shows personal humility. Technical Curiosity / Willingness to Learn: Interest in seeking out new experiences, knowledge, and candid... 
    Hourly pay
    Temporary work
    For contractors
    Work at office
    Local area
    Monday to Friday

    ACCO Engineered Systems

    Pleasanton, CA
    1 day ago
  • $76k - $88k

     ...dedicated to providing high-level professional services to our clients. A Graduate Engineer is expected to work on a team providing support to a licensed Civil Engineer while learning the company's standard practices and local agency codes. A large amount of time will... 
    Full time
    Local area
    Flexible hours

    Westwood Professional Services

    Pleasanton, CA
    5 days ago
  • Overview The Inventory Control Tower Business Intelligence Engineer role is a specialized function responsible for enabling enterprise-...  ...leveraging emerging technologies, business intelligence and AI/machine learning. Key responsibilities include establishing and maintaining... 
    Temporary work
    Work experience placement

    Kaiser Permanente

    Pleasanton, CA
    1 day ago
  • $129.15k - $196.48k

     ...impact. Job Description We have an opening for a Project Management Engineer to lead and support the execution of high-impact technical...  ...may be fraudulent, we strongly recommend you do not respond. To learn more about recruitment scams: Equal Employment Opportunity We... 
    Minimum wage
    Contract work
    For contractors
    Local area
    Work from home
    Relocation package
    Flexible hours
    1 day per week

    Physics World

    Livermore, CA
    5 days ago
  • $100k - $150k

     ...Job Description Job Description We seek a Senior Mechanical Engineer to join our facilities team in Fremont, CA. This role provides an excellent opportunity to learn on the job while working closely with the Facility Manager to support the operations of our four locations... 
    For contractors

    CalTek Staffing, Inc.

    Fremont, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Reinforcement Learning Engineer. Be the first to apply!