Reinforcement Learning Engineer
$100k - $150kBright Vision Technologies
Job Title: Reinforcement Learning Engineer
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary: $100K - $150K / Annum
Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.
Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.
BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.
However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.
Job Summary
We are looking for a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for high-impact decision-making problems where supervised learning alone is insufficient. The role requires deep familiarity with modern reinforcement learning algorithms, simulation environments, reward modeling, and the engineering complexity of training and evaluating policies at scale. The ideal candidate has both research depth and engineering pragmatism, with experience taking RL solutions out of the lab and into production where stability, safety, and ongoing improvement are critical.
Key Responsibilities
- Design and implement reinforcement learning solutions for sequential decision-making problems in real and simulated environments.
- Develop, calibrate, and maintain simulation environments suitable for large-scale agent training.
- Implement and evaluate modern RL algorithms including policy gradient, actor-critic, off-policy, and offline RL methods.
- Engineer reward functions and shaping strategies that align agent behavior with desired outcomes and safety constraints.
- Apply offline RL and imitation learning techniques where exploration is costly or unsafe.
- Use RLHF, DPO, and related techniques for fine-tuning large language models when relevant.
- Build scalable training infrastructure for distributed RL, including efficient experience collection and replay systems.
- Optimize training stability and sample efficiency through algorithmic and engineering improvements.
- Design rigorous evaluation protocols, including out-of-distribution and adversarial test cases.
- Implement safety mechanisms such as constraint enforcement, conservative policies, and human-in-the-loop oversight.
- Collaborate with applied scientists and product teams to identify high-value RL use cases.
- Monitor deployed policies and models in production for drift, regression, and unintended behaviors, building the alerting and dashboards that surface issues before they meaningfully affect users.
- Document methodology, design decisions, and operational characteristics for internal stakeholders.
- Stay current with RL research and translate promising techniques into production-ready solutions.
- Master’s or PhD in Computer Science, Machine Learning, or a related field; or equivalent applied experience.
- Six or more years of combined RL research and engineering experience.
- Strong proficiency in Python and modern deep learning frameworks.
- Hands-on experience with at least one major RL library or in-house RL stack.
- Solid understanding of probability, optimization, and the theoretical foundations of RL.
- Experience designing and tuning reward functions in non-trivial environments.
- Familiarity with simulation environments and large-scale experience collection.
- Experience training neural network policies on GPU clusters.
- Strong written and verbal communication skills.
- Track record of shipping or publishing impactful RL work.
- Experience with RLHF for large language models.
- Familiarity with multi-agent RL or hierarchical RL.
- Exposure to robotics, control systems, or autonomous driving.
- Publications in RL or related research venues.
- Open-source contributions to RL libraries or environments.
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on brightvisiontechnologies.applytojob.com or contact us at Show phone number. Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”
Equal Employment Opportunity (EEO) Statement
Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.
BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
- ...Bright Vision Technologies is seeking a remote Reinforcement Learning Engineer to design and implement RL solutions for decision-making problems. The ideal candidate should have a Master’s or PhD in Computer Science or a related field, with over 6 years of relevant experience...SuggestedFull timeRemote work
- ...Automation Engineer Location: Sayreville, NJ, US, 08872 Group: North American Steel Group Division: Maintenance CMC provides an excellent opportunity to learn the steel, construction reinforcement and ground stabilization industries and to grow in your career. Whether...SuggestedWork at officeLocal area
$90k - $100k
...speed, scalable fulfillment for industry leaders. As a Controls Engineer I, you will not only support these systems - you will help... ...cutting-edge automation systems. Curiosity and willingness to learn Interest in combining controls engineering with AI...SuggestedFull timeFor subcontractorInternshipWork at officeLocal areaFlexible hours$125.5k - $230.2k
...want it to go. Join EY and help to build a better working world. Technology – Data and Decision Science – AI Native Engineering AI/Machine Learning Engineer, Manager Consultant Power & Utilities Sector The opportunity Our Artificial Intelligence and Data...SuggestedFull timeWork experience placementSummer holidayFlexible hours$38 - $45 per hour
...Inspiration, Quality and Teamwork! Job Description The Support Engineer is a hands-on role that demands strong technical expertise,... ...travel as needed. Responsibilities Ability to learn and understand electrical and control aspects of installed systems...SuggestedHourly payFull timeLocal areaRemote workFlexible hours$75k - $89k
...Duties: Yes Salary Range: $75,000 - $89,000 The Project Engineer (PE) will be primarily responsible for obtaining, distributing... ...skills (written and oral). Flexibility and willingness to learn. Critical thinking and desire to solve problems. Required...Full timeFor subcontractorInternshipWork at office$65k - $70k
...and their patients. Take your next step to an altogether life-changing career. Learn about the Danaher Business System ( which makes everything possible. The Field Service Engineer is responsible for service, repair and/or installation of products or IT solutions...Work experience placementRemote workFlexible hoursNight shift$36.84 - $49.12 per hour
...often (in days) to receive an alert: Apply now Field Service Engineer Location: US - NJ - Somerset Requisition... ...difference-makers to join our growing team. Interested? Learn more at Overview Veeco is seeking a motivated Field Service...Hourly payWork at officeRelocationFlexible hoursShift work$91k - $113.7k
Job Description The Opportunity As a Field Service Engineer supporting Gas Insulated Switchgear (GIS), you work directly at customer substations... ...professionals across the US power industry Access to learning platforms and career development programs Competitive health and...For subcontractorWork at officeFlexible hours$40 - $45 per hour
...efficiency, clinical confidence, adaptive collaboration, and accelerated intelligence. Learn about the Danaher Business System ( which makes everything possible. The Field Service Engineer is responsible for the vital work of maintaining diagnostic tools, which directly...Hourly payRemote workRelocationMonday to FridayFlexible hours- ...Senior Application Security Engineer – AI & SAST Automation Our client, a leading organization in the technology and security... ...or PyTorch. ~ Experience with generative AI models, reinforcement learning, and autocode remediation models. ~ At least 2 years of experience...Remote work3 days per week
$30 per hour
...Quality and Teamwork! Job Description The Mechanical Project Engineer is a hands-on individual with the maturity and experience to... ...in time management; attention to detail and the ability to learn quickly ~ Excellence in communication (both verbal and written...Daily paidFull timeLocal areaFlexible hours$100k
...Details Tenure-track faculty positions, Electrical and Computer Engineering Department Electrical & Computer Engineering Salary $100,000... ...; Security, especially in information security and AI/machine learning security; AI for domain applications, especially next‑...Flexible hours- ...technologies and the lessons of aerospace engineering to other commercial applications. We... ...hydrogen) as well as light-weighting / reinforcing electric vehicles. The FUZE team is growing... ...A self-starter with a strong desire to learn new skills, and a willingness to ask questions...Permanent employmentFull time
- ...Job Description Job Description Automation Controls Engineer Automation Controls Engineer Position Overview The Automation Engineer will design, implement, commission and support manufacturing automation systems to improve production efficiency, quality and...Ongoing contractFor contractorsLocal area
- ...Job Description Job Description EG&R is seeking an Electrical Engineer with experience in the Power Industry (Power Plants / Substations) who is self-motivated and ready to contribute to the future success of EG&R Engineering PC. Responsibilities: Main...Temporary work
$60k - $75k
Overview Under the close supervision of a Principal Engineer or other designated supervisor, and while serving as a trainee and contributing... ...and samples construction materials for laboratory testing. Learns to lay out and conduct detailed inspections of public...TraineeshipWork at officeLocal area- ...identify software defects in electrified and internal combustion engine (ICE) powertrains, ensuring the delivery of robust, high-... ...who thrives in a fast-paced environment and is passionate about learning and problem-solving. This role is responsible for the validation...Full timeImmediate start
$155.66k - $225.16k
...insights on various topics and providing a unique platform to learn and connect with others. Poe : a platform providing millions... ...About the Team and Role: We’re hiring our first AI Automation Engineer to lead how we apply AI internally across the company. This is...Remote jobFull timeShift work- ...Consigli Construction Co. is seeking a Project Engineer located in New Brunswick, New Jersey. This full-time position involves managing project documentation, reporting on critical project information, and coordinating with subcontractors and vendors. The ideal candidate...Full timeFor subcontractorWork at office
$106.8k - $194.8k
...help to build a better working world. WAF Operations Solution Engineer PRACTICE DESCRIPTION: As a WAF Operations Solution... ...in a diverse and inclusive culture of globally connected teams. Learn more . We offer a comprehensive compensation and benefits...Summer holidayFlexible hours$110k - $120k
...organizational, and communication skills, focused on supporting BEUMER’s installed systems within the Customer Support division. The Controls Engineer II applies professional programming knowledge to troubleshoot, optimize, and enhance machine control systems—primarily within...Daily paidFull timeLocal areaRemote workFlexible hours$106.9k - $176.5k
...want it to go. Join EY and help to build a better working world. Technology – Data and Decision Science – AI Native Engineering AI/Machine Learning Engineer, Senior Consultant The opportunity Our Artificial Intelligence and Data team helps apply cutting...Full timeWork experience placementSummer holidayFlexible hours- ...Job Description Job Description The Automation Engineer is responsible for designing, implementing, and sustaining automated manufacturing systems that improve production efficiency, consistency, and cost performance. This role drives automation strategy by identifying...
- ...Why This Job CMC provides an excellent opportunity to learn the steel, construction reinforcement and ground stabilization industries and to grow in... ...assets Provide technical assistance in the scope of engineering, preventative and predictive maintenance, and other facets...Work at officeLocal area
$79.33k - $98.68k
...TRC has long set the bar for clients who require more than just engineering, combining science with the latest technology to devise... ...Design Firms in the United States. Please watch the video below to learn more: Benefits : TRC offers a competitive benefit package consisting...Full timeTemporary workPart timeFor contractorsWork at officeLocal area- ...business by fusing threat intelligence, monitoring, detection engineering, and response into one proactive, intelligence-led defense capability... ...enterprise. The team prioritizes collaboration, continuous learning, and innovation, with shared ownership of outcomes and a...Second jobLive inWorldwideFlexible hours
$123.58k - $149.75k
...Education : ~ Bachelors (or Masters) degree in science, engineering, packaging or a related field with a minimum of 6 years relevant... ...security numbers during our application or recruitment process. Learn more about protecting yourself at . Any data processed in connection...Hourly payFull timeContract workTemporary workPart timeFor contractorsSummer workLive inWork at officeLocal areaRemote workFlexible hoursShift work- ...Position Summary The Spear Group offers a full‑time Construction Project Engineer/Coordinator position supporting our life sciences/pharma manufacturing client on site in Carteret, NJ. The role supports the client’s owner ’s side CAPEX team in managing construction projects...Full timeImmediate start
$250k
...impact. Here, you’ll thrive in a culture that valuescontinuous learning, professional growth, and innovation—all while having fun... ...Responsibilities Project Management & Delivery Lead cross-functional engineering projects from concept through stabilization Develop detailed...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Reinforcement Learning Engineer. Be the first to apply!



