Reinforcement Learning Engineer
$100k - $150kBright Vision Technologies
Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.
As we continue to grow, we’re looking for a skilled Reinforcement Learning Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.
This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.
Job Title: Reinforcement Learning Engineer
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary: $100K - $150K
Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.
Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role
BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.
However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.
Job Summary
We are looking for a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for high-impact decision-making problems where supervised learning alone is insufficient. The role requires deep familiarity with modern reinforcement learning algorithms, simulation environments, reward modeling, and the engineering complexity of training and evaluating policies at scale. The ideal candidate has both research depth and engineering pragmatism, with experience taking RL solutions out of the lab and into production where stability, safety, and ongoing improvement are critical.
Key Responsibilities
- Design and implement reinforcement learning solutions for sequential decision-making problems in real and simulated environments
- Develop, calibrate, and maintain simulation environments suitable for large-scale agent training
- Implement and evaluate modern RL algorithms including policy gradient, actor-critic, off-policy, and offline RL methods
- Engineer reward functions and shaping strategies that align agent behavior with desired outcomes and safety constraints
- Apply offline RL and imitation learning techniques where exploration is costly or unsafe
- Use RLHF, DPO, and related techniques for fine-tuning large language models when relevant
- Build scalable training infrastructure for distributed RL, including efficient experience collection and replay systems
- Optimize training stability and sample efficiency through algorithmic and engineering improvements
- Design rigorous evaluation protocols, including out-of-distribution and adversarial test cases
- Implement safety mechanisms such as constraint enforcement, conservative policies, and human-in-the-loop oversight
- Collaborate with applied scientists and product teams to identify high-value RL use cases
- Monitor deployed policies and models in production for drift, regression, and unintended behaviors, building the alerting and dashboards that surface issues before they meaningfully affect users
- Document methodology, design decisions, and operational characteristics for internal stakeholders
- Stay current with RL research and translate promising techniques into production-ready solutions
Required Qualifications
- Master’s or PhD in Computer Science, Machine Learning, or a related field; or equivalent applied experience
- Six or more years of combined RL research and engineering experience
- Strong proficiency in Python and modern deep learning frameworks
- Hands-on experience with at least one major RL library or in-house RL stack
- Solid understanding of probability, optimization, and the theoretical foundations of RL
- Experience designing and tuning reward functions in non-trivial environments
- Familiarity with simulation environments and large-scale experience collection
- Experience training neural network policies on GPU clusters
- Strong written and verbal communication skills
- Track record of shipping or publishing impactful RL work
Preferred Qualifications
- Experience with RLHF for large language models
- Familiarity with multi-agent RL or hierarchical RL
- Exposure to robotics, control systems, or autonomous driving
- Publications in RL or related research venues
- Open-source contributions to RL libraries or environments
How To Apply
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to ***email_hidden***
Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”
Equal Employment Opportunity (EEO) Statement
Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.
BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
Powered by JazzHR
wc4LVWuXGt
$125.3k - $250.6k
...business, and our customers can achieve their full potential. As a Learning Architect - Fellow, you will be responsible for all aspects of... ...materials include lectures, hands‑on labs, and assessments reinforcing learning. The primary objective of this position is to...SuggestedRemote jobTemporary workWork at officeImmediate startWorldwideFlexible hours$150k - $250k
Albert Invent is looking for an AI/ML expert to build scalable systems for training and fine-tuning generative models. You will implement RL-style methods and ship solutions that enhance automated workflows in autonomous vehicles. The role requires a strong educational ...Suggested$175.53k - $267.06k
...opening for a Senior Guidance, Navigation & Control (GNC) Engineer to supportwork on projects and initiatives with a focus on... ...as AGI STK, SSAPy, Battleschool or AFSIM; or with machine learning/reinforcement learning techniques and with LINUX/UNIX operating systems....SuggestedFull timeFor contractorsRelocation packageFlexible hours$100k
...Job Description Job Description As a Senior Project Engineer, you’ll serve as a critical link between the field and project management... ...of Microsoft Office (Word, Excel, Outlook). Ability to learn new programs. Strong written and verbal communication skills...SuggestedFor contractorsFor subcontractorWork at officeNight shiftDay shift- ...a multi-service infrastructure consulting firm providing civil engineering and surveying services across California, the Pacific Northwest... ...BKF offers competitive and award-winning benefits and perks. To learn more click here. BKF Engineers provides equal employment...SuggestedBi-weekly payWork at officeLocal areaRemote workFlexible hours
- ...atmosphere. Summary: Calyxo is seeking a Staff Systems Engineer to architect, validate, and own the system-level behavior of a... ...of the system through early experimentation and focused learning cycles. Cross-Functional Technical Leadership: ~...Shift work
- ...THE U.S. ARMY OR ARMY RESERVE*** As a Horizontal Construction Engineer, you'll operate heavy construction equipment like bulldozers,... ...construction equipment from one site to another. Skills you'll learn align with Vehicle Operations, Maintenance & Repairs, and...Full timePart timeImmediate startRelocation package
- Field Engineer Department: Operations Employment Type: Full Time Location: Pleasanton, CA Description The primary function of the Field... ..., strives for increased responsibility and has a desire to learn Shows emotional maturity, punctuality and dependability Shows...Full timeFor subcontractor
- ...Job Title: Quality Engineer III Location: Pleasanton, CA Pay Range: $35 to $45/hr on W2 Mode: 12 Months / 100% Onsite What... ...If this is a role that interests you and you'd like to learn more, click apply now and a recruiter will be in touch with you...Weekly payTemporary workFlexible hours
$28.84 - $57.69 per hour
...issues early to drive solutions to resolution. Leverage standard engineering technologies to provide technical support, participate in... ...shows personal humility. Technical Curiosity/Willingness to Learn: Interest in seeking out new experiences, knowledge, and candid...Hourly payTemporary workFor contractorsWork experience placementFor subcontractorWork at officeMonday to Friday- ...Job Title: Systems Engineer Location: Santa Clara, CA Need PCR and Sequencing experience for this position. We are... ...revolutionizing sequencing technology. As a Senior System Engineer, you will learn how to use the new Roche Single Molecule Sequencing platform...Flexible hoursEarly shift
$75.58k - $107.97k
...internal department relationships (i.e. accounting, purchasing, engineering, construction, fabrication shops, legal, etc.). Collaborates... ...shows personal humility. Technical Curiosity / Willingness to Learn: Interest in seeking out new experiences, knowledge, and candid...Full timeTemporary workFor contractorsFor subcontractorWork at officeMonday to Friday$222.9k - $334.3k
...As a Principal Software Engineer at Workday, you will partner with engineering and product leadership to shape the platform’s long‑term... ...the overall technical bar and fostering a culture of continuous learning and engineering excellence. Lead cloud infrastructure...Contract work$146.34k - $222.56k
...impact. Job Description We have an opening for a Systems Engineer to drive the adoption of Model-Based Systems Engineering (MBSE... ...be fraudulent, we strongly recommend you do not respond. To learn more about recruitment scams: Equal Employment Opportunity...Minimum wageFor contractorsWork at officeLocal areaWork from homeRelocation packageFlexible hours1 day per week$100k - $120k
Overview Salary: $100,000 - $120,000 per year. As a project engineer on our team, you will be responsible for overseeing the technical,... ...? Easy Apply now by clicking the "Easy Apply" button. Want to learn more about this role and Jobot? Click our Jobot logo and follow...$110k - $135k
..., geophysical exploration, and telecommunications. Our team of engineers, scientists, software developers, and operations professionals... ...methodical bring-up and debugging. Nice to have (great to learn on the job): Signal integrity/EMI: impedance control,...Permanent employmentInternshipImmediate startFlexible hours$69.5 - $72.86 per hour
...Services Construction Management Solutions provides strategic engineering, program management, construction management, and operational... ...false positive rates. Collaborate with data analytics, machine learning, and engineering teams to validate and refine predictive...Hourly payFull timeTemporary workWork at office- ...Calyxo is seeking a highly skilled and innovative electrical engineer to help develop electronic subsystems in kidney stone treatment... ...opportunity to shape our company culture. You will experience constant learning and dynamic challenges to help you grow and be the best version...Contract workFor contractorsShift work
$129.15k - $196.48k
...Job Description We have an opening for an Electrical Designer/Engineer who will prepare construction documents for new construction, remodeling... ...be fraudulent, we strongly recommend you do not respond. To learn more about recruitment scams: Equal Employment Opportunity...For contractorsInternshipWork at officeRelocation packageFlexible hours$105k - $115k
...Title: Substation Project Engineer Location: Sacramento, Oakland, or Fresno Department: Substation Work Type: hybrid K&A... ...professional and personal growth is a top priority. Innovation & Learning: We encourage continuous learning and invest in your...Temporary workFor contractorsFlexible hours$75k - $130k
...and other comprehensive benefits Department Overview GILLIG Engineering is responsible for the development, customization, and configuration... ...and practices to determine appropriate action. Applies learned techniques and contributes to analysis and investigation to solve...Full timeRemote workFlexible hours$50 - $100 per hour
Work Setting: 100% onsite engineering and manufacturing environment in Fremont, CA Employment Type: Contract (40 hours per week) Compensation... ...photonics and semiconductor technology is seeking a Machine Learning Operations (MLOps) Engineer to help develop, deploy, and scale...Hourly payFull timeContract workFor contractors$85k - $110k
...computer skills and understanding of computer-operated systems and engineering design tools (e.g., AutoCAD, VISIO, EDS) is required.... ...shows personal humility. Technical Curiosity / Willingness to Learn: Interest in seeking out new experiences, knowledge, and candid...Temporary workWork experience placement- ...R&D Mechanical Engineer Primary Location US-CA-Livermore ID 2026-4578 Category Mechanical Engineer... ...and well-being programs to support you and your family. To Learn More Visit: Essential Duties and Responsibilities The...Contract workTemporary workFor contractorsRemote work
- ...on kidney stone treatment. We are looking for a Staff Systems Engineer to architect, validate, and own the system-level behavior of a... ...or critical aspects through early experimentation and focused learning cycles. Cross‑Functional Technical Leadership Serve as the systems...
$50 - $72 per hour
...Title This position is responsible for independently delivering engineering services, from conceptual design through construction... ...shows personal humility. Technical Curiosity / Willingness to Learn: Interest in seeking out new experiences, knowledge, and candid...Hourly payTemporary workFor contractorsWork at officeLocal areaMonday to Friday$76k - $88k
...dedicated to providing high-level professional services to our clients. A Graduate Engineer is expected to work on a team providing support to a licensed Civil Engineer while learning the company's standard practices and local agency codes. A large amount of time will...Full timeLocal areaFlexible hours- Overview The Inventory Control Tower Business Intelligence Engineer role is a specialized function responsible for enabling enterprise-... ...leveraging emerging technologies, business intelligence and AI/machine learning. Key responsibilities include establishing and maintaining...Temporary workWork experience placement
$129.15k - $196.48k
...impact. Job Description We have an opening for a Project Management Engineer to lead and support the execution of high-impact technical... ...may be fraudulent, we strongly recommend you do not respond. To learn more about recruitment scams: Equal Employment Opportunity We...Minimum wageContract workFor contractorsLocal areaWork from homeRelocation packageFlexible hours1 day per week$100k - $150k
...Job Description Job Description We seek a Senior Mechanical Engineer to join our facilities team in Fremont, CA. This role provides an excellent opportunity to learn on the job while working closely with the Facility Manager to support the operations of our four locations...For contractors
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Reinforcement Learning Engineer. Be the first to apply!



