AI Research Lead Post-Training & RLHF
1mind
1mind is seeking an AI Research Lead in San Francisco to define and drive the research agenda for their AI models. This role involves owning the research roadmap, designing experiments for sales LLMs, and collaborating with cross-functional teams. The ideal candidate will have significant experience in machine learning and a passion for building cutting-edge models. 1mind offers a competitive compensation package and the opportunity to work with a top-class team in a fast-paced, innovative environment. #J-18808-Ljbffr
- ...product knowledge. They can lead unlimited, simultaneous conversations... ...We’re looking for an AI Research Lead to define and drive 1mind... ...exploratory research into vertical post‑training for sales and GTM domains,... ...post‑training techniques — RLHF, DPO, reward modeling, and...TrainingFull timeLive inRelocationVisa sponsorship
$146.2k - $261.4k
...Research Lead - AI Cyber Testing & Evaluation RAND's Center on AI, Security, and Technology (CAST), part of the Global and Emerging Risks... ...Development Program (CNODP), Remote Interactive Operator Training (RIOT), Future Operator Readiness Growth and Enrichment (FORGE...TrainingWork experience placementRemote workWork from home$167.3k - $261.4k
...Type: Term (Fixed Term) RAND’s Center on AI, Security, and Technology (CAST), part... ...and strategic leader to serve as Senior Research Lead - AI Security Portfolio. CAST conducts cutting... ...secure infrastructure for AI model training and deployment. Familiarity with AI/ML hardware...TrainingFixed term contractRemote workWork from home$357k
...data, applications, processes, and AI into a single, governed platform. A... ...Responsibilities Workato's AI Research Lab is seeking an exceptional Lead AI Research Scientist to join our... ...experience with large-scale model training, transformer architectures, reinforcement...TrainingWork at officeRemote workFlexible hours$200k - $280k
...architectures, engines) and post-training / RL systems. We build... ...models (e.g., GRPO, RLHF/RLAIF, DPO-like methods... .... Have a solid research foundation in your area... ...About Together AI Together AI is a research... ...We have contributed to leading open-source research, models...TrainingFull time- ...A leading AI research lab is seeking a Research Scientist to focus on alignment and post-training techniques for advanced video generation models. The role involves designing evaluation frameworks, collaborating with teams, and mentoring researchers to enhance alignment...Training
- ...platform for evaluating how AI models perform in the real world. Created by researchers from UC Berkeley’s... ...centered model evaluations. Leading enterprises and AI labs... ...: Hands-on experience training large-scale models,... ...LLMs with methods like RLHF, DPO, and contrastive learning...TrainingPermanent employmentWork at officeWorldwide
- ...deeply integrated with hardware. Our AI team designs and ships the models that let... ...benchmarks. About the role As an AI Researcher at Droyd, you’ll own meaningful parts of... ...stack that power our robotic arms. You’ll train models, push them onto hardware, and...Training
- ...mission is to build multimodal AI to expand human imagination... ...vision. So, we are working on training and scaling up multimodal... ...Manager to partner closely with researchers and engineers building state-... ...Join Us Work alongside leading researchers pushing the frontier...Training
$175k - $300k
...the knowledge and tools to make AI work for their unique needs... ...fine-tuning API that empowers researchers and developers to customize frontier... ...Tinkerers full flexibility in training open weights models with their... ...a GTM Strategy & Operations lead to build the commercial engine...TrainingVisa sponsorshipWork visaRelocation package- ...Axon is seeking an AI/Technology Evangelist in San Francisco to drive the adoption and responsible use of AI across the organization. You will design training programs, identify high-impact AI use cases, and serve as the internal voice for Axon’s AI platform. The ideal...Training
- ...enables anyone to create, train, and deploy them. We... ...pair it with the full RL post-training stack:... ...RL trainer. We enable researchers, startups and enterprises... ...paradigm. Responsibilities Lead and participate in... ...resource utilization of AI inference workloads by...TrainingRemote workWorldwideVisa sponsorshipRelocation packageFlexible hours
$170k - $240k
...About Zip Zip is the AI platform for enterprise procurement — built for humans and agents... ...reach into every department, and own the training, enablement, and tooling that continues to... ...Zip as an industry leader in AI. You lead by doing — prototyping solutions, pressure...TrainingWork at officeHome officeFlexible hoursWeekend work- ...Anthropic is seeking a Research Lead for the Training Insights team to shape the evaluation of model capabilities. This hands-on leadership role involves... .... You will play a crucial role in transforming how AI capabilities are assessed, working collaboratively across various...TrainingRemote work
$92k - $115k
...Lead, CS AI Content Flex is a growth-stage, NYC headquartered FinTech company that is creating... ...out. Maintain version-controlled AI training content that is audit-ready and aligned... ...workplace. Offices Roles posted in New York, San Francisco, and Salt Lake...TrainingFull timeLocal areaRelocation packageFlexible hours2 days per week3 days per week- ...Research Lead, Training Insights Remote-Friendly (Travel Required) | San Francisco, CA; New York City, NY About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and...TrainingWork at officeRemote workVisa sponsorshipFlexible hoursShift work
- ...involves architecting end-to-end ML systems, leading technical roadmaps, and mentoring engineers.... ...and leadership is essential. Experience in ML platforms and distributed training is highly valued. Join a forward-thinking team shaping the future of AI. #J-18808-Ljbffr...Training
$10 per hour
...every freight decision. You will lead the Autonomous Freight Systems... ...-run one. You will lead an AI-first engineering team tasked... ...The range displayed on each job posting reflects the minimum and... ...and relevant education and / or training. The US base salary range for...Training$150k - $300k
...enables anyone to create, train, and deploy them. We... ...pair it with the full rl post-training stack:... ...RL trainer. We enable researchers, startups and enterprises... ...on our decentralizing AI training stack. If you... ...you. Responsibilities Lead and participate in novel...TrainingRemote workWorldwideVisa sponsorshipRelocation packageFlexible hours$185k - $215k
A leading software company in San Francisco seeks an experienced Enablement Program Manager. This role will be focused on evolving customer... ...practical enablement. Key responsibilities include designing training programs and partnering with leadership for performance...Training$110k - $170k
...program across regions and roles. This role emphasizes creating AI-forward learning experiences to expedite new hire productivity.... ...include building efficient onboarding frameworks, facilitating training sessions, and measuring readiness metrics. Candidates should have...Training- Hinge Health is hiring a Senior Project Manager / AI Implementations to lead the Growth Marketing Operations team. This role will drive AI tool... ...a quality-focused AI tools repository and facilitate team training, contributing to an AI-led growth strategy in a fast-paced...Training
- A leading AI-powered platform in San Francisco seeks a Customer Success Professional to own deployment and activation success across customer... ...strategies, building advocate networks, and facilitating training sessions. The position offers a hybrid work environment and...Training
- ...Autonomous Detection Layer designed to fight AI with AI. We aren't just "bolting on" a... ...Your Role: Detection Architect & Adversarial Lead You won't just be writing detections; you'... ...from the wild are instantly fed back into training sets to harden the system against the next...TrainingLive in
- .... Notable investors: Y Combinator, VP of Research at Google Deepmind, researchers at Anthropic... ..., Replit, Cohere, and Redis. shipd.ai - We turn data‑creation tasks into paid bounties... ...define and enforce quality standards and training systems Implement contributor engagement...TrainingFor contractors
$167.3k - $261.4k
...RAND Corporation in San Francisco is seeking a Senior Research Lead for its AI Security Portfolio to oversee research on AI systems. The role involves guiding technical teams, defining research agendas, and engaging with policymakers. Candidates should possess a Ph.D....$225k - $320k
Backed by leading Silicon Valley investors, Peregrine helps public safety organizations, state... ...unprecedented speed and accuracy. Our AI‑enabled platform turns siloed and... ...Peregrine’s AI infrastructure, including: Training and inference pipelines Model serving, versioning...TrainingLocal area$100k - $170k
Founding AI Implementation Lead -Minoa (San Francisco) San Francisco | $100-170k base + 0.3-1.0% equity... ...with their CRM and revenue stack, training their team, and driving adoption through... .... The chance to build Minoa's entire post‑sales function from scratch, at a...Training- ...future generations. About the Role We are hiring our first AI Enablement Lead to drive how AI is adopted across the company. This is a hands... ...a byproduct of building together. You will not be running a training program or writing curricula. You will be sitting next to...TrainingRelocation packageFlexible hours
- About WRITER WRITER is where the world's leading enterprises orchestrate AI-powered work. Our vision is to expand human capacity through superintelligence... ...programs, office hours, usage dashboards, customized trainings, and workshops Build champion communities Create...TrainingFull timeWork at officeLocal areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Research Lead Post-Training & RLHF. Be the first to apply!
- research dietitian San Francisco, CA
- history research San Francisco, CA
- education policy research San Francisco, CA
- research pharmacist San Francisco, CA
- research professional San Francisco, CA
- student research intern San Francisco, CA
- research intern San Francisco, CA
- physics research San Francisco, CA
- pharmaceutical research San Francisco, CA
- cancer research San Francisco, CA


