Frontier AI Risk Evaluations Scientist
$197.4k - $246.75kScale AI, Inc.
A leading AI research organization in San Francisco is seeking a Research Scientist for Frontier Risk Evaluations. This role involves designing evaluation measures for assessing risks posed by advanced AI systems, working collaboratively with government agencies, and publishing scientific findings. Ideal candidates will have a strong background in machine learning research, experience in building ML pipelines, and excellent communication skills. This is a full-time position with a salary range of $197,400—$246,750 USD. #J-18808-Ljbffr Scale AI, Inc.
$216k - $270k
SCALE LABS, RESEARCH SCIENTIST — FRONTIER RISK EVALUATIONS As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems. Building on this expertise, Scale...RiskFull time$197.4k - $246.75k
Scale Labs, Research Scientist — AI Controls and Monitoring As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the... ...make informed, scientific decisions about AI risks and capabilities. Our research tackles...Risk$50 - $70 per hour
...technical talent with leading AI research labs. Headquartered in... ...and stakeholders to challenge frontier AI agents. Collaborate with... ...to refine task designs and evaluation criteria for environmental-science... ..., ESG reporting, or climate-risk analysis. ~ Day-to-day use of...RiskHourly payFull timeContract workSummer workRemote work$168.1k - $312.3k
...makes us Roche. Advances in AI, data, and computational... ...Intelligence (AI) to assist our scientists in both pRED and gRED to deliver... .... The Opportunity Frontier Research is dedicated to foundational... ...reusable code, running evaluations, and organizing results. Work...SuggestedLocal areaWorldwideRelocation package$120k - $200k
...V7 At V7, we’re building AI platforms that help humans do their best work, at incredible... ...you'll have You'll be embedded in our frontier AI projects team, working on projects that... ...and keep them on track through proactive risk identification, dependency management, and...RiskRemote work- ...On-site Department Technical About the Role Generative AI is transforming what's computationally possible—but it's... ...offers a path through these bottlenecks. As an ML Research Scientist, you'll work at the frontier of generative modeling and quantum acceleration,...Full timeCasual workVisa sponsorship
- A leading AI evaluation firm based in San Francisco seeks a Machine Learning Scientist to foster understanding of AI model performance. You'll engage in designing and analyzing comprehensive experiments while collaborating across teams. Applicants should possess a PhD...
- ...Research team works on high-risk, high-reward ideas that... ...the next decade of AI. Our goal is to advance... ...focus on future frontier models. Pushing the boundaries... ...world-class research scientists and engineers developing... ...infrastructure for training, evaluating, and integrating...RiskWork at officeRelocation package
$192.6k - $344.85k
## AI Research Manager/Scientist, Reinforcement LearningApplylocations: San Francisco... ...training workflows### ### Evaluation, Alignment & *Model*... ...explicit articulation of known risks and trade-offs### ###... ...in an AI research lab or frontier *model* organization* Background...RiskRemote work- Member of Technical Staff - Research Scientist Patronus AI is a frontier lab developing simulation research... ...and most influential research in AI evaluation like FinanceBench , Lynx , SimpleSafetyTests... ...clearly and proactively, flagging risks, blockers, or timeline changes early...Risk
$320k
...interpretable, and steerable AI systems. We want AI to be safe... ...systems. About The Team The Frontier Red Team (FRT) is a small, focused... ...closely with the Emerging Risks workstream to understand novel... ...on our team, you’ll build and evaluate model organisms of autonomous...RiskRelocationVisa sponsorship$75.2k - $124.1k
...related processes or tasks Compile and/or evaluate moderately complex data, computations,... ...aspects of delivery. Understand opportunity risk in relation to our Scope of Services Develop... ...in capital markets. Enabled by data, AI and advanced technology, EY teams help clients...RiskSummer holidayFlexible hours$250k - $325k
...was first‑to‑market with An AI agent that lives in MS Word and... ...deviation analysis to locate buried risk in huge contract databases [20... ...is to keep raising it. Push frontier techniques into production by... ...on legal‑specific tasks. Evaluate emerging work in agentic systems...RiskContract workWork at officeImmediate startRemote work$200k - $250k
The Center for AI Safety (CAIS) is a leading research... ...societal-scale risks from AI. We address AI’... ...As a Senior Research Scientist here, you will lead and... ...safety and reliability of frontier AI systems, taking ownership... ...needed to train and evaluate models at scale, and turn...RiskWork at officeLocal area$140k - $160k
...industry agendas. The Centre for AI Excellence (CAIE) is a global... ...is looking for a Lead for its Frontier AI Systems & Capabilities... ...productivity, and the nature of systemic risk. In this role, you will lead... ...system‑centric frameworks for evaluation, assurance, and deployment,...RiskRelocation packageShift work3 days per week$150k
Description Join Amazon's Frontier AI & Robotics team as a Member of Technical Staff, this Technical Program Manager will become the driving... ...decisions are one‑way or two‑way doors Own program‑level risk management, proactively identifying technical, schedule, and resource...RiskLocal areaDay shift- A technology firm in San Francisco is seeking a Research Engineer to enhance AI model quality. The ideal candidate will build benchmarks, datasets, and evaluation loops to ensure effective performance on critical tasks. This role requires strong programming skills and a...Risk
- ...leading technology company in San Francisco is seeking a Research Engineer for the Frontier Safety Loss of Control team. The role focuses on monitoring and controlling AI to mitigate risks associated with misaligned agents. Candidates should have a Bachelor’s degree in...Risk
$320k
...reliable, interpretable, and steerable AI systems. We want AI to be safe... ...: we don't have open Research Scientist (Emerging Risks) positions on the Frontier Red Team at this time. However, we... ...Scientist will focus on scoping, evaluating, red teaming, and defending...RiskWork at officeRelocationVisa sponsorshipFlexible hours- ...mitigate abuse and strategic risks to ensure a safe online ecosystem... ...goal of developing AI that benefits humanity. The... ...harms and misuse of AI at the frontier in a time of rapid, sustained... ...adapt AI tools for misuse and evaluate how product mechanics, incentives...Risk
$250k
About AfterQuery AfterQuery builds the training data and evaluation infrastructure that frontier AI labs use to make their models better. We work with the world's leading labs to design high signal datasets and run rigorous evaluations that go beyond static benchmarks....$150k - $250k
Research Scientist - Frontier Data — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $150,000 - $250,000 base |... ...AfterQuery builds training data infrastructure and evaluation systems used by frontier AI labs to improve large language models and next-generation...Full timeVisa sponsorship- Fleet AI, Inc. is seeking a Research Scientist to join their core research team in San Francisco. This role focuses on investigating how environments... ...responsibilities include generating benchmarks to evaluate frontier models, automating environment construction for...
- ...the Team Our Cyber team builds AI systems and products that help... ...the safety and reliability of frontier models in security-sensitive... ...engineering, model training, evaluations, safeguards, and deployment to... ...understand cyber use cases, evaluate risk, and turn feedback into...RiskFull time
- ...software that operationalizes responsible AI governance at scale. We're a 4-month-old... ...We're seeking a Principal AI Security & Risk Researcher to join our founding research... ...generative AI and agentic systems Develop risk evaluation methodologies that adapt as threats...RiskPart timeRemote workFlexible hours
- A leading AI research organization is seeking to enhance safety protocols during model... ...candidate will design and implement techniques to evaluate and mitigate unsafe behaviors in AI models... ...advanced architectures and evaluating risks early in the training process, this role...Risk
$142.8k - $274.8k
...are at the forefront of driving AI applications and copilot... ...Job As aPrincipalApplied Scientist at Viva Engage, your role is to... ...data/labels modeling/prompting evaluation experimentation (A/B tests) deployment... ...into product implications, risks, and expected outcomes....RiskOngoing contractWork at officeLocal area- ...than Veracode! Veracode is a global leader in Application Risk Management for the AI era. Powered by trillions of lines of code scans and a... ...Veracode's products Design and implement experiments to evaluate the effectiveness of AI technologies through proof of concepts...RiskWorldwide
- ...AI Behavior Researcher - Child Safety and Mental Health Transluce is a fast-moving... ...lab building the public tech stack for AI evaluation and oversight. We are pioneering research... ...directly with leading AI researchers, frontier AI labs, and prominent child safety and mental...
- ...Clinician Scientist Abridge is looking for Clinician Scientists to drive the development of our AI-powered clinician tools. You will help shape... ...clinical tools across notes, risk adjustment (HCC capture),... ...contexts. Build and refine evaluation tools to streamline medical...RiskHourly payFull timeWork at officeRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Frontier AI Risk Evaluations Scientist. Be the first to apply!


