Research Scientist, Frontier Risk Evaluations San Francisco, CA Apply

Scale AI

Scale Labs, Research Scientist — Frontier Risk Evaluations As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems. Building on this expertise, Scale Labs has launched a new team focused on policy research to bridge the gap between AI research and global policymakers, enabling informed, scientific decisions about AI risks and capabilities. Our research tackles the hardest problems in agent robustness, AI control protocols, and AI risk evaluations to help governments, industry, and the public understand and mitigate AI risk while maximizing AI adoption. This team collaborates broadly across industry, the public sector, and academia and regularly publishes our findings. We are actively seeking talented researchers to join us in shaping this vision. Job Responsibilities Design and build harnesses to test AI models and systems (including agents) for dangerous capabilities such as security vulnerability exploitation, CBRN uplift, and other high-risk activities. Work with government agencies or other labs to collectively scope and design evaluations to measure and mitigate risks posed by advanced AI systems. Publish evaluation methodologies and write technical reports for policymakers. Qualifications Commitment to our mission of promoting safe, secure, and trustworthy AI deployments as frontier AI capabilities continue to advance. Practical experience conducting technical research collaboratively, building and instrumenting ML pipelines, writing evaluation harnesses, and turning research literature into working prototypes. A track record of published research in machine learning, particularly in generative AI. At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development. Strong written and verbal communication skills to operate in a cross‑functional team. Nice to Have Experience crafting evaluations and benchmarks, or a background in data science roles related to LLM technologies. Experience with red‑teaming or adversarial testing of AI systems. Familiarity with AI safety policy frameworks (e.g., NIST AI RMF, EU AI Act, Korea AI Basic Act). Equal Employment Opportunity We believe that everyone should be able to bring their whole selves to work, and we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity, Veteran status, or any other protected characteristic. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at View email address on click.appcast.io. We comply with the United States Department of Labor's Pay Transparency provision. We collect, retain and use personal data responsibly for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. Please see our privacy policy for additional information. #J-18808-Ljbffr

Apply

Vacancy posted 12 hours ago

Similar jobs that could be interesting for youBased on the Research Scientist, Frontier Risk Evaluations San Francisco, CA Apply in New York, NY vacancy

Research Scientist, Safety Post Training San Francisco, CA Apply
$216k - $270k
...Scale Labs, Research Scientist — Safety Post Training As... ...the leading data and evaluation partner for frontier AI companies, Scale... ...decisions about AI risks and capabilities.... ...you will develop and apply post-training... ...in the locations of San Francisco, New York, Seattle...
Risk
Full time
Scale AI
New York, NY
1 day ago
Research Engineer / Research Scientist, Tokens New York City, NY; New York City, NY | Seattle, WA; San Francisco, CA
$350k
...quickly growing group of committed researchers, engineers, policy experts, and... .... We encourage you to apply despite this, as we are continually evaluating for top talent to join our team... ...benefit corporation headquartered in San Francisco. We offer competitive...
Suggested
Work at office
Visa sponsorship
Flexible hours
New York, NY
more than 2 months ago
[Expression of Interest] Research Scientist / Engineer, Honesty New York City, NY; San Francisco, CA
$350k
...growing group of committed researchers, engineers, policy... ...: As a Research Scientist/Engineer focused on... ...honesty benchmarks and evaluation frameworks... ...with RLHF specifically applied to improving model truthfulness... ...headquartered in San Francisco. We offer competitive...
Suggested
Work at office
Visa sponsorship
Flexible hours
New York, NY
more than 2 months ago
Investment Banking - Banker - Tech - VP - New York or San Francisco
$275k
...Banker - Tech - VP - New York or San Francisco Country: United States of... ..., operational and credit risk. ~ Experience with sell-side... ...actively encourage everyone to apply. Santander is an equal... ...perform other duties. You may be evaluated in part based upon your performance...
Risk
Hourly pay
Full time
Contract work
Work experience placement
Work at office
Shift work
Santander
New York, NY
1 day ago
Research Scientist, Frontier Risk Evaluations
...understand that in the public sector, a model failure may be a risk to public safety or privacy. Customer communication: The ability... ...the same role. This allows us to ensure a fair and thorough evaluation of all applicants. We are proud to be an inclusive and equal opportunity...
Risk
AI Chopping Block, Inc.
New York, NY
12 hours ago
Senior Applied Scientist (Credit Risk)
$10k
...authorizing payments, flagging risk, categorizing spend,... ...looking for a Senior Applied Scientist to help drive the... ...Investigate and evaluate new data sources, including... ...Learning Engineer, Research Scientist, or... ...notices Pursuant to the San Francisco Fair Chance Ordinance...
Risk
Full time
Work at office
Home office
Relocation package
Flexible hours
RAMP
New York, NY
11 hours ago
Research Scientist (Measurement and Evaluation)
...practicing MDs, AI scientists, PhDs, creatives,... ...District in San Francisco, the SoHo neighborhood... ...is hiring Research Scientists to join... ...team to rigorously evaluate and advance the real... ...impact, applying serious measurement... ...the methodological frontier to solve the real...
Hourly pay
Full time
Work at office
Relocation package
Flexible hours
aijoblist
New York, NY
12 hours ago
Product Design Manager New York, NY, USA; San Francisco, CA, USA
$202k
New York, NY, USA; San Francisco, CA, USA About the Role We’re hiring a Design Manager to lead... ...Champion member-centric design grounded in research, data, and experimentation Identify... ...domains (e.g., financial products, lending, risk, or adjacent spaces) Experience...
Risk
Full time
Work at office
Local area
Remote work
Night shift
Chime
New York, NY
4 days ago
Human Resources GeneralistPeople & PlacesSan Francisco, CA
...365—all within 90 seconds. Based in San Francisco, CA, Rippling has raised $1.4B+ from the world... ...documentation with minimal review ~ Applied knowledge of federal, state, and local... ...the ability to flag and escalate legal risk appropriately ~ Familiarity with HRIS...
Risk
Work at office
Local area
Flexible hours
3 days per week
ZoneIn
New York, NY
3 days ago
Senior Applied ML Scientist
$200k - $240k
...provides innovative identity and risk solutions, empowering... ...physical offices in Austin, San Francisco, New York City, Seattle,... ...office. Role: As a Senior Applied ML Scientist at SentiLink, you will build... ...financial risk. As an experienced researcher you will be relied upon to...
Risk
Work experience placement
Live in
Work at office
Remote work
Home office
Flexible hours
SentiLink
New York, NY
4 days ago
Applied Researcher I (AI Foundations)
$218.7k - $249.6k
...Applied Researcher I (AI Foundations) Overview: At... ...functional team of data scientists, software engineers,... ...through training, evaluation, validation, and... ...Applied Researcher I San Jose, CA: $238,600 - $272,300... ...Correction Law; San Francisco, California Police Code...
Full time
Part time
Local area
Flexible hours
Capital One
New York, NY
12 hours ago
Applied Researcher II
$262.5k - $299.6k
...Applied Researcher II At Capital One, we are creating trustworthy and... ...-functional team of data scientists, software engineers, machine... ...design through training, evaluation, validation, and... ...for Applied Researcher II San Francisco, CA: $286,400 - $326,800 for Applied...
Full time
Part time
Local area
Flexible hours
Capital One
New York, NY
4 days ago
Applied Researcher I
$218.7k - $249.6k
...building world‑class applied science and engineering... ...every aspect of the research life cycle, from... ...functional team of data scientists, software engineers,... ...design through training, evaluation, validation, and... ...for Applied Researcher San Francisco, CA: $238,600 - $272,300...
Full time
Part time
Local area
Flexible hours
Capital One National Association
New York, NY
11 hours ago
Applied AI Researcher, System Self-Construction
$150k - $250k
...AI Distyl is an applied AI technology company... ...critical operations for the frontier of AI. Our customers... ...organizations. We research and deploy... ...construction processes, evaluate them, and evolve. They... ...Distyl has offices in San Francisco and New York. This role...
3 days per week
Distyl AI
New York, NY
4 days ago
Applied Researcher I (AI Foundations, LLM Customization, Finetuning, Reinforcement Learning)
$218.7k - $249.6k
...Applied Researcher I (AI Foundations, LLM Customization, Finetuning... ...team of data scientists, software engineers,... ...through training, evaluation, validation, and implementation... ...Researcher I San Jose, CA: $238,600 - $272,300... ...Correction Law; San Francisco, California Police...
Full time
Part time
Local area
Flexible hours
Capital One
New York, NY
1 day ago
Applied Researcher I (AI Foundations, LLM Core and Agentic AI)
$218.7k - $249.6k
...Applied Researcher I (AI Foundations, LLM Core and Agentic AI) Overview... ...cross-functional team of data scientists, software engineers,... ...design through training, evaluation, validation, and implementation... ...for Applied Researcher I San Francisco, CA: $238,600 - $272,300 for...
Full time
Part time
Local area
Flexible hours
Capital One
New York, NY
1 day ago
Sr. Distinguished Applied Researcher
$311.9k - $356k
...Sr. Distinguished Applied Researcher Overview: At Capital One, we... ...mentor a team of applied scientists and their managers without... ...design through training, evaluation, validation, and implementation... ...Applied Researcher San Francisco, CA: $374,300 - $427,200 for Sr...
Full time
Part time
Local area
Flexible hours
Capital One
New York, NY
2 days ago
Distinguished Applied Researcher
$278.4k - $317.7k
...Distinguished Applied Researcher Overview: At Capital One, we are... ...mentor a team of applied scientists and their managers without... ...design through training, evaluation, validation, and implementation... ...Applied Researcher San Francisco, CA: $334,100 - $381,300 for Distinguished...
Full time
Part time
Local area
Flexible hours
Capital One
New York, NY
1 day ago
Applied AI Researcher, System Discovery
$150k - $250k
...AI Distyl is an applied AI technology company... ...critical operations for the frontier of AI. Our customers... ...organizations. We research and deploy... ...pipelines, reasoning agents, evaluation harnesses, multimodal... ...has offices in San Francisco and New York. This role...
3 days per week
Distyl AI
New York, NY
4 days ago
GTM Engineer, Growth Programs Marketing · · San Francisco; New York Apply →
...using a combination of inventive research, design, and engineering. Our... ...loops so programs can be evaluated, iterated, and scaled. Create... ...APIs. Practical experience applying AI to real workflows (LLMs, prompt... ...controls, and operational risk management in program ops....
Risk
Anysphere
New York, NY
4 days ago
Applied Researcher II (AI Foundations)
...building world-class applied science and engineering... ...every aspect of the research lifecycle, from... ...functional team of data scientists, software engineers,... ...design through training, evaluation, validation, and... ...McLean, VA; New York, NY; San Francisco, CA; San Jose, CA. The...
Full time
Part time
Capital One
New York, NY
4 days ago
Summer 2026 | Applied Scientist Intern
$125k
...receipts, close books, flag risks, and surface insights. This... .... About the Role The Applied Science team builds models and... ...curating datasets and building/evaluating ML models Interest or... ...notices Pursuant to the San Francisco Fair Chance Ordinance, we will...
Risk
Full time
Internship
Summer internship
Work at office
Work from home
Home office
Relocation package
Monday to Friday
Flexible hours
Ramp
New York, NY
more than 2 months ago
Remote Applied AI Research Scientist (LLM & Evaluation)
$30 - $50 per hour
...A tech company specializing in AI research is seeking a mid-senior level researcher to manage applied AI research projects. The role involves end-to-end research cycles, building and evaluating LLM systems, and collaborating on dataset development. The ideal candidate...
Hourly pay
Full time
Remote work
Rex USA
New York, NY
4 days ago
Climate Financial Analyst
$115k - $130k
...advocates, lawyers, scientists, and... ..., New York City, San Francisco, and Washington,... ...analyses, identify risks, and surface insights... ...position you are applying for is part of the... ...functions: Evaluate, model, and structure... ...and assess new frontiers in climate and...
Risk
Work at office
Local area
Flexible hours
Night shift
Natural Resources Defense Council Inc
New York, NY
5 days ago
Applied Scientist
...major opportunity in applied AI, and one of the harder... ...of machine learning research, real world data, and... ...environments. As Applied Scientist, you work on the... ...control, planning, or evaluation. Take problems from ambiguous... ...on. Identify and de-risk scaling challenges in...
Risk
Local area
Siemens Mobility
New York, NY
12 hours ago
Applied Scientist
...driven threat detection, insider risk monitoring, and securing... ...Role We’re looking for an Applied AI Scientist who will design, prototype,... ...and resilient . You’ll bridge research and production, working on the... ...Validate Models: Build and evaluate experimental AI models using...
Risk
Temporary work
Remote work
Work from home
Worldwide
Home office
Flexible hours
Appgate
New York, NY
12 hours ago
Senior / Staff Machine Learning Research Scientist, Agents
$302.4k - $378k
...Staff Machine Learning Research Scientist, Agents San Francisco, CA; Seattle, WA; New York,... ...accelerating the abundance of frontier data to pave the road to... ...upon our prior model evaluation work with enterprise... ...-facing Researchers and Applied AI Engineers. Our core mission...
Full time
Scale AI
New York, NY
3 days ago
Analyst, Capital Markets & Industry Relations New York, NY; San Francisco; Washington D.C. (upd[...]
$150k - $195k
...Industry Relations New York, NY; San Francisco; Washington D.C. Position... ...VantageScore is understood, evaluated, and adopted across the capital... ...rating agency stakeholders, research analysts, and industry influencers... ...performance, credit risk, and securitization markets is...
Risk
VantageScore Solutions, LLC.
New York, NY
1 day ago
Safeguards Enforcement Analyst, Safety Evaluations
$230k - $270k
...Enforcement Analyst, Safety Evaluations Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC; San... ...group of committed researchers, engineers, policy... ...lifecycle — from identifying risks and scoping the right... ...We encourage you to apply even if you do not...
Risk
Work at office
Remote work
Visa sponsorship
Flexible hours
Shift work
Anthropic
New York, NY
9 days ago
Senior Applied Economist, Causal Inference & Forecasting
$121.5k - $270k
...Navan is seeking a Senior Applied Economist to join the Data Science... ...and Treasury can rely on for risk management. Causal... ...academic experience in an applied research, finance, or data science... ...including primary work location, an evaluation of the candidate's skills and...
Risk
Navan
New York, NY
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist, Frontier Risk Evaluations San Francisco, CA Apply. Be the first to apply!