Research Scientist, Frontier Risk Evaluations San Francisco, CA Apply
Scale AI
Scale Labs, Research Scientist — Frontier Risk Evaluations As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems. Building on this expertise, Scale Labs has launched a new team focused on policy research to bridge the gap between AI research and global policymakers, enabling informed, scientific decisions about AI risks and capabilities. Our research tackles the hardest problems in agent robustness, AI control protocols, and AI risk evaluations to help governments, industry, and the public understand and mitigate AI risk while maximizing AI adoption. This team collaborates broadly across industry, the public sector, and academia and regularly publishes our findings. We are actively seeking talented researchers to join us in shaping this vision. Job Responsibilities Design and build harnesses to test AI models and systems (including agents) for dangerous capabilities such as security vulnerability exploitation, CBRN uplift, and other high-risk activities. Work with government agencies or other labs to collectively scope and design evaluations to measure and mitigate risks posed by advanced AI systems. Publish evaluation methodologies and write technical reports for policymakers. Qualifications Commitment to our mission of promoting safe, secure, and trustworthy AI deployments as frontier AI capabilities continue to advance. Practical experience conducting technical research collaboratively, building and instrumenting ML pipelines, writing evaluation harnesses, and turning research literature into working prototypes. A track record of published research in machine learning, particularly in generative AI. At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development. Strong written and verbal communication skills to operate in a cross‑functional team. Nice to Have Experience crafting evaluations and benchmarks, or a background in data science roles related to LLM technologies. Experience with red‑teaming or adversarial testing of AI systems. Familiarity with AI safety policy frameworks (e.g., NIST AI RMF, EU AI Act, Korea AI Basic Act). Equal Employment Opportunity We believe that everyone should be able to bring their whole selves to work, and we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity, Veteran status, or any other protected characteristic. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at View email address on click.appcast.io. We comply with the United States Department of Labor's Pay Transparency provision. We collect, retain and use personal data responsibly for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. Please see our privacy policy for additional information. #J-18808-Ljbffr
$216k - $270k
...Scale Labs, Research Scientist — Safety Post Training As... ...the leading data and evaluation partner for frontier AI companies, Scale... ...decisions about AI risks and capabilities.... ...you will develop and apply post-training... ...in the locations of San Francisco, New York, Seattle...RiskFull time$350k
...quickly growing group of committed researchers, engineers, policy experts, and... .... We encourage you to apply despite this, as we are continually evaluating for top talent to join our team... ...benefit corporation headquartered in San Francisco. We offer competitive...SuggestedWork at officeVisa sponsorshipFlexible hours$350k
...growing group of committed researchers, engineers, policy... ...: As a Research Scientist/Engineer focused on... ...honesty benchmarks and evaluation frameworks... ...with RLHF specifically applied to improving model truthfulness... ...headquartered in San Francisco. We offer competitive...SuggestedWork at officeVisa sponsorshipFlexible hours$275k
...Banker - Tech - VP - New York or San Francisco Country: United States of... ..., operational and credit risk. ~ Experience with sell-side... ...actively encourage everyone to apply. Santander is an equal... ...perform other duties. You may be evaluated in part based upon your performance...RiskHourly payFull timeContract workWork experience placementWork at officeShift work- ...understand that in the public sector, a model failure may be a risk to public safety or privacy. Customer communication: The ability... ...the same role. This allows us to ensure a fair and thorough evaluation of all applicants. We are proud to be an inclusive and equal opportunity...Risk
$10k
...authorizing payments, flagging risk, categorizing spend,... ...looking for a Senior Applied Scientist to help drive the... ...Investigate and evaluate new data sources, including... ...Learning Engineer, Research Scientist, or... ...notices Pursuant to the San Francisco Fair Chance Ordinance...RiskFull timeWork at officeHome officeRelocation packageFlexible hours- ...practicing MDs, AI scientists, PhDs, creatives,... ...District in San Francisco, the SoHo neighborhood... ...is hiring Research Scientists to join... ...team to rigorously evaluate and advance the real... ...impact, applying serious measurement... ...the methodological frontier to solve the real...Hourly payFull timeWork at officeRelocation packageFlexible hours
$202k
New York, NY, USA; San Francisco, CA, USA About the Role We’re hiring a Design Manager to lead... ...Champion member-centric design grounded in research, data, and experimentation Identify... ...domains (e.g., financial products, lending, risk, or adjacent spaces) Experience...RiskFull timeWork at officeLocal areaRemote workNight shift- ...365—all within 90 seconds. Based in San Francisco, CA, Rippling has raised $1.4B+ from the world... ...documentation with minimal review ~ Applied knowledge of federal, state, and local... ...the ability to flag and escalate legal risk appropriately ~ Familiarity with HRIS...RiskWork at officeLocal areaFlexible hours3 days per week
$200k - $240k
...provides innovative identity and risk solutions, empowering... ...physical offices in Austin, San Francisco, New York City, Seattle,... ...office. Role: As a Senior Applied ML Scientist at SentiLink, you will build... ...financial risk. As an experienced researcher you will be relied upon to...RiskWork experience placementLive inWork at officeRemote workHome officeFlexible hours$218.7k - $249.6k
...Applied Researcher I (AI Foundations) Overview: At... ...functional team of data scientists, software engineers,... ...through training, evaluation, validation, and... ...Applied Researcher I San Jose, CA: $238,600 - $272,300... ...Correction Law; San Francisco, California Police Code...Full timePart timeLocal areaFlexible hours$262.5k - $299.6k
...Applied Researcher II At Capital One, we are creating trustworthy and... ...-functional team of data scientists, software engineers, machine... ...design through training, evaluation, validation, and... ...for Applied Researcher II San Francisco, CA: $286,400 - $326,800 for Applied...Full timePart timeLocal areaFlexible hours$218.7k - $249.6k
...building world‑class applied science and engineering... ...every aspect of the research life cycle, from... ...functional team of data scientists, software engineers,... ...design through training, evaluation, validation, and... ...for Applied Researcher San Francisco, CA: $238,600 - $272,300...Full timePart timeLocal areaFlexible hours$150k - $250k
...AI Distyl is an applied AI technology company... ...critical operations for the frontier of AI. Our customers... ...organizations. We research and deploy... ...construction processes, evaluate them, and evolve. They... ...Distyl has offices in San Francisco and New York. This role...3 days per week$218.7k - $249.6k
...Applied Researcher I (AI Foundations, LLM Customization, Finetuning... ...team of data scientists, software engineers,... ...through training, evaluation, validation, and implementation... ...Researcher I San Jose, CA: $238,600 - $272,300... ...Correction Law; San Francisco, California Police...Full timePart timeLocal areaFlexible hours$218.7k - $249.6k
...Applied Researcher I (AI Foundations, LLM Core and Agentic AI) Overview... ...cross-functional team of data scientists, software engineers,... ...design through training, evaluation, validation, and implementation... ...for Applied Researcher I San Francisco, CA: $238,600 - $272,300 for...Full timePart timeLocal areaFlexible hours$311.9k - $356k
...Sr. Distinguished Applied Researcher Overview: At Capital One, we... ...mentor a team of applied scientists and their managers without... ...design through training, evaluation, validation, and implementation... ...Applied Researcher San Francisco, CA: $374,300 - $427,200 for Sr...Full timePart timeLocal areaFlexible hours$278.4k - $317.7k
...Distinguished Applied Researcher Overview: At Capital One, we are... ...mentor a team of applied scientists and their managers without... ...design through training, evaluation, validation, and implementation... ...Applied Researcher San Francisco, CA: $334,100 - $381,300 for Distinguished...Full timePart timeLocal areaFlexible hours$150k - $250k
...AI Distyl is an applied AI technology company... ...critical operations for the frontier of AI. Our customers... ...organizations. We research and deploy... ...pipelines, reasoning agents, evaluation harnesses, multimodal... ...has offices in San Francisco and New York. This role...3 days per week- ...using a combination of inventive research, design, and engineering. Our... ...loops so programs can be evaluated, iterated, and scaled. Create... ...APIs. Practical experience applying AI to real workflows (LLMs, prompt... ...controls, and operational risk management in program ops....Risk
- ...building world-class applied science and engineering... ...every aspect of the research lifecycle, from... ...functional team of data scientists, software engineers,... ...design through training, evaluation, validation, and... ...McLean, VA; New York, NY; San Francisco, CA; San Jose, CA. The...Full timePart time
$125k
...receipts, close books, flag risks, and surface insights. This... .... About the Role The Applied Science team builds models and... ...curating datasets and building/evaluating ML models Interest or... ...notices Pursuant to the San Francisco Fair Chance Ordinance, we will...RiskFull timeInternshipSummer internshipWork at officeWork from homeHome officeRelocation packageMonday to FridayFlexible hours$30 - $50 per hour
...A tech company specializing in AI research is seeking a mid-senior level researcher to manage applied AI research projects. The role involves end-to-end research cycles, building and evaluating LLM systems, and collaborating on dataset development. The ideal candidate...Hourly payFull timeRemote work$115k - $130k
...advocates, lawyers, scientists, and... ..., New York City, San Francisco, and Washington,... ...analyses, identify risks, and surface insights... ...position you are applying for is part of the... ...functions: Evaluate, model, and structure... ...and assess new frontiers in climate and...RiskWork at officeLocal areaFlexible hoursNight shift- ...major opportunity in applied AI, and one of the harder... ...of machine learning research, real world data, and... ...environments. As Applied Scientist, you work on the... ...control, planning, or evaluation. Take problems from ambiguous... ...on. Identify and de-risk scaling challenges in...RiskLocal area
- ...driven threat detection, insider risk monitoring, and securing... ...Role We’re looking for an Applied AI Scientist who will design, prototype,... ...and resilient . You’ll bridge research and production, working on the... ...Validate Models: Build and evaluate experimental AI models using...RiskTemporary workRemote workWork from homeWorldwideHome officeFlexible hours
$302.4k - $378k
...Staff Machine Learning Research Scientist, Agents San Francisco, CA; Seattle, WA; New York,... ...accelerating the abundance of frontier data to pave the road to... ...upon our prior model evaluation work with enterprise... ...-facing Researchers and Applied AI Engineers. Our core mission...Full time$150k - $195k
...Industry Relations New York, NY; San Francisco; Washington D.C. Position... ...VantageScore is understood, evaluated, and adopted across the capital... ...rating agency stakeholders, research analysts, and industry influencers... ...performance, credit risk, and securitization markets is...Risk$230k - $270k
...Enforcement Analyst, Safety Evaluations Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC; San... ...group of committed researchers, engineers, policy... ...lifecycle — from identifying risks and scoping the right... ...We encourage you to apply even if you do not...RiskWork at officeRemote workVisa sponsorshipFlexible hoursShift work$121.5k - $270k
...Navan is seeking a Senior Applied Economist to join the Data Science... ...and Treasury can rely on for risk management. Causal... ...academic experience in an applied research, finance, or data science... ...including primary work location, an evaluation of the candidate's skills and...Risk
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist, Frontier Risk Evaluations San Francisco, CA Apply. Be the first to apply!
- safety scientist New York, NY
- image scientist New York, NY
- entry level research scientist New York, NY
- regulatory scientist New York, NY
- water quality scientist New York, NY
- senior principal scientist New York, NY
- pharmaceutical scientist New York, NY
- downstream processing scientist New York, NY
- remote scientist New York, NY
- genomics scientist New York, NY

