Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist, Frontier Risk Evaluations San Francisco, CA Apply

Scale AI

Scale Labs, Research Scientist — Frontier Risk Evaluations As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems. Building on this expertise, Scale Labs has launched a new team focused on policy research to bridge the gap between AI research and global policymakers, enabling informed, scientific decisions about AI risks and capabilities. Our research tackles the hardest problems in agent robustness, AI control protocols, and AI risk evaluations to help governments, industry, and the public understand and mitigate AI risk while maximizing AI adoption. This team collaborates broadly across industry, the public sector, and academia and regularly publishes our findings. We are actively seeking talented researchers to join us in shaping this vision. Job Responsibilities Design and build harnesses to test AI models and systems (including agents) for dangerous capabilities such as security vulnerability exploitation, CBRN uplift, and other high-risk activities. Work with government agencies or other labs to collectively scope and design evaluations to measure and mitigate risks posed by advanced AI systems. Publish evaluation methodologies and write technical reports for policymakers. Qualifications Commitment to our mission of promoting safe, secure, and trustworthy AI deployments as frontier AI capabilities continue to advance. Practical experience conducting technical research collaboratively, building and instrumenting ML pipelines, writing evaluation harnesses, and turning research literature into working prototypes. A track record of published research in machine learning, particularly in generative AI. At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development. Strong written and verbal communication skills to operate in a cross‑functional team. Nice to Have Experience crafting evaluations and benchmarks, or a background in data science roles related to LLM technologies. Experience with red‑teaming or adversarial testing of AI systems. Familiarity with AI safety policy frameworks (e.g., NIST AI RMF, EU AI Act, Korea AI Basic Act). Equal Employment Opportunity We believe that everyone should be able to bring their whole selves to work, and we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity, Veteran status, or any other protected characteristic. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at View email address on click.appcast.io. We comply with the United States Department of Labor's Pay Transparency provision. We collect, retain and use personal data responsibly for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. Please see our privacy policy for additional information. #J-18808-Ljbffr

Vacancy posted 12 hours ago
Similar jobs that could be interesting for youBased on the Research Scientist, Frontier Risk Evaluations San Francisco, CA Apply in New York, NY vacancy
  • $216k - $270k

     ...Scale Labs, Research Scientist — Safety Post Training As...  ...the leading data and evaluation partner for frontier AI companies, Scale...  ...decisions about AI risks and capabilities....  ...you will develop and apply post-training...  ...in the locations of San Francisco, New York, Seattle... 
    Risk
    Full time

    Scale AI

    New York, NY
    1 day ago
  • $350k

     ...quickly growing group of committed researchers, engineers, policy experts, and...  .... We encourage you to apply despite this, as we are continually evaluating for top talent to join our team...  ...benefit corporation headquartered in San Francisco. We offer competitive... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours
    New York, NY
    more than 2 months ago
  • $350k

     ...growing group of committed researchers, engineers, policy...  ...:  As a Research Scientist/Engineer focused on...  ...honesty benchmarks and evaluation frameworks...  ...with RLHF specifically applied to improving model truthfulness...  ...headquartered in San Francisco. We offer competitive... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours
    New York, NY
    more than 2 months ago
  • $275k

     ...Banker - Tech - VP - New York or San Francisco Country: United States of...  ..., operational and credit risk. ~ Experience with sell-side...  ...actively encourage everyone to apply. Santander is an equal...  ...perform other duties. You may be evaluated in part based upon your performance... 
    Risk
    Hourly pay
    Full time
    Contract work
    Work experience placement
    Work at office
    Shift work

    Santander

    New York, NY
    1 day ago
  •  ...understand that in the public sector, a model failure may be a risk to public safety or privacy. Customer communication: The ability...  ...the same role. This allows us to ensure a fair and thorough evaluation of all applicants. We are proud to be an inclusive and equal opportunity... 
    Risk

    AI Chopping Block, Inc.

    New York, NY
    12 hours ago
  • $10k

     ...authorizing payments, flagging risk, categorizing spend,...  ...looking for a Senior Applied Scientist to help drive the...  ...Investigate and evaluate new data sources, including...  ...Learning Engineer, Research Scientist, or...  ...notices Pursuant to the San Francisco Fair Chance Ordinance... 
    Risk
    Full time
    Work at office
    Home office
    Relocation package
    Flexible hours

    RAMP

    New York, NY
    11 hours ago
  •  ...practicing MDs, AI scientists, PhDs, creatives,...  ...District in San Francisco, the SoHo neighborhood...  ...is hiring Research Scientists to join...  ...team to rigorously evaluate and advance the real...  ...impact, applying serious measurement...  ...the methodological frontier to solve the real... 
    Hourly pay
    Full time
    Work at office
    Relocation package
    Flexible hours

    aijoblist

    New York, NY
    12 hours ago
  • $202k

    New York, NY, USA; San Francisco, CA, USA About the Role We’re hiring a Design Manager to lead...  ...Champion member-centric design grounded in research, data, and experimentation Identify...  ...domains (e.g., financial products, lending, risk, or adjacent spaces) Experience... 
    Risk
    Full time
    Work at office
    Local area
    Remote work
    Night shift

    Chime

    New York, NY
    4 days ago
  •  ...365—all within 90 seconds. Based in San Francisco, CA, Rippling has raised $1.4B+ from the world...  ...documentation with minimal review ~ Applied knowledge of federal, state, and local...  ...the ability to flag and escalate legal risk appropriately ~ Familiarity with HRIS... 
    Risk
    Work at office
    Local area
    Flexible hours
    3 days per week

    ZoneIn

    New York, NY
    3 days ago
  • $200k - $240k

     ...provides innovative identity and risk solutions, empowering...  ...physical offices in Austin, San Francisco, New York City, Seattle,...  ...office. Role: As a Senior Applied ML Scientist at SentiLink, you will build...  ...financial risk. As an experienced researcher you will be relied upon to... 
    Risk
    Work experience placement
    Live in
    Work at office
    Remote work
    Home office
    Flexible hours

    SentiLink

    New York, NY
    4 days ago
  • $218.7k - $249.6k

     ...Applied Researcher I (AI Foundations) Overview: At...  ...functional team of data scientists, software engineers,...  ...through training, evaluation, validation, and...  ...Applied Researcher I San Jose, CA: $238,600 - $272,300...  ...Correction Law; San Francisco, California Police Code... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    New York, NY
    12 hours ago
  • $262.5k - $299.6k

     ...Applied Researcher II At Capital One, we are creating trustworthy and...  ...-functional team of data scientists, software engineers, machine...  ...design through training, evaluation, validation, and...  ...for Applied Researcher II San Francisco, CA: $286,400 - $326,800 for Applied... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    New York, NY
    4 days ago
  • $218.7k - $249.6k

     ...building world‑class applied science and engineering...  ...every aspect of the research life cycle, from...  ...functional team of data scientists, software engineers,...  ...design through training, evaluation, validation, and...  ...for Applied Researcher San Francisco, CA: $238,600 - $272,300... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One National Association

    New York, NY
    11 hours ago
  • $150k - $250k

     ...AI Distyl is an applied AI technology company...  ...critical operations for the frontier of AI. Our customers...  ...organizations. We research and deploy...  ...construction processes, evaluate them, and evolve. They...  ...Distyl has offices in San Francisco and New York. This role... 
    3 days per week

    Distyl AI

    New York, NY
    4 days ago
  • $218.7k - $249.6k

     ...Applied Researcher I (AI Foundations, LLM Customization, Finetuning...  ...team of data scientists, software engineers,...  ...through training, evaluation, validation, and implementation...  ...Researcher I San Jose, CA: $238,600 - $272,300...  ...Correction Law; San Francisco, California Police... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    New York, NY
    1 day ago
  • $218.7k - $249.6k

     ...Applied Researcher I (AI Foundations, LLM Core and Agentic AI) Overview...  ...cross-functional team of data scientists, software engineers,...  ...design through training, evaluation, validation, and implementation...  ...for Applied Researcher I San Francisco, CA: $238,600 - $272,300 for... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    New York, NY
    1 day ago
  • $311.9k - $356k

     ...Sr. Distinguished Applied Researcher Overview: At Capital One, we...  ...mentor a team of applied scientists and their managers without...  ...design through training, evaluation, validation, and implementation...  ...Applied Researcher San Francisco, CA: $374,300 - $427,200 for Sr... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    New York, NY
    2 days ago
  • $278.4k - $317.7k

     ...Distinguished Applied Researcher Overview: At Capital One, we are...  ...mentor a team of applied scientists and their managers without...  ...design through training, evaluation, validation, and implementation...  ...Applied Researcher San Francisco, CA: $334,100 - $381,300 for Distinguished... 
    Full time
    Part time
    Local area
    Flexible hours

    Capital One

    New York, NY
    1 day ago
  • $150k - $250k

     ...AI Distyl is an applied AI technology company...  ...critical operations for the frontier of AI. Our customers...  ...organizations. We research and deploy...  ...pipelines, reasoning agents, evaluation harnesses, multimodal...  ...has offices in San Francisco and New York. This role... 
    3 days per week

    Distyl AI

    New York, NY
    4 days ago
  •  ...using a combination of inventive research, design, and engineering. Our...  ...loops so programs can be evaluated, iterated, and scaled. Create...  ...APIs. Practical experience applying AI to real workflows (LLMs, prompt...  ...controls, and operational risk management in program ops.... 
    Risk

    Anysphere

    New York, NY
    4 days ago
  •  ...building world-class applied science and engineering...  ...every aspect of the research lifecycle, from...  ...functional team of data scientists, software engineers,...  ...design through training, evaluation, validation, and...  ...McLean, VA; New York, NY; San Francisco, CA; San Jose, CA. The... 
    Full time
    Part time

    Capital One

    New York, NY
    4 days ago
  • $125k

     ...receipts, close books, flag risks, and surface insights. This...  .... About the Role The Applied Science team builds models and...  ...curating datasets and building/evaluating ML models Interest or...  ...notices Pursuant to the San Francisco Fair Chance Ordinance, we will... 
    Risk
    Full time
    Internship
    Summer internship
    Work at office
    Work from home
    Home office
    Relocation package
    Monday to Friday
    Flexible hours

    Ramp

    New York, NY
    more than 2 months ago
  • $30 - $50 per hour

     ...A tech company specializing in AI research is seeking a mid-senior level researcher to manage applied AI research projects. The role involves end-to-end research cycles, building and evaluating LLM systems, and collaborating on dataset development. The ideal candidate... 
    Hourly pay
    Full time
    Remote work

    Rex USA

    New York, NY
    4 days ago
  • $115k - $130k

     ...advocates, lawyers, scientists, and...  ..., New York City, San Francisco, and Washington,...  ...analyses, identify risks, and surface insights...  ...position you are applying for is part of the...  ...functions: Evaluate, model, and structure...  ...and assess new frontiers in climate and... 
    Risk
    Work at office
    Local area
    Flexible hours
    Night shift

    Natural Resources Defense Council Inc

    New York, NY
    5 days ago
  •  ...major opportunity in applied AI, and one of the harder...  ...of machine learning research, real world data, and...  ...environments. As Applied Scientist, you work on the...  ...control, planning, or evaluation. Take problems from ambiguous...  ...on. Identify and de-risk scaling challenges in... 
    Risk
    Local area

    Siemens Mobility

    New York, NY
    12 hours ago
  •  ...driven threat detection, insider risk monitoring, and securing...  ...Role We’re looking for an Applied AI Scientist who will design, prototype,...  ...and resilient . You’ll bridge research and production, working on the...  ...Validate Models: Build and evaluate experimental AI models using... 
    Risk
    Temporary work
    Remote work
    Work from home
    Worldwide
    Home office
    Flexible hours

    Appgate

    New York, NY
    12 hours ago
  • $302.4k - $378k

     ...Staff Machine Learning Research Scientist, Agents San Francisco, CA; Seattle, WA; New York,...  ...accelerating the abundance of frontier data to pave the road to...  ...upon our prior model evaluation work with enterprise...  ...-facing Researchers and Applied AI Engineers. Our core mission... 
    Full time

    Scale AI

    New York, NY
    3 days ago
  • $150k - $195k

     ...Industry Relations New York, NY; San Francisco; Washington D.C. Position...  ...VantageScore is understood, evaluated, and adopted across the capital...  ...rating agency stakeholders, research analysts, and industry influencers...  ...performance, credit risk, and securitization markets is... 
    Risk

    VantageScore Solutions, LLC.

    New York, NY
    1 day ago
  • $230k - $270k

     ...Enforcement Analyst, Safety Evaluations Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC; San...  ...group of committed researchers, engineers, policy...  ...lifecycle — from identifying risks and scoping the right...  ...We encourage you to apply even if you do not... 
    Risk
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours
    Shift work

    Anthropic

    New York, NY
    9 days ago
  • $121.5k - $270k

     ...Navan is seeking a Senior Applied Economist to join the Data Science...  ...and Treasury can rely on for risk management. Causal...  ...academic experience in an applied research, finance, or data science...  ...including primary work location, an evaluation of the candidate's skills and... 
    Risk

    Navan

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist, Frontier Risk Evaluations San Francisco, CA Apply. Be the first to apply!