Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Safety Lead: Red Team & Model Risk

Reflection

A leading AI research firm in San Francisco is seeking an experienced professional to own the red-teaming pipeline for their models, ensuring safety and alignment. The ideal candidate has a graduate degree in Computer Science or a related field, along with a deep understanding of LLM safety. This position offers top-tier compensation, comprehensive health benefits, and opportunities to grow in a dynamic startup environment. Join us to contribute to the frontier of open foundational models. #J-18808-Ljbffr Reflection

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the AI Safety Lead: Red Team & Model Risk in San Francisco, CA vacancy
  • $320k

     ...interpretable, and steerable AI systems. We want AI...  ...as a whole. Our team is a quickly...  ...Team The Frontier Red Team (FRT) is a small...  ...and ensuring safety with self‑improving...  ...with the Emerging Risks workstream to understand...  ...build and evaluate model organisms of autonomous... 
    Risk
    Relocation
    Visa sponsorship

    Anthropic

    San Francisco, CA
    5 days ago
  • About the Team The Human Data team at OpenAI...  ...and mitigating risks in advanced AI systems by designing...  ...researchers to strengthen model reliability and...  ...Manager, you will lead initiatives that test the safety and robustness of OpenAI...  ..., experiments, and red‑teaming campaigns.... 
    Risk
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...developing open weight models for individuals, agents...  ...even nation states. Our team of AI researchers and company...  ...the Role Own the red-teaming and adversarial...  ...Alignment team to translate safety findings into concrete...  ...release meets the lab’s risk thresholds before it... 
    Risk
    Relocation package

    Reflection

    San Francisco, CA
    5 days ago
  • $207k - $295k

    A leading AI research company in San Francisco is looking for a Senior Policy role focused on model safety. The candidate will design policies to ensure safe...  ...in ML policies and risk assessments. The job offers...  ...$295K. Join a pioneering team driving AI safety and ethical... 
    Risk

    OpenAI

    San Francisco, CA
    4 days ago
  • $144k - $164k

     ...Manager, Product Management, Gen AI Model Gateway At Capital One, we’re...  ...Generative AI Platform team is at the forefront of Capital...  ...Gen AI Foundation Models Ensure risk & regulatory requirements are...  ...the ability to influence and lead. Basic Qualifications: Bachelor... 
    Risk
    Full time
    Part time
    Local area

    Capital One National Association

    San Francisco, CA
    4 days ago
  • $125k - $190k

    About the Role FAR.AI is hiring a Technical...  ...frontier AI red‑teaming programmes. You will...  ...Kellin Pelrine (co‑leading and technical lead...  ...frontier labs, AI safety organisations, technical...  ...write status updates, risk memos, outreach to...  ...a field—the risk models, the landscape of labs... 
    Risk
    Full time
    Contract work
    For contractors
    Remote work
    Visa sponsorship
    Shift work

    Aisafety

    Berkeley, CA
    2 days ago
  • $144k - $187k

    Team Responsibilities The Factors Research Platform and Governance team is responsible...  ...that power MSCI's quantitative risk and factor model research. The team develops the systems...  ...research infrastructure that integrates AI to accelerate model development, validation... 
    Risk
    Flexible hours

    MSCI Inc.

    San Francisco, CA
    2 days ago
  • $207k - $295k

     ...About the Team Our Safety Systems team is at the forefront of...  ...driving our commitment to AI safety and fostering a...  ...Safety Systems, the Model Policy team aligns model...  ...should behave in high-risk or high-ambiguity contexts...  ...safeguards. Use red-teaming results, deployment... 
    Risk
    Work at office
    Work from home
    Relocation package
    Shift work

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...believe the safest AI is the one that’s already...  ...a pod of AI Red-Teamers: human data...  ...experts who probe AI models with adversarial...  ...and generate the red-team data that makes AI...  ...and flag systemic risks Apply structure:...  ...at the frontier of safety Play a direct role... 
    Risk
    Contract work
    For contractors
    Remote work
    Flexible hours

    YO IT CONSULTING

    San Francisco, CA
    4 days ago
  • $130k - $205k

     ...Blackrock and Fidelity, and employs a team of 450 engineers and...  ...in Northern California, USA. Red Team Security Engineer Astranis...  ...security breach simulations. Lead and participate in purple team...  ...findings, articulate business risk, and provide actionable recommendations... 
    Risk
    Permanent employment
    Flexible hours

    Astranis

    San Francisco, CA
    5 days ago
  • $160k - $175k

     ...AI Governance Team, Responsible AI/ML Data Scientist...  ...solutions and business model Develop...  ...evaluation, testing, and risk mitigation that...  ...privacy preservation, safety protocols, and...  ...Consulting experience at a leading management or...  ...AI tooling, red teaming platforms,... 
    Risk
    Work at office
    Local area
    Remote work
    2 days per week

    Accordion USA

    San Francisco, CA
    1 day ago
  • $240.45k - $300.3k

     ...Learning Engineer - Model Evaluations, Public Sector...  ...The Public Sector ML team at Scale deploys advanced AI systems—including...  ...performance, robustness, and safety metrics, including...  ...stress tests and red-teaming workflows to...  ...that power the world's leading models, and help... 
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  •  ...developing open weight models for individuals,...  ...nation states. Our team of AI researchers and...  ...building model evals and safety from the ground up,...  ...capabilities, risks, and readiness for...  ...research and engineering leads across pre-training...  ...methodologies, red‑teaming, alignment... 
    Risk
    Relocation package

    AI Chopping Block, Inc.

    San Francisco, CA
    5 days ago
  •  ...cutting-edge multimodal foundation models that have the ability to...  ...Ventures, and prominent AI visionaries and founders such...  ...vital member of our ML Data Team - which leads the full spectrum of video-language...  ...models. Experience in red teaming, localization testing... 
    Work at office
    Worldwide
    Flexible hours

    Twelve Labs, Inc

    San Francisco, CA
    4 days ago
  • $144k - $198k

     ...for an experienced Trust & Safety (T&S) Strategy Lead to help protect the integrity...  ...frameworks. As the team's first strategy hire, you'll...  ...of relevant experience in a risk, operations, strategy, or...  ...including the latest enterprise AI tools, to help you work... 
    Risk
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    3 days per week

    Faire Inc

    San Francisco, CA
    2 days ago
  • $132k - $165k

     ...Chicago, or New York follow a hybrid work model to foster collaboration. You must be...  ...sponsorship is not available. Overview The Senior Red Team Engineer is responsible for identifying...  .... Support the company’s commitment to risk management and protecting the integrity... 
    Risk
    Work at office
    Flexible hours

    Early Warning

    San Francisco, CA
    4 days ago
  • $86.5k - $166k

     ..., detect, contain, and remediate cyber threats. Those in the Red Team at PwC will focus on simulating realistic adversary activity through...  ...Security Management System (ISMS), Information Security Risk Assessments, Intellectual Curiosity, Intrusion Detection System... 
    Risk
    H1b
    Visa sponsorship
    Work visa
    Flexible hours

    PwC IT Services Co.

    San Francisco, CA
    1 day ago
  • $250.6k - $384.6k

    A leading automotive company in San Francisco is seeking an AI Safety Principal Engineer. This role involves leading AI safety strategies for autonomous vehicles, ensuring...  ...with industry standards, and mentoring a team of engineers. Ideal candidates should have over 1... 

    General Motors

    San Francisco, CA
    4 days ago
  • $132k - $165k

     ...Chicago, or New York follow a hybrid work model to allow for a more collaborative working...  ...sponsorship. Overall Purpose The Senior Red Team Engineer position within the Red Team,...  ...Threats. Support the company's commitment to risk management and protecting the integrity... 
    Risk
    Hourly pay
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services LLC

    San Francisco, CA
    5 days ago
  • $280k - $308k

     ...practice. This leader will structure, lead, support, drive, and grow West...  ...lead on IT Operating Model and Outsourcing Advisory Services...  ...Client Delivery: Support and lead teams serving clients across...  ...plans, pricing estimates, and risk assessments for prospects. Attend... 
    Risk
    Full time
    Contract work
    Work at office
    Local area
    Immediate start
    Flexible hours

    West Monroe

    San Francisco, CA
    19 hours ago
  • $220k - $270k

     ...powering the next generation of AI products. We build the infrastructure, tools, and model access that teams need to move from idea to...  ...understanding, and the ability to lead complex, cross‑functional...  ...and capacity commitments, and risk‑buy frameworks. Partner Relationship... 
    Risk
    Contract work
    Temporary work
    Currently hiring
    Relocation
    Visa sponsorship

    fal

    San Francisco, CA
    1 day ago
  • $117.73k - $138.5k

     ...usage of complex statistical Treasury Risk and Pre-provision Net Revenue (PPNR) models. Regularly reviews model...  ...approaches - Demonstrated independence, team work and leadership skills - Strong...  ...such as those related to ethics, safety, or operational procedures. Applicants... 
    Risk
    Full time
    Temporary work
    Local area
    3 days per week

    U.S. Bank

    San Francisco, CA
    4 days ago
  • $212k - $318k

    Stripe is seeking a Global AML Lead to own and evolve its AML function across geographies, ensuring compliance and risk management. This role requires at least 15 years in financial...  ...Responsibilities include leading regional teams, driving AML transformation, and... 
    Risk
    Remote job

    Stripe

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, Model Efficiency Who are we? Our mission is to scale intelligence...  ...and enterprises who are building AI systems to power magical experiences like...  ...what’s best for our customers. Cohere is a team of researchers, engineers, designers, and... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    4 days ago
  •  ...Role: As a Research Engineer - Language Model Pre-Training , you'll shape our language...  ...work extremely closely with our pretraining team, who will integrate your insights into our...  ...all enjoy what we do and love discussing AI Benefits and Perks: Comprehensive... 
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    25 days ago
  •  ...intelligence company based in San Francisco, California. The Role: As a Research Engineer - Model Architectures , you will be a core contributor to Zyphra’s AI Architecture Research Team. This will involve designing and rigorously testing novel model architectures and... 
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    25 days ago
  • $192k - $272k

     ...talent-dense, high-agency AI safety team at Lila that will...  ...organization (science, model training, lab integration...  ...integration, etc) to prepare for risks from scientific...  ...in program management, leading cross-functional teams...  ...concepts — alignment, red-teaming, evaluations, responsible... 
    Risk
    Full time
    Work at office
    Local area
    Flexible hours

    Lila Sciences

    San Francisco, CA
    1 day ago
  • $140k - $160k

     ...Senior Software Engineer — Development Team Location: Remote - Bay Area (Occasional Office Visits to Carmel...  ...technical debt, scalability bottlenecks, and reliability risks before they become problems Leverage AI and LLM tooling as a force multiplier — you treat... 
    Risk
    Full time
    Live in
    Work at office
    Remote work

    GrabJobs

    San Francisco, CA
    5 days ago
  • $189k - $280k

     ...technology company is seeking a Senior Operator to enhance Ads Trust & Safety Operations. You will manage critical workflows, partner with cross-functional teams, and utilize data to identify risks. The ideal candidate has over 5 years in Trust & Safety and strong operational... 
    Risk

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...for a Senior Manager of Issuing Risk Operations to lead and mentor the Risk Operations team. Responsibilities include overseeing...  ...strategies, implementing AI-driven processes, and managing stakeholder...  ...position offers a hybrid work model and a chance to influence global... 
    Risk

    Airwallex

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Safety Lead: Red Team & Model Risk. Be the first to apply!