Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Safety Lead: Red Team & Model Risk

Reflection

A leading AI research firm in San Francisco is seeking an experienced professional to own the red-teaming pipeline for their models, ensuring safety and alignment. The ideal candidate has a graduate degree in Computer Science or a related field, along with a deep understanding of LLM safety. This position offers top-tier compensation, comprehensive health benefits, and opportunities to grow in a dynamic startup environment. Join us to contribute to the frontier of open foundational models. #J-18808-Ljbffr Reflection

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Safety Lead: Red Team & Model Risk in San Francisco, CA vacancy
  • $20 - $22 per hour

     ...and technical talent with leading AI research labs. Headquartered...  .... Position: AI Safety Experts — English & Malayalam...  ...Role Responsibilities Red team conversational AI models and agents. Perform jailbreaks...  ..., and flag systemic risks. Apply structure using... 
    Risk
    Contract work
    Summer work
    Remote work

    Mercor

    San Francisco, CA
    9 days ago
  •  ...Security Engineer to join our security team in San Francisco. This role involves planning and executing red team operations, as well as...  ..., applications, and AI systems. The ideal candidate has...  ...translating findings into clear risk narratives. #J-18808-Ljbffr Aimling
    Risk

    Aimling

    San Francisco, CA
    23 hours ago
  • $20 - $22 per hour

     ...and technical talent with leading AI research labs. Headquartered...  .... Position: AI Safety Experts — English & Telugu...  ...Role Responsibilities Red team conversational AI models and agents to identify jailbreaks...  ..., and flagging systemic risks. Apply structure by following... 
    Risk
    Hourly pay
    Weekly pay
    Contract work
    For contractors
    Summer work
    Remote work
    Flexible hours

    Mercor

    San Francisco, CA
    9 days ago
  • Mercor is seeking an AI Safety Expert with fluency in English and Marathi to identify...  ...in conversational AI models. As part of a remote team, you'll engage in red teaming efforts, generate high-quality data to mitigate risks, and ensure consistency in testing... 
    Risk
    Remote job
    Contract work

    Mercor

    San Francisco, CA
    23 hours ago
  • $20 - $22 per hour

    Mercor is seeking AI Safety Experts with fluency in English and Marathi to join our team remotely. The role involves identifying vulnerabilities in AI models through red teaming and generating valuable data for risk assessment. Successful candidates will have a strong background... 
    Risk
    Remote job
    Hourly pay

    Mercor

    San Francisco, CA
    23 hours ago
  • Mercor is seeking an AI Safety Expert fluent in English and Telugu, responsible for red teaming AI models to uncover vulnerabilities. This remote position offers flexible hours...  ...and the ability to clearly communicate risks. Join us in enhancing AI performance while working... 
    Risk
    Remote job
    Hourly pay
    Weekly pay
    Flexible hours

    Mercor

    San Francisco, CA
    23 hours ago
  • $20 - $22 per hour

    Mercor is hiring AI Safety Experts fluent in English and Punjabi to connect elite talent with leading AI labs. The role involves red teaming AI models, generating human data, and documenting risks with clear communication to stakeholders. Candidates will need experience... 
    Risk
    Remote job
    Hourly pay
    Contract work

    Mercor

    San Francisco, CA
    4 days ago
  • $29 per hour

    A leading AI talent connector based in San Francisco is looking for an AI Red Team Specialist. This role involves evaluating AI models for vulnerabilities and generating human data for testing. Candidates should have native-level fluency in English and Brazilian Portuguese... 
    Risk
    Remote job
    Hourly pay
    Full time
    Contract work
    Part time

    Mercor

    San Francisco, CA
    23 hours ago
  •  ...with a frontier AI research company...  ...weight foundation models with a mission...  ...accessible. Their team includes...  ...of the world's leading AI labs and technology...  ...adversarial research, red teaming, model...  ..., misuse risks, and alignment gaps...  ...build scalable safety evaluation frameworks... 
    Risk

    Xcede

    San Francisco, CA
    23 hours ago
  •  ...Research Program Manager to build foundational model evaluations and safety frameworks. This role demands a proven background in technical program management and AI safety. You will engage with research and engineering teams while establishing operational processes that... 

    B Capital

    San Francisco, CA
    1 day ago
  • $207k - $285k

    About the Team The Human Data team at OpenAI...  ...and mitigating risks in advanced AI systems by designing...  ...researchers to strengthen model reliability and...  ...Manager, you will lead initiatives that test the safety and robustness of...  ..., experiments, and red-teaming campaigns.... 
    Risk
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    2 days ago
  • $207k - $295k

    A leading AI research company in San Francisco is looking for a Senior Policy role focused on model safety. The candidate will design policies to ensure safe...  ...in ML policies and risk assessments. The job offers...  ...$295K. Join a pioneering team driving AI safety and ethical... 
    Risk

    OpenAI

    San Francisco, CA
    1 day ago
  • $207k - $285k

    OpenAI is seeking a Technical Program Manager in San Francisco to lead initiatives that ensure the safety and robustness of its AI models. The role involves collaborating with diverse teams to turn risks into actionable plans. Ideal candidates will have experience in technical... 
    Risk

    OpenAI

    San Francisco, CA
    4 days ago
  • $144k - $164k

     ...Manager, Product Management, Gen AI Model Gateway At Capital One, we’re...  ...Generative AI Platform team is at the forefront of Capital...  ...Gen AI Foundation Models Ensure risk & regulatory requirements are...  ...the ability to influence and lead. Basic Qualifications: Bachelor... 
    Risk
    Full time
    Part time
    Local area

    Capital One National Association

    San Francisco, CA
    1 day ago
  • $125k - $190k

    About the Role FAR.AI is hiring a Technical...  ...frontier AI red‑teaming programmes. You will...  ...Kellin Pelrine (co‑leading and technical lead...  ...frontier labs, AI safety organisations, technical...  ...write status updates, risk memos, outreach to...  ...a field—the risk models, the landscape of labs... 
    Risk
    Full time
    Contract work
    For contractors
    Remote work
    Visa sponsorship
    Shift work

    Aisafety

    Berkeley, CA
    4 days ago
  • $144k - $187k

    Team Responsibilities The Factors Research Platform and Governance team is responsible...  ...that power MSCI's quantitative risk and factor model research. The team develops the systems...  ...research infrastructure that integrates AI to accelerate model development, validation... 
    Risk
    Flexible hours

    MSCI Inc.

    San Francisco, CA
    4 days ago
  • $207k - $295k

     ...About the Team Our Safety Systems team is at the forefront of...  ...driving our commitment to AI safety and fostering a...  ...Safety Systems, the Model Policy team aligns model...  ...should behave in high-risk or high-ambiguity contexts...  ...safeguards. Use red-teaming results, deployment... 
    Risk
    Work at office
    Work from home
    Relocation package
    Shift work

    OpenAI

    San Francisco, CA
    23 hours ago
  • Xcede is looking for a Member of Technical Staff focused on AI Safety to lead red-teaming efforts and ensure the robustness of next-generation AI systems. The selected candidate will design scalable safety frameworks, partner with researchers to define production safety... 

    Xcede

    San Francisco, CA
    23 hours ago
  • $111.63k - $132.5k

     ...investment management, risk management and advisory...  ...communication with a diverse team of partners strengthens...  ..., and the industry-leading iShares® ETFs. Key Responsibilities...  ...: Architect AI systems for investors...  ...about. Our hybrid work model BlackRock's hybrid... 
    Risk
    Apprenticeship
    Work at office
    Local area
    Work from home
    Flexible hours
    1 day per week

    BlackRock Services

    San Francisco, CA
    23 hours ago
  •  ...believe the safest AI is the one that’s already...  ...a pod of AI Red-Teamers: human data...  ...experts who probe AI models with adversarial...  ...and generate the red-team data that makes AI...  ...and flag systemic risks Apply structure: follow...  ...at the frontier of safety Play a direct role... 
    Risk
    Contract work
    For contractors
    Remote work
    Flexible hours

    YO IT Consulting

    San Francisco, CA
    4 days ago
  • $117.73k - $138.5k

     ...documents, and oversees usage of complex statistical Treasury Risk and Pre-provision Net Revenue (PPNR) models. Regularly reviews model monitoring reports. The...  ...management approaches - Demonstrated independence, team work and leadership skills - Strong project... 
    Risk
    Temporary work
    3 days per week

    U.S. Bancorp

    San Francisco, CA
    2 days ago
  • $130k - $205k

     ...Blackrock and Fidelity, and employs a team of 450 engineers and...  ...in Northern California, USA. Red Team Security Engineer Astranis...  ...security breach simulations. Lead and participate in purple team...  ...findings, articulate business risk, and provide actionable recommendations... 
    Risk
    Permanent employment
    Flexible hours

    Astranis

    San Francisco, CA
    22 days ago
  • $208k - $300k

     ...Learning Engineer - Model Evaluations, Public Sector...  ...to Apply? Join the team shaping the future of AI at Scale. Machine...  ...performance, robustness, and safety metrics, including...  ...stress tests and red‑teaming workflows to...  ...that power the world’s leading models, and help... 
    Full time

    Scale AI, Inc.

    San Francisco, CA
    3 days ago
  • $144k - $198k

     ...for an experienced Trust & Safety (T&S) Strategy Lead to help protect the integrity...  ...frameworks. As the team's first strategy hire, you'll...  ...of relevant experience in a risk, operations, strategy, or...  ...including the latest enterprise AI tools, to help you work... 
    Risk
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    3 days per week

    Faire Inc

    San Francisco, CA
    4 days ago
  •  ...cutting-edge multimodal foundation models that have the ability to...  ...Ventures, and prominent AI visionaries and founders such...  ...vital member of our ML Data Team - which leads the full spectrum of video-language...  ...models. Experience in red teaming, localization testing... 
    Work at office
    Worldwide
    Flexible hours

    Twelve Labs, Inc

    San Francisco, CA
    1 day ago
  • $132k - $165k

     ...Chicago, or New York follow a hybrid work model to allow for a more collaborative working...  ...sponsorship. Overall Purpose The Senior Red Team Engineer position within the Red Team,...  ...Threats. Support the company's commitment to risk management and protecting the integrity... 
    Risk
    Hourly pay
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services LLC

    San Francisco, CA
    2 days ago
  •  ...Technical Business Development (Model Labs) at fal, you will...  ...of integrating AI infrastructure, while ensuring...  ..., and the ability to lead complex, cross‑functional...  ...commitments, and risk‑buy frameworks. Partner...  ...coordination with technical teams to ensure seamless execution... 
    Risk
    Contract work
    Temporary work

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    3 days ago
  • $275k - $300k

     ...Postman is the world's leading API platform, used...  .... About the Team The Information...  ...pillars: Governance Risk & Compliance (GRC),...  ...team is the "red" pulse of this organization...  ...validation, AI-augmented adversary...  ...RAG pipelines, and model-serving infrastructure... 
    Risk
    Work at office
    Flexible hours
    3 days per week

    Postman

    San Francisco, CA
    23 hours ago
  • $212k - $318k

    Stripe is seeking a Global AML Lead to own and evolve its AML function across geographies, ensuring compliance and risk management. This role requires at least 15 years in financial...  ...Responsibilities include leading regional teams, driving AML transformation, and... 
    Risk
    Remote job

    Stripe

    San Francisco, CA
    1 day ago
  • $164.7k - $339.08k

     ...Possible. At Pinterest, AI isn't just a feature,...  ...Manager for the GenAI Safety team within Trust & Safety,...  ...harms before they emerge, red-teaming new AI features...  ...creation tools Threat Modeling & Red-Teaming: Lead proactive identification of risks, failure modes, and... 
    Risk
    Work at office
    Local area
    Remote work
    Relocation
    Relocation package

    Pinterest

    San Francisco, CA
    23 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Safety Lead: Red Team & Model Risk. Be the first to apply!