AI Safety Lead: Red Team & Model Risk

Reflection

A leading AI research firm in San Francisco is seeking an experienced professional to own the red-teaming pipeline for their models, ensuring safety and alignment. The ideal candidate has a graduate degree in Computer Science or a related field, along with a deep understanding of LLM safety. This position offers top-tier compensation, comprehensive health benefits, and opportunities to grow in a dynamic startup environment. Join us to contribute to the frontier of open foundational models. #J-18808-Ljbffr Reflection

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the AI Safety Lead: Red Team & Model Risk in San Francisco, CA vacancy

AI Safety Expert - Red Team
$20 - $22 per hour
...and technical talent with leading AI research labs. Headquartered... .... Position: AI Safety Experts — English & Malayalam... ...Role Responsibilities Red team conversational AI models and agents. Perform jailbreaks... ..., and flag systemic risks. Apply structure using...
Risk
Contract work
Summer work
Remote work
Mercor
San Francisco, CA
9 days ago
Lead Offensive Security Engineer - Red Team & AI Security
...Security Engineer to join our security team in San Francisco. This role involves planning and executing red team operations, as well as... ..., applications, and AI systems. The ideal candidate has... ...translating findings into clear risk narratives. #J-18808-Ljbffr Aimling
Risk
Aimling
San Francisco, CA
23 hours ago
AI Safety Expert - Red Team
$20 - $22 per hour
...and technical talent with leading AI research labs. Headquartered... .... Position: AI Safety Experts — English & Telugu... ...Role Responsibilities Red team conversational AI models and agents to identify jailbreaks... ..., and flagging systemic risks. Apply structure by following...
Risk
Hourly pay
Weekly pay
Contract work
For contractors
Summer work
Remote work
Flexible hours
Mercor
San Francisco, CA
9 days ago
AI Safety Red Team Specialist — Remote Contract
Mercor is seeking an AI Safety Expert with fluency in English and Marathi to identify... ...in conversational AI models. As part of a remote team, you'll engage in red teaming efforts, generate high-quality data to mitigate risks, and ensure consistency in testing...
Risk
Remote job
Contract work
Mercor
San Francisco, CA
23 hours ago
Remote AI Safety Red Team Specialist (English & Marathi)
$20 - $22 per hour
Mercor is seeking AI Safety Experts with fluency in English and Marathi to join our team remotely. The role involves identifying vulnerabilities in AI models through red teaming and generating valuable data for risk assessment. Successful candidates will have a strong background...
Risk
Remote job
Hourly pay
Mercor
San Francisco, CA
23 hours ago
Remote AI Safety Red Team Specialist - English & Telugu
Mercor is seeking an AI Safety Expert fluent in English and Telugu, responsible for red teaming AI models to uncover vulnerabilities. This remote position offers flexible hours... ...and the ability to clearly communicate risks. Join us in enhancing AI performance while working...
Risk
Remote job
Hourly pay
Weekly pay
Flexible hours
Mercor
San Francisco, CA
23 hours ago
Remote AI Safety Red Team Expert (English & Punjabi)
$20 - $22 per hour
Mercor is hiring AI Safety Experts fluent in English and Punjabi to connect elite talent with leading AI labs. The role involves red teaming AI models, generating human data, and documenting risks with clear communication to stakeholders. Candidates will need experience...
Risk
Remote job
Hourly pay
Contract work
Mercor
San Francisco, CA
4 days ago
Remote AI Red Team Cyber Risk Assessor
$29 per hour
A leading AI talent connector based in San Francisco is looking for an AI Red Team Specialist. This role involves evaluating AI models for vulnerabilities and generating human data for testing. Candidates should have native-level fluency in English and Brazilian Portuguese...
Risk
Remote job
Hourly pay
Full time
Contract work
Part time
Mercor
San Francisco, CA
23 hours ago
Senior Member of Technical Staff - Model Safety
...with a frontier AI research company... ...weight foundation models with a mission... ...accessible. Their team includes... ...of the world's leading AI labs and technology... ...adversarial research, red teaming, model... ..., misuse risks, and alignment gaps... ...build scalable safety evaluation frameworks...
Risk
Xcede
San Francisco, CA
23 hours ago
Zero-to-One AI Model Eval & Safety Lead
...Research Program Manager to build foundational model evaluations and safety frameworks. This role demands a proven background in technical program management and AI safety. You will engage with research and engineering teams while establishing operational processes that...
B Capital
San Francisco, CA
1 day ago
Technical Program Manager - Adversarial Model Research
$207k - $285k
About the Team The Human Data team at OpenAI... ...and mitigating risks in advanced AI systems by designing... ...researchers to strengthen model reliability and... ...Manager, you will lead initiatives that test the safety and robustness of... ..., experiments, and red-teaming campaigns....
Risk
Work at office
Relocation package
OpenAI
San Francisco, CA
2 days ago
Model Policy Architect: AI Safety & Risk (Hybrid)
$207k - $295k
A leading AI research company in San Francisco is looking for a Senior Policy role focused on model safety. The candidate will design policies to ensure safe... ...in ML policies and risk assessments. The job offers... ...$295K. Join a pioneering team driving AI safety and ethical...
Risk
OpenAI
San Francisco, CA
1 day ago
Technical Program Manager, Safety & Model Evaluation (Hybrid)
$207k - $285k
OpenAI is seeking a Technical Program Manager in San Francisco to lead initiatives that ensure the safety and robustness of its AI models. The role involves collaborating with diverse teams to turn risks into actionable plans. Ideal candidates will have experience in technical...
Risk
OpenAI
San Francisco, CA
4 days ago
Manager, Product Management, Gen AI Model Gateway
$144k - $164k
...Manager, Product Management, Gen AI Model Gateway At Capital One, we’re... ...Generative AI Platform team is at the forefront of Capital... ...Gen AI Foundation Models Ensure risk & regulatory requirements are... ...the ability to influence and lead. Basic Qualifications: Bachelor...
Risk
Full time
Part time
Local area
Capital One National Association
San Francisco, CA
1 day ago
Technical Project Manager, Red Team
$125k - $190k
About the Role FAR.AI is hiring a Technical... ...frontier AI red‑teaming programmes. You will... ...Kellin Pelrine (co‑leading and technical lead... ...frontier labs, AI safety organisations, technical... ...write status updates, risk memos, outreach to... ...a field—the risk models, the landscape of labs...
Risk
Full time
Contract work
For contractors
Remote work
Visa sponsorship
Shift work
Aisafety
Berkeley, CA
4 days ago
Quantitative Researcher - Model Scaling
$144k - $187k
Team Responsibilities The Factors Research Platform and Governance team is responsible... ...that power MSCI's quantitative risk and factor model research. The team develops the systems... ...research infrastructure that integrates AI to accelerate model development, validation...
Risk
Flexible hours
MSCI Inc.
San Francisco, CA
4 days ago
Model Policy
$207k - $295k
...About the Team Our Safety Systems team is at the forefront of... ...driving our commitment to AI safety and fostering a... ...Safety Systems, the Model Policy team aligns model... ...should behave in high-risk or high-ambiguity contexts... ...safeguards. Use red-teaming results, deployment...
Risk
Work at office
Work from home
Relocation package
Shift work
OpenAI
San Francisco, CA
23 hours ago
Senior Staff Engineer - AI Safety & Model Evaluation
Xcede is looking for a Member of Technical Staff focused on AI Safety to lead red-teaming efforts and ensure the robustness of next-generation AI systems. The selected candidate will design scalable safety frameworks, partner with researchers to define production safety...
Xcede
San Francisco, CA
23 hours ago
Applied AI Engineer - Fundamental Equity Tech Investing Team
$111.63k - $132.5k
...investment management, risk management and advisory... ...communication with a diverse team of partners strengthens... ..., and the industry-leading iShares® ETFs. Key Responsibilities... ...: Architect AI systems for investors... ...about. Our hybrid work model BlackRock's hybrid...
Risk
Apprenticeship
Work at office
Local area
Work from home
Flexible hours
1 day per week
BlackRock Services
San Francisco, CA
23 hours ago
AI Red-Teamer Adversarial AI Testing English
...believe the safest AI is the one that’s already... ...a pod of AI Red-Teamers: human data... ...experts who probe AI models with adversarial... ...and generate the red-team data that makes AI... ...and flag systemic risks Apply structure: follow... ...at the frontier of safety Play a direct role...
Risk
Contract work
For contractors
Remote work
Flexible hours
YO IT Consulting
San Francisco, CA
4 days ago
Quantitative Model Validation Analyst
$117.73k - $138.5k
...documents, and oversees usage of complex statistical Treasury Risk and Pre-provision Net Revenue (PPNR) models. Regularly reviews model monitoring reports. The... ...management approaches - Demonstrated independence, team work and leadership skills - Strong project...
Risk
Temporary work
3 days per week
U.S. Bancorp
San Francisco, CA
2 days ago
Red Team Security Engineer
$130k - $205k
...Blackrock and Fidelity, and employs a team of 450 engineers and... ...in Northern California, USA. Red Team Security Engineer Astranis... ...security breach simulations. Lead and participate in purple team... ...findings, articulate business risk, and provide actionable recommendations...
Risk
Permanent employment
Flexible hours
Astranis
San Francisco, CA
22 days ago
Senior Machine Learning Engineer - Model Evaluations, Public Sector New York, NY Apply →
$208k - $300k
...Learning Engineer - Model Evaluations, Public Sector... ...to Apply? Join the team shaping the future of AI at Scale. Machine... ...performance, robustness, and safety metrics, including... ...stress tests and red‑teaming workflows to... ...that power the world’s leading models, and help...
Full time
Scale AI, Inc.
San Francisco, CA
3 days ago
Trust & Safety Strategy Lead
$144k - $198k
...for an experienced Trust & Safety (T&S) Strategy Lead to help protect the integrity... ...frameworks. As the team's first strategy hire, you'll... ...of relevant experience in a risk, operations, strategy, or... ...including the latest enterprise AI tools, to help you work...
Risk
Work experience placement
Work at office
Local area
Remote work
Monday to Friday
Flexible hours
3 days per week
Faire Inc
San Francisco, CA
4 days ago
Model Evaluation & Data Quality Lead
...cutting-edge multimodal foundation models that have the ability to... ...Ventures, and prominent AI visionaries and founders such... ...vital member of our ML Data Team - which leads the full spectrum of video-language... ...models. Experience in red teaming, localization testing...
Work at office
Worldwide
Flexible hours
Twelve Labs, Inc
San Francisco, CA
1 day ago
Sr. Red Team Engineer
$132k - $165k
...Chicago, or New York follow a hybrid work model to allow for a more collaborative working... ...sponsorship. Overall Purpose The Senior Red Team Engineer position within the Red Team,... ...Threats. Support the company's commitment to risk management and protecting the integrity...
Risk
Hourly pay
Work at office
Immediate start
Visa sponsorship
Work visa
Flexible hours
Early Warning Services LLC
San Francisco, CA
2 days ago
Technical Business Development (Model Labs)
...Technical Business Development (Model Labs) at fal, you will... ...of integrating AI infrastructure, while ensuring... ..., and the ability to lead complex, cross‑functional... ...commitments, and risk‑buy frameworks. Partner... ...coordination with technical teams to ensure seamless execution...
Risk
Contract work
Temporary work
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
3 days ago
Principal Offensive Security Engineer
$275k - $300k
...Postman is the world's leading API platform, used... .... About the Team The Information... ...pillars: Governance Risk & Compliance (GRC),... ...team is the "red" pulse of this organization... ...validation, AI-augmented adversary... ...RAG pipelines, and model-serving infrastructure...
Risk
Work at office
Flexible hours
3 days per week
Postman
San Francisco, CA
23 hours ago
Global AML Leader - AI-Driven Risk & Compliance (Remote)
$212k - $318k
Stripe is seeking a Global AML Lead to own and evolve its AML function across geographies, ensuring compliance and risk management. This role requires at least 15 years in financial... ...Responsibilities include leading regional teams, driving AML transformation, and...
Risk
Remote job
Stripe
San Francisco, CA
1 day ago
Staff Product Manager (AI Safety)
$164.7k - $339.08k
...Possible. At Pinterest, AI isn't just a feature,... ...Manager for the GenAI Safety team within Trust & Safety,... ...harms before they emerge, red-teaming new AI features... ...creation tools Threat Modeling & Red-Teaming: Lead proactive identification of risks, failure modes, and...
Risk
Work at office
Local area
Remote work
Relocation
Relocation package
Pinterest
San Francisco, CA
23 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Safety Lead: Red Team & Model Risk. Be the first to apply!