AI Safety Lead: Red Team & Model Risk
Reflection
A leading AI research firm in San Francisco is seeking an experienced professional to own the red-teaming pipeline for their models, ensuring safety and alignment. The ideal candidate has a graduate degree in Computer Science or a related field, along with a deep understanding of LLM safety. This position offers top-tier compensation, comprehensive health benefits, and opportunities to grow in a dynamic startup environment. Join us to contribute to the frontier of open foundational models. #J-18808-Ljbffr Reflection
$20 - $22 per hour
...and technical talent with leading AI research labs. Headquartered... .... Position: AI Safety Experts — English & Malayalam... ...Role Responsibilities Red team conversational AI models and agents. Perform jailbreaks... ..., and flag systemic risks. Apply structure using...RiskContract workSummer workRemote work- ...Security Engineer to join our security team in San Francisco. This role involves planning and executing red team operations, as well as... ..., applications, and AI systems. The ideal candidate has... ...translating findings into clear risk narratives. #J-18808-Ljbffr AimlingRisk
$20 - $22 per hour
...and technical talent with leading AI research labs. Headquartered... .... Position: AI Safety Experts — English & Telugu... ...Role Responsibilities Red team conversational AI models and agents to identify jailbreaks... ..., and flagging systemic risks. Apply structure by following...RiskHourly payWeekly payContract workFor contractorsSummer workRemote workFlexible hours- Mercor is seeking an AI Safety Expert with fluency in English and Marathi to identify... ...in conversational AI models. As part of a remote team, you'll engage in red teaming efforts, generate high-quality data to mitigate risks, and ensure consistency in testing...RiskRemote jobContract work
$20 - $22 per hour
Mercor is seeking AI Safety Experts with fluency in English and Marathi to join our team remotely. The role involves identifying vulnerabilities in AI models through red teaming and generating valuable data for risk assessment. Successful candidates will have a strong background...RiskRemote jobHourly pay- Mercor is seeking an AI Safety Expert fluent in English and Telugu, responsible for red teaming AI models to uncover vulnerabilities. This remote position offers flexible hours... ...and the ability to clearly communicate risks. Join us in enhancing AI performance while working...RiskRemote jobHourly payWeekly payFlexible hours
$20 - $22 per hour
Mercor is hiring AI Safety Experts fluent in English and Punjabi to connect elite talent with leading AI labs. The role involves red teaming AI models, generating human data, and documenting risks with clear communication to stakeholders. Candidates will need experience...RiskRemote jobHourly payContract work$29 per hour
A leading AI talent connector based in San Francisco is looking for an AI Red Team Specialist. This role involves evaluating AI models for vulnerabilities and generating human data for testing. Candidates should have native-level fluency in English and Brazilian Portuguese...RiskRemote jobHourly payFull timeContract workPart time- ...with a frontier AI research company... ...weight foundation models with a mission... ...accessible. Their team includes... ...of the world's leading AI labs and technology... ...adversarial research, red teaming, model... ..., misuse risks, and alignment gaps... ...build scalable safety evaluation frameworks...Risk
- ...Research Program Manager to build foundational model evaluations and safety frameworks. This role demands a proven background in technical program management and AI safety. You will engage with research and engineering teams while establishing operational processes that...
$207k - $285k
About the Team The Human Data team at OpenAI... ...and mitigating risks in advanced AI systems by designing... ...researchers to strengthen model reliability and... ...Manager, you will lead initiatives that test the safety and robustness of... ..., experiments, and red-teaming campaigns....RiskWork at officeRelocation package$207k - $295k
A leading AI research company in San Francisco is looking for a Senior Policy role focused on model safety. The candidate will design policies to ensure safe... ...in ML policies and risk assessments. The job offers... ...$295K. Join a pioneering team driving AI safety and ethical...Risk$207k - $285k
OpenAI is seeking a Technical Program Manager in San Francisco to lead initiatives that ensure the safety and robustness of its AI models. The role involves collaborating with diverse teams to turn risks into actionable plans. Ideal candidates will have experience in technical...Risk$144k - $164k
...Manager, Product Management, Gen AI Model Gateway At Capital One, we’re... ...Generative AI Platform team is at the forefront of Capital... ...Gen AI Foundation Models Ensure risk & regulatory requirements are... ...the ability to influence and lead. Basic Qualifications: Bachelor...RiskFull timePart timeLocal area$125k - $190k
About the Role FAR.AI is hiring a Technical... ...frontier AI red‑teaming programmes. You will... ...Kellin Pelrine (co‑leading and technical lead... ...frontier labs, AI safety organisations, technical... ...write status updates, risk memos, outreach to... ...a field—the risk models, the landscape of labs...RiskFull timeContract workFor contractorsRemote workVisa sponsorshipShift work$144k - $187k
Team Responsibilities The Factors Research Platform and Governance team is responsible... ...that power MSCI's quantitative risk and factor model research. The team develops the systems... ...research infrastructure that integrates AI to accelerate model development, validation...RiskFlexible hours$207k - $295k
...About the Team Our Safety Systems team is at the forefront of... ...driving our commitment to AI safety and fostering a... ...Safety Systems, the Model Policy team aligns model... ...should behave in high-risk or high-ambiguity contexts... ...safeguards. Use red-teaming results, deployment...RiskWork at officeWork from homeRelocation packageShift work- Xcede is looking for a Member of Technical Staff focused on AI Safety to lead red-teaming efforts and ensure the robustness of next-generation AI systems. The selected candidate will design scalable safety frameworks, partner with researchers to define production safety...
$111.63k - $132.5k
...investment management, risk management and advisory... ...communication with a diverse team of partners strengthens... ..., and the industry-leading iShares® ETFs. Key Responsibilities... ...: Architect AI systems for investors... ...about. Our hybrid work model BlackRock's hybrid...RiskApprenticeshipWork at officeLocal areaWork from homeFlexible hours1 day per week- ...believe the safest AI is the one that’s already... ...a pod of AI Red-Teamers: human data... ...experts who probe AI models with adversarial... ...and generate the red-team data that makes AI... ...and flag systemic risks Apply structure: follow... ...at the frontier of safety Play a direct role...RiskContract workFor contractorsRemote workFlexible hours
$117.73k - $138.5k
...documents, and oversees usage of complex statistical Treasury Risk and Pre-provision Net Revenue (PPNR) models. Regularly reviews model monitoring reports. The... ...management approaches - Demonstrated independence, team work and leadership skills - Strong project...RiskTemporary work3 days per week$130k - $205k
...Blackrock and Fidelity, and employs a team of 450 engineers and... ...in Northern California, USA. Red Team Security Engineer Astranis... ...security breach simulations. Lead and participate in purple team... ...findings, articulate business risk, and provide actionable recommendations...RiskPermanent employmentFlexible hours$208k - $300k
...Learning Engineer - Model Evaluations, Public Sector... ...to Apply? Join the team shaping the future of AI at Scale. Machine... ...performance, robustness, and safety metrics, including... ...stress tests and red‑teaming workflows to... ...that power the world’s leading models, and help...Full time$144k - $198k
...for an experienced Trust & Safety (T&S) Strategy Lead to help protect the integrity... ...frameworks. As the team's first strategy hire, you'll... ...of relevant experience in a risk, operations, strategy, or... ...including the latest enterprise AI tools, to help you work...RiskWork experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours3 days per week- ...cutting-edge multimodal foundation models that have the ability to... ...Ventures, and prominent AI visionaries and founders such... ...vital member of our ML Data Team - which leads the full spectrum of video-language... ...models. Experience in red teaming, localization testing...Work at officeWorldwideFlexible hours
$132k - $165k
...Chicago, or New York follow a hybrid work model to allow for a more collaborative working... ...sponsorship. Overall Purpose The Senior Red Team Engineer position within the Red Team,... ...Threats. Support the company's commitment to risk management and protecting the integrity...RiskHourly payWork at officeImmediate startVisa sponsorshipWork visaFlexible hours- ...Technical Business Development (Model Labs) at fal, you will... ...of integrating AI infrastructure, while ensuring... ..., and the ability to lead complex, cross‑functional... ...commitments, and risk‑buy frameworks. Partner... ...coordination with technical teams to ensure seamless execution...RiskContract workTemporary work
$275k - $300k
...Postman is the world's leading API platform, used... .... About the Team The Information... ...pillars: Governance Risk & Compliance (GRC),... ...team is the "red" pulse of this organization... ...validation, AI-augmented adversary... ...RAG pipelines, and model-serving infrastructure...RiskWork at officeFlexible hours3 days per week$212k - $318k
Stripe is seeking a Global AML Lead to own and evolve its AML function across geographies, ensuring compliance and risk management. This role requires at least 15 years in financial... ...Responsibilities include leading regional teams, driving AML transformation, and...RiskRemote job$164.7k - $339.08k
...Possible. At Pinterest, AI isn't just a feature,... ...Manager for the GenAI Safety team within Trust & Safety,... ...harms before they emerge, red-teaming new AI features... ...creation tools Threat Modeling & Red-Teaming: Lead proactive identification of risks, failure modes, and...RiskWork at officeLocal areaRemote workRelocationRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Safety Lead: Red Team & Model Risk. Be the first to apply!
- safety lead San Francisco, CA
- technology risk San Francisco, CA
- risk assurance San Francisco, CA
- risk underwriter San Francisco, CA
- geopolitical risk San Francisco, CA
- safety technician San Francisco, CA
- construction safety San Francisco, CA
- safety San Francisco, CA
- entry level safety San Francisco, CA
- safety intern summer San Francisco, CA


