Software Engineer, Safeguards

$320k

Anthropic

Software Engineer, Safeguards

San Francisco, CA | New York City, NY

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

We are looking for software engineers to help build safety and oversight mechanisms for our AI systems. As a software engineer on the Safeguards team, you will work to monitor models, prevent misuse, and ensure user well-being. This role will focus on building systems to detect unwanted model behaviors and prevent disallowed use of models. You will apply your technical skills to uphold our principles of safety, transparency, and oversight while enforcing our terms of service and acceptable use policies.

Responsibilities:

Develop monitoring systems to detect unwanted behaviors from our API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review
Build abuse detection mechanisms and infrastructure
Surface abuse patterns to our research teams to harden models at the training stage
Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale

You May Be a Good Fit If You:

Bachelor's degree in Computer Science, Software Engineering or comparable experience
5-10+ years of experience in a software engineering position, preferably with a focus on integrity, spam, fraud, or abuse detection and mitigation
Proficiency in Python and Typescript
Ability to work across the stack
Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders

Strong Candidates May Also:

Have experience building trust and safety detection mechanisms and intervention for AI/ML systems
Have experience with prompt engineering, jailbreak attacks, and other adversarial inputs
Have worked closely with operational teams to build custom internal tooling

Deadline to apply: None. Applications will be reviewed on a rolling basis.

The annual compensation range for this role is listed below. For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

Annual Salary:

$320,000 - $485,000 USD

Logistics

Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.

How We're Different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come Work With Us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Software Engineer, Safeguards in New York, NY vacancy

(Senior) Software Engineer - Backend
...chains. Addressing the global climate challenge and safeguarding both humanity and the planet requires a... ...in close cooperation with other developers, data engineers and product managers. Promote agile and software best practices and develop effective solutions to...
Suggested
Remote work
GrabJobs
New York, NY
4 days ago
Full Stack Software Engineer - 2026 Summer Intern
...security posture management platform is engineered specifically to address modern data... ...it, and how it's being used, Symmetry safeguards your organization's data from misuse, insider... ...and external use Contribute to software development tasks under the guidance of...
Suggested
Summer work
Internship
Summer internship
Remote work
Hyperproof
New York, NY
4 days ago
Software Engineer, Safeguards Labs
$320k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...AI systems. About the role Safeguards Labs is a new team operating at the intersection... ...Safeguards teams. We're hiring software engineers to partner with our research...
Suggested
Work at office
Visa sponsorship
Flexible hours
anthropic
New York, NY
3 days ago
Software Engineer - Equities/ETFs
$200k
...Optiver is seeking Software Engineers specializing in the US cash equities and US ETF markets to help lead our continued rapid growth across... .... With a focus on continuous improvement, we prioritize safeguarding the health and efficiency of the markets for all participants...
Suggested
Work at office
Optiver
New York, NY
2 days ago
Software Engineer
$120k - $180k
...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in... ...Product Management, Design, and other stakeholders within Engineering to maintain a high bar for quality in a fast-paced, iterative...
Suggested
Work at office
Worldwide
Flexible hours
Secureframe
New York, NY
3 days ago
Sr Software Engineer
...Job Title: Sr Software Engineer Duration: 6-12 months Location: NYC or San Francisco, CA Job Type: Contract to hire... ...They build and deploy software and biological AI systems to safeguard humanity. The same AI architectures that enable...
Contract work
Work at office
CoSourcing Partners
New York, NY
5 days ago
New Grad Software Engineer, Growth
...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in... ...that powers trust on the internet. We're looking for talented engineers who are obsessed with building. This role sits at the...
Internship
Remote work
Worldwide
Flexible hours
Secureframe
New York, NY
3 days ago
Software Engineer
...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in... ...Product Management, Design, and other stakeholders within Engineering to maintain a high bar for quality in a fast-paced, iterative...
Remote work
Worldwide
Flexible hours
Secureframe
New York, NY
1 day ago
Software Engineer - Credit
$200k
...Optiver is seeking Software Engineers specializing in US corporate bond and credit derivative markets to join the greenfield buildout of... ...conditions. With a focus on continuous improvement, we prioritise safeguarding the health and efficiency of the markets for all...
Work at office
Optiver
New York, NY
1 day ago
Full-Stack Software Engineer at Movo AI
...s network. The next step is to speak to Jack. Full-Stack Software Engineer Company Description Fast-growing voice AI startup Job Description... ...is paramount, and taking preventive measures is crucial to safeguarding your information from potential risks and unauthorized use....
Immediate start
Remote work
Jack & Jill
New York, NY
1 day ago
Software Engineer IV
...Job Title: Software Engineer IV Location: New York City, NY Onsite/ Remote: Day 1 Onsite at customer location... .... • Plan, coordinate, and implement security measures to safeguard information in computer files against accidental or unauthorized...
Remote work
Yochana
New York, NY
3 days ago
Senior Software Engineer- C13 - JERSEY CITY
$142.32k - $213.48k
...transaction services, and wealth management. Our core activities are safeguarding assets, lending money, making payments and accessing the... ...We are looking for a highly skilled and hands-on Senior Software Engineer to drive modern engineering practices and technology...
Full time
Work at office
Citigroup Inc
Jersey City, NJ
1 day ago
Senior Software Engineer
$181k - $213k
...more users is simple for an e-commerce or software company to do. But common web analytics... ...experience. For marketing, Freshpaint safeguards health information while helping our customers... ...optimize for them. Pair program with engineers in an iterative environment that gives...
Immediate start
Remote work
Fresh Paint Inc
New York, NY
4 days ago
Sr. Software Engineer, Threat Intelligence
$166.5k - $244.2k
...and AI workflows. Our mission is simple: safeguard the digital world and empower people to... ...lifecycle. The primary focus is software development, with additional responsibility... .... Collaborate closely with other engineering teams and threat intelligence stakeholders...
Flexible hours
Proofpoint
Jersey City, NJ
3 days ago
Senior Software Engineer
$160k - $240k
...With a robust portfolio of products that safeguard thousands of businesses worldwide, we... ...Product Management, Design, and other Engineering stakeholders to uphold a high quality standard... ...knowledge. ~5+ years of professional software development experience. ~ Proficient...
Work at office
Worldwide
Flexible hours
Shift work
Secureframe
New York, NY
2 days ago
New Grad Software Engineer, Product
$100k - $140k
...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in... ...Product Management, Design, and other stakeholders within Engineering to solve real business problems Own projects end-to-end: scope...
Internship
Work at office
Worldwide
Flexible hours
Secureframe
New York, NY
3 days ago
Senior Software Engineer, Fullstack (Risk Engineering)
$172k - $210k
...Senior Software Engineer, Fullstack (Risk Engineering) New York, NY; San Francisco, CA; USA Flex is a growth-stage, NYC headquartered... ...is to enhance Flex app accessibility for customers while safeguarding against improper use and unauthorized access by identifying...
Full time
Local area
Relocation package
Flexible hours
2 days per week
3 days per week
FLEX Inc
New York, NY
3 days ago
Sr Software Engineer
$148.7k - $199.4k
Job Summary As a Senior Software Engineer (P3), you will lead the design and implementation of intelligent, AI-driven systems that safeguard the reliability and performance of Disney’s global streaming ecosystem. You will be a technical leader in developing agentic systems...
Disney
New York, NY
3 days ago
ML Software Engineer
$65 - $75 per hour
...75/Hr Job Description: You will operate as a hands-on engineering leader responsible for designing, building, and running production... ...Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain...
ApTask
Jersey City, NJ
1 day ago
Senior Software Engineer, Risk Engineering
$172k - $215k
...part of the team? About The Role Flex is looking for a Senior Software Engineer to join our Risk Platform team. This role involves... ...mission is to enhance Flex app accessibility for customers while safeguarding against improper use and unauthorized access by identifying...
Full time
Flexible hours
FLEX Inc
New York, NY
4 days ago
Software Engineer - Research Platform
$200k
...performance compute infrastructure and the software platform layered on top. ORP enables... ...including traders, researchers, and engineers to push their understanding of the market... ...continuous improvement, we prioritize safeguarding the health and efficiency of the markets...
Work at office
Optiver
New York, NY
1 day ago
Applied AI Engineer
...Job Title: Applied AI Engineer Duration: 6-12 months contract to hire Location: NYC or San Francisco... ...intelligence company. They build and deploy software and biological AI systems to safeguard humanity. The same AI architectures that enable...
Contract work
Work at office
CoSourcing Partners
New York, NY
5 days ago
Senior Staff Software Engineer (Backend)
...Senior Staff Software Engineer (Backend/ Python) at Applied Systems Mid-Level, Senior-Level Posted: 1 week ago Regular Full-Time About the... ...people and products enable millions of people worldwide to safeguard and protect what matters most. For 40 years, Applied Systems...
Full time
Local area
Remote work
Worldwide
Flexible hours
Inclusively
New York, NY
4 days ago
Systems Developer
$10k
...the oldest and largest health department in the U.S., dedicated to protecting and improving the health of NYC. Our mission is to safeguard the health of every resident and cultivate a city where everyone, regardless of age, background, or location, can achieve their optimal...
Work at office
Work from home
2 days per week
City of New York
Long Island City, NY
3 days ago
Full Stack Application Developer Vice President
$142.32k - $213.48k
...wealth management. Our core activities are safeguarding assets, lending money, making payments... ...cross-functional business users, QA engineers and BAs collaborating to deliver best in... ..., and solutions. Design and develop software applications using microservices cloud-...
Full time
Citigroup Inc
Jersey City, NJ
6 days ago
AI Solutions Engineer ($160K - $200K + Equity) at V7
$160k - $200k
AI Solutions Engineer ($160K - $200K + Equity) at V7 Jack & Jill | Jack & Jill | Posted Mar 7 Full-time New York Negotiable Advanced... ...is paramount, and taking preventive measures is crucial to safeguarding your information from potential risks and unauthorized use....
Full time
Jack & Jill
New York, NY
1 day ago
Sr Java Developer
$65 - $70 per hour
...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our...
ApTask
New York, NY
5 days ago
Web3 InfoSec Engineer - Secure Core Infra
...Framework Ventures in the United States is seeking a passionate Product Security Engineer to safeguard its core technologies. In this role, you will analyze and assess security issues, design and build security tools, and provide guidance for engineers on secure operational...
Flexible hours
Framework Ventures
New York, NY
4 days ago
Lead Mainframe Developer
$70 - $72 per hour
...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our...
ApTask
Union City, NJ
2 days ago
Web Application Developer
...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our...
ApTask
New York, NY
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Safeguards. Be the first to apply!