Software Engineer, Safeguards
$320kAnthropic
Software Engineer, Safeguards
San Francisco, CA | New York City, NY
About Anthropic
Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the Role
We are looking for software engineers to help build safety and oversight mechanisms for our AI systems. As a software engineer on the Safeguards team, you will work to monitor models, prevent misuse, and ensure user well-being. This role will focus on building systems to detect unwanted model behaviors and prevent disallowed use of models. You will apply your technical skills to uphold our principles of safety, transparency, and oversight while enforcing our terms of service and acceptable use policies.
Responsibilities:
- Develop monitoring systems to detect unwanted behaviors from our API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review
- Build abuse detection mechanisms and infrastructure
- Surface abuse patterns to our research teams to harden models at the training stage
- Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale
You May Be a Good Fit If You:
- Bachelor's degree in Computer Science, Software Engineering or comparable experience
- 5-10+ years of experience in a software engineering position, preferably with a focus on integrity, spam, fraud, or abuse detection and mitigation
- Proficiency in Python and Typescript
- Ability to work across the stack
- Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders
Strong Candidates May Also:
- Have experience building trust and safety detection mechanisms and intervention for AI/ML systems
- Have experience with prompt engineering, jailbreak attacks, and other adversarial inputs
- Have worked closely with operational teams to build custom internal tooling
Deadline to apply: None. Applications will be reviewed on a rolling basis.
The annual compensation range for this role is listed below. For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.
Annual Salary:
$320,000 - $485,000 USD
Logistics
Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience
Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience
Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position
Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.
We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.
How We're Different
We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come Work With Us!
Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
- ...chains. Addressing the global climate challenge and safeguarding both humanity and the planet requires a... ...in close cooperation with other developers, data engineers and product managers. Promote agile and software best practices and develop effective solutions to...SuggestedRemote work
- ...security posture management platform is engineered specifically to address modern data... ...it, and how it's being used, Symmetry safeguards your organization's data from misuse, insider... ...and external use Contribute to software development tasks under the guidance of...SuggestedSummer workInternshipSummer internshipRemote work
$320k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...AI systems. About the role Safeguards Labs is a new team operating at the intersection... ...Safeguards teams. We're hiring software engineers to partner with our research...SuggestedWork at officeVisa sponsorshipFlexible hours$200k
...Optiver is seeking Software Engineers specializing in the US cash equities and US ETF markets to help lead our continued rapid growth across... .... With a focus on continuous improvement, we prioritize safeguarding the health and efficiency of the markets for all participants...SuggestedWork at office$120k - $180k
...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in... ...Product Management, Design, and other stakeholders within Engineering to maintain a high bar for quality in a fast-paced, iterative...SuggestedWork at officeWorldwideFlexible hours- ...Job Title: Sr Software Engineer Duration: 6-12 months Location: NYC or San Francisco, CA Job Type: Contract to hire... ...They build and deploy software and biological AI systems to safeguard humanity. The same AI architectures that enable...Contract workWork at office
- ...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in... ...that powers trust on the internet. We're looking for talented engineers who are obsessed with building. This role sits at the...InternshipRemote workWorldwideFlexible hours
- ...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in... ...Product Management, Design, and other stakeholders within Engineering to maintain a high bar for quality in a fast-paced, iterative...Remote workWorldwideFlexible hours
$200k
...Optiver is seeking Software Engineers specializing in US corporate bond and credit derivative markets to join the greenfield buildout of... ...conditions. With a focus on continuous improvement, we prioritise safeguarding the health and efficiency of the markets for all...Work at office- ...s network. The next step is to speak to Jack. Full-Stack Software Engineer Company Description Fast-growing voice AI startup Job Description... ...is paramount, and taking preventive measures is crucial to safeguarding your information from potential risks and unauthorized use....Immediate startRemote work
- ...Job Title: Software Engineer IV Location: New York City, NY Onsite/ Remote: Day 1 Onsite at customer location... .... • Plan, coordinate, and implement security measures to safeguard information in computer files against accidental or unauthorized...Remote work
$142.32k - $213.48k
...transaction services, and wealth management. Our core activities are safeguarding assets, lending money, making payments and accessing the... ...We are looking for a highly skilled and hands-on Senior Software Engineer to drive modern engineering practices and technology...Full timeWork at office$181k - $213k
...more users is simple for an e-commerce or software company to do. But common web analytics... ...experience. For marketing, Freshpaint safeguards health information while helping our customers... ...optimize for them. Pair program with engineers in an iterative environment that gives...Immediate startRemote work$166.5k - $244.2k
...and AI workflows. Our mission is simple: safeguard the digital world and empower people to... ...lifecycle. The primary focus is software development, with additional responsibility... .... Collaborate closely with other engineering teams and threat intelligence stakeholders...Flexible hours$160k - $240k
...With a robust portfolio of products that safeguard thousands of businesses worldwide, we... ...Product Management, Design, and other Engineering stakeholders to uphold a high quality standard... ...knowledge. ~5+ years of professional software development experience. ~ Proficient...Work at officeWorldwideFlexible hoursShift work$100k - $140k
...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in... ...Product Management, Design, and other stakeholders within Engineering to solve real business problems Own projects end-to-end: scope...InternshipWork at officeWorldwideFlexible hours$172k - $210k
...Senior Software Engineer, Fullstack (Risk Engineering) New York, NY; San Francisco, CA; USA Flex is a growth-stage, NYC headquartered... ...is to enhance Flex app accessibility for customers while safeguarding against improper use and unauthorized access by identifying...Full timeLocal areaRelocation packageFlexible hours2 days per week3 days per week$148.7k - $199.4k
Job Summary As a Senior Software Engineer (P3), you will lead the design and implementation of intelligent, AI-driven systems that safeguard the reliability and performance of Disney’s global streaming ecosystem. You will be a technical leader in developing agentic systems...$65 - $75 per hour
...75/Hr Job Description: You will operate as a hands-on engineering leader responsible for designing, building, and running production... ...Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain...$172k - $215k
...part of the team? About The Role Flex is looking for a Senior Software Engineer to join our Risk Platform team. This role involves... ...mission is to enhance Flex app accessibility for customers while safeguarding against improper use and unauthorized access by identifying...Full timeFlexible hours$200k
...performance compute infrastructure and the software platform layered on top. ORP enables... ...including traders, researchers, and engineers to push their understanding of the market... ...continuous improvement, we prioritize safeguarding the health and efficiency of the markets...Work at office- ...Job Title: Applied AI Engineer Duration: 6-12 months contract to hire Location: NYC or San Francisco... ...intelligence company. They build and deploy software and biological AI systems to safeguard humanity. The same AI architectures that enable...Contract workWork at office
- ...Senior Staff Software Engineer (Backend/ Python) at Applied Systems Mid-Level, Senior-Level Posted: 1 week ago Regular Full-Time About the... ...people and products enable millions of people worldwide to safeguard and protect what matters most. For 40 years, Applied Systems...Full timeLocal areaRemote workWorldwideFlexible hours
$10k
...the oldest and largest health department in the U.S., dedicated to protecting and improving the health of NYC. Our mission is to safeguard the health of every resident and cultivate a city where everyone, regardless of age, background, or location, can achieve their optimal...Work at officeWork from home2 days per week$142.32k - $213.48k
...wealth management. Our core activities are safeguarding assets, lending money, making payments... ...cross-functional business users, QA engineers and BAs collaborating to deliver best in... ..., and solutions. Design and develop software applications using microservices cloud-...Full time$160k - $200k
AI Solutions Engineer ($160K - $200K + Equity) at V7 Jack & Jill | Jack & Jill | Posted Mar 7 Full-time New York Negotiable Advanced... ...is paramount, and taking preventive measures is crucial to safeguarding your information from potential risks and unauthorized use....Full time$65 - $70 per hour
...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our...- ...Framework Ventures in the United States is seeking a passionate Product Security Engineer to safeguard its core technologies. In this role, you will analyze and assess security issues, design and build security tools, and provide guidance for engineers on secure operational...Flexible hours
$70 - $72 per hour
...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our...- ...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Safeguards. Be the first to apply!
- graduate software developer New York, NY
- rust software engineer New York, NY
- senior software design engineer New York, NY
- software engineer student New York, NY
- software engineer amazon New York, NY
- software developer positions New York, NY
- software engineer full time New York, NY
- software qa engineer New York, NY
- new graduate software engineer New York, NY
- junior software developer New York, NY

