Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Safeguards

$320k

Anthropic

Software Engineer, Safeguards

San Francisco, CA | New York City, NY

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

We are looking for software engineers to help build safety and oversight mechanisms for our AI systems. As a software engineer on the Safeguards team, you will work to monitor models, prevent misuse, and ensure user well-being. This role will focus on building systems to detect unwanted model behaviors and prevent disallowed use of models. You will apply your technical skills to uphold our principles of safety, transparency, and oversight while enforcing our terms of service and acceptable use policies.

Responsibilities:
  • Develop monitoring systems to detect unwanted behaviors from our API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review
  • Build abuse detection mechanisms and infrastructure
  • Surface abuse patterns to our research teams to harden models at the training stage
  • Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale
You May Be a Good Fit If You:
  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 5-10+ years of experience in a software engineering position, preferably with a focus on integrity, spam, fraud, or abuse detection and mitigation
  • Proficiency in Python and Typescript
  • Ability to work across the stack
  • Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders
Strong Candidates May Also:
  • Have experience building trust and safety detection mechanisms and intervention for AI/ML systems
  • Have experience with prompt engineering, jailbreak attacks, and other adversarial inputs
  • Have worked closely with operational teams to build custom internal tooling

Deadline to apply: None. Applications will be reviewed on a rolling basis.

The annual compensation range for this role is listed below. For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

Annual Salary:

$320,000 - $485,000 USD

Logistics

Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.

How We're Different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come Work With Us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, Safeguards in New York, NY vacancy
  •  ...chains. Addressing the global climate challenge and safeguarding both humanity and the planet requires a...  ...in close cooperation with other developers, data engineers and product managers. Promote agile and software best practices and develop effective solutions to... 
    Suggested
    Remote work

    GrabJobs

    New York, NY
    4 days ago
  •  ...security posture management platform is engineered specifically to address modern data...  ...it, and how it's being used, Symmetry safeguards your organization's data from misuse, insider...  ...and external use Contribute to software development tasks under the guidance of... 
    Suggested
    Summer work
    Internship
    Summer internship
    Remote work

    Hyperproof

    New York, NY
    4 days ago
  • $320k

     ...growing group of committed researchers, engineers, policy experts, and business leaders working...  ...AI systems. About the role Safeguards Labs is a new team operating at the intersection...  ...Safeguards teams. We're hiring software engineers to partner with our research... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    anthropic

    New York, NY
    3 days ago
  • $200k

     ...Optiver is seeking Software Engineers specializing in the US cash equities and US ETF markets to help lead our continued rapid growth across...  .... With a focus on continuous improvement, we prioritize safeguarding the health and efficiency of the markets for all participants... 
    Suggested
    Work at office

    Optiver

    New York, NY
    2 days ago
  • $120k - $180k

     ...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in...  ...Product Management, Design, and other stakeholders within Engineering to maintain a high bar for quality in a fast-paced, iterative... 
    Suggested
    Work at office
    Worldwide
    Flexible hours

    Secureframe

    New York, NY
    3 days ago
  •  ...Job Title: Sr Software Engineer Duration: 6-12 months Location: NYC or San Francisco, CA Job Type: Contract to hire...  ...They build and deploy software and biological AI systems to safeguard humanity. The same AI architectures that enable... 
    Contract work
    Work at office

    CoSourcing Partners

    New York, NY
    5 days ago
  •  ...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in...  ...that powers trust on the internet. We're looking for talented engineers who are obsessed with building. This role sits at the... 
    Internship
    Remote work
    Worldwide
    Flexible hours

    Secureframe

    New York, NY
    3 days ago
  •  ...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in...  ...Product Management, Design, and other stakeholders within Engineering to maintain a high bar for quality in a fast-paced, iterative... 
    Remote work
    Worldwide
    Flexible hours

    Secureframe

    New York, NY
    1 day ago
  • $200k

     ...Optiver is seeking Software Engineers specializing in US corporate bond and credit derivative markets to join the greenfield buildout of...  ...conditions. With a focus on continuous improvement, we prioritise safeguarding the health and efficiency of the markets for all... 
    Work at office

    Optiver

    New York, NY
    1 day ago
  •  ...s network. The next step is to speak to Jack. Full-Stack Software Engineer Company Description Fast-growing voice AI startup Job Description...  ...is paramount, and taking preventive measures is crucial to safeguarding your information from potential risks and unauthorized use.... 
    Immediate start
    Remote work

    Jack & Jill

    New York, NY
    1 day ago
  •  ...Job Title: Software Engineer IV Location: New York City, NY Onsite/ Remote: Day 1 Onsite at customer location...  .... • Plan, coordinate, and implement security measures to safeguard information in computer files against accidental or unauthorized... 
    Remote work

    Yochana

    New York, NY
    3 days ago
  • $142.32k - $213.48k

     ...transaction services, and wealth management. Our core activities are safeguarding assets, lending money, making payments and accessing the...  ...We are looking for a highly skilled and hands-on Senior Software Engineer to drive modern engineering practices and technology... 
    Full time
    Work at office

    Citigroup Inc

    Jersey City, NJ
    1 day ago
  • $181k - $213k

     ...more users is simple for an e-commerce or software company to do. But common web analytics...  ...experience. For marketing, Freshpaint safeguards health information while helping our customers...  ...optimize for them. Pair program with engineers in an iterative environment that gives... 
    Immediate start
    Remote work

    Fresh Paint Inc

    New York, NY
    4 days ago
  • $166.5k - $244.2k

     ...and AI workflows. Our mission is simple: safeguard the digital world and empower people to...  ...lifecycle. The primary focus is software development, with additional responsibility...  .... Collaborate closely with other engineering teams and threat intelligence stakeholders... 
    Flexible hours

    Proofpoint

    Jersey City, NJ
    3 days ago
  • $160k - $240k

     ...With a robust portfolio of products that safeguard thousands of businesses worldwide, we...  ...Product Management, Design, and other Engineering stakeholders to uphold a high quality standard...  ...knowledge. ~5+ years of professional software development experience. ~ Proficient... 
    Work at office
    Worldwide
    Flexible hours
    Shift work

    Secureframe

    New York, NY
    2 days ago
  • $100k - $140k

     ...commitment to excellence. With a robust portfolio of products that safeguard thousands of businesses worldwide, we have been featured in...  ...Product Management, Design, and other stakeholders within Engineering to solve real business problems Own projects end-to-end: scope... 
    Internship
    Work at office
    Worldwide
    Flexible hours

    Secureframe

    New York, NY
    3 days ago
  • $172k - $210k

     ...Senior Software Engineer, Fullstack (Risk Engineering) New York, NY; San Francisco, CA; USA Flex is a growth-stage, NYC headquartered...  ...is to enhance Flex app accessibility for customers while safeguarding against improper use and unauthorized access by identifying... 
    Full time
    Local area
    Relocation package
    Flexible hours
    2 days per week
    3 days per week

    FLEX Inc

    New York, NY
    3 days ago
  • $148.7k - $199.4k

    Job Summary As a Senior Software Engineer (P3), you will lead the design and implementation of intelligent, AI-driven systems that safeguard the reliability and performance of Disney’s global streaming ecosystem. You will be a technical leader in developing agentic systems... 

    Disney

    New York, NY
    3 days ago
  • $65 - $75 per hour

     ...75/Hr Job Description: You will operate as a hands-on engineering leader responsible for designing, building, and running production...  ...Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain... 

    ApTask

    Jersey City, NJ
    1 day ago
  • $172k - $215k

     ...part of the team? About The Role Flex is looking for a Senior Software Engineer to join our Risk Platform team. This role involves...  ...mission is to enhance Flex app accessibility for customers while safeguarding against improper use and unauthorized access by identifying... 
    Full time
    Flexible hours

    FLEX Inc

    New York, NY
    4 days ago
  • $200k

     ...performance compute infrastructure and the software platform layered on top. ORP enables...  ...including traders, researchers, and engineers to push their understanding of the market...  ...continuous improvement, we prioritize safeguarding the health and efficiency of the markets... 
    Work at office

    Optiver

    New York, NY
    1 day ago
  •  ...Job Title: Applied AI Engineer Duration: 6-12 months contract to hire Location: NYC or San Francisco...  ...intelligence company. They build and deploy software and biological AI systems to safeguard humanity. The same AI architectures that enable... 
    Contract work
    Work at office

    CoSourcing Partners

    New York, NY
    5 days ago
  •  ...Senior Staff Software Engineer (Backend/ Python) at Applied Systems Mid-Level, Senior-Level Posted: 1 week ago Regular Full-Time About the...  ...people and products enable millions of people worldwide to safeguard and protect what matters most. For 40 years, Applied Systems... 
    Full time
    Local area
    Remote work
    Worldwide
    Flexible hours

    Inclusively

    New York, NY
    4 days ago
  • $10k

     ...the oldest and largest health department in the U.S., dedicated to protecting and improving the health of NYC. Our mission is to safeguard the health of every resident and cultivate a city where everyone, regardless of age, background, or location, can achieve their optimal... 
    Work at office
    Work from home
    2 days per week

    City of New York

    Long Island City, NY
    3 days ago
  • $142.32k - $213.48k

     ...wealth management. Our core activities are safeguarding assets, lending money, making payments...  ...cross-functional business users, QA engineers and BAs collaborating to deliver best in...  ..., and solutions. Design and develop software applications using microservices cloud-... 
    Full time

    Citigroup Inc

    Jersey City, NJ
    6 days ago
  • $160k - $200k

    AI Solutions Engineer ($160K - $200K + Equity) at V7 Jack & Jill | Jack & Jill | Posted Mar 7 Full-time New York Negotiable Advanced...  ...is paramount, and taking preventive measures is crucial to safeguarding your information from potential risks and unauthorized use.... 
    Full time

    Jack & Jill

    New York, NY
    1 day ago
  • $65 - $70 per hour

     ...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our... 

    ApTask

    New York, NY
    5 days ago
  •  ...Framework Ventures in the United States is seeking a passionate Product Security Engineer to safeguard its core technologies. In this role, you will analyze and assess security issues, design and build security tools, and provide guidance for engineers on secure operational... 
    Flexible hours

    Framework Ventures

    New York, NY
    4 days ago
  • $70 - $72 per hour

     ...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our... 

    ApTask

    Union City, NJ
    2 days ago
  •  ...current state or government issued ID during each interview. Candidate Data Collection Disclaimer: At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our... 

    ApTask

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Safeguards. Be the first to apply!