Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Evaluation TPM — Cross-Functional Impact

$300k - $320k

Anthropic

About the role: We are seeking a Technical Program Manager to lead our AI model evaluation initiatives across multiple workstreams. This role will be crucial in assessing the performance, capabilities, limitations, and potential risks of our AI models. Working closely with our Research, Trust & Safety, Frontier Redteaming, and Policy teams, you will drive high-priority evaluation projects to build new processes, align metrics with policy, and track measurable progress. You will help build and adapt the model evaluation program to ensure model deployments are rigorous and aligned with our commitment to responsible AI development. The ideal candidate will have a strong technical background and experience managing cross-functional programs in AI development, ML engineering, or related fields. You’ll be joining a team of Technical Program Managers who own and drive cross-functional programs that align to the company’s top priorities. In this role, you’ll have the opportunity to make a foundational impact as you contribute the scaling of a centralized TPM function for the company. Extremely strong soft skills are paramount, as our team is front and center in driving lots of company-wide changes and top priority initiatives that require generating buy-in, balancing various opinions, and competing for attention in our rapidly scaling environment. This role is a great fit for someone who has both seen excellence at scale and operated in rapidly scaling, high-ambiguity teams and scope. We are seeking candidates with deep TPM expertise but who are comfortable acting as adaptable generalists who add value fast. We excel at maintaining a broad view of our work but diving deep into the details when necessary. We understand business goals, translate and organize them into technical programs and projects, and drive execution. We are adept at engaging with both non-technical and technical stakeholders at all levels of the company, including executive leadership. In this role, you will have the opportunity to shape the development of advanced AI systems and contribute to Anthropic's mission of ensuring that AI benefits all of humanity. If you are passionate about responsible AI development, have a strong technical background, and thrive in a fast-paced, collaborative environment, we'd love to hear from you. Responsibilities: Partner with teams like Frontier Risk Evaluations, Security, and Trust & Safety to develop and implement comprehensive evaluation protocols for our latest frontier AI models Build a single source of truth for tracking all types of model evaluations as required by our Responsible Scaling Policy, AI safety institutes, the White House, and others Develop and maintain procedures for conducting evaluations, including designing test suites, coordinating red team exercises, and analyzing results Create and manage dashboards and reporting systems to track model performance, safety metrics, and evaluation outcomes across different AI systems and versions Lead cross-functional workshops to identify potential risks and edge cases for evaluation, ensuring thorough coverage of AI capabilities and limitations Coordinate with external partners and industry standards bodies to align our evaluation practices with emerging best practices in responsible AI development Provide detailed status reports, identifying technical risks, dependencies, and areas requiring additional support Facilitate communication and coordination between technical workstreams and stakeholders Continuously identify opportunities for technical process improvements and implement changes as needed Stay up-to-date with the latest developments in AI safety, ML engineering, and related fields to ensure the program remains at the forefront of responsible AI development You might be a good fit if you: Have several years of experience in technical program management, with a track record of successfully delivering complex technical programs, preferably in AI development, ML engineering, or related fields Have experience executing technical programs that require systems and engineering-level knowledge. Have exceptionally strong interpersonal and communication skills that enable you to influence without authority, build cross-organizational support, cooperation and action around initiatives and process adoption. Have experience prompt engineering on language models Have experience designing and/or running evaluations on Large Language Models Have knowledge of emerging AI governance frameworks and best practices Have a high threshold for navigating ambiguity and are able to balance setting strategic priorities with rapid, high-quality execution. Thrive in unstructured environments, and have a knack for bringing order to chaos. The expected salary range for this position is: Annual Salary:

$300,000—$320,000 USD

Logistics Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. US visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate; operations roles are especially difficult to support. But if we make you an offer, we will make every effort to get you into the United States, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Compensation and Benefits* Anthropic’s compensation package consists of three elements: salary, equity, and benefits. We are committed to pay fairness and aim for these three elements collectively to be highly competitive with market rates. Equity - For eligible roles, equity will be a major component of the total compensation. We aim to offer higher-than-average equity compensation for a company of our size, and communicate equity amounts at the time of offer issuance. US Benefits - The following benefits are for our US-based employees: Optional equity donation matching. Comprehensive health, dental, and vision insurance for you and all your dependents. 401(k) plan with 4% matching. 22 weeks of paid parental leave. Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more! Stipends for education, home office improvements, commuting, and wellness. Fertility benefits via Carrot. Daily lunches and snacks in our office. Relocation support for those moving to the Bay Area. UK Benefits - The following benefits are for our UK-based employees: Optional equity donation matching. Private health, dental, and vision insurance for you and your dependents. Pension contribution (matching 4% of your salary). 21 weeks of paid parental leave. Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more! Health cash plan. Life insurance and income protection. Daily lunches and snacks in our office. #J-18808-Ljbffr Anthropic

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI Evaluation TPM — Cross-Functional Impact in Seattle, WA vacancy
  • $125k - $170k

     ...AI Data Scientist, Evaluation & Insights Join to apply for the AI Data Scientist, Evaluation & Insights...  ...datasets. Your work will directly impact the effectiveness of features like...  ...communicating insights clearly to cross-functional stakeholders. ~ Experience with prompt... 
    Suggested
    Full time
    Contract work
    Remote work

    IRONCLAD COMPANY

    Seattle, WA
    2 days ago
  • $139.5k - $258.1k

    Senior Applied Scientist - AI Evaluation & Quality Systems Seattle, Washington, United States...  ...solutions, working closely with cross‑functional teams to ensure the data powering our...  ...or machine learning with demonstrated impact on shipped systems Strong hands‑on experience... 
    Suggested
    Relocation
    Shift work

    Apple Inc.

    Seattle, WA
    1 day ago
  • $204k - $259k

     ...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving...  ...S. states. The mission of the Waymo AI Foundations team is to develop machine...  ...of embodied AI agents Partner with cross-functional teams within the organization to land innovative... 
    Suggested
    Full time
    Temporary work
    Remote work

    Waymo

    Kirkland, WA
    1 day ago
  • $171.6k - $302.2k

    Machine Learning Safety: Evaluation Research Engineer...  ...Machine Learning and AI This role supports the...  ...Description You will play an impactful role: shaping...  ...reproducibility, and enable rapid cross‑market safety...  ...recommendations to cross‑functional stakeholders including... 
    Suggested
    Relocation

    Apple Inc.

    Seattle, WA
    2 days ago
  • $148.7k - $201.2k

     ...products that directly impact millions of users....  ...you thrive on complex cross-functional problems where no one...  ...identifying dependencies, evaluating engineering approaches...  ...Technical Program Manager (TPM) or a related...  ...entertainment Experience with AI/ML systems, agentic... 
    Suggested
    Flexible hours

    Twitch

    Seattle, WA
    1 day ago
  • $139.5k - $258.1k

    ML Engineer - Evaluation Analysis, Metric and Data Strategy Seattle, Washington...  ...team ensures the quality of AI-powered features across a...  ...as the primary evaluation function, and its analysis directly informs...  ...‑informed decisions across cross‑functional teams Proficiency... 
    Relocation

    Apple Inc.

    Seattle, WA
    1 day ago
  • $175k - $245k

     ...Senior Software Engineer II - Applied AI and Evaluations (Remote Eligible) -REMOTE, USA- For...  ...next without losing the room Strong cross‑functional judgment, you know when to elevate, when...  ...; this work has immediate, visible impact You will be part of a fast‑moving team... 
    Full time
    Temporary work
    Local area
    Immediate start
    Remote work

    Smartsheet

    Bellevue, WA
    10 days ago
  • $166.8k

     ...scientific research or other function, with its own leadership team...  ...nation and the world. The AI and Data Analytics Division,...  ...s mission by contributing to impactful, mission focused applied R&D...  ...innovative training strategies) and evaluation (T&E, robustness) for key... 
    For contractors
    Work experience placement
    Work at office
    Local area
    Remote work
    Relocation package
    Flexible hours

    Pacific Northwest National Laboratory

    Seattle, WA
    17 hours ago
  • $140k - $180k

     ...opportunity to deliver high-impact work in a dynamic, fast-growing...  ...ready to shape the future of AI and Data Science, your next...  ...closely with stakeholders and cross-functional teams to define use cases, architect...  ...data for training and evaluation. Apply experimental design... 
    Temporary work

    Launch Consulting

    Bellevue, WA
    2 days ago
  •  ...Technical Program Manager (TPM) Bellevue Office, Sunset Corporate...  ...the edge, delivering modular AI infrastructure from first...  ...autonomy, and delivering impact. You'll tackle challenges that...  ...you will partner closely with cross-functional teams across engineering, construction... 
    Temporary work
    Work at office
    Flexible hours

    Armada

    Bellevue, WA
    17 hours ago
  • $172.8k - $259.2k

     ...exciting challenge to lead cross-functional teams, drive end-to-end programs...  ...build products enhanced by AI innovation. The ideal...  ...complex problems into actionable, impactful outcomes....  ...Improvement : Continuously evaluate and improve program management... 
    Local area

    F5

    Seattle, WA
    2 days ago
  • $290k - $365k

     ...interpretable, and steerable AI systems. We want AI to be...  ...which every model training run, evaluation, and inference workload...  ...You'll join a small, high-impact TPM team and take ownership of critical...  ..., and health Lead cross-functional coordination for compute transitions... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    Seattle, WA
    17 hours ago
  • $171.6k - $258.1k

     ...States Software and Services AI systems are only as trustworthy as the methods used to evaluate them. At Apple, where AI powers...  ...evaluation right is not a support function. It is a foundational. Join...  ...or a related highly technical, cross-functional role. Customer Obsession... 
    Relocation

    Apple Inc.

    Seattle, WA
    3 days ago
  • $200k - $250k

     ...power our next generation of AI. You will oversee 4 critical...  ...engineers to influence impactful business outcomes Develop...  ...Growth , Edge deployment) and cross-functional leaders in Hardware, Platform...  ...decision tool (AEDT) to assess or evaluate your candidacy for... 
    Temporary work
    Work at office
    Local area

    Metropolis Corp

    Seattle, WA
    1 day ago
  • $242k - $333k

     ...data, your work will directly impact how we validate software...  ...and relevant. Collaborate Cross-Functionally: You'll work closely with system...  ...deep learning models, evaluation, and optimization. Strong programming...  ...artificial intelligence (AI) tools to support parts of... 
    Temporary work
    Remote work
    Relocation package

    Zoox

    Seattle, WA
    17 hours ago
  •  ...Ensure pipelines meet non-functional requirements for...  ..., reporting, APIs, and AI/ML workloads. -...  ...Stakeholder & Cross-Functional Leadership...  ...complexity into business impact. Success Measures...  ...opportunity employer. We evaluate qualified applicants without... 
    Minimum wage
    Contract work
    Temporary work
    Work experience placement
    Local area
    Remote work

    MAXIMUS

    Seattle, WA
    2 days ago
  • $151.28k - $183.32k

     ...Bristol Myers Squibb's AI Venture Studio delivery...  ...domain experts to build and evaluate AI systems across R&D,...  ..., and Enabling Functions. The role sits at the...  ...performance, measure product impact, support sandboxed data...  ...: Cambridge Crossing: $151,280 - $183,319 Madison... 
    Hourly pay
    Full time
    Temporary work
    Part time
    For contractors
    Summer work
    Live in
    Work at office
    Local area
    Remote work
    Flexible hours
    Shift work

    Bristol-Myers Squibb

    Seattle, WA
    9 hours ago
  • $141.9k - $190.3k

     ...years to come. * Reach, Scale & Impact: More than ever, Disney's...  ...specialization in generative AI applications, including generative...  ..., strategic impact, and cross-functional collaboration, across both generative...  ...applications. * Create, evaluate, improve, optimize... 

    Disney Entertainment and ESPN Product & Technology Careers

    Seattle, WA
    2 days ago
  • $165k - $225k

     ...vision and strategy for Outreach AI Platform that powers...  ...model quality, and business impact. Engage with customers and...  ...prompt design, guardrails, and evaluation frameworks. ~ Proven ability...  ...paced environment. ~ Excellent cross‑functional communication and... 
    Full time
    Temporary work

    Outreach

    Seattle, WA
    2 days ago
  •  ...years to come. Reach, Scale & Impact: More than ever, Disney's...  ...specialization in generative AI applications, including generative...  ..., strategic impact, and cross-functional collaboration, across both generative...  ...applications. Create, evaluate, improve, optimize... 

    The Walt Disney Studios

    Seattle, WA
    17 hours ago
  • $129.96k - $228k

     ...metrics system, risk analytics, and cross-functional collaboration to monitor and...  ...models for effective tracking, evaluation, and continuous improvement -...  ...fraud initiatives, and scale high-impact solutions across global markets. - Leverage AI driven approaches to build and... 
    Temporary work
    Local area

    Tik Tok

    Seattle, WA
    3 days ago
  • $139.5k - $258.1k

    ML Engineer - Automated Evaluation and Adversarial Design Seattle, Washington...  ...team ensures the quality of AI-powered features across a...  ...serves as the primary evaluation function, providing critical quality...  ...and readiness assessments to cross-functional partners Minimum Qualifications... 
    Relocation
    Shift work

    Apple Inc.

    Seattle, WA
    17 hours ago
  • $171.6k - $302.2k

     ...platform and shape how AI fundamentally transforms...  ...unique opportunity to impact some of the most far-reaching...  ...agents, retrieval and evaluation, and shares our passion...  ...about embeddings, loss functions and statistical rigor,...  ...clearly across cross‑functional teams to influence... 
    Worldwide
    Relocation

    Apple Inc.

    Seattle, WA
    3 days ago
  • $99.4k - $150.3k

     ...Salesforce Salesforce is the #1 AI CRM, where humans with agents...  ...across the organization. - Cross-Functional Collaboration: Partner with...  ...AI agents accelerate your impact so you can do your best....  ...help our recruiters assess and evaluate candidates' resumes and qualifications... 

    Salesforce.Com Inc

    Seattle, WA
    1 day ago
  • $124.6k - $168.2k

     ...We are looking for an AI Solution Architect. The...  ...quantitative evidence such as evaluation metrics, load testing,...  ..., multi‑step tool/function calling, RAG pipelines,...  ..., global AI platforms. Cross‑Functional Leadership,...  ...design reviews for high‑impact initiatives. Accelerate... 

    Carnival Corporation & plc

    Seattle, WA
    2 days ago
  • $163k - $237k

     ...management. Experience with AI model training, testing, evaluation, and tuning processes,...  ...managing complex cross‑functional or cross‑team projects....  ...Technical Program Manager (TPM), you will guide the delivery...  ...leadership. Manage high‑impact customer engagements, advocate... 
    Full time
    Temporary work

    Google

    Kirkland, WA
    1 day ago
  • $131.2k - $196.8k

     ...we increasingly incorporate AI into our security workflows,...  ...person will play a key role in evaluating and implementing AI-driven...  ...tool configurations Work cross-functionally with IT, Engineering, and...  ...technology that delivers a positive impact on the world. Collaboration... 
    Work at office
    Remote work

    Impinj

    Seattle, WA
    8 days ago
  • $197.3k - $313.7k

     ...Salesforce Salesforce is the #1 AI CRM, where humans with agents...  ...expertise with strategic impact , helping to scale and...  ...ability to collaborate across cross-functional teams. Proven experience...  ...our recruiters assess and evaluate candidates' resumes and qualifications... 

    Salesforce.Com Inc

    Seattle, WA
    1 day ago
  • $188k - $282k

     ...Fortune 500 company and a leading AI platform for managing people,...  ...We're forming small, senior, cross-functional AI teams that bring together...  ...high expectations, and real impact. Engineering, but brighter....  ..., workflow orchestration, evaluation, and feedback loops, ensuring... 
    Work at office
    Remote work
    Home office
    Flexible hours

    Workday

    Seattle, WA
    3 days ago
  • $178.3k - $407k

     ...build a better working world. AI and Data – Data Scientist –...  ...EY delivers unparalleled cross-functional tech consulting services in artificial...  ...is a high-visibility, high-impact role at the intersection of...  ..., hybrid search, reranking, evaluation design, model selection, fine... 
    Summer holiday
    Flexible hours

    EY

    Seattle, WA
    17 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Evaluation TPM — Cross-Functional Impact. Be the first to apply!