Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Product Manager, Claude Code Model Performance

$305k

Anthropic

Product Manager, Claude Code Model Performance

San Francisco, CA | New York City, NY

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

As a Product Manager on Claude Code's model performance team, you will drive model launches end-to-end, build evals that measure what matters, and partner directly with researchers and product engineers to translate model improvements into developer-facing outcomes.

Claude Code is the most capable coding agent in the world but there's much more we can do to extract the maximum performance from our models. We're looking for a PM who has personally built agentic evals, thinks in systems, uses Claude Code every day, and has refined model taste. You should be as comfortable influencing our research team as you are getting in the weeds of transcripts. You will be the connective tissue between frontier research and the millions of developers who depend on Claude Code to do their best work.

Responsibilities
  • Own model launch planning and execution for Claude Code: define readiness criteria, coordinate across research and product engineering, and ensure launches land cleanly with developers
  • Design and implement agentic evals that measure real-world coding performance
  • Drive the engineering team's eval roadmap
  • Partner with researchers working on coding capabilities to define target behaviors and influence model development with evidence from real usage
  • Talk with users and analyze transcripts to understand capability gaps and turn research progress into shipped improvements
  • Synthesize signal from internal users, external developers, and competitive benchmarks into clear priorities
You Might Be a Good Fit If You
  • Have personally built agentic evals (e.g. SWE-bench-style task suites)
  • Are a daily Claude Code user and can articulate what behaviors you'd want to change or add to the model
  • Have an engineering background and 2+ years in product management, or equivalent experience driving product direction as an engineer
  • Have a deep grasp of AI concepts and are comfortable going deep on model behavior, prompt engineering, and evaluation methodology
  • Are a systems thinker: when you find a problem, you build the infrastructure that prevents its whole class
  • Have launched products or capabilities in ambiguous, research-adjacent environments
  • Have a creative, hacker spirit and love solving puzzles

San Francisco and Seattle only

The annual compensation range for this role is listed below. For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

Annual Salary:

$305,000 - $460,000 USD

Logistics

Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.

How We're Different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come Work With Us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

Vacancy posted 19 hours ago
Similar jobs that could be interesting for youBased on the Product Manager, Claude Code Model Performance in New York, NY vacancy
  • $405k

     ...Model Performance Software Engineer, Claude Code San Francisco, CA | New York City, NY About Anthropic Anthropic's mission is to create reliable,...  ...completion Serve as a senior technical bridge between product and research, using strong product intuition to... 
    Performance
    Work at office
    Visa sponsorship
    Flexible hours

    anthropic

    New York, NY
    12 hours ago
  •  ...Mirantis, Inc. is seeking a full-time Product Manager - AI Inference to own the product strategy and lifecycle for AI inference and model serving. This role involves leading technical...  ...with engineering teams to optimize performance across GPU, network, and storage... 
    Performance
    Full time
    Remote work

    Mirantis

    New York, NY
    12 hours ago
  • $122.55k - $201k

     ...enjoy shaping the future of product innovation as a core leader,...  ...impact by delivering high-quality model serving infrastructure and...  ...alike. As a Product Manager in Model Serving, you are an...  ...decision-making, and product performance reporting, while escalating opportunities... 
    Performance

    JPMorgan Chase Bank, N.A.

    New York, NY
    12 hours ago
  • Job Overview As an AI Model Training Specialist with a focus on Bash coding, you will play a crucial role in enhancing the performance of AI models through precise data annotation and validation. You will be responsible for training AI systems by providing domain expertise... 
    Performance
    Hourly pay
    Flexible hours

    MERIT Beauty

    New York, NY
    3 days ago
  • $114.75k - $255k

     ...As the Product Manager for Navan Expense Policy, you'll own the vision,...  ...AI-powered engine that auto-codes and auto-audits millions of...  ...market research and define key performance metrics to measure success...  ...experience with AI-tooling (i.e. Claude Code, Braintrust, Cursor),... 
    Performance

    Navan

    New York, NY
    4 days ago
  • $140k - $175k

     ...and in any global market. Product Manager - Catapult A first-...  ...functionally to enhance automation, performance, and user experience. You...  ...ideas with AI tools (e.g., Claude, FigmaMake). Stand up clickable...  ...~ Active use of LLMs and AI coding tools in daily workflows ~... 
    Performance
    Full time
    Summer work
    Remote work

    Front Row Group

    New York, NY
    4 days ago
  • $90k - $110k

     ...The Role We're hiring a Product Manager to join our growing product team...  ...Excited to Learn) AI assistants : Claude, ChatGPT, or similar — for...  ...& design : Figma, Claude Code Emerging AI tools: We’re...  ...,000 Variable bonus tied to performance Benefits 401k matching Health... 
    Performance
    Local area
    Immediate start
    Remote work
    Flexible hours

    QBench

    New York, NY
    2 days ago
  • $230k - $300k

     ...and prioritize high-impact product opportunities. Shape greenfield...  ...years of relevant product management experience, ideally in...  ...proficiency with tools such as Claude Code, Cursor, or similar...  ...group, in connection with the performance of their duties of employment... 
    Performance
    Hourly pay

    D. E. Shaw & Co.

    New York, NY
    2 days ago
  •  ...intelligent. By combining deep product expertise with the...  ...platform. Improve performance and reliability by...  ..., and effective at managing ambiguity , prioritization...  ..., write, analyze, code, and craft better...  ...premium AI tools (ChatGPT, Claude, Granola & others) and... 
    Performance
    Temporary work
    Work from home
    Shift work

    Gorgias

    New York, NY
    3 days ago
  •  ...engineer, implementation manager, technical account...  ...pain and the product. You've seen where...  ...—you live in Claude, Cursor, v0, and often...  ...testing, performance monitoring that catches...  ...on case study—vibe code a small AI feature...  ...threads about which model handles a specific... 
    Performance
    Live in

    Siena AI

    New York, NY
    4 days ago
  •  ...engineer, implementation manager, technical account...  ...pain and the product. You've seen where...  ...—you live in Claude, Cursor, v0, and often...  ...testing, performance monitoring that catches...  ...on case study—vibe code a small AI feature...  ...threads about which model handles a specific... 
    Performance
    Live in
    Remote work

    Siena AI

    New York, NY
    1 day ago
  •  ...need We're looking for a Director, Product Manager, SDK to join our growing Product team,...  ...Analysing business health metrics and SDK performance data to make informed prioritisation...  ...like ChatGPT’s Codex or Anthropic’s Claude Code What we can offer ~ Bonus ~... 
    Performance
    Summer work
    Work at office

    LoopMe

    New York, NY
    4 days ago
  • $10k

     ...companies move and manage billions, Ramp is...  ...customer-facing agentic products and giving...  ...builders who can vibe-code a prototype before...  ..., Ramp Glass/Claude Code, CX Agents, Ramp...  ...AI agent and tool performance. Build dashboards,...  ...research or building models rather than... 
    Performance
    Full time
    Work at office
    Home office
    Relocation package
    Flexible hours
    Shift work

    RAMP

    New York, NY
    2 days ago
  •  ...Enterprise Data Senior Product Manager to lead execution of...  ...definitions, and data models that ensure consistency...  ...optimization — use AI tools (Claude, LLMs) to automate...  ...SQL (complex queries, performance optimization, data...  ...automation); able to review code and set technical... 
    Performance
    Local area
    Immediate start
    Remote work

    TradeStation

    New York, NY
    2 days ago
  • $130k - $180k

     ...marketers. As large language models (LLMs) become the go-...  ...to our SVP of Product and join a small, highly...  ...Role As a Product Manager at Evertune, you'll...  ...product, customer, and performance data - able to understand...  ..., developing with Claude Code, performing data analytics... 
    Performance
    Work at office
    Local area
    Visa sponsorship
    Shift work

    Evertune

    New York, NY
    4 days ago
  • $200.7k - $229.1k

     ...Senior Product Manager, Flights Search & Merchandising Senior Manager, Product Management...  ...powered development tools-specifically Claude Code, Claude Cowork, or similar platforms (e...  ...information is solely for candidates hired to perform work within one of these locations, and... 
    Performance
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    3 days ago
  •  ...regulators. 2. Engineers and Product Teams They need low...  ..., auditability, and performance. That gap is where we...  ...flows How asset managers calculate staking...  ...in the weeds of data modeling decisions, API design...  ...have a top 0.1% user of claude code in the world at Allium... 
    Performance
    Full time
    Contract work
    Local area
    Flexible hours
    Day shift

    Allium

    New York, NY
    4 days ago
  • $155k - $190k

     ...As a Product Manager II for Model Lab , you will define and launch Datadog's experiment tracking platform built for teams training and fine-tuning...  ...Model Lab centralizes metrics, hyperparameters, datasets, code versions, artifacts, and lineage to help ML and AI teams... 
    Work at office

    Datadog

    New York, NY
    1 day ago
  •  ...Overview Great products come from people who refuse to compromise...  ...looking for a Senior Product Manager to lead Studio, our core...  ...ideas using Figma, v0, Cursor, Claude Code Contribute code directly to...  ...feedback Set metrics, track performance, and use data to make better... 
    Performance
    Shift work

    Restream, Inc.

    New York, NY
    2 days ago
  • Requirements 5+ years in product management, applied AI, product operations, or a hybrid...  ...Technical fluency with modern AI coding harnesses (Cursor, Claude Code, Codex) Working knowledge of...  ...frameworks to measure AI agent and tool performance. Build dashboards, eval systems,... 
    Performance
    Shift work

    Ramp

    New York, NY
    12 hours ago
  • We’re hiring a Senior Product Manager to join our Product team, perfect...  ...Use AI-powered tools (e.g., Claude Code) to explore product concepts...  ...experiences, from data inputs and model outputs to user-facing...  ...comfort using data to evaluate performance, prioritize work, and... 
    Performance
    Flexible hours
    Shift work

    ExaCare AI

    New York, NY
    1 day ago
  •  ...technology and services. The Product Manager Intern will work cross‑...  ...product metrics to assess feature performance and measure impact on the...  ...Generative AI tools (e.g., ChatGPT, Claude) in academic, professional,...  ...Experience reviewing/coding with Python and SQL Knowledge... 
    Performance
    Remote work

    Feedinkoo

    New York, NY
    1 day ago
  • $174.7k - $218.4k

     ...Marqeta is hiring for a Group Product Manager – Fraud role to join the...  ...Flexible First . This role can be performed remotely in the United...  ...concepts, build proof-of-concept models, and accelerate the path...  ...tools (e.g., Cursor, Replit, Claude Code) and the ability to use them... 
    Performance
    Work at office
    Remote work
    Flexible hours

    Marqeta

    New York, NY
    2 days ago
  • $164.8k - $188.1k

    Manager, Product Management - DevX, Source Code Management Product Management at Capital One is a booming, vibrant craft that requires reimagining the status...  ...salary information is solely for candidates hired to perform work within one of these locations, and refers to the... 
    Performance
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    12 hours ago
  • $151.6k - $208.4k

     .... As an Applied AI Product Lead at Humana, you will...  ...frameworks for AI models and platforms, creating...  ...the growth of product managers and other cross-functional...  ...model and platform performance. ~ Familiarity with...  ...such as Figma, Lucid, Claude Code , along with knowledge... 
    Performance
    Bi-weekly pay
    Full time
    Temporary work
    Work at office
    Work from home
    Home office

    Humana

    New York, NY
    12 hours ago
  • $179.7k - $205.1k

     ...leading financial services company in New York is seeking a Manager of Product Management. This role focuses on driving innovative product...  ...range of $179,700 - $205,100, this position also offers performance-based incentives and a comprehensive benefits package. #J-1... 
    Performance

    Capital One

    New York, NY
    1 day ago
  • $168k - $258.75k

     ...seasoned professional with over 7 years in product management or technical partnerships. The role involves collaboration with community model builders, product marketing support, and...  ...a BS or MS in a technical field, Python code review skills, and excellent communication... 
    3 days per week

    NVIDIA

    New York, NY
    1 day ago
  • $175k - $250k

     ...AI Product Manager - Infrastructure The AI Product Manager role within...  ...and Trends Use Claude Code / Codex / Cursor / etc. as...  ...especially Large Language Models (LLMs). Understanding of their...  ...base salary, discretionary performance bonus, and a comprehensive... 
    Performance

    Millennium Management Corp

    New York, NY
    12 hours ago
  • Cursor is looking for a Product Manager to enhance their developer tools through deep customer understanding and technical execution. The...  ...should have a strong engineering background, experience with AI coding tools, and the ability to handle technical architecture... 

    Cursor

    New York, NY
    2 days ago
  • $196.35k - $292.6k

     ...AI-Native Product Management Role This role partners with leaders across NetApp to apply...  ...future-state design, hands-on, in code. Use Cursor, Claude Code, and the other AI development...  ...Paid Time Off, various Leave options, Performance-Based Incentives, employee stock... 
    Performance
    Local area
    Immediate start

    NetApp

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Product Manager, Claude Code Model Performance. Be the first to apply!