Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Product Manager, Public Sector GenAI Test & Evaluation (T&E)

$205.6k - $257k

Scale AI

Product Manager, Public Sector GenAI Test & Evaluation (T&E)

At Scale, our mission is to develop reliable AI systems for the world's most important decisions. The Public Sector team is at the forefront of this mission, partnering with government agencies to deploy mission-critical agentic solutions.

Role Overview

The Public Sector GenAI T&E Product Manager will be a high-horsepower technical leader, defining the vision and owning the roadmap for our evaluation capabilities. This role requires thriving in unscripted, high-stakes environments, as you will be the primary owner for the T&E tech stack—the robust infrastructure required to continuously measure, improve, and prove the superiority and sustained performance of our agentic applications.

Traversing multiple engineering organizations across Scale, you will identify bottlenecks, distill technical friction into actionable plans, and drive execution. You will work across Scale's commercial and public sector teams to define requirements, ensuring our evaluation services are robust enough for the most demanding government use cases. Key objectives include refining the tech stack that allows ML teams to hillclimb, and surfacing critical performance information to stakeholders.

Minimum Qualifications (Quantifiable)
  • Engineering Depth: 3+ years of experience in software engineering, systems architecture, or highly technical program management. You must be able to read code, understand system architecture, and participate in technical design reviews alongside engineering teams.
  • Evaluation Systems Expertise: Proven experience designing, owning the roadmap for, or operating the infrastructure required to continuously measure, improve, and show the performance of AI applications.
  • Problem Distillation: Demonstrated experience taking a vaguely defined problem (e.g., "our evaluation cycles are too slow") and delivering a technical roadmap, resource requirements, and measurable success metrics within a narrow time window.
  • Ambiguity Management: Proven track record of taking a project from "stalled/undefined" to "shipped" in a high-pressure environment. You can point to at least two instances where you inherited a failing project and saw it through to production.
  • Cross-Functional Leadership: Led multiple projects that required direct alignment between at least three distinct engineering organizations (e.g., Infrastructure, ML Research, and Product).
  • Operational Execution: Experience using technical project management frameworks (e.g., Linear) to provide consistent weekly reporting on delivery velocity and blockers to executive stakeholders.
Preferred Qualifications (Nice to Haves)
  • Security Clearance: Active Secret, Top Secret, or TS/SCI clearance.
  • GenAI Implementation: Practical experience developing or evaluating features built specifically on LLMs, RAG, or autonomous agent workflows.
  • Technical Rigor: Advanced degree in Computer Science, Engineering, or a related field.
  • Public Sector Expertise: 2+ years of experience working with DoD, IC, or Civil agencies on mission-critical software deployments.

Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.

Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is:

$205,600 - $257,000 USD

The base salary range for this full-time position in the locations of Hawaii, Washington DC, Texas, Colorado is:

$184,800 - $231,000 USD

The base salary range for this full-time position in the location of St. Louis is:

$154,400 - $193,000 USD

PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.

At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications.

We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.

We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at View email address on click.appcast.io. Please see the United States Department of Labor's Know Your Rights poster for additional information.

We comply with the United States Department of Labor's Pay Transparency provision.

PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants' needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Product Manager, Public Sector GenAI Test & Evaluation (T&E) in New York, NY vacancy
  •  ...Product Manager, Data Engine San Francisco, CA; St. Louis...  ...decisions. For the Public Sector, we translate this mission...  ..., and model evaluation for internal and external...  ...state-of-the-art model testing. You Will:...  ...evaluation frameworks (T&E). Operationalize Collaboration... 
    Suggested
    Immediate start

    Scale AI

    New York, NY
    5 days ago
  • $103.75k - $174.75k

     ...AI Product Manager - GenAI and Agentic Capabilities New York, NY, United States Job Description Amex Digital Labs’ mission is to build...  ...and continuous improvement using product analytics, model evaluation, and user feedback. Partner with engineering on the... 
    Suggested
    Full time
    Work at office
    Local area
    Immediate start
    Visa sponsorship
    Flexible hours

    American Express

    New York, NY
    3 days ago
  • $155.2k - $194k

     ...Product Marketing Lead, GenAI San Francisco, CA At Scale, we develop reliable AI systems for the...  ...allows us to ensure a fair and thorough evaluation of all applicants. At Scale, our...  ...believe is appropriate and necessary to manage applicants' needs, provide our... 
    Suggested
    Full time

    Scale AI

    New York, NY
    5 days ago
  •  ...redefining how people manage owning a car, one of their...  ...our Growth team as a Product Manager. This role is...  ...millions of users discover, evaluate, and purchase car...  ...trust Contribute to A/B testing, funnel analysis, and...  ...conversations about our use of GenAI, such as this Forbes... 
    Suggested
    Contract work
    Part time
    Freelance
    Local area
    Flexible hours

    Jerry Insurance Agency, LLC

    New York, NY
    2 days ago
  •  ...world safeguards data, enabling organizations to thrive in a GenAI era where data is the ultimate currency. If you're ready to shape...  ...while safeguarding privacy. We are seeking a Gen AI Product Manager to join the Product Team to design and deliver user-facing AI... 
    Suggested
    Local area

    Protegrity

    New York, NY
    4 days ago
  • $385k

     ...Product Manager, Developer Productivity San Francisco, CA | New York City, NY About Anthropic Anthropic...  ...and researchers at Anthropic develop, build, test, and ship code—the foundation on which every model, evaluation, and product feature depends: Partner... 
    Visa sponsorship
    Shift work

    Anthropic

    New York, NY
    1 day ago
  • $160k - $190k

     ...AI Visualization Platform Product Manager Hybrid NYC (3 days a week in office) $160k - $...  ...months. These are the metrics you will be evaluated against. # Lift scan-to-booking...  ...cadence of no fewer than 4 shipped A/B tests per month in GrowthBook. # Reduce scan... 
    Work at office
    Local area
    3 days per week

    LINQM

    New York, NY
    4 days ago
  •  ...intelligence solutions that help public and private sector agencies investigate and...  ...as possible Build and manage a conference and events...  ...partners to brief and pressure-test content for distribution...  ...and leverage You will be evaluated on applied AI fluency during... 
    Worldwide

    TRM

    New York, NY
    2 days ago
  • $97.2k - $150k

     ...Education is seeking a Sr. Product Owner, AI Authoring...  ...drives revenue growth. Lead evaluation, selection, and management of AI technology partners...  ...Lead business acceptance testing (BAT), product training, and...  ...concepts as applied to LLM/GenAI systems (e.g., embeddings,... 
    Remote work
    Worldwide

    McGraw-Hill Education

    New York, NY
    1 day ago
  • $58k - $115k

     ...Automation & Workflow Product Owner - Wealth Management Platforms Associate Location...  ...artificial intelligence (GenAI, Agentic AI), and...  ...potential. Business Analysis & Testing : Conduct comprehensive...  ...Process Improvement : Evaluate and optimize existing processes... 
    Temporary work
    Work at office
    Worldwide
    Shift work

    Morgan Stanley

    New York, NY
    3 days ago
  • $170k - $320k

     ...We are seeking a Director of Product Marketing, Growth & Platforms...  ...touchpoints. Portfolio Lifecycle Management (LCM): Develop our LCM system...  ...high-velocity multivariate testing frameworks to optimize...  ...Proficiency: Ability to translate GenAI capabilities (e.g., LLMs,... 
    Temporary work
    Local area
    Worldwide

    Adobe

    New York, NY
    3 days ago
  •  ...are looking for a dynamic Vice President Product Manager to lead the development and execution...  ...product vision and roadmap for innovative GenAI solutions that accelerate AI/ML use...  ...alignment and support. Support vendor evaluation and integration efforts related to... 

    JPMorgan Chase & Co.

    New York, NY
    13 days ago
  • $237.6k - $297k

     ...Staff Product Manager, Agentic Platform New York, NY;...  ...potential of generative AI (GenAI). We are seeking a...  ..., ensuring Scale's public sector AI solution aligns...  ...capability that can help evaluate thousands of pages of...  ...development, testing, and launches Lead... 
    Full time

    Scale AI

    New York, NY
    5 days ago
  •  ...Senior Product Manager – AI & Public Sector About the Role We are hiring a senior product leader to...  ...you should be comfortable building, testing, and iterating on software at a high...  ...-impact solutions Continuously evaluate emerging AI technologies and incorporate... 
    Permanent employment

    Next Ventures

    New York, NY
    3 days ago
  • $110.35k - $181.29k

     ...As a Senior Product Manager, Risk Evaluation & Delivery , you will be responsible for defining and driving the product roadmap in alignment with the product vision and Objectives & Key Results. You will collaborate with Product Owners, Digital & Technology leaders,... 
    Full time
    Work experience placement
    Visa sponsorship
    Work visa
    Flexible hours

    Guardian Life

    New York, NY
    3 days ago
  • $85k - $140k

     ...Automation & Workflow Product Owner - Wealth Management Platforms Assistant Vice President...  ...artificial intelligence (GenAI, Agentic AI), and...  ...potential. Business Analysis & Testing : Conduct comprehensive...  ...Improvement : Critically evaluate and optimize existing... 
    Temporary work
    Work at office
    Worldwide
    Shift work

    Morgan Stanley

    New York, NY
    3 days ago
  •  ...The Role We're looking for a Senior Product Manager, Personalization & AI to lead the product vision, strategy,...  ...personalization-aware UX. Drive experimentation, A/B testing, and offline/online model evaluation frameworks to measure lift rigorously. Build... 

    CookUnity

    New York, NY
    4 days ago
  • $185k - $200k

     ...growing, and high-performing network of public charter schools takes a village -...  ...seeking an experienced Senior AI Product Manager to drive the adoption and...  ...requirements gathering and vendor evaluation through implementation, testing, and rollout. Manage relationships... 
    Work at office
    Visa sponsorship

    Success Academy Charter Schools

    New York, NY
    5 days ago
  • $176k - $179.5k

    Tech & AI Product Manager I - Life Sciences Job ID: 106300 Boston...  ...in innovation-driven sectors such as Life Sciences, Specialty...  ...engineering best practices (e.g., test driven development,...  ...story-lining presentations and evaluating role ~ Exceptional time management... 
    Hourly pay
    Apprenticeship
    Work at office
    Local area
    Easy work
    Shift work

    McKinsey & Company

    New York, NY
    4 days ago
  •  ...Product Manager We are looking for an experienced Product Manager who will help us reinvent...  ...automation and preemptive healing). Will evaluate new opportunities to ensure that Amex...  ...ServiceNow Conversational AI GenAI LLM Chatbots Self-service Workflow... 
    Weekly pay
    Temporary work
    Worldwide
    Flexible hours

    Experis

    New York, NY
    2 days ago
  • $112k - $154k

     ...name on it We're looking for a Senior Product Manager to own a critical area of the FanDuel...  ...quantitative insights to identify opportunities, evaluate performance, and inform roadmap...  ...to require or administer a lie detector test as a condition of employment or continued... 
    Temporary work
    Local area
    Worldwide

    FanDuel

    New York, NY
    1 day ago
  •  ...eye on future growth. Define a product vision, strategy, and roadmap...  ...user interviews, usability tests, surveys, etc. Partner with design...  ...You have 5+ years of product management experience in a Fintech...  ...priorities, manage tradeoffs and evaluate opportunistic new ideas with... 

    NovumTech Partners

    New York, NY
    10 days ago
  • $124k

     ...expect Responsible for outlining the product roadmap, setting feature priorities,...  ...best practices; designing, running, and evaluating A/B tests to optimize key flows; partnering on...  ...looking for Bring 8+ years of product management experience focused on eCommerce... 
    Work at office
    Remote work

    Zoom Corporation

    New York, NY
    3 days ago
  • $64k - $80k

     ...SUMMARY In this dynamic role as our Product Manager, Pre and Post (PPM), you'll be at the...  ...bounds. Put your financial prowess to the test as you closely manage offerings to optimize...  ..., and industry-specific survey data. We evaluate external equity and the cost of labor/... 
    Contract work
    Work at office
    Local area
    Remote work
    Flexible hours
    Night shift
    Weekend work

    Lindblad Expeditions

    New York, NY
    1 day ago
  • $180k - $225k

     ...Product Manager Lazard is one of the world's preeminent financial advisory and asset management...  ...prioritization matrix. Identify and evaluate AI use cases where technology can...  ...balanced with grounded rigor Tried and tested successful methodologies that ensure... 
    Local area

    Lazard

    New York, NY
    5 days ago
  •  ...Senior Product Manager for Borrower Acquisition At January, we're transforming the lives of...  ...experimentation infrastructure to rapidly test across channels, templates, timing, and...  ...for it. Inputs beat outcomes. You evaluate decisions by the thinking behind them, not... 
    Odd job
    Currently hiring
    Work at office

    January Service Company

    New York, NY
    4 days ago
  •  ...fire. About the Role Our product team owns the core user...  ...platform. We're hiring a Product Manager to own our entire payments...  ...payment service providers and evaluate new vendor partnerships...  ...conversion rates through A/B testing and data analysis Implement... 
    Immediate start

    Polymarket

    New York, NY
    5 days ago
  • $200k - $250k

     ...value creation at scale. As a Senior Product Manager , you will help lead the next stage of...  ...design, and data teams to rapidly prototype, test, and ship products in real operating...  ..., positioning, and using data to evaluate pilots and quantify impact ~ Excellent... 
    Work at office
    Local area
    Remote work

    TriEdge Investments

    New York, NY
    4 days ago
  • $149k - $165.5k

     ...Apron is looking for a high-impact Sr. Product Manager (Forecasting) to own and scale the demand...  ...reduction Analyze performance: Evaluate forecast accuracy across SKU groups (e....  ...feature definition through prioritization, testing, and impact measurement Communicate... 
    Work at office
    Visa sponsorship
    Work visa
    3 days per week

    WonderGroup

    New York, NY
    1 day ago
  • $150k - $225k

     ...Sr. Product Manager Take your career to the next level! In the last few years our goal has...  ...while improving risk-adjusted returns. Evaluate product investments using ROI, RAROC,...  ...Performance Monitoring Design and launch A/B tests and champion/challenger experiments to... 

    Regional Finance Corp

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Product Manager, Public Sector GenAI Test & Evaluation (T&E). Be the first to apply!