Product Manager, Public Sector GenAI Test & Evaluation (T&E)
$205.6k - $257kScale AI
Product Manager, Public Sector GenAI Test & Evaluation (T&E)
At Scale, our mission is to develop reliable AI systems for the world's most important decisions. The Public Sector team is at the forefront of this mission, partnering with government agencies to deploy mission-critical agentic solutions.
Role Overview
The Public Sector GenAI T&E Product Manager will be a high-horsepower technical leader, defining the vision and owning the roadmap for our evaluation capabilities. This role requires thriving in unscripted, high-stakes environments, as you will be the primary owner for the T&E tech stack—the robust infrastructure required to continuously measure, improve, and prove the superiority and sustained performance of our agentic applications.
Traversing multiple engineering organizations across Scale, you will identify bottlenecks, distill technical friction into actionable plans, and drive execution. You will work across Scale's commercial and public sector teams to define requirements, ensuring our evaluation services are robust enough for the most demanding government use cases. Key objectives include refining the tech stack that allows ML teams to hillclimb, and surfacing critical performance information to stakeholders.
Minimum Qualifications (Quantifiable)
- Engineering Depth: 3+ years of experience in software engineering, systems architecture, or highly technical program management. You must be able to read code, understand system architecture, and participate in technical design reviews alongside engineering teams.
- Evaluation Systems Expertise: Proven experience designing, owning the roadmap for, or operating the infrastructure required to continuously measure, improve, and show the performance of AI applications.
- Problem Distillation: Demonstrated experience taking a vaguely defined problem (e.g., "our evaluation cycles are too slow") and delivering a technical roadmap, resource requirements, and measurable success metrics within a narrow time window.
- Ambiguity Management: Proven track record of taking a project from "stalled/undefined" to "shipped" in a high-pressure environment. You can point to at least two instances where you inherited a failing project and saw it through to production.
- Cross-Functional Leadership: Led multiple projects that required direct alignment between at least three distinct engineering organizations (e.g., Infrastructure, ML Research, and Product).
- Operational Execution: Experience using technical project management frameworks (e.g., Linear) to provide consistent weekly reporting on delivery velocity and blockers to executive stakeholders.
Preferred Qualifications (Nice to Haves)
- Security Clearance: Active Secret, Top Secret, or TS/SCI clearance.
- GenAI Implementation: Practical experience developing or evaluating features built specifically on LLMs, RAG, or autonomous agent workflows.
- Technical Rigor: Advanced degree in Computer Science, Engineering, or a related field.
- Public Sector Expertise: 2+ years of experience working with DoD, IC, or Civil agencies on mission-critical software deployments.
Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.
Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is:
$205,600 - $257,000 USD
The base salary range for this full-time position in the locations of Hawaii, Washington DC, Texas, Colorado is:
$184,800 - $231,000 USD
The base salary range for this full-time position in the location of St. Louis is:
$154,400 - $193,000 USD
PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.
At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications.
We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.
We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at View email address on click.appcast.io. Please see the United States Department of Labor's Know Your Rights poster for additional information.
We comply with the United States Department of Labor's Pay Transparency provision.
PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants' needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
- ...Product Manager, Data Engine San Francisco, CA; St. Louis... ...decisions. For the Public Sector, we translate this mission... ..., and model evaluation for internal and external... ...state-of-the-art model testing. You Will:... ...evaluation frameworks (T&E). Operationalize Collaboration...SuggestedImmediate start
$103.75k - $174.75k
...AI Product Manager - GenAI and Agentic Capabilities New York, NY, United States Job Description Amex Digital Labs’ mission is to build... ...and continuous improvement using product analytics, model evaluation, and user feedback. Partner with engineering on the...SuggestedFull timeWork at officeLocal areaImmediate startVisa sponsorshipFlexible hours$155.2k - $194k
...Product Marketing Lead, GenAI San Francisco, CA At Scale, we develop reliable AI systems for the... ...allows us to ensure a fair and thorough evaluation of all applicants. At Scale, our... ...believe is appropriate and necessary to manage applicants' needs, provide our...SuggestedFull time- ...redefining how people manage owning a car, one of their... ...our Growth team as a Product Manager. This role is... ...millions of users discover, evaluate, and purchase car... ...trust Contribute to A/B testing, funnel analysis, and... ...conversations about our use of GenAI, such as this Forbes...SuggestedContract workPart timeFreelanceLocal areaFlexible hours
- ...world safeguards data, enabling organizations to thrive in a GenAI era where data is the ultimate currency. If you're ready to shape... ...while safeguarding privacy. We are seeking a Gen AI Product Manager to join the Product Team to design and deliver user-facing AI...SuggestedLocal area
$385k
...Product Manager, Developer Productivity San Francisco, CA | New York City, NY About Anthropic Anthropic... ...and researchers at Anthropic develop, build, test, and ship code—the foundation on which every model, evaluation, and product feature depends: Partner...Visa sponsorshipShift work$160k - $190k
...AI Visualization Platform Product Manager Hybrid NYC (3 days a week in office) $160k - $... ...months. These are the metrics you will be evaluated against. # Lift scan-to-booking... ...cadence of no fewer than 4 shipped A/B tests per month in GrowthBook. # Reduce scan...Work at officeLocal area3 days per week- ...intelligence solutions that help public and private sector agencies investigate and... ...as possible Build and manage a conference and events... ...partners to brief and pressure-test content for distribution... ...and leverage You will be evaluated on applied AI fluency during...Worldwide
$97.2k - $150k
...Education is seeking a Sr. Product Owner, AI Authoring... ...drives revenue growth. Lead evaluation, selection, and management of AI technology partners... ...Lead business acceptance testing (BAT), product training, and... ...concepts as applied to LLM/GenAI systems (e.g., embeddings,...Remote workWorldwide$58k - $115k
...Automation & Workflow Product Owner - Wealth Management Platforms Associate Location... ...artificial intelligence (GenAI, Agentic AI), and... ...potential. Business Analysis & Testing : Conduct comprehensive... ...Process Improvement : Evaluate and optimize existing processes...Temporary workWork at officeWorldwideShift work$170k - $320k
...We are seeking a Director of Product Marketing, Growth & Platforms... ...touchpoints. Portfolio Lifecycle Management (LCM): Develop our LCM system... ...high-velocity multivariate testing frameworks to optimize... ...Proficiency: Ability to translate GenAI capabilities (e.g., LLMs,...Temporary workLocal areaWorldwide- ...are looking for a dynamic Vice President Product Manager to lead the development and execution... ...product vision and roadmap for innovative GenAI solutions that accelerate AI/ML use... ...alignment and support. Support vendor evaluation and integration efforts related to...
$237.6k - $297k
...Staff Product Manager, Agentic Platform New York, NY;... ...potential of generative AI (GenAI). We are seeking a... ..., ensuring Scale's public sector AI solution aligns... ...capability that can help evaluate thousands of pages of... ...development, testing, and launches Lead...Full time- ...Senior Product Manager – AI & Public Sector About the Role We are hiring a senior product leader to... ...you should be comfortable building, testing, and iterating on software at a high... ...-impact solutions Continuously evaluate emerging AI technologies and incorporate...Permanent employment
$110.35k - $181.29k
...As a Senior Product Manager, Risk Evaluation & Delivery , you will be responsible for defining and driving the product roadmap in alignment with the product vision and Objectives & Key Results. You will collaborate with Product Owners, Digital & Technology leaders,...Full timeWork experience placementVisa sponsorshipWork visaFlexible hours$85k - $140k
...Automation & Workflow Product Owner - Wealth Management Platforms Assistant Vice President... ...artificial intelligence (GenAI, Agentic AI), and... ...potential. Business Analysis & Testing : Conduct comprehensive... ...Improvement : Critically evaluate and optimize existing...Temporary workWork at officeWorldwideShift work- ...The Role We're looking for a Senior Product Manager, Personalization & AI to lead the product vision, strategy,... ...personalization-aware UX. Drive experimentation, A/B testing, and offline/online model evaluation frameworks to measure lift rigorously. Build...
$185k - $200k
...growing, and high-performing network of public charter schools takes a village -... ...seeking an experienced Senior AI Product Manager to drive the adoption and... ...requirements gathering and vendor evaluation through implementation, testing, and rollout. Manage relationships...Work at officeVisa sponsorship$176k - $179.5k
Tech & AI Product Manager I - Life Sciences Job ID: 106300 Boston... ...in innovation-driven sectors such as Life Sciences, Specialty... ...engineering best practices (e.g., test driven development,... ...story-lining presentations and evaluating role ~ Exceptional time management...Hourly payApprenticeshipWork at officeLocal areaEasy workShift work- ...Product Manager We are looking for an experienced Product Manager who will help us reinvent... ...automation and preemptive healing). Will evaluate new opportunities to ensure that Amex... ...ServiceNow Conversational AI GenAI LLM Chatbots Self-service Workflow...Weekly payTemporary workWorldwideFlexible hours
$112k - $154k
...name on it We're looking for a Senior Product Manager to own a critical area of the FanDuel... ...quantitative insights to identify opportunities, evaluate performance, and inform roadmap... ...to require or administer a lie detector test as a condition of employment or continued...Temporary workLocal areaWorldwide- ...eye on future growth. Define a product vision, strategy, and roadmap... ...user interviews, usability tests, surveys, etc. Partner with design... ...You have 5+ years of product management experience in a Fintech... ...priorities, manage tradeoffs and evaluate opportunistic new ideas with...
$124k
...expect Responsible for outlining the product roadmap, setting feature priorities,... ...best practices; designing, running, and evaluating A/B tests to optimize key flows; partnering on... ...looking for Bring 8+ years of product management experience focused on eCommerce...Work at officeRemote work$64k - $80k
...SUMMARY In this dynamic role as our Product Manager, Pre and Post (PPM), you'll be at the... ...bounds. Put your financial prowess to the test as you closely manage offerings to optimize... ..., and industry-specific survey data. We evaluate external equity and the cost of labor/...Contract workWork at officeLocal areaRemote workFlexible hoursNight shiftWeekend work$180k - $225k
...Product Manager Lazard is one of the world's preeminent financial advisory and asset management... ...prioritization matrix. Identify and evaluate AI use cases where technology can... ...balanced with grounded rigor Tried and tested successful methodologies that ensure...Local area- ...Senior Product Manager for Borrower Acquisition At January, we're transforming the lives of... ...experimentation infrastructure to rapidly test across channels, templates, timing, and... ...for it. Inputs beat outcomes. You evaluate decisions by the thinking behind them, not...Odd jobCurrently hiringWork at office
- ...fire. About the Role Our product team owns the core user... ...platform. We're hiring a Product Manager to own our entire payments... ...payment service providers and evaluate new vendor partnerships... ...conversion rates through A/B testing and data analysis Implement...Immediate start
$200k - $250k
...value creation at scale. As a Senior Product Manager , you will help lead the next stage of... ...design, and data teams to rapidly prototype, test, and ship products in real operating... ..., positioning, and using data to evaluate pilots and quantify impact ~ Excellent...Work at officeLocal areaRemote work$149k - $165.5k
...Apron is looking for a high-impact Sr. Product Manager (Forecasting) to own and scale the demand... ...reduction Analyze performance: Evaluate forecast accuracy across SKU groups (e.... ...feature definition through prioritization, testing, and impact measurement Communicate...Work at officeVisa sponsorshipWork visa3 days per week$150k - $225k
...Sr. Product Manager Take your career to the next level! In the last few years our goal has... ...while improving risk-adjusted returns. Evaluate product investments using ROI, RAROC,... ...Performance Monitoring Design and launch A/B tests and champion/challenger experiments to...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Product Manager, Public Sector GenAI Test & Evaluation (T&E). Be the first to apply!
- director product design New York, NY
- product security manager New York, NY
- product manager mobile applications New York, NY
- sr technical product manager New York, NY
- junior product manager New York, NY
- product operations manager New York, NY
- associate product manager web New York, NY
- product manager financial services New York, NY
- workday product manager New York, NY
- health product manager New York, NY

