Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Evaluation Program Manager

$150k - $160k

Twelve Labs

Location San Francisco Employment Type Full time Location Type Hybrid Department Tech ML Data Compensation $150K – $160K • Offers Equity The base salary & equity offered for this position will depend on several factors, including location, experience, qualifications and business needs which are assessed during the interview process. Who We Are: At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media. With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation. We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI. About the Role: You will be a vital member of our ML Data Team – which leads the full spectrum of video-language data preparation and model evaluation. This role comes with high ownership and includes responsibilities such as defining dataset needs and requirements in consultation with our research and product teams; designing and building data pipelines; and driving our post-training model evaluation strategy. You will also be responsible for automating as much of the repetitive partnership, annotation, and quality evaluation work as possible. A desire to work cross functionally and to build relationships is critical for success in this position. You will: Model Evaluation: Design and build robust model evaluation frameworks, automating repetitive processes and maintaining a balanced approach to efficiency and depth in obtaining evaluation metrics and feedback. Portfolio Monitoring : Manage resource allocation and timelines, adjusting direction flexibly based on real-time information across all data streams in your product vertical. External Partner Collaboration : Enhance dataset and process quality through seamless collaboration with vendors and outsourcing partners. Data Quality & Tooling Advancement : Establish labeling guidelines, monitor data quality, and improve tools and infrastructure to build a sustainable data operations framework. Internal Collaboration : Partner with Engineering and AI Model teams to align on top priority data needs, design tools such as analytical reports and dashboards, and clearly communicate project progress. You may be a good fit if you have: 5+ years of experience working in an AI focused data operations organization. A proven track record designing and executing large scale data or evaluation projects, including gathering, labeling, and post-processing data. The ability to analyze messy and complex data, identify overarching patterns, and distill your findings into crisp annotation guidelines or model quality reports. Proficiency with Python, LLMs, or other popular industry tools for automation. Excellent communication and project management skills, and the ability to support several projects simultaneously. A foundational understanding of and interest in LLMs/VLMs and multimodal AI. Conviction that data is the key ingredient for the performance and assessment of AI models. You’ll stand out if you have: Experience in data collection and labeling for multimodal language models. Experience in red teaming, localization testing, or other evaluation focused fields. Experience working with research scientists and engineers. Expertise or interest in video-centric domains, such as sports, advertising, and content creation. Tech Stack: Development & Analysis : Python (primarily pandas, Jupyter, etc.) Data Management & Visualization : Amazon S3, Various data visualization tools (framework-agnostic) Project Management Tools : Linear, Notion Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at TwelveLabs. We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI. Benefits and Perks: An open and inclusive culture and work environment. Work closely with a collaborative, mission-driven team on cutting-edge AI technology. Full health, dental, and vision benefits. ✈️ Flexible PTO and parental leave policy. Office closed the week of Christmas and New Years. Compensation Range: $150K - $160K #J-18808-Ljbffr Twelve Labs

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the AI Evaluation Program Manager in San Francisco, CA vacancy
  •  ...Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei...  ...-language data preparation and model evaluation. This role comes with high ownership and...  ...and feedback. Portfolio Monitoring : Manage resource allocation and timelines, adjusting... 
    Suggested
    Work at office
    Worldwide
    Flexible hours

    Twelve Labs

    San Francisco, CA
    2 days ago
  • A leading AI development firm situated in San Francisco is looking for a Research Program Manager to coordinate the development and evaluation of AI benchmarks. You will collaborate across teams to ensure rigorous evaluation processes are upheld and results shared with... 
    Suggested
    Relocation package

    Mercor

    San Francisco, CA
    5 days ago
  • $151.2k - $189k

     ...the intersection of Engagement Management (EM), GenAI Delivery, GTM,...  ...TPM on Planning you will own program-level planning and systems workstreams...  ...ensure a fair and thorough evaluation of all applicants. About...  ...is to develop reliable AI systems for the world's most... 
    Suggested
    Full time

    Scale AI

    San Francisco, CA
    6 days ago
  • $147.68k - $236.28k

     ...mission that matters at a company where you matter. AI/Technology Evangelist - Program Manager (Corporate AI Team) Team & Role Overview Axon's Corporate...  ...into actionable guidance for the organization. Evaluate new AI tools and vendors; make recommendations about what... 
    Suggested
    Work experience placement
    Work at office

    Axon

    San Francisco, CA
    5 days ago
  • $173k - $200k

     ...Senior Program Manager, Generative AI At Early Warning, we've powered and protected the U.S. financial system for over thirty years with cutting...  ...to ensure responsible AI deployment. Oversees vendor evaluation, procurement, and onboarding of GenAI technologies.... 
    Suggested
    Hourly pay
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services

    San Francisco, CA
    2 days ago
  • $132.7k - $206.8k

     .... With intelligent agreement management, Docusign unleashes business-...  ...pivotal moment-transitioning to an AI-first enterprise. This isn't...  ...experience value. As a Program Manager, AI & Innovation, you...  ...problems like model drift, evaluation uncertainty, data governance,... 
    Permanent employment
    Full time
    Contract work
    Work at office
    Local area
    Remote work
    2 days per week

    DocuSign

    San Francisco, CA
    1 day ago
  • $126.9k - $197.8k

    What you'll do The Program Manager for Enterprise Transformation is a highly driven individual with demonstrated...  ..., and customer success teams Identify, evaluate, and champion opportunities to integrate Artificial Intelligence (AI) into Program Management processes and... 
    Permanent employment
    Full time
    Work at office
    Local area
    Remote work
    2 days per week

    DocuSign, Inc.

    San Francisco, CA
    5 days ago
  • Requirements AI-first execution mindset: demonstrated ability...  ...GenAI to accelerate planning, program operations, and stakeholder...  ...controls , 3+ years of experience managing complex, cross-functional...  ...called out across both kits’ evaluation criteria and interview plans... 
    Flexible hours

    Pinterest

    San Francisco, CA
    3 days ago
  •  ...deeper understanding in healthcare. Our AI‑powered platform was purpose‑built for medical...  .... The Role As the AI Enablement Program Manager, you will own Abridge’s company‑wide strategy...  ...& Approval: Own the AI tool intake and evaluation process, the single front door for any... 
    Hourly pay
    Full time
    Flexible hours

    Abridge

    San Francisco, CA
    3 days ago
  • $132.7k - $206.8k

     .... With intelligent agreement management, Docusign unleashes business-...  ...pivotal moment—transitioning to an AI-first enterprise. This isn’t...  ...experience value. As a Program Manager, AI & Innovation, you...  ...solving problems like model drift, evaluation uncertainty, data governance,... 
    Permanent employment
    Full time
    Contract work
    Work at office
    Local area
    Remote work
    2 days per week

    DocuSign, Inc.

    San Francisco, CA
    2 days ago
  • $241.6k - $302k

     ...Director, Technical Program Manager San Francisco, CA About This Role As the Director...  ...efficiency, and the seamless integration of AI capabilities across all business units....  ...including LLM fine-tuning, performance evaluation, and the infrastructure requirements for... 
    Full time
    Shift work

    Scale AI

    San Francisco, CA
    1 day ago
  •  ...Technical Program Manager The future of AI — whether in training or evaluation, classical ML or agentic workflows — starts with high-quality data. At HumanSignal, we're building the platform that powers the creation, curation, and evaluation of that data. From fine... 

    HumanSignal

    San Francisco, CA
    1 day ago
  • $150.1k - $227k

     ...not duplicating efforts. Job Category Program & Project Management Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive...  ...) tools to help our recruiters assess and evaluate candidates' resumes and qualifications throughout... 

    Salesforce.Com Inc

    San Francisco, CA
    5 days ago
  • $148.7k - $201.2k

     ...experience on Twitch. As a Senior Technical Program Manager in this org, you'll own programs that...  ...decisions, identifying dependencies, evaluating engineering approaches, and influencing...  ...or online entertainment Experience with AI/ML systems, agentic architectures, or applied... 
    Flexible hours

    Twitch

    San Francisco, CA
    3 days ago
  • $162k - $240k

     ...reliable signals for training and evaluation. We design and run end-to-end programs that capture the depth of human intent...  ...About the Role As a Program Manager (PGM) in the Human Data team you...  ...between our external vendors and AI trainers, ensuring human data campaigns... 
    Flexible hours
    Shift work

    OpenAI

    San Francisco, CA
    4 days ago
  • $211.2k - $264k

    As a Technical Program Manager for the Platform team, you will partner with engineering teams to...  ...development and maturity of the Scale Generative AI Platform (SGP). We are looking for a TPM...  ...cluster utilization, or model training/evaluation setups. Masterful Communication: Proven... 
    Full time

    Scale AI

    San Francisco, CA
    16 hours ago
  • $90k - $110k

     ...POSITION TITLE: Program Data and Evaluation Manager REPORTS TO: Development Director SALARY: $90,000 - $110,000 SCHEDULE: Full-time position. Onsite/hybrid. BENEFITS: Health, Dental, and Vision insurance; Commuter stipend HOMELESS CHILDREN'S NETWORK MISSION... 
    Full time

    Homeless Children S Network

    San Francisco, CA
    3 days ago
  •  ...Valence has built the only AI native coaching platform for enterprise, offering personalized...  ...Role Valence is hiring a Technical Program Manager to sit at the intersection of our...  ...you submit will be used for recruiting, evaluation, legal compliance, and recordkeeping... 
    Work at office
    Remote work
    3 days per week

    Valence

    San Francisco, CA
    2 days ago
  • $125.6k - $228k

     ...journey to build the world's most advanced AI infrastructure ecosystem. The Stargate...  ...Stargate is hiring a Technical Program Manager to coordinate datacenter deployments in...  ...maintaining and updating risk registers to evaluate contingencies and drive resolution.... 
    Contract work
    For contractors
    Work at office
    3 days per week

    OpenAI

    San Francisco, CA
    2 days ago
  •  ...Hardware Technical Program Manager Sesame believes in a future where computers are lifelike...  ...functional teams (including world-class AI builders) to align on marketing & technical...  ...selection, prototyping, materials evaluation, and manufacturing, ensuring adherence to... 
    Full time
    Contract work
    Overseas
    Flexible hours

    SESAME

    San Francisco, CA
    2 days ago
  • $148.7k - $201.2k

     ...experience on Twitch. As a Senior Technical Program Manager in this org, you'll own programs that...  ...decisions, identifying dependencies, evaluating engineering approaches, and influencing...  ...online entertainment - Experience with AI/ML systems, agentic architectures, or applied... 
    Local area
    Flexible hours

    Amazon

    San Francisco, CA
    2 days ago
  • $290k - $365k

     ...Technical Program Manager, Compute San Francisco, CA | New York City, NY | Seattle, WA About...  ...reliable, interpretable, and steerable AI systems. We want AI to be safe and...  ...foundation on which every model training run, evaluation, and inference workload depends. You'... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    2 days ago
  • $173k - $200k

     ...employment Visa sponsorship. Overall Purpose The Senior Program Manager, Generative AI is responsible for driving Early Warning’s rollout and adoption...  ...to ensure responsible AI deployment. Oversees vendor evaluation, procurement, and onboarding of GenAI technologies.... 
    Hourly pay
    Full time
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning®

    San Francisco, CA
    2 days ago
  • $211.2k - $264k

    As a Technical Program Manager, you will partner with our Frontier Agent Engineering teams on enterprise...  ...Operating in a hyper-growth, demanding AI environment, you will translate...  ...tasks, model fine-tuning, and performance evaluation. Masterful Communication: Proven track... 
    Full time

    Scale AI

    San Francisco, CA
    4 days ago
  •  ...Senior Technical Program Manager, Product New York, New York; San Francisco, California We...  ...from engineering. Expectations of AI Use in this role (required): ~ Expected...  ...shipping probabilistic systems: model evaluation, data quality gates, and iterative... 
    Work at office
    Local area

    Komodo Health

    San Francisco, CA
    4 days ago
  • $140k - $185k

    Overview Tiki AI builds the infrastructure that teaches frontier...  ...AI models are trained and evaluated. If you want to work at the infrastructure...  ...through multiple concurrent programs running across RLHF, coding...  ...a steady-state operations management role. The deliverable is a... 
    Local area
    Overseas

    Tiki AI

    San Francisco, CA
    4 days ago
  • $150.9k - $226.3k

     ...motion designers, and design program specialists. Our mission is to...  ...consistent systems that scale. As AI becomes central to how product...  ...Design, as a Design Program Manager you will collaborate closely...  ...and performance. Source and evaluate vendors when needed. Interface... 
    Work at office
    Local area
    Immediate start
    Remote work
    Work from home
    Relocation
    Shift work

    Stripe

    San Francisco, CA
    1 day ago
  •  ...field , 5+ years of experience as a Technical Program Manager in a software engineering environment , A...  ...projects involving machine learning and/or AI , Experience with machine learning models, training pipelines, or evaluation frameworks , Excellent communication,... 

    Waymo

    San Francisco, CA
    3 days ago
  •  ...freight with groundbreaking vision‑based AI, designed for today’s global logistics...  ...present. Position Overview The Technical Program Manager (TPM) is responsible for driving the...  ...decisions by working with technical leads to evaluate schedule, cost, and scope impacts.... 
    Local area

    Humble Robotics

    San Francisco, CA
    1 day ago
  • $180.8k - $226k

     ...Scale is at the frontier of GenAI and human-AI collaboration. The Gen AI Ops Trust and...  ...for a highly analytical Technical Program Manager (TPM) who leans heavily into fraud analytics...  ...tracking dashboards, and define offline evaluation frameworks (e.g., false positive... 
    Full time
    Shift work

    Scale AI

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Evaluation Program Manager. Be the first to apply!