AI Evaluation Program Manager

$150k - $160k

Twelve Labs

Location San Francisco Employment Type Full time Location Type Hybrid Department Tech ML Data Compensation $150K – $160K • Offers Equity The base salary & equity offered for this position will depend on several factors, including location, experience, qualifications and business needs which are assessed during the interview process. Who We Are: At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media. With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation. We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI. About the Role: You will be a vital member of our ML Data Team – which leads the full spectrum of video-language data preparation and model evaluation. This role comes with high ownership and includes responsibilities such as defining dataset needs and requirements in consultation with our research and product teams; designing and building data pipelines; and driving our post-training model evaluation strategy. You will also be responsible for automating as much of the repetitive partnership, annotation, and quality evaluation work as possible. A desire to work cross functionally and to build relationships is critical for success in this position. You will: Model Evaluation: Design and build robust model evaluation frameworks, automating repetitive processes and maintaining a balanced approach to efficiency and depth in obtaining evaluation metrics and feedback. Portfolio Monitoring : Manage resource allocation and timelines, adjusting direction flexibly based on real-time information across all data streams in your product vertical. External Partner Collaboration : Enhance dataset and process quality through seamless collaboration with vendors and outsourcing partners. Data Quality & Tooling Advancement : Establish labeling guidelines, monitor data quality, and improve tools and infrastructure to build a sustainable data operations framework. Internal Collaboration : Partner with Engineering and AI Model teams to align on top priority data needs, design tools such as analytical reports and dashboards, and clearly communicate project progress. You may be a good fit if you have: 5+ years of experience working in an AI focused data operations organization. A proven track record designing and executing large scale data or evaluation projects, including gathering, labeling, and post-processing data. The ability to analyze messy and complex data, identify overarching patterns, and distill your findings into crisp annotation guidelines or model quality reports. Proficiency with Python, LLMs, or other popular industry tools for automation. Excellent communication and project management skills, and the ability to support several projects simultaneously. A foundational understanding of and interest in LLMs/VLMs and multimodal AI. Conviction that data is the key ingredient for the performance and assessment of AI models. You’ll stand out if you have: Experience in data collection and labeling for multimodal language models. Experience in red teaming, localization testing, or other evaluation focused fields. Experience working with research scientists and engineers. Expertise or interest in video-centric domains, such as sports, advertising, and content creation. Tech Stack: Development & Analysis : Python (primarily pandas, Jupyter, etc.) Data Management & Visualization : Amazon S3, Various data visualization tools (framework-agnostic) Project Management Tools : Linear, Notion Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at TwelveLabs. We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI. Benefits and Perks: An open and inclusive culture and work environment. Work closely with a collaborative, mission-driven team on cutting-edge AI technology. Full health, dental, and vision benefits. ✈️ Flexible PTO and parental leave policy. Office closed the week of Christmas and New Years. Compensation Range: $150K - $160K #J-18808-Ljbffr Twelve Labs

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the AI Evaluation Program Manager in San Francisco, CA vacancy

AI Evaluation Program Manager
...Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei... ...-language data preparation and model evaluation. This role comes with high ownership and... ...and feedback. Portfolio Monitoring : Manage resource allocation and timelines, adjusting...
Suggested
Work at office
Worldwide
Flexible hours
Twelve Labs
San Francisco, CA
2 days ago
AI Benchmarks & Evaluations Program Manager
A leading AI development firm situated in San Francisco is looking for a Research Program Manager to coordinate the development and evaluation of AI benchmarks. You will collaborate across teams to ensure rigorous evaluation processes are upheld and results shared with...
Suggested
Relocation package
Mercor
San Francisco, CA
5 days ago
Technical Program Manager, Gen AI Operations Planning
$151.2k - $189k
...the intersection of Engagement Management (EM), GenAI Delivery, GTM,... ...TPM on Planning you will own program-level planning and systems workstreams... ...ensure a fair and thorough evaluation of all applicants. About... ...is to develop reliable AI systems for the world's most...
Suggested
Full time
Scale AI
San Francisco, CA
6 days ago
AI Evangelist - Program Manager
$147.68k - $236.28k
...mission that matters at a company where you matter. AI/Technology Evangelist - Program Manager (Corporate AI Team) Team & Role Overview Axon's Corporate... ...into actionable guidance for the organization. Evaluate new AI tools and vendors; make recommendations about what...
Suggested
Work experience placement
Work at office
Axon
San Francisco, CA
5 days ago
Sr. Program Manager, Generative AI
$173k - $200k
...Senior Program Manager, Generative AI At Early Warning, we've powered and protected the U.S. financial system for over thirty years with cutting... ...to ensure responsible AI deployment. Oversees vendor evaluation, procurement, and onboarding of GenAI technologies....
Suggested
Hourly pay
Work at office
Immediate start
Visa sponsorship
Work visa
Flexible hours
Early Warning Services
San Francisco, CA
2 days ago
Program Manager, AI & Innovation
$132.7k - $206.8k
.... With intelligent agreement management, Docusign unleashes business-... ...pivotal moment-transitioning to an AI-first enterprise. This isn't... ...experience value. As a Program Manager, AI & Innovation, you... ...problems like model drift, evaluation uncertainty, data governance,...
Permanent employment
Full time
Contract work
Work at office
Local area
Remote work
2 days per week
DocuSign
San Francisco, CA
1 day ago
Program Manager, AI & Innovation
$126.9k - $197.8k
What you'll do The Program Manager for Enterprise Transformation is a highly driven individual with demonstrated... ..., and customer success teams Identify, evaluate, and champion opportunities to integrate Artificial Intelligence (AI) into Program Management processes and...
Permanent employment
Full time
Work at office
Local area
Remote work
2 days per week
DocuSign, Inc.
San Francisco, CA
5 days ago
AI-Native Platform Program Manager
Requirements AI-first execution mindset: demonstrated ability... ...GenAI to accelerate planning, program operations, and stakeholder... ...controls , 3+ years of experience managing complex, cross-functional... ...called out across both kits’ evaluation criteria and interview plans...
Flexible hours
Pinterest
San Francisco, CA
3 days ago
AI Enablement Program Manager
...deeper understanding in healthcare. Our AI‑powered platform was purpose‑built for medical... .... The Role As the AI Enablement Program Manager, you will own Abridge’s company‑wide strategy... ...& Approval: Own the AI tool intake and evaluation process, the single front door for any...
Hourly pay
Full time
Flexible hours
Abridge
San Francisco, CA
3 days ago
Program Manager, AI & Innovation
$132.7k - $206.8k
.... With intelligent agreement management, Docusign unleashes business-... ...pivotal moment—transitioning to an AI-first enterprise. This isn’t... ...experience value. As a Program Manager, AI & Innovation, you... ...solving problems like model drift, evaluation uncertainty, data governance,...
Permanent employment
Full time
Contract work
Work at office
Local area
Remote work
2 days per week
DocuSign, Inc.
San Francisco, CA
2 days ago
Director, Technical Program Manager
$241.6k - $302k
...Director, Technical Program Manager San Francisco, CA About This Role As the Director... ...efficiency, and the seamless integration of AI capabilities across all business units.... ...including LLM fine-tuning, performance evaluation, and the infrastructure requirements for...
Full time
Shift work
Scale AI
San Francisco, CA
1 day ago
Technical Program Manager
...Technical Program Manager The future of AI — whether in training or evaluation, classical ML or agentic workflows — starts with high-quality data. At HumanSignal, we're building the platform that powers the creation, curation, and evaluation of that data. From fine...
HumanSignal
San Francisco, CA
1 day ago
Senior Technical Program Manager
$150.1k - $227k
...not duplicating efforts. Job Category Program & Project Management Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive... ...) tools to help our recruiters assess and evaluate candidates' resumes and qualifications throughout...
Salesforce.Com Inc
San Francisco, CA
5 days ago
Senior Technical Program Manager
$148.7k - $201.2k
...experience on Twitch. As a Senior Technical Program Manager in this org, you'll own programs that... ...decisions, identifying dependencies, evaluating engineering approaches, and influencing... ...or online entertainment Experience with AI/ML systems, agentic architectures, or applied...
Flexible hours
Twitch
San Francisco, CA
3 days ago
Program Manager, Human Data
$162k - $240k
...reliable signals for training and evaluation. We design and run end-to-end programs that capture the depth of human intent... ...About the Role As a Program Manager (PGM) in the Human Data team you... ...between our external vendors and AI trainers, ensuring human data campaigns...
Flexible hours
Shift work
OpenAI
San Francisco, CA
4 days ago
Technical Program Manager, Platform
$211.2k - $264k
As a Technical Program Manager for the Platform team, you will partner with engineering teams to... ...development and maturity of the Scale Generative AI Platform (SGP). We are looking for a TPM... ...cluster utilization, or model training/evaluation setups. Masterful Communication: Proven...
Full time
Scale AI
San Francisco, CA
16 hours ago
Program Data and Evaluation Manager
$90k - $110k
...POSITION TITLE: Program Data and Evaluation Manager REPORTS TO: Development Director SALARY: $90,000 - $110,000 SCHEDULE: Full-time position. Onsite/hybrid. BENEFITS: Health, Dental, and Vision insurance; Commuter stipend HOMELESS CHILDREN'S NETWORK MISSION...
Full time
Homeless Children S Network
San Francisco, CA
3 days ago
Founding Technical Program Manager
...Valence has built the only AI native coaching platform for enterprise, offering personalized... ...Role Valence is hiring a Technical Program Manager to sit at the intersection of our... ...you submit will be used for recruiting, evaluation, legal compliance, and recordkeeping...
Work at office
Remote work
3 days per week
Valence
San Francisco, CA
2 days ago
Technical Program Manager, Partnership Site Deployment - Stargate
$125.6k - $228k
...journey to build the world's most advanced AI infrastructure ecosystem. The Stargate... ...Stargate is hiring a Technical Program Manager to coordinate datacenter deployments in... ...maintaining and updating risk registers to evaluate contingencies and drive resolution....
Contract work
For contractors
Work at office
3 days per week
OpenAI
San Francisco, CA
2 days ago
Hardware Technical Program Manager
...Hardware Technical Program Manager Sesame believes in a future where computers are lifelike... ...functional teams (including world-class AI builders) to align on marketing & technical... ...selection, prototyping, materials evaluation, and manufacturing, ensuring adherence to...
Full time
Contract work
Overseas
Flexible hours
SESAME
San Francisco, CA
2 days ago
Senior Technical Program Manager
$148.7k - $201.2k
...experience on Twitch. As a Senior Technical Program Manager in this org, you'll own programs that... ...decisions, identifying dependencies, evaluating engineering approaches, and influencing... ...online entertainment - Experience with AI/ML systems, agentic architectures, or applied...
Local area
Flexible hours
Amazon
San Francisco, CA
2 days ago
Technical Program Manager, Compute
$290k - $365k
...Technical Program Manager, Compute San Francisco, CA | New York City, NY | Seattle, WA About... ...reliable, interpretable, and steerable AI systems. We want AI to be safe and... ...foundation on which every model training run, evaluation, and inference workload depends. You'...
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
2 days ago
Sr. Program Manager, Generative AI
$173k - $200k
...employment Visa sponsorship. Overall Purpose The Senior Program Manager, Generative AI is responsible for driving Early Warning’s rollout and adoption... ...to ensure responsible AI deployment. Oversees vendor evaluation, procurement, and onboarding of GenAI technologies....
Hourly pay
Full time
Work at office
Immediate start
Visa sponsorship
Work visa
Flexible hours
Early Warning®
San Francisco, CA
2 days ago
Technical Program Manager, Enterprise
$211.2k - $264k
As a Technical Program Manager, you will partner with our Frontier Agent Engineering teams on enterprise... ...Operating in a hyper-growth, demanding AI environment, you will translate... ...tasks, model fine-tuning, and performance evaluation. Masterful Communication: Proven track...
Full time
Scale AI
San Francisco, CA
4 days ago
Senior Technical Program Manager, Product
...Senior Technical Program Manager, Product New York, New York; San Francisco, California We... ...from engineering. Expectations of AI Use in this role (required): ~ Expected... ...shipping probabilistic systems: model evaluation, data quality gates, and iterative...
Work at office
Local area
Komodo Health
San Francisco, CA
4 days ago
AI Data Program Manager
$140k - $185k
Overview Tiki AI builds the infrastructure that teaches frontier... ...AI models are trained and evaluated. If you want to work at the infrastructure... ...through multiple concurrent programs running across RLHF, coding... ...a steady-state operations management role. The deliverable is a...
Local area
Overseas
Tiki AI
San Francisco, CA
4 days ago
Design Program Manager, AI
$150.9k - $226.3k
...motion designers, and design program specialists. Our mission is to... ...consistent systems that scale. As AI becomes central to how product... ...Design, as a Design Program Manager you will collaborate closely... ...and performance. Source and evaluate vendors when needed. Interface...
Work at office
Local area
Immediate start
Remote work
Work from home
Relocation
Shift work
Stripe
San Francisco, CA
1 day ago
Technical Program Manager - Onboard AI & Vision Systems
...field , 5+ years of experience as a Technical Program Manager in a software engineering environment , A... ...projects involving machine learning and/or AI , Experience with machine learning models, training pipelines, or evaluation frameworks , Excellent communication,...
Waymo
San Francisco, CA
3 days ago
Technical Program Manager
...freight with groundbreaking vision‑based AI, designed for today’s global logistics... ...present. Position Overview The Technical Program Manager (TPM) is responsible for driving the... ...decisions by working with technical leads to evaluate schedule, cost, and scope impacts....
Local area
Humble Robotics
San Francisco, CA
1 day ago
Lead Technical Program Manager, Trust & Safety
$180.8k - $226k
...Scale is at the frontier of GenAI and human-AI collaboration. The Gen AI Ops Trust and... ...for a highly analytical Technical Program Manager (TPM) who leans heavily into fraud analytics... ...tracking dashboards, and define offline evaluation frameworks (e.g., false positive...
Full time
Shift work
Scale AI
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Evaluation Program Manager. Be the first to apply!