Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Data & Model Evaluation Lead

Twelve-Labs

Twelve-Labs in San Francisco is seeking an experienced data operations professional for their ML Data Team. This role focuses on video-language data preparation, model evaluation, and requires strong skills in Python and project management. Ideal candidates should have over 5 years in AI data operations, the ability to manage large datasets, and a commitment to ensuring high-quality data. The position includes benefits like full health coverage and a flexible PTO policy. #J-18808-Ljbffr Twelve-Labs

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the AI Data & Model Evaluation Lead in San Francisco, CA vacancy
  •  ...Francisco is seeking a dedicated member for our ML Data Team to lead video data preparation and evaluation. This role includes defining dataset needs, automating...  ...should have over 5 years of experience in AI data operations, proficiency in Python, and strong communication... 
    Data
    Flexible hours

    Twelve-Labs

    San Francisco, CA
    2 days ago
  •  ...YO IT Consulting is seeking a Senior Data Architect to contribute to how AI systems reason about complex enterprise data. This remote...  ...with cloud platforms. Responsibilities include evaluating AI responses, refining models, and providing structured feedback on data architecture... 
    Data
    Remote work

    YO IT Consulting

    San Francisco, CA
    9 days ago
  • $200k - $365k

     ...building the world's most trusted AI work companion for professionals...  ...committed to the highest standards of data security and privacy protection....  ...systems, data pipelines, or evaluation harnesses that can run at scale against live model checkpoints. Can deeply partner... 
    Data
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    17 hours ago
  • $240.45k - $300.3k

     ...Machine Learning Engineer - Model Evaluations, Public Sector San Francisco...  ...at Scale deploys advanced AI systems—including LLMs, agentic...  ...Background in algorithms, data structures, and object-oriented...  ...technologies that power the world's leading models, and help enterprises... 
    Data
    Full time

    Scale AI

    San Francisco, CA
    2 days ago
  • A leading AI solutions company in San Francisco is seeking an ML Eval Engineer to design evaluation benchmarks and improve model performance. This role involves working with unstructured enterprise data and collaborating closely with the ML and engineering teams. You will... 
    Data

    Reducto

    San Francisco, CA
    3 days ago
  • A cutting-edge AI company located in San Francisco is seeking an ML Eval Engineer to enhance model evaluations and ensure quality metrics. This role involves designing benchmarks,...  ...complex problems, and a background in AI or data infrastructure. The position is in-person... 
    Data

    Reducto, Inc.

    San Francisco, CA
    4 days ago
  •  ...-edge multimodal foundation models that have the ability to comprehend...  ...Ventures, and prominent AI visionaries and founders...  ...be a vital member of our ML Data Team - which leads the full spectrum of video-language...  ...data preparation and model evaluation. This role comes with high... 
    Data
    Work at office
    Worldwide
    Flexible hours

    Twelve Labs, Inc

    San Francisco, CA
    17 hours ago
  • $25 per hour

    Prolific is seeking AI Training Experts to assist in training and evaluating cutting-edge AI models. The role involves completing tasks such as analyzing and writing annotations...  ...creates a global pool for quality human data, connecting researchers with quality participants... 
    Data
    Remote job
    Hourly pay
    Work from home
    Flexible hours

    Prolific

    San Francisco, CA
    4 days ago
  • Software Engineer (Model Evaluation & Benchmarking) About the Role We are hiring Engineers focused on AI Model Evaluation to build the systems that ensure multimodal AI behaves...  ...(C++, Java, Python, or similar). Strong data structures and algorithms fundamentals. Understanding... 
    Data

    SpreeAI

    San Francisco, CA
    17 hours ago
  •  ...innovative Quality Engineer for their AI products. This role blends ops,...  ...AI engineering team, you will use data to shape how AI behaves, work with partners in leading labs, and ensure user satisfaction through effective evaluation baselines. Competitive salary and benefits... 
    Data

    Notion

    San Francisco, CA
    2 days ago
  • TwelveLabs is seeking a key member for its ML Data Team in San Francisco. This role involves designing evaluation frameworks, managing data operations, and collaborating...  ...should have over 5 years of experience in AI data operations, proficiency in Python and a strong... 
    Data
    Flexible hours

    TwelveLabs

    San Francisco, CA
    3 days ago
  • Twelve Labs in San Francisco is seeking a vital ML Data Team member to lead video-language data preparation and model evaluation. You will define dataset needs, automate...  ...collaborate cross-functionally with engineering and AI model teams. Ideal candidates have over 5 years... 
    Data
    Flexible hours

    Twelve-Labs

    San Francisco, CA
    4 days ago
  •  ...contribute directly to how the next generation of AI systems understand construction work. You'll challenge and evaluate advanced language models on construction engineering topics to...  ...not required.Previous experience with AI data training, annotation, or evaluating AI-... 
    Data
    Remote work

    YO IT Consulting

    San Francisco, CA
    1 day ago
  •  ...candidate with a PhD in chemistry to design tasks and workflows evaluating scientific reasoning. Ideal candidates will have strong...  ...is a plus. This role is crucial for improving data quality and model evaluation in a collaborative environment. #J-18808-Ljbffr... 
    Data

    Cypress HCM

    San Francisco, CA
    2 days ago
  • $180k - $260k

    Perplexity is looking for a Model Behavior Architect to help shape...  ...through well-designed research and evaluation projects. These projects may...  ...Demonstrated passion for AI and can share specific, related...  ...philosophy, psychology, linguistics, data science, or related fields.... 
    Data

    Perplexity

    San Francisco, CA
    3 days ago
  •  ...Science Professionals to join their Expert Network. In this role, you'll evaluate AI-generated scientific responses, fact-check technical claims, and ensure ethical alignment in biological data. Ideal candidates will have a BS, MS, or PhD in relevant fields and experience... 
    Data
    Remote job
    Work from home
    Flexible hours

    Prolific Academic Ltd

    San Francisco, CA
    3 days ago
  • $172.43k - $230.95k

     ...Senior Software Engineer For The Ai Model Lifecycle Team Crusoe is...  ...energy, manufacturing, data center construction, and cloud...  ...management: versioning, lineage, evaluation, and reproducible fine-tuning...  ...years of industry experience leading and driving impactful... 
    Data
    Temporary work

    Crusoe

    San Francisco, CA
    3 days ago
  • Build the AI infrastructure layer of the physical world At Meter...  ...team to build and train models that understand these systems,...  ...latency really matter. Unmatched data advantage, control over the full...  ...all decisions on a network. Evaluate model performance over real‑... 
    Data

    Meter

    San Francisco, CA
    17 hours ago
  • $281k - $356k

     ...Staff Software Engineer, Model Post Training Waymo...  ...engineers to join our team to lead the post-training LLM...  ...generation of frontier AI models. You will:...  ...researchers across ML, infra, and data teams. Raise the...  ...for how Waymo trains, evaluates, and deploys LLM models... 
    Data
    Full time
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  •  ...Research Engineer - Language Model Pre-Training , you'll shape our...  ...collection, processing, and evaluation Architecture and methodology...  ...training pipelines – including model/data parallelism, distributed...  ...we do and love discussing AI Benefits and Perks: Comprehensive... 
    Data
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    26 days ago
  • $320k

     ...interpretable, and steerable AI systems. We want AI to be safe...  ...Research Engineers to build the evaluations that tell us — and the world —...  ...and leadership use to monitor model health during training,...  ...operating distributed systems, data pipelines, or other infrastructure... 
    Data
    Remote job
    Work at office
    Visa sponsorship
    Flexible hours
    San Francisco, CA
    a month ago
  •  ...for the world's most dynamic AI companies, like Cursor, Notion...  ...frontier of AI to bring cutting-edge models into production. We're growing...  ...helping developers discover, evaluate, and select the right models...  ...low‑code, API‑first, or model/data platform company. BENEFITS... 
    Data
    Flexible hours

    Baseten

    San Francisco, CA
    3 days ago
  • $40 per hour

    A leading AI data firm is looking for Python Developers to join as Domain Expert participants. The role involves training and evaluating AI models through Python tasks. Candidates should have verifiable Python experience, strong attention to detail, and a reliable internet... 
    Data
    Hourly pay
    Remote work
    Work from home
    Flexible hours

    Prolific

    San Francisco, CA
    17 hours ago
  • Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning... 
    Full time

    Refresh AI

    San Francisco, CA
    3 days ago
  • $208k - $300k

    A leading AI company is seeking a Machine Learning Engineer in the Public Sector to develop automated evaluation pipelines for AI models. You will work on advanced AI systems and ensure they perform reliably in mission-critical environments. Ideal candidates have a strong... 

    Scale AI, Inc.

    San Francisco, CA
    3 days ago
  • $148.5k - $266.2k

     ...Engineering Manager on the Model Delivery team within...  ...Autodesk Research, you will lead production ML...  ...deployment, monitoring, evaluation, reliability, and operational...  ...generative models and other AI capabilities used across...  ...) Experience with 3D data (geometry/CAD/BIM) and/or... 
    Data
    For contractors
    Remote work

    Autodesk

    San Francisco, CA
    3 days ago
  •  ...organization in San Francisco is seeking a Research & Evaluation Senior Lead to lead research efforts and manage impact...  ...requires over 8 years of experience in data analysis leadership, with strong skills in survey design and using AI tools for analysis. Responsibilities... 
    Data
    Remote work
    Flexible hours

    aiEDU

    San Francisco, CA
    17 hours ago
  • $300 per month

     ...create ambitiously with AI — without sacrificing...  ...Software Engineer for the Model LifeCycle team will...  ...: versioning, lineage, evaluation, and reproducible fine-...  ...of consistent success leading a varied portfolio of initiatives...  ...alignment with market data. Equal Opportunity... 
    Data
    Temporary work

    Crusoe Energy Systems LLC

    San Francisco, CA
    3 days ago
  •  ...Research Engineering Manager to lead the team of all-star AI researchers and engineers...  ...for developing the models that drive our products. Our...  ...technical contributions. Own the data, training, and eval...  ...iteration velocity. Design evaluations and improve the production... 
    Data

    Perplexity AI Inc.

    San Francisco, CA
    2 days ago
  • $125k - $135k

     ...The AI Education Project (aiEDU) is a non‑profit devoted to...  ...About the Role The Research & Evaluation Senior Lead role is responsible for...  ...impact and managing our impact data and organizational dashboards...  ...agenda aligned with a logic model for our organization’s impact... 
    Data
    Temporary work
    Local area
    Immediate start
    Remote work
    Home office
    Flexible hours

    aiEDU

    San Francisco, CA
    17 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Data & Model Evaluation Lead. Be the first to apply!