Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Data Engineer - Pipelines, Datasets & Quality

Sesame

Sesame is seeking a Data Engineer to build and maintain data pipelines crucial for AI models in San Francisco. You will collaborate with machine learning engineers to ensure access to structured data for model training and evaluation. This role focuses on developing production pipelines for complex data including voice and conversational data, underlining strong SQL and Python skills. At Sesame, you will contribute significantly to data workflows and governance. #J-18808-Ljbffr Sesame

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the ML Data Engineer - Pipelines, Datasets & Quality in San Francisco, CA vacancy
  • Gravity Engineering Services Pvt Ltd. in San Francisco seeks a Machine Learning Engineer to enhance our data processing and generation systems. The ideal candidate has strong...  ...build and maintain critical data pipelines and ensure the quality and performance of our models.... 
    Pipeline
    Quality

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    5 days ago
  •  ...across deployment targets, from data center accelerators to on-...  ...depends on purpose-built datasets. We need ML-minded engineers who can collect, filter, and synthesize high-quality data at scale. We treat...  ..., filtering, and selection pipelines at scale Create pipelines... 
    Pipeline
    Quality

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    1 day ago
  •  ...community of world-class engineers, researchers, and...  ..., and generating high-quality text data for pretraining, midtraining...  ...1 year of experience Dataset Engineering: Expertise...  ...models in popular ML frameworks, and experience...  ...filtering, selection pipeline than can handle >100TB... 
    Pipeline
    Quality

    Liquid AI

    San Francisco, CA
    1 day ago
  •  ...AWS Data Engineer Location: San Francisco and jersey city (...  ...Spark, AWS, data lake, data pipelining python Job Description...  ..., and ensuring data quality and governance. Your expertise...  ...to process and analyze large datasets. • Orchestration: Use Airflow... 
    Pipeline
    Quality
    Remote work

    Apex Informatics

    San Francisco, CA
    2 days ago
  • $210k - $300k

    Gerra Group, located in San Francisco, is seeking an experienced ML Engineer to build systems that enhance training data for physical AI. You'll develop ML pipelines, computer vision systems, and collaborate with customers to optimize their ML training requirements. The... 
    Pipeline
    Quality

    Gerra Group

    San Francisco, CA
    1 day ago
  •  ...Francisco is seeking a Machine Learning Engineer to ensure the quality and coverage of data across diverse languages. You will design large-scale datasets, evaluate models, and implement...  ...datasets and a strong background in applied ML. This full-time role offers... 
    Quality
    Full time
    Work at office

    Cartesia

    San Francisco, CA
    3 days ago
  •  ...Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the...  ...will define how we measure quality, how we turn feedback...  ...checks, labeling strategy, dataset versioning, and...  ...Develop and productionize pipelines for dataset creation, model... 
    Pipeline
    Quality
    Work experience placement
    Casual work
    Live in
    Work at office
    Remote work

    airbnb, Inc.

    San Francisco, CA
    2 days ago
  •  ...payout, which means the data infrastructure...  ...first dedicated data engineering hire, you'll own the full...  ...warehouse architecture, pipeline reliability, and the systems...  ...into clean, trusted datasets. Pipeline...  ...real decisions. Data quality and reliability. Build... 
    Pipeline
    Quality

    Triumph

    San Francisco, CA
    12 hours ago
  • $180k - $250k

     ...build the platform engineers turn to to ship AI...  ...We’re hiring a Data Engineer to build...  ...data into reliable datasets that power decision...  ...the data models, pipelines, and analytics infrastructure...  ...reliability and quality through testing,...  ...to a variety of ML startups, offering... 
    Pipeline
    Quality
    Flexible hours

    Baseten

    San Francisco, CA
    2 days ago
  • $160k - $260k

     ...leading FinTech unicorn in San Francisco is looking for a Data Engineer. In this hybrid role, you'll collaborate with data...  ...data engineering. You will be responsible for optimizing datasets and ensuring data quality management. This position offers a competitive salary between... 
    Pipeline
    Quality

    Kikoff Inc.

    San Francisco, CA
    1 day ago
  •  ...new Machine Learning Engineer opportunities posted on...  ...learning systems including data ingestion,...  ...and optimize end-to-end ML pipelines encompassing data collection...  .... Ensure data quality, observability, and performance...  .... Analyze large datasets to derive insights and... 
    Pipeline
    Quality
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  • Zyphra, an AI company based in San Francisco, is looking for a Data Engineer specialized in multimodal systems. You'll contribute to the creation and improvement of datasets and data pipelines across various modalities. Ideal candidates have experience in large dataset... 
    Pipeline

    Energy Jobline ZR

    San Francisco, CA
    4 days ago
  • $150k - $300k

    An early-stage AI data company that went...  ...You will own the ML systems that turn...  ...code with production quality. Work with a founder...  ..., LLM inference pipelines, distributed training...  ...ML research and engineering cycle, from...  ...hard problems on a dataset of hundreds of millions... 
    Pipeline
    Quality

    Open Select

    San Francisco, CA
    3 days ago
  • $225k - $285k

     ...in California is seeking a Senior Data Engineer to build the enrichment platform for contact datasets. This role requires over 5 years of...  ...include designing pipelines to handle millions of records, ensuring data quality, and optimizing vendor costs. The... 
    Pipeline
    Quality

    Unify

    San Francisco, CA
    2 days ago
  • Machine Learning Engineer - Perception Models At Mach9, ML Engineers build the perception...  ...extraction pipeline—image and 3D...  ...: problem framing, data strategy, training,...  ...Python and a production‑quality ML library like PyTorch...  ...large unstructured datasets—imagery and 3D... 
    Pipeline
    Quality

    Mach9

    San Francisco, CA
    2 days ago
  • $190k - $320k

     ...Responsibilities: Own evaluation pipelines — design, build, and...  .... Harness the data — create tooling for...  ..., privacy-aware dataset curation and discovery...  ...live evals — surface quality regressions before users...  .... Proven software engineer who loves ML; comfortable writing... 
    Pipeline
    Quality
    Full time
    Contract work
    Flexible hours
    Shift work

    Dormont Manufacturing Co

    San Francisco, CA
    1 day ago
  •  ...believe culture can be engineered - but when it falls...  ...We're looking for an ML infrastructure engineer...  ...vehicle compute to data collection to dataset curation to large-scale...  ...batch compute pipelines for cataloging, exploring...  ...curating raw data into high-quality training sets Design... 
    Pipeline
    Quality
    Local area

    Humble Robotics

    San Francisco, CA
    1 day ago
  • $185k - $225k

    # Data EngineerYou.comPosted by Mariane Bekker...  ...Our team includes engineers, researchers,...  ...-performance data pipelines and systems.You’ll...  ...helping ensure data quality, accessibility,...  ...and manage curated datasets to support analytics...  ...* Support AI/ML and agent-based applications... 
    Pipeline
    Quality
    Full time
    Immediate start
    Remote work
    Work from home

    Founders Bay Inc.

    San Francisco, CA
    4 days ago
  • $200k - $300k

     ...s largest proprietary dataset for deformable food manipulation...  ...Model. As a Senior ML Engineer, Foundation Models,...  ...Model, building the data infrastructure that...  ...tuning, and alignment pipelines that improve the model...  ...reliable, production‑quality training and evaluation... 
    Pipeline
    Quality
    Flexible hours

    Chef Robotics

    San Francisco, CA
    1 day ago
  • Strava is seeking an Engineering Manager to lead its Data Products team focused on AI strategies...  ...'s data into enriched datasets and driving technical...  ...experience in managing AI/ML teams, developing complex...  ...and building robust data pipelines. The position follows a hybrid... 
    Pipeline
    Work at office
    3 days per week

    Strava

    San Francisco, CA
    4 days ago
  • $200k - $300k

     ...s largest proprietary dataset for deformable food manipulation...  .... As a Senior ML Engineer, Manipulation, you...  ...to-end: from defining data collection strategies...  ...build data collection pipelines using teleoperation, kinesthetic...  ...reliable, production-quality training and... 
    Pipeline
    Quality
    Flexible hours

    Alumni Ventures

    San Francisco, CA
    4 days ago
  •  ...the Role We are seeking a Data Infrastructure Engineer to build and operate the infrastructure...  ...data into production datasets, models, and customer-...  ...reproducibility, and raise the quality bar for production systems....  ...operate scalable data and ML infrastructure on AWS,... 
    Quality
    Permanent employment
    Full time

    Matter Intelligence

    San Francisco, CA
    1 day ago
  •  ...bringing autonomy to software engineering, and we’re hiring a Data Engineer to own the...  ...data stack, designing the pipelines, models, and integrations...  ...surface at Factory, from core datasets and dashboards to the...  ...the company. Drive data quality, observability,... 
    Pipeline
    Quality
    Work at office

    Factory

    San Francisco, CA
    1 day ago
  • $160k - $260k

     ...Responsibilities We are looking for a Data Engineer or Analytics Engineer to join...  ...data models, foundational datasets and scalable infrastructure...  ...: Build and optimize high-quality ergonomic foundational datasets and the relevant data pipelines Establish data quality... 
    Pipeline
    Quality
    Work experience placement
    Local area

    Kikoff

    San Francisco, CA
    4 days ago
  • $170k - $220k

    Job Title Data Engineer Salary $170K - $220K + Equity Company Description Duckbill...  .... You will design scalable ETL pipelines, tackle massive semistructured datasets at scale, and own the data...  ...Develop automated data validation and quality control systems using Python and... 
    Pipeline
    Quality

    Jack & Jill/External ATS

    San Francisco, CA
    2 days ago
  •  ...Analytics team serves as the data backbone for this...  ...the data models, pipelines, metrics, and reporting...  ...We are seeking a Data Engineer to help build and scale...  ...functions. Develop trusted datasets and reporting systems...  ...systems. Improve data quality, lineage,... 
    Pipeline
    Quality

    The Consulting Solutions

    San Francisco, CA
    1 day ago
  • $170k - $220k

    Windfall is seeking a Sr. Data Engineer to join our data team. As...  ...personally design and build the pipelines for massive datasets, taking them all the way...  ...data science team to run ML models on top of billions...  ...making trade-offs between quality, complexity, and speed-of-... 
    Pipeline
    Quality

    Dormont Manufacturing Co

    San Francisco, CA
    2 days ago
  •  ...the first AI software engineer, and Windsurf, the AI-native...  ...to own our full data stack - from database architecture and pipelines to integrations and reporting...  ...business reporting: datasets, dashboards, metrics, and...  ...analytics Ensure data quality, observability,... 
    Pipeline
    Quality

    Dormont Manufacturing Co

    San Francisco, CA
    5 days ago
  • The Data Engineer will be responsible for collecting, parsing, managing...  ..., and visualizing large datasets to transform information into...  ...repeatable, and secure data pipelines across various platforms. The...  ...deliver curated, trusted, and quality data into the Common Data... 
    Pipeline
    Quality

    Compunnel

    San Francisco, CA
    5 days ago
  • $225k - $285k

     ...a machine learning research engineer at Scale AI . The rest of our...  ...generation of go-to-market. As Senior Data Engineer for Enrichment, you'...  ...the largest, highest-quality first-party contact dataset in the market. You'll design the pipelines that ingest data from multiple... 
    Pipeline
    Quality

    Unify

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Data Engineer - Pipelines, Datasets & Quality. Be the first to apply!