ML Data Engineer - Pipelines, Datasets & Quality
Sesame
Sesame is seeking a Data Engineer to build and maintain data pipelines crucial for AI models in San Francisco. You will collaborate with machine learning engineers to ensure access to structured data for model training and evaluation. This role focuses on developing production pipelines for complex data including voice and conversational data, underlining strong SQL and Python skills. At Sesame, you will contribute significantly to data workflows and governance. #J-18808-Ljbffr Sesame
- Gravity Engineering Services Pvt Ltd. in San Francisco seeks a Machine Learning Engineer to enhance our data processing and generation systems. The ideal candidate has strong... ...build and maintain critical data pipelines and ensure the quality and performance of our models....PipelineQuality
- ...across deployment targets, from data center accelerators to on-... ...depends on purpose-built datasets. We need ML-minded engineers who can collect, filter, and synthesize high-quality data at scale. We treat... ..., filtering, and selection pipelines at scale Create pipelines...PipelineQuality
- ...community of world-class engineers, researchers, and... ..., and generating high-quality text data for pretraining, midtraining... ...1 year of experience Dataset Engineering: Expertise... ...models in popular ML frameworks, and experience... ...filtering, selection pipeline than can handle >100TB...PipelineQuality
- ...AWS Data Engineer Location: San Francisco and jersey city (... ...Spark, AWS, data lake, data pipelining python Job Description... ..., and ensuring data quality and governance. Your expertise... ...to process and analyze large datasets. • Orchestration: Use Airflow...PipelineQualityRemote work
$210k - $300k
Gerra Group, located in San Francisco, is seeking an experienced ML Engineer to build systems that enhance training data for physical AI. You'll develop ML pipelines, computer vision systems, and collaborate with customers to optimize their ML training requirements. The...PipelineQuality- ...Francisco is seeking a Machine Learning Engineer to ensure the quality and coverage of data across diverse languages. You will design large-scale datasets, evaluate models, and implement... ...datasets and a strong background in applied ML. This full-time role offers...QualityFull timeWork at office
- ...Staff Machine Learning Engineer, Data & Eval United States AI and ML are at the heart of the... ...will define how we measure quality, how we turn feedback... ...checks, labeling strategy, dataset versioning, and... ...Develop and productionize pipelines for dataset creation, model...PipelineQualityWork experience placementCasual workLive inWork at officeRemote work
- ...payout, which means the data infrastructure... ...first dedicated data engineering hire, you'll own the full... ...warehouse architecture, pipeline reliability, and the systems... ...into clean, trusted datasets. Pipeline... ...real decisions. Data quality and reliability. Build...PipelineQuality
$180k - $250k
...build the platform engineers turn to to ship AI... ...We’re hiring a Data Engineer to build... ...data into reliable datasets that power decision... ...the data models, pipelines, and analytics infrastructure... ...reliability and quality through testing,... ...to a variety of ML startups, offering...PipelineQualityFlexible hours$160k - $260k
...leading FinTech unicorn in San Francisco is looking for a Data Engineer. In this hybrid role, you'll collaborate with data... ...data engineering. You will be responsible for optimizing datasets and ensuring data quality management. This position offers a competitive salary between...PipelineQuality- ...new Machine Learning Engineer opportunities posted on... ...learning systems including data ingestion,... ...and optimize end-to-end ML pipelines encompassing data collection... .... Ensure data quality, observability, and performance... .... Analyze large datasets to derive insights and...PipelineQualityFlexible hours
- Zyphra, an AI company based in San Francisco, is looking for a Data Engineer specialized in multimodal systems. You'll contribute to the creation and improvement of datasets and data pipelines across various modalities. Ideal candidates have experience in large dataset...Pipeline
$150k - $300k
An early-stage AI data company that went... ...You will own the ML systems that turn... ...code with production quality. Work with a founder... ..., LLM inference pipelines, distributed training... ...ML research and engineering cycle, from... ...hard problems on a dataset of hundreds of millions...PipelineQuality$225k - $285k
...in California is seeking a Senior Data Engineer to build the enrichment platform for contact datasets. This role requires over 5 years of... ...include designing pipelines to handle millions of records, ensuring data quality, and optimizing vendor costs. The...PipelineQuality- Machine Learning Engineer - Perception Models At Mach9, ML Engineers build the perception... ...extraction pipeline—image and 3D... ...: problem framing, data strategy, training,... ...Python and a production‑quality ML library like PyTorch... ...large unstructured datasets—imagery and 3D...PipelineQuality
$190k - $320k
...Responsibilities: Own evaluation pipelines — design, build, and... .... Harness the data — create tooling for... ..., privacy-aware dataset curation and discovery... ...live evals — surface quality regressions before users... .... Proven software engineer who loves ML; comfortable writing...PipelineQualityFull timeContract workFlexible hoursShift work- ...believe culture can be engineered - but when it falls... ...We're looking for an ML infrastructure engineer... ...vehicle compute to data collection to dataset curation to large-scale... ...batch compute pipelines for cataloging, exploring... ...curating raw data into high-quality training sets Design...PipelineQualityLocal area
$185k - $225k
# Data EngineerYou.comPosted by Mariane Bekker... ...Our team includes engineers, researchers,... ...-performance data pipelines and systems.You’ll... ...helping ensure data quality, accessibility,... ...and manage curated datasets to support analytics... ...* Support AI/ML and agent-based applications...PipelineQualityFull timeImmediate startRemote workWork from home$200k - $300k
...s largest proprietary dataset for deformable food manipulation... ...Model. As a Senior ML Engineer, Foundation Models,... ...Model, building the data infrastructure that... ...tuning, and alignment pipelines that improve the model... ...reliable, production‑quality training and evaluation...PipelineQualityFlexible hours- Strava is seeking an Engineering Manager to lead its Data Products team focused on AI strategies... ...'s data into enriched datasets and driving technical... ...experience in managing AI/ML teams, developing complex... ...and building robust data pipelines. The position follows a hybrid...PipelineWork at office3 days per week
$200k - $300k
...s largest proprietary dataset for deformable food manipulation... .... As a Senior ML Engineer, Manipulation, you... ...to-end: from defining data collection strategies... ...build data collection pipelines using teleoperation, kinesthetic... ...reliable, production-quality training and...PipelineQualityFlexible hours- ...the Role We are seeking a Data Infrastructure Engineer to build and operate the infrastructure... ...data into production datasets, models, and customer-... ...reproducibility, and raise the quality bar for production systems.... ...operate scalable data and ML infrastructure on AWS,...QualityPermanent employmentFull time
- ...bringing autonomy to software engineering, and we’re hiring a Data Engineer to own the... ...data stack, designing the pipelines, models, and integrations... ...surface at Factory, from core datasets and dashboards to the... ...the company. Drive data quality, observability,...PipelineQualityWork at office
$160k - $260k
...Responsibilities We are looking for a Data Engineer or Analytics Engineer to join... ...data models, foundational datasets and scalable infrastructure... ...: Build and optimize high-quality ergonomic foundational datasets and the relevant data pipelines Establish data quality...PipelineQualityWork experience placementLocal area$170k - $220k
Job Title Data Engineer Salary $170K - $220K + Equity Company Description Duckbill... .... You will design scalable ETL pipelines, tackle massive semistructured datasets at scale, and own the data... ...Develop automated data validation and quality control systems using Python and...PipelineQuality- ...Analytics team serves as the data backbone for this... ...the data models, pipelines, metrics, and reporting... ...We are seeking a Data Engineer to help build and scale... ...functions. Develop trusted datasets and reporting systems... ...systems. Improve data quality, lineage,...PipelineQuality
$170k - $220k
Windfall is seeking a Sr. Data Engineer to join our data team. As... ...personally design and build the pipelines for massive datasets, taking them all the way... ...data science team to run ML models on top of billions... ...making trade-offs between quality, complexity, and speed-of-...PipelineQuality- ...the first AI software engineer, and Windsurf, the AI-native... ...to own our full data stack - from database architecture and pipelines to integrations and reporting... ...business reporting: datasets, dashboards, metrics, and... ...analytics Ensure data quality, observability,...PipelineQuality
- The Data Engineer will be responsible for collecting, parsing, managing... ..., and visualizing large datasets to transform information into... ...repeatable, and secure data pipelines across various platforms. The... ...deliver curated, trusted, and quality data into the Common Data...PipelineQuality
$225k - $285k
...a machine learning research engineer at Scale AI . The rest of our... ...generation of go-to-market. As Senior Data Engineer for Enrichment, you'... ...the largest, highest-quality first-party contact dataset in the market. You'll design the pipelines that ingest data from multiple...PipelineQuality
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Data Engineer - Pipelines, Datasets & Quality. Be the first to apply!
- machine learning ai engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- senior cloud data engineer San Francisco, CA

