ML Data Infrastructure Engineer — Pipelines & Multimodal
Cartesia
Cartesia is looking for a Software Engineer to build the data infrastructure for its AI models in San Francisco. In this hands-on role, you will design and implement scalable data pipelines for multimodal data, particularly audio. Candidates should have experience with ML data systems and demonstrate modern engineering execution. Attractive compensation includes a competitive salary and equity. The position operates in a collaborative office culture with supportive benefits. #J-18808-Ljbffr Cartesia
$140k - $180k
...Data Infrastructure Engineer Alljoined is creating a future where humans are fully understood... ...data lifecycle, from building pipelines that process massive multimodal datasets (video, audio, text,... ...-speed networking for intensive ML workloads. Have a background...PipelineLocal areaVisa sponsorship- ...telemetry, and demonstration data every time they move -... ...to learning. Data infrastructure sits at the center of... ...As a Staff Software Engineer on data infrastructure... ...Droyd, you'll own the pipelines that carry data from robot... ...team across robotics, ML, and hardware. Your...PipelineImmediate start
- ...part and supported the Regular Toilet is seeking a Software Engineer to build large-scale models that support our mission of creating... ...-house while utilizing cloud technology to create reliable data infrastructure. The ideal candidate has 5+ years of software engineering...Pipeline
- Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM).... ...We are looking for a Senior Data Infrastructure Engineer to build and scale the real-time data pipelines that power agent behavior analysis... .... Familiarity with ML workflow orchestration (Airflow...Pipeline
- ...call recording, enrichment, pipeline management) with one... ...’s possible when all the data lives under one roof in the... ...for a Senior Data Platform Engineer to help build Monaco’s data and ML platform – the pipelines, context systems, and infrastructure that power our AI‑driven...PipelineWork at officeShift work
$140k - $200k
...foundational member of our engineering team: a highly... ...and evolution of our data platform. You will be... ...ingestion and management infrastructure that powers Crustdata’... ...AWS, GCP, or Azure). Pipeline Development: Develop and... ...Enable Data Science & ML: Create the...PipelineFull time$235k - $376k
...Data Platform Engineer Figma is growing our team of passionate creatives... ...early engineer shaping Figma's ML and data platform, enabling... ...the intersection of data, infrastructure, and machine learning,... ...building prompt-processing pipelines, instrumenting interactions...PipelineFull timeRemote workWork from home$140k - $225.08k
...unleashes business‑critical data that is trapped inside... ...Data and AI Platform Engineer will design, build,... ...data science, and AI/ML capabilities at scale.... ...on Snowflake and AI infrastructure, with a strong focus on... ...into robust platforms, pipelines, and services. This...PipelineContract workWork at officeLocal areaRemote work2 days per week- A data solutions firm in San Francisco is seeking an experienced Data Engineer to design and maintain scalable data pipelines supporting AI workflows. The role requires a strong foundation in programming, big data technologies, and cloud services. Ideal candidates will...Pipeline
$142.6k - $176k
Octave is hiring a Sr. Data Engineer in San Francisco to evolve their data platform for AI and ML applications. The role requires extensive experience in data engineering... ...with data scientists to ensure reliable data pipelines. The compensation ranges from $142,600 to $176...Pipeline- Plaid, headquartered in San Francisco, seeks a Software Engineer in Data Infrastructure to lead projects that enhance machine learning capabilities and data systems. Ideal candidates will have over 5 years of software engineering experience, specifically in data infrastructure...Pipeline
- ...intelligence company based in San Francisco, California. The Role: As a Data Engineer - Multimodal Systems , you will be a core contributor to creating, collecting, and improving Zyphra’s datasets and data pipelines across a variety of modalities. Your work will intersect with...PipelineWork at officeRelocation package
$120k - $150k
The Opportunity As a Data Engineer, you are passionate and drive excellence across data ingestion... ...as we enhance our data platform and infrastructure Required to work on-site 3 days a week... ...Data Science algorithms and/or ML ops Experience with a variety of data serialization...PipelineFull time3 days per week- ...radar, and sensor data. But today's data... ...analytics, not the multimodal corpora that power... ...generation. Storage and pipelines haven't. The gap... .... Our open‑source engine, Daft, is the... ...labs and public AI infrastructure companies today. We... ...calling APIs). ML/AI research background...PipelineHourly payWork at officeFlexible hoursNight shift1 day per week
$150k - $170k
...for an experienced Senior Data Platform Engineer, with significant experience... ...platform to support analytics, ML Ops, and business... ...implement end-to-end ML Ops pipelines Architect and manage data... ...Design and implement scalable infrastructure for Large Language Model (LLM...PipelineFull timeRemote workFlexible hours$135.3k - $178.35k
...Data Infrastructure Engineer Berkeley, CA About Glyphic: At Glyphic Biotechnologies, we plan to... ...Latch, Google Sheets, Confluence), our pipelines are functional but fragile, and scientists... ...work alongside a Staff Scientist, an ML Scientist, and wet-lab teams to...PipelineWork at office$320k
Principal Engineer, AI And Data Platform Engineering (r4941) Own the AI data... ...organization responsible for the infrastructure that underpins autonomy... ...Feedback: Own the pipeline from training to deployment... ...Experience building and operating ML infrastructure at scale (10...PipelineFull timeTemporary workPart time$157.58k - $262.63k
Purpose of Onyx The Onyx Research Data Tech organization is GSK’s... ...portfolio leadership, data engineering, infrastructure and DevOps, data / metadata... ...knowledge platforms, and AI/ML and analysis platforms, all... ...building data pipelines with ETL/ELT tools and orchestration...PipelineLocal area$180k - $250k
...behaviour directly from data. MetaVoice is founded by: Sid, founding engineer at Wayve.ai ( $2B+... ...Experience Experience building infrastructure & distributed data pipelines to process 10s of TBs... ...working with multimodal data in the context of AI/ML products or systems Demonstrated...Pipeline$150k - $250k
...If you're ready to own data strategy at a high-growth... ...build. As Senior Data Engineer , you won’t just clean datasets and maintain pipelines. You’ll own the entire... ..., and optimize multimodal datasets (text, video,... ...training – Work closely with ML engineers to curate and...PipelineFull timeRemote workFlexible hours- ...training and inference infrastructure that powers... ...are looking for an engineer to design and implement... ..., scaling pipelines across thousands of... ...closely with the multimodal researchers, and other... ...for multimodal (MM) data that cannot fit in... ...) part of the ML stack. Bonus points...Pipeline
- Droyd in San Francisco is seeking a Staff Software Engineer focused on data infrastructure. You will own data pipelines that convert robot telemetry into valuable training signals. Collaborate directly with a small, senior team across robotics and machine learning to improve...Pipeline
- ...Hinge Health is building the data and ML backbone that powers... ...for our customers. As a Data Engineering Manager leading our Data &... ...world. This is not a pure infrastructure or ML engineering role. We’... ...platform: batch and streaming pipelines, data models, orchestration...PipelineWork at officeLocal areaRemote workWorldwideFlexible hours3 days per week
$160k - $230k
Open role Founding Data Infrastructure Engineer San Francisco (On-site) About Us Constellation is creating... ...human. We are generating the richest multimodal dataset ever collected to build a new... ...with the data requirements of modern ML frameworks (PyTorch/TensorFlow). You...Work at officeLocal areaRelocation package- Zyphra, an AI company based in San Francisco, is looking for a Data Engineer specialized in multimodal systems. You'll contribute to the creation and improvement of datasets and data pipelines across various modalities. Ideal candidates have experience in large dataset...Pipeline
$180k - $250k
...Francisco seeks a talented engineer to build infrastructure for voice AI conversations.... ...involves creating distributed data pipelines and optimizing systems for large... ...have experience with AI/ML products, particularly in handling multimodal data, and possess strong problem...Pipeline- Dormont Manufacturing Co is seeking a Staff Machine-Learning Infrastructure Engineer to drive the development of our computer-vision platform for workplace safety. You will manage data pipelines and build large-scale training infrastructure using Kubernetes. The ideal candidate...Pipeline
$50 - $100 per hour
...Francisco is seeking a network engineer for a contract role. This... ...blends network engineering with data science to structure and annotate data for autonomous infrastructure. The ideal candidate will... ...define schemas for large data pipelines. Compensation ranges from $50...PipelineHourly payContract work- ...Computer Science, Information Systems, or a related field. Experience: Minimum of 7-8 years of experience in data engineering, with a focus on data architecture and pipeline development. Proven experience with cloud platforms (GCP) and big data technologies (e.g., Airflow,...Pipeline
$157.58k - $262.63k
...Purpose The Onyx Research Data Tech organization is... ...for scientists, engineers, and decision makers,... ...mechanics. We also provide AI/ML and data analysis... ...CLI, Azure, AWS) and infrastructure‑as‑code for delivery.... ...Experience building data pipelines with ETL/ELT tools and...PipelineWork experience placementLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Data Infrastructure Engineer — Pipelines & Multimodal. Be the first to apply!
- machine learning ai engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- senior cloud data engineer San Francisco, CA

