Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Infra Architect: Build Scalable Data & Training Pipelines

Humble Robotics

A leading robotics company in San Francisco is looking for an ML Infrastructure Engineer to design and scale foundational systems for their autonomous vehicles. The ideal candidate should have experience in building high-availability web services, working with cloud infrastructure, and developing efficient data collection and training pipelines. Additionally, familiarity with tools like Terraform and Ansible is essential. This is an opportunity to make a significant impact in a fast-paced, innovative environment. #J-18808-Ljbffr

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the ML Infra Architect: Build Scalable Data & Training Pipelines in San Francisco, CA vacancy
  •  ...and optimize our training systems and core...  ...to building reusable and efficient...  ...efficient JAX training pipelines. You’ll work...  ...intersection of ML, software engineering, and scalable infrastructure....  ...with research, data, and platform engineers...  ...needs into infra capabilities and... 
    Pipeline
    Data
    Training

    Physical Intelligence

    San Francisco, CA
    4 days ago
  •  ...re in stealth, building an Autonomous Detection...  ...; we are architecting a constellation...  ...in live data. Live in the Signal...  ...0-minute CI/CD pipelines. No "Change Management...  ...traces) and ML/AI systems....  ...As a Detection Infra lead, you’ll build...  ...fed back into training sets to harden... 
    Pipeline
    Data
    Training
    Live in

    Cerebras

    San Francisco, CA
    7 hours ago
  •  ...Staff Software Engineer, ML Infra & Distributed Systems...  ...and Product teams to build world-class machine...  ...Responsibilities: Design and build scalable, high throughput, and...  ...of the art. Take a data driven approach to...  ...of ML model training pipelines and model internals. Experience... 
    Pipeline
    Data
    Training

    Tubi TV

    San Francisco, CA
    4 days ago
  • $230k - $287.5k

     ...the Director, AI Architect at Headspace:...  ...applied research, ML science, and...  ...thinks about, builds with, and deploys...  ..., responsibly, scalably, and in ways...  ...explainability, data governance, and...  ...preserving data pipelines and continuously...  ...-adapted model training for specialized... 
    Pipeline
    Data
    Training
    Full time
    Work at office
    Local area
    3 days per week

    Headspace

    San Francisco, CA
    5 days ago
  •  ...interact with the web by building AI agents that can...  ...be agent‑first, from training our own models to generative...  ...: Scale infra for post-training of multimodal...  ...: Experience with ML infrastructure (GPU...  ...performance of multimodal LLMs (data/tensor/pipeline/context/expert... 
    Pipeline
    Data
    Training
    Work at office
    Relocation
    Visa sponsorship

    Yutori

    San Francisco, CA
    3 days ago
  •  ...ML Systems Engineer – Robotics & AI We are building the full-stack foundation for the next...  ...unseen in training. We work at the intersection...  ...robotic data including vision,...  ...model changes into scalable implementations. Provide...  ...training, tensor/pipeline parallelism, gradient... 
    Pipeline
    Data
    Training

    Maxwell Bond

    San Francisco, CA
    7 hours ago
  •  ...Reflection’s mission is to build open...  ...scale distributed training systems that power...  ...experimental ideas into scalable, production-ready training...  ...maintain training pipelines that support large-...  ...strategies (data, tensor, pipeline,...  ...working closely with ML researchers to productionize... 
    Pipeline
    Data
    Training
    Relocation package

    Reflection

    San Francisco, CA
    7 hours ago
  • Ensure that ML models can be effectively developed...  ...ML models - integrate trained ML models with Production systems Build and manage ML pipelines - design, build, and...  ...pipelines including data ingestion, data preprocessing...  ..., efficiency, and scalability of ML models and their... 
    Pipeline
    Data
    Training
    Permanent employment
    Contract work
    Local area

    Cloud Hybrid Technologies, LLC

    San Francisco, CA
    3 days ago
  • $218.4k - $365.2k

     ...Software Engineering Architect focusing on Spiff,...  ...optimization, and data cloud connectivity...  ...management, and building developer‑facing...  ...architecting analytics pipelines, data integration...  .... Experience with ML/AI model...  ...promotion, benefits, training, assessment of job... 
    Pipeline
    Data
    Training
    Contract work

    Centaur Labs

    San Francisco, CA
    5 days ago
  •  ...cloud infrastructure and data orchestration to...  ...the founding team (AI/ML, product, and design)...  ...Terraform). Establish CI/CD pipelines and best practices...  ..., Kubernetes, etc.). Build and maintain scalable data ingestion and...  .../AI workflows (model training, inference,... 
    Pipeline
    Data
    Training

    Pear VC

    San Francisco, CA
    4 days ago
  •  ...growing startup that is building the AI backbone...  ...AI models"-not data or raw compute, but...  ...Engineer (ML Infrastructure) to...  ...large-scale model training and deployment....  ...distributed training pipelines, cloud-native infrastructure...  ...requirements into scalable backend and... 
    Pipeline
    Data
    Training

    Rockstar

    San Francisco, CA
    2 days ago
  •  ...contributing to the design of scalable architecture by building distributed systems...  ...including training pipelines and real-time...  ...industries. Lead – AI/ML Stack Infrastructure...  ...machine learning, data science, product engineering...  ...deployment. Architect compute and storage... 
    Pipeline
    Data
    Training
    Temporary work

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  • $320k - $405k

     ...working together to build beneficial AI...  ...Infrastructure Engineer, Node Infra About the role...  ...how quickly we can train new models, how...  ...long-term compute, data, and infrastructure...  ...node provisioning pipelines Low-level systems...  ...InfiniBand) for distributed ML workloads.... 
    Pipeline
    Data
    Training
    Visa sponsorship

    Menlo Ventures

    San Francisco, CA
    6 days ago
  •  ...infrastructure for monetization and ads systems. The role involves building data pipelines, creating training platforms, and collaborating with various teams to...  ...engineering, particularly in distributed systems and ML workflows. Join us in shaping the future of AI at OpenAI... 
    Pipeline
    Data
    Training

    AI Chopping Block, Inc.

    San Francisco, CA
    7 hours ago
  •  ...Slope is seeking an experienced Software Engineer in San Francisco to build machine learning infrastructure for monetization systems. You will design large-scale data pipelines, work on model training platforms, and enhance system performance. The ideal candidate will... 
    Pipeline
    Data
    Training

    Slope

    San Francisco, CA
    4 days ago
  • $405k

     ...working together to build beneficial AI...  ...protects user data across our AI...  ...for AI training and inference,...  ...platform and infra frameworks underpinning...  ..., and ML‑based PII detection...  ...risks and architecting scalable mitigations Enable...  ..., and data pipelines Communicate and... 
    Pipeline
    Data
    Training

    Menlo Ventures

    San Francisco, CA
    7 hours ago
  • $250k - $350k

    Most AI roles build on top of models. This one builds what...  ...work. We’re hiring ML Infrastructure Engineers...  ...This isn’t clean benchmark data. It’s messy, continuous,...  ...across: High-throughput video pipelines handling millions of hours of data Training and inference systems for... 
    Pipeline
    Data
    Training

    Trades Workforce Solutions

    San Francisco, CA
    3 days ago
  •  ...in business analytics, data science, and software engineering...  ...Familiarity with CI/CD pipelines, orchestration tools (...  ..., develop, and deploy scalable and secure Snowflake...  ...and Reporting: Build monitoring dashboards to...  ...adoption, and compliance. Training and Enablement: Lead... 
    Pipeline
    Data
    Training

    Tredence

    San Francisco, CA
    4 days ago
  •  ...Overview The Principal Architect leads HPC AI...  ...an enterprise to build its first private...  ...centered on the NVIDIA data center ecosystem....  ...optimized for AI training, inference, and HPC...  ...performance, scalability, resiliency, and cost...  ...Optimize end‑to‑end data pipelines, including GPU... 
    Pipeline
    Data
    Training
    For contractors
    Remote work
    Work from home

    SupportFinity

    San Francisco, CA
    4 days ago
  •  ...About the Role ML Ops Engineer —...  ...by 8VC, we're building a world-class team...  ..., agent-native pipelines, retrieval-...  ...automating the model training, deployment,...  ...agents and AI data fabric. You’ll...  ...maintain secure, scalable, and automated...  ...engineering, or infra-focused ML roles... 
    Pipeline
    Data
    Training
    Full time

    Fabrion

    San Francisco, CA
    3 days ago
  •  ...Principal Architect, Data Knowledge Platform The Knowledge...  ...AI strategy. We are building the foundational infrastructure...  ...data, AI, and ML into business impact....  ...callable tools, automated pipelines, and transformation...  ...environment. Design scalable patterns that satisfy... 
    Pipeline
    Data
    Local area
    Remote work

    Airwallex

    San Francisco, CA
    3 days ago
  • $176k - $220k

     ...The GTM Architect, Enterprise will report...  ...opportunity to build and lead a GTM...  ...tools) to support scalable growth...  ...metrics across pipeline health, deal velocity...  ...setting Ensure data integrity,...  ...familiarity with AI, ML, or data...  ...relevant education or training. Scale... 
    Pipeline
    Data
    Training
    Full time

    Scale AI

    San Francisco, CA
    21 days ago
  • $217k - $325k

     ...platforms to enable scalable and secure access to...  ...solutions. Design, build, and optimize scalable model training, fine tuning, and inference pipelines, ensuring robust integration...  ...to streamline ML and AI workflows, improve...  ..., including data and applied scientists... 
    Pipeline
    Data
    Training
    Full time
    Work at office
    Flexible hours

    GoFundMe

    San Francisco, CA
    more than 2 months ago
  • Building Open Superintelligence Infrastructure...  ...models to the infra that enables...  ...to create, train, and deploy them...  ...with synthetic data and teaching LLMs...  ...generation pipeline and orchestration...  ...advancements in AI/ML infrastructure...  ...and scalability of AI workloads... 
    Pipeline
    Data
    Training
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    2 days ago
  •  ...an MLOps Engineer to design and maintain scalable ML infrastructure. Your role will involve...  ...performance, and ensuring robust deployment pipelines. Strong experience in Python and...  ...You will work alongside ML engineers and data scientists to enhance production systems... 
    Pipeline
    Data

    Sierracorp

    San Francisco, CA
    6 days ago
  • $200k - $250k

     ...Overview Build and operate the ML platform that powers AppFolio’s AI-native...  ...Estate platform, ensuring scalable training, inference, and cost‑...  ...language models, including data pipelines, GPU orchestration, and evaluation...  ...with a significant AI infra footprint. Experience... 
    Pipeline
    Data
    Training
    Remote work

    AppFolio

    San Francisco, CA
    7 hours ago
  • $218.4k - $365.2k

     ...Architect, Data Platform — AgentExchangeSkip to main...  ...infrastructure, telemetry pipelines, marketplace...  ...platform, and the ML-powered signals...  ...contracts every team builds to, and architect...  .... Feature store, training and serving infrastructure...  ..., performant, and scalable in everything we... 
    Pipeline
    Data
    Training
    Contract work
    Live in
    Work at office

    Salesforce

    San Francisco, CA
    4 days ago
  • $150k - $300k

    Building Open Superintelligence Infrastructure...  ...models to the infra that enables...  ...to create, train, and deploy them...  ...advancements in AI/ML infrastructure...  ...end-to-end pipelines for training and...  ...performance and scalability of AI workloads...  ...such as data, tensor & pipeline... 
    Pipeline
    Data
    Training
    Remote work
    Worldwide
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    6 days ago
  • $250k - $380k

     ...running OpenAI's LLM training and inference...  ...for building standardized dataset...  ...interfaces, scaling pipelines across...  ...researchers, and other infra groups to...  ...multimodal (MM) data that cannot fit...  ...modular code, and scalable abstractions, while...  ...) part of the ML stack.... 
    Pipeline
    Data
    Training

    OpenAI

    San Francisco, CA
    1 day ago
  •  ...Nexxa.ai is building artificial super intelligence...  ...Title: Senior AI Architect Location:...  ...models into reliable, scalable, and user‑facing systems...  ...in company data, using large‑scale...  ...Define data models and pipelines that power AI use...  ...in production ML frameworks (e.g.,... 
    Pipeline
    Data

    Nexxa.AI

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Infra Architect: Build Scalable Data & Training Pipelines. Be the first to apply!