Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Platform Engineer

Foxglove Technologies, Inc

Build the data infrastructure for robots operating in the real world.

Robotics is moving from research labs into production across factories, warehouses, vehicles, and field deployments. When robots fail, behave unexpectedly, or need to be improved, engineers rely on data to understand what actually happened.

At Foxglove, we build the observability, visualization, and data infrastructure that makes that possible. Our tools are used by robotics and autonomous systems teams to ingest, store, query, replay, and analyze massive volumes of multimodal sensor data from live systems and from production fleets.

About the Role

We're looking for a ML Platform Engineer with deep infrastructure instincts to help design, deploy, and scale the systems that power Foxglove's data platform. This is a platform-first role: you'll own the infrastructure layer that makes ML possible in production, not just the models that run on top of it.

You'll be responsible for the reliability, scalability, and performance of the ML platform itself, from inference serving and pipeline orchestration to training infrastructure and evaluation frameworks. The problems are real and urgent: petabyte-scale multimodal robotics data, high-throughput retrieval and embedding pipelines, and the internal ML flywheel that lets our team ship fast. This is a hands-on infrastructure role, not research.

Key Responsibilities
  • Design, deploy, and operate production inference infrastructure - including model serving, autoscaling, load balancing, and cost optimization across cloud environments
  • Own the platform architecture for embedding and retrieval pipelines that power semantic search over multimodal robotics data (image, video, point cloud, and timeseries)
  • Build and maintain the training and evaluation infrastructure that enables rapid iteration on model performance - including job orchestration, experiment tracking, and dataset versioning
  • Drive cloud infrastructure decisions (AWS/GCP) that directly impact latency, throughput, reliability, and cost at scale
  • Define platform abstractions and internal tooling that let product engineers ship ML-powered features without needing to manage infrastructure themselves
  • Evaluate, integrate, and operationalize third-party ML infrastructure components; establish clear build vs. buy frameworks for the team
What We're Looking For
  • Deep, hands-on experience owning production ML infrastructure: inference serving, model optimization (e.g., vLLM, Triton, TorchServe), orchestration, and cloud cost management
  • Strong foundation in distributed systems and cloud infrastructure (AWS/GCP) - you think in terms of system reliability, failure modes, and operational burden, not just model accuracy
  • Experience architecting and operating retrieval systems at scale, including vector databases (e.g., Pinecone, Lance, turbopuffer, pgvector) and embedding pipelines over large, heterogeneous datasets
  • A platform engineer's mindset: you build systems that other engineers depend on, and you take that responsibility seriously
  • Proven ability to operate with high ownership - you can make hard infrastructure tradeoffs independently and move fast without breaking things
  • Strong communication skills; you can explain infrastructure tradeoffs clearly to both ML and non-ML engineers
Bonus Points
  • Familiarity with fine-tuning and domain adaptation techniques for LLMs or embedding models (i.e. SFT, PEFT)
  • Familiarity with data mining or hybrid search workflows, especially as applied in robotics autonomous vehicles, or physical AI workflows
  • Prior experience building ML platforms, evaluation frameworks, or data management tooling from the ground up
What We Offer
  • $300 monthly budget towards commuter benefits or building your personal workspace (remote only)
  • Competitive equity grant in a Series B company
  • Medical, Dental, Vision, and Term Life insurance coverage at 100% for employees and 75% for dependents
  • 401(k) matching up to 4%
  • 4 weeks vacation, plus holidays and winter break
  • All expenses paid company off-sites 2× per year
Why Join Us
  • Impact: Own growth at a fast-growing, high-leverage moment for the company.
  • Mission: Accelerate the development of the next generation of robotics and embodied AI.
  • Team: Work with world-class engineers, designers, and researchers passionate about open-source and developer tools.
  • Ownership: Drive initiatives end-to-end, with high autonomy and visibility.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the ML Platform Engineer in San Francisco, CA vacancy
  • A leading livestream shopping platform is looking for an AI/ML Platform Engineer to shape the future of AI and ML systems. This role involves designing the infrastructure that powers machine learning applications, working alongside experts to deploy models at scale. Candidates... 
    Suggested
    Remote work
    Flexible hours

    Whatnot

    San Francisco, CA
    4 days ago
  •  ...reliability. Minimum Qualifications ~ Bachelor's degree in Computer Science, Engineering, or equivalent practical experience ~3+ years in Software Engineering, MLOps, or ML Infrastructure ~ Strong Python proficiency ~ Experience building internal developer... 
    Suggested
    Immediate start
    Relocation package
    Night shift

    AGI

    San Francisco, CA
    1 day ago
  • A decentralized AI platform company in the United States is seeking an experienced ML Training Platform Engineer to design and build robust infrastructure for ML training. The ideal candidate has over 5 years in infrastructure and platform engineering, with expertise in... 
    Suggested

    Pluralis Research

    San Francisco, CA
    4 days ago
  •  ...Staff ML Platform Engineer – Large Scale Training (LLMOps/MLOps) We're TrueFoundry, and we're building the foundational infrastructure for production AI systems. We're looking for a Staff ML Platform Engineer – Large Scale Training (LLMOps/MLOps) to join the team.... 
    Suggested
    Flexible hours

    TrueFoundry

    San Francisco, CA
    15 hours ago
  • $200k

     ...Glocomms is looking for a hands-on Software Engineer to join its early-stage team in San Francisco or London. This role focuses on building and scaling core systems for data, research, and machine learning. The ideal candidate will design data pipelines, develop Kubernetes... 
    Suggested
    Remote work

    Glocomms

    San Francisco, CA
    8 days ago
  • A cutting-edge AI firm in San Francisco is seeking a talented engineer to design and implement robust CI/CD pipelines for machine learning workflows. The ideal candidate will have a bachelor's degree in Computer Science or a related field, with at least 3 years of experience... 

    Pantera Capital

    San Francisco, CA
    2 days ago
  • A dynamic tech company in San Francisco is seeking a seasoned ML Infrastructure Engineer to lead the development of innovative AI product systems. This role entails scaling ML product development infrastructure, collaborating with cross-functional teams, and mentoring... 

    Lightfield

    San Francisco, CA
    4 days ago
  • Icehouseventures is seeking an Infrastructure Engineer in San Francisco, CA, to build and maintain the foundational Kubernetes platform across AWS, GCP, and Azure. The role...  ...controls, and collaborate closely with SRE and ML teams. Ideal candidates will thrive in a startup... 

    Icehouseventures

    San Francisco, CA
    15 hours ago
  •  ...performance and reliability. Minimum Qualifications Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience 3+ years in Software Engineering, MLOps, or ML Infrastructure Experience building internal developer tools, CLIs, or dashboards... 
    Full time
    Immediate start
    Relocation package
    Night shift

    AGI Inc

    San Francisco, CA
    15 hours ago
  • A technology company in San Francisco is seeking a Software Engineer focused on MLOps to design and implement CI/CD pipelines for machine learning workflows. The role involves building scalable evaluation systems, ensuring model performance tracking, and managing cloud... 
    Relocation package

    AGI Inc

    San Francisco, CA
    15 hours ago
  • A cutting-edge AI firm in San Francisco seeks a Software Engineer specializing in machine learning infrastructure. This role requires designing robust CI/CD pipelines and developing internal tools that streamline research processes. The ideal candidate has a Bachelor's... 
    Relocation package

    AGI, Inc.

    San Francisco, CA
    4 days ago
  • Saviynt, located in San Francisco, is seeking an AI Platform Engineer to manage and optimize the training and inference of AI models. You will...  ...advanced GPU clusters. The ideal candidate has a solid foundation in ML engineering, particularly with Ray, LLMs, and experience in... 

    Medium

    San Francisco, CA
    2 days ago
  • $212k - $318.4k

    A leading technology company in San Francisco is seeking a Software Engineer to join its Applied Machine Learning team. This role focuses on designing and building a robust ML platform and infrastructure to support enterprise-level initiatives. Candidates should have at... 

    Apple Inc.

    San Francisco, CA
    1 day ago
  • $245k - $345k

    Whatnot is seeking an AI/ML Engineer in San Francisco who will design and scale infrastructure for machine learning applications. The role requires over 4 years of experience in developing machine learning systems and strong proficiency in Python. Benefits include a competitive... 
    Remote work
    Work from home
    Flexible hours

    Whatnot

    San Francisco, CA
    15 hours ago
  • Strava is seeking a GenAI/ML Platform Engineer to join their AI team in San Francisco. This role involves driving AI/ML platform projects from inception to implementation, utilizing Strava's extensive datasets. You'll work closely with engineers and data scientists to... 
    Work at office
    Flexible hours
    3 days per week

    Strava

    San Francisco, CA
    2 days ago
  • $205k - $235k

     ...Detroit, Houston, Los Angeles, McLean, New York, Hoboken, Philadelphia, San Francisco, Seattle EY-Parthenon - EY Growth Platforms - AI ML Engineering - Director At EY-Parthenon, our unique blend of strategy, transactions and corporate finance, combined with cutting‑edge... 
    Full time
    For contractors
    Work experience placement
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    San Francisco, CA
    2 days ago
  • $205k - $316k

    Quizlet is looking for a Data Platform Engineer to design and build the infrastructure that supports large-scale data processing and machine learning...  .... The role entails building and maintaining the data and ML infrastructure, improving platform usability, and partnering... 
    Work at office
    3 days per week

    Quizlet

    San Francisco, CA
    3 days ago
  • $205k - $235k

    A leading professional services firm is seeking a Director for AI ML Engineering to co-lead the engineering team and build a scalable analytics platform. This role involves delivering high-visibility solutions for Fortune 500 clients by translating business needs into... 

    Ernst & Young Oman

    San Francisco, CA
    2 days ago
  • $160k - $250k

     ...Machine Learning, Platform Engineer San Francisco About the Role Our team focuses on enabling custom models and dedicated inference...  ...familiar with multi-cluster scheduling and have some sense of ML bottlenecks. Responsibilities New hires may work on multi... 
    Full time

    Together AI

    San Francisco, CA
    2 days ago
  • Job Title Disabled veteran A veteran who served on active duty in the U.S. military and is entitled to disability compensation (or who but for the receipt of military retired pay would be entitled to disability compensation) under laws administered by the Secretary ...

    Veho

    San Francisco, CA
    3 days ago
  • $250k - $300k

     ...scribe. We're building the AI intelligence platform that restores humanity to healthcare and...  ...every product team, every quarter. Our engineering roles are hybrid in our SF office (3x/week...  ...years in software engineering, 3+ focused on ML infrastructure, platform engineering, or... 
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Ambience Healthcare

    San Francisco, CA
    1 day ago
  • $246.5k - $339k

     ...About Faire Faire is a technology wholesale platform built on the belief that the future is local....  ...role As a Staff Machine Learning Platform Engineer, you will help design, improve, and operate a scalable ML platform to accelerate model training, deployment... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    3 days per week

    Faire Inc

    San Francisco, CA
    15 hours ago
  • $180k - $250k

     ...ML Platform / MLOps Engineer Emeryville, California, United States; Hybrid (2-3 days on-site) Profluent is an AI-first protein design company. Founded in 2022, we develop deep generative models to design and validate novel, functional proteins to revolutionize biomedicine... 

    Profluent

    Emeryville, CA
    1 day ago
  •  ...Francisco seeks a Staff Machine Learning Engineer. This pivotal role drives the productization...  ...machine learning models for partner platforms. Responsibilities include designing integrations...  ...of experience, a strong background in ML infrastructure, and hands-on cloud skills... 
    Flexible hours

    ChatGPT Jobs

    San Francisco, CA
    4 days ago
  • $181.1k - $318.4k

     ...AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training Infrastructure Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience... 
    Relocation

    Apple

    San Francisco, CA
    4 days ago
  • A technology company in San Francisco is seeking an experienced ML Infrastructure Engineer to develop platforms for machine learning jobs and to lead cross-functional initiatives. The ideal candidate will have experience with continuous integration and deployment models... 

    Delphina

    San Francisco, CA
    15 hours ago
  • Wherobots, Inc. is seeking a Senior Machine Learning Engineer in San Francisco, California to lead the development of a scalable geospatial ML platform. The ideal candidate will have a strong background in distributed systems and extensive experience with GPU-based workflows... 
    Remote job

    Wherobots, Inc

    San Francisco, CA
    4 days ago
  •  ...dramatically accelerate the invention of new materials. Our platform helps scientists and engineers build structured data foundations, digitize formulation...  ...smarter, and at scale. About the role As our AI/ML Platform Engineer, you will build the foundational... 
    Remote work
    Work from home

    Albert Invent

    Oakland, CA
    1 day ago
  • $197.3k - $225.1k

     ...Lead AI/ML Engineer (Platform, kubeflow) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Francisco, CA
    3 days ago
  •  ...The Community You Will Join: The Growth Platform team’s vision is to drive long term sustainable...  ...You Will Make: As a machine learning engineer or scientist, your expertise will be...  ...variant testing, and faster iteration cycles. ML/AI Orchestration for Decisioning -... 
    Work experience placement
    Remote work
    Shift work

    airbnb, Inc.

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Platform Engineer. Be the first to apply!