ML Platform Engineer

Foxglove Technologies, Inc

Build the data infrastructure for robots operating in the real world.

Robotics is moving from research labs into production across factories, warehouses, vehicles, and field deployments. When robots fail, behave unexpectedly, or need to be improved, engineers rely on data to understand what actually happened.

At Foxglove, we build the observability, visualization, and data infrastructure that makes that possible. Our tools are used by robotics and autonomous systems teams to ingest, store, query, replay, and analyze massive volumes of multimodal sensor data from live systems and from production fleets.

About the Role

We're looking for a ML Platform Engineer with deep infrastructure instincts to help design, deploy, and scale the systems that power Foxglove's data platform. This is a platform-first role: you'll own the infrastructure layer that makes ML possible in production, not just the models that run on top of it.

You'll be responsible for the reliability, scalability, and performance of the ML platform itself, from inference serving and pipeline orchestration to training infrastructure and evaluation frameworks. The problems are real and urgent: petabyte-scale multimodal robotics data, high-throughput retrieval and embedding pipelines, and the internal ML flywheel that lets our team ship fast. This is a hands-on infrastructure role, not research.

Key Responsibilities

Design, deploy, and operate production inference infrastructure - including model serving, autoscaling, load balancing, and cost optimization across cloud environments
Own the platform architecture for embedding and retrieval pipelines that power semantic search over multimodal robotics data (image, video, point cloud, and timeseries)
Build and maintain the training and evaluation infrastructure that enables rapid iteration on model performance - including job orchestration, experiment tracking, and dataset versioning
Drive cloud infrastructure decisions (AWS/GCP) that directly impact latency, throughput, reliability, and cost at scale
Define platform abstractions and internal tooling that let product engineers ship ML-powered features without needing to manage infrastructure themselves
Evaluate, integrate, and operationalize third-party ML infrastructure components; establish clear build vs. buy frameworks for the team

What We're Looking For

Deep, hands-on experience owning production ML infrastructure: inference serving, model optimization (e.g., vLLM, Triton, TorchServe), orchestration, and cloud cost management
Strong foundation in distributed systems and cloud infrastructure (AWS/GCP) - you think in terms of system reliability, failure modes, and operational burden, not just model accuracy
Experience architecting and operating retrieval systems at scale, including vector databases (e.g., Pinecone, Lance, turbopuffer, pgvector) and embedding pipelines over large, heterogeneous datasets
A platform engineer's mindset: you build systems that other engineers depend on, and you take that responsibility seriously
Proven ability to operate with high ownership - you can make hard infrastructure tradeoffs independently and move fast without breaking things
Strong communication skills; you can explain infrastructure tradeoffs clearly to both ML and non-ML engineers

Bonus Points

Familiarity with fine-tuning and domain adaptation techniques for LLMs or embedding models (i.e. SFT, PEFT)
Familiarity with data mining or hybrid search workflows, especially as applied in robotics autonomous vehicles, or physical AI workflows
Prior experience building ML platforms, evaluation frameworks, or data management tooling from the ground up

What We Offer

$300 monthly budget towards commuter benefits or building your personal workspace (remote only)
Competitive equity grant in a Series B company
Medical, Dental, Vision, and Term Life insurance coverage at 100% for employees and 75% for dependents
401(k) matching up to 4%
4 weeks vacation, plus holidays and winter break
All expenses paid company off-sites 2× per year

Why Join Us

Impact: Own growth at a fast-growing, high-leverage moment for the company.
Mission: Accelerate the development of the next generation of robotics and embodied AI.
Team: Work with world-class engineers, designers, and researchers passionate about open-source and developer tools.
Ownership: Drive initiatives end-to-end, with high autonomy and visibility.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the ML Platform Engineer in San Francisco, CA vacancy

EY-Parthenon - Strategy and Execution - Growth Platforms - AI ML Engineering - Director
$205k - $235k
...Detroit, Houston, Los Angeles, McLean, New York, Hoboken, Philadelphia, San Francisco, Seattle EY-Parthenon – EY Growth Platforms - AI ML Engineering – Director At EY-Parthenon, our unique blend of strategy, transactions and corporate finance, combined with cutting‑...
Suggested
Full time
For contractors
Work experience placement
Summer holiday
Flexible hours
Ernst & Young Oman
San Francisco, CA
1 day ago
ML Platform & Infrastructure Engineer
...reliability. Minimum Qualifications ~ Bachelor's degree in Computer Science, Engineering, or equivalent practical experience ~3+ years in Software Engineering, MLOps, or ML Infrastructure ~ Strong Python proficiency ~ Experience building internal developer...
Suggested
Immediate start
Relocation package
Night shift
AGI
San Francisco, CA
2 days ago
Remote ML Platform Engineer - Scale AI Infrastructure
A leading livestream shopping platform is looking for an AI/ML Platform Engineer to shape the future of AI and ML systems. This role involves designing the infrastructure that powers machine learning applications, working alongside experts to deploy models at scale. Candidates...
Suggested
Remote work
Flexible hours
Whatnot
San Francisco, CA
4 days ago
Monetization ML Platform Engineer
$293k
...OpenAI is seeking a Software Engineer for Monetization ML Infrastructure in San Francisco. This role involves designing the machine learning infrastructure... ...'ll work on large-scale data pipelines, model training platforms, and real-time serving infrastructure, all while ensuring...
Suggested
OpenAI
San Francisco, CA
4 days ago
ML Platform Engineer: Scalable Pipelines & Tooling
...Agi,-Inc. is seeking a skilled Software Engineer to design and implement CI/CD pipelines for machine learning workflows in San Francisco, California. In this role, you will automate training runs and build scalable evaluation harnesses to optimize model performance. The...
Suggested
Relocation package
AGI
San Francisco, CA
4 days ago
ML Training Platform Engineer | Multi-Cloud & Decentralized
A decentralized AI platform company in the United States is seeking an experienced ML Training Platform Engineer to design and build robust infrastructure for ML training. The ideal candidate has over 5 years in infrastructure and platform engineering, with expertise in...
Pluralis Research
San Francisco, CA
4 days ago
Founding ML Platform Engineer - Full-Stack DevTools
$150k - $250k
...David Joseph & Company is seeking a Founding Engineer for their San Francisco office. This pivotal role involves developing cutting-edge ML DevTools and systems in a high-ownership environment. You will engage across various disciplines, designing backend systems and...
Work at office
David Joseph & Company
San Francisco, CA
4 days ago
Senior Cloud & ML Ops Platform Engineer
...Icehouseventures is seeking an Infrastructure Engineer in San Francisco, CA, to build and maintain the foundational Kubernetes platform across AWS, GCP, and Azure. The role... ...controls, and collaborate closely with SRE and ML teams. Ideal candidates will thrive in a startup...
Icehouseventures
San Francisco, CA
3 days ago
Senior ML Platform Engineer, GenAI & LLM Infra
...A dynamic tech company in San Francisco is seeking a seasoned ML Infrastructure Engineer to lead the development of innovative AI product systems. This role entails scaling ML product development infrastructure, collaborating with cross-functional teams, and mentoring...
LIGHTFIELD INC
San Francisco, CA
4 days ago
ML Platform & Infra Engineer - Scale AI Pipelines
...A cutting-edge AI firm in San Francisco is seeking a talented engineer to design and implement robust CI/CD pipelines for machine learning workflows. The ideal candidate will have a bachelor's degree in Computer Science or a related field, with at least 3 years of experience...
Pantera Capital
San Francisco, CA
4 days ago
GenAI/ML Platform Engineer End-to-End AI at Scale
$155k - $175k
...Alumni Ventures is seeking a GenAI/ML Platform Engineer to join its AI team. This hybrid role in San Francisco focuses on leveraging machine learning and generative AI to enhance features for Strava athletes. The ideal candidate will lead projects from inception to deployment...
Alumni Ventures
San Francisco, CA
4 days ago
Staff ML Platform Engineer Real-Time ML at Scale, Equity
...Grow Therapy in San Francisco is hiring a Staff ML Platform Engineer to drive the technical vision and execution of its Machine Learning Platform. This role involves designing and building scalable real-time ML systems, particularly for patient-provider matching. The...
Grow Therapy
San Francisco, CA
15 hours ago
Remote Data & ML Platform Engineer Equity, Kubernetes
$200k
...Glocomms is looking for a hands-on Software Engineer to join its early-stage team in San Francisco or London. This role focuses on building and scaling core systems for data, research, and machine learning. The ideal candidate will design data pipelines, develop Kubernetes...
Remote work
Glocomms
San Francisco, CA
4 days ago
GenAI/ML Platform Engineer Scale AI at Speed
...Strava is seeking a GenAI/ML Platform Engineer to join their AI team in San Francisco. This role involves driving AI/ML platform projects from inception to implementation, utilizing Strava's extensive datasets. You'll work closely with engineers and data scientists to...
Work at office
Flexible hours
3 days per week
Strava
San Francisco, CA
3 days ago
ML Platform Engineer: Scalable AI & RAG Pipelines
Docusign is looking for a Machine Learning Engineer to develop the foundational infrastructure for intelligent systems. The role requires... ...distributed systems, developing pipelines, and deploying robust ML models. The ideal candidate has over 5 years in machine learning...
UNAVAILABLE
San Francisco, CA
20 hours ago
ML Platform Engineer: Scalable Cloud Infrastructure
$212k - $318.4k
A leading technology company in San Francisco is seeking a Software Engineer to join its Applied Machine Learning team. This role focuses on designing and building a robust ML platform and infrastructure to support enterprise-level initiatives. Candidates should have at...
Apple Inc.
San Francisco, CA
1 day ago
ML Platform Engineer Build Scalable ML Systems
...CVFine by Instrovate Technologies is seeking a Machine Learning Platform Engineer to help build scalable systems that support model training for... ...tools that enhance the productivity of data scientists and ML engineers. The ideal candidate should have a strong grasp of ML...
CVFine by Instrovate Technologies
San Francisco, CA
13 hours ago
Director of AI/ML Platform Engineering
$205k - $235k
...A leading professional services firm is seeking a Director for AI ML Engineering to co-lead the engineering team and build a scalable analytics platform. This role involves delivering high-visibility solutions for Fortune 500 clients by translating business needs into...
Ernst & Young Oman
San Francisco, CA
3 days ago
AI & ML Data Platform Engineer
$205k - $316k
...Quizlet is looking for a Data Platform Engineer to design and build the infrastructure that supports large-scale data processing and machine... ...workflows. The role entails building and maintaining the data and ML infrastructure, improving platform usability, and partnering...
Work at office
3 days per week
Quizlet
San Francisco, CA
4 days ago
AI/ML Platform Adoption Engineer
Anyscale in San Francisco is seeking a Customer Engineer to assist customers in onboarding and using the Anyscale platform effectively. The role involves troubleshooting technical issues, providing best practices, and maintaining relationships with technical stakeholders...
Anyscale
San Francisco, CA
20 hours ago
Senior Machine Learning Platform Engineer
Job Title Disabled veteran A veteran who served on active duty in the U.S. military and is entitled to disability compensation (or who but for the receipt of military retired pay would be entitled to disability compensation) under laws administered by the Secretary of...
Veho
San Francisco, CA
4 days ago
Senior ML Platform Engineer MLOps & Vertex AI (Equity)
$171k - $231.5k
...A leading financial technology company is seeking a Machine Learning Engineer to enhance their infrastructure for data science teams. The candidate should have over 7 years of experience in machine learning, strong programming skills in Python and Java, and familiarity...
Intuit
Oakland, CA
4 days ago
Senior ML Engineer, GeoAI Platform Remote/Hybrid
...Wherobots, Inc. is seeking a Senior Machine Learning Engineer in San Francisco, California to lead the development of a scalable geospatial ML platform. The ideal candidate will have a strong background in distributed systems and extensive experience with GPU-based workflows...
Remote work
Wherobots, Inc
San Francisco, CA
3 days ago
Machine Learning, Platform Engineer
$160k - $250k
...Machine Learning, Platform Engineer San Francisco About the Role Our team focuses on enabling custom models and dedicated inference... ...familiar with multi-cluster scheduling and have some sense of ML bottlenecks. Responsibilities New hires may work on multi...
Full time
Together AI
San Francisco, CA
3 days ago
ML Infrastructure Engineer - Build Scalable AI Platforms
...A technology company in San Francisco is seeking an experienced ML Infrastructure Engineer to develop platforms for machine learning jobs and to lead cross-functional initiatives. The ideal candidate will have experience with continuous integration and deployment models...
Delphina
San Francisco, CA
3 days ago
ML Engineer, AI Platform & RAG (Hybrid)
$164.7k - $266k
...DocuSign, Inc. is seeking a Machine Learning Engineer to design and build scalable infrastructure for intelligent systems. You will work closely with AI research and engineering teams, ensuring the development of robust models and distributed systems. The role requires...
DocuSign
San Francisco, CA
4 days ago
Realtime ML Engineer - Ad Ranking & Serving Platform
...Ersilia is seeking a passionate ML engineer in San Francisco to own the recommendation engine, balancing user experience and advertiser ROAS. This role involves building low-latency ad ranking systems and designing data pipelines for ML training. The ideal candidate should...
Ersilia
San Francisco, CA
13 hours ago
ML Platform / MLOps Engineer
$180k - $250k
...ML Platform / MLOps Engineer Emeryville, California, United States; Hybrid (2-3 days on-site) Profluent is an AI-first protein design company. Founded in 2022, we develop deep generative models to design and validate novel, functional proteins to revolutionize biomedicine...
Profluent
Emeryville, CA
1 day ago
ML Engineer - AI Agent Platform for Productivity
...Superhuman is seeking a Machine Learning Engineer to develop AI-driven products in San Francisco. The role is pivotal... ...productivity suite. Key responsibilities include building an AI platform, developing sophisticated ML models, and thriving in a fast-paced environment....
I did my part and supported the Regular Toilet
San Francisco, CA
3 days ago
Staff ML Engineer, AI Platform
$250k - $300k
...Us: Ambience Healthcare is the leading AI platform for documentation, coding, and clinical... ...across every product team, every quarter. Our engineering roles are hybrid in our SF office (3x/... ...in software engineering, 3+ focused on ML infrastructure, platform engineering, or...
Work at office
Immediate start
Ambience
San Francisco, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Platform Engineer. Be the first to apply!