Founding ML Infra Engineer: Scale Real-Time Inference
URun
URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling of our GPU compute platform from the ground up, ensuring high availability and low-latency inference. This is a founding technical hire position, requiring end-to-end ownership across the infrastructure stack, with promising growth and significant responsibilities. At URun, competitive salary and equity, along with full health coverage and flexibility, are offered. #J-18808-Ljbffr URun
- A cutting-edge technology company in San Francisco is seeking an ML Infrastructure Engineer to build and scale machine learning systems for real-time perception and inference. This role involves designing scalable training pipelines for computer vision models, optimizing...Suggested
$225k - $400k
A pioneering AI research firm is seeking a Founding Machine Learning Research Engineer in San Francisco to develop innovative AI systems for real-time voice agents. This high-impact role requires a strong ML research background and proficiency in PyTorch. Responsibilities...Suggested- ...ML Ops Engineer — Agentic AI Lab (Founding Team) Location: San Francisco... ...Area Type: Full-Time Compensation:... ..., and inference rollout Manage... ...engineering, or infra-focused ML roles... ...(spot instance scaling, batch prioritization... ...some really hard real world problems –...SuggestedFull time
- .... is hiring a Machine Learning Infra Engineer in San Francisco to build and maintain ML training and inference frameworks. The role focuses on high performance and scaling across multiple nodes and GPUs.... ...along with a willingness to tackle real-world challenges. This position...Suggested
- Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,...Suggested
- ...media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the... .... The ideal candidate will drive innovations in real-time model performance, design in-house inference runtimes...Relocation package
- Arena Intelligence, Inc. in San Francisco, CA, is seeking a Senior Software Engineer (Infrastructure) to lead the design of scalable data and API systems. The role involves architecting real-time data pipelines, ensuring performance and reliability, and mentoring...
$180k - $220k
...seeking a Senior Machine Learning Engineer for their Applied Data Science... ...collaboration with data scientists on real-time optimization solutions. Required... ...in AdTech, and expertise in AI/ML technologies like Java, Python, and large-scale frameworks. The position offers a...- A leading tech startup in San Francisco seeks founding Machine Learning Engineers (MLEs) to enhance core action models for their proactive automation... ...improving model accuracy and speed. This role demands strong ML skills and experience with LLMs, emphasizing...
- ...to deliver that at scale doesn't really exist... ...fix it uRun is the inference cloud for... ...compute layer that makes real‑time, stateful inference... ...investors, and are founded by Keegan McCallum,... ...infrastructure. As our ML Infrastructure and Platform Engineer, you will own the...Flexible hoursShift work
- Voiceflow is seeking a Software Engineer (Distributed Systems) in San Francisco. As a founding engineer, you will focus on building a real-time database replication solution leveraging Kafka and CDC while interacting directly with customers. The ideal candidate has strong...
$150k - $300k
...Founding ML Engineer Location: San Francisco, CA Company Stage: Early-Stage... ..., understand, and act on real-time internet data. Instead of... ...achieved strong early traction—scaling to millions in ARR within... ...Experience scaling LLM inference pipelines in production...Visa sponsorship- ...Founding Ml Engineer Skills: Python, PyTorch, NLP, LLMs, Information Retrieval... ..., or data generation at scale ~ Strong Python and PyTorch... ...records Given raw people data, infer the org chart — who reports... ...ontology systems over messy real-world data Background in multilingual...
- ...helps AI teams ingest real world enterprise... ...processes at scale. We've grown incredibly... ...Machine Learning Engineer to help us train... ...As an ML Infra Engineer , you'll... ...role in building the inference and training frameworks... ...heuristics we built over time. Put simply, we...Work at officeLocal area
- ...Debrief Intelligence Engineer Navi captures... ...Role This is a founding AI/ML role. You'll own... ...accuracy against real-world CFI assessments... ..., audio ML, time-series analysis, or... ...training, evaluation, inference, and monitoring in... ...Force A role that scales into technical...
- Judgment Labs, based in San Francisco, is seeking a Senior Data Infrastructure Engineer to design and scale real-time data pipelines critical for agent behavior analysis. The ideal candidate should have over 6 years of experience managing high-throughput, petabyte-scale...
- ...Employment Type Full time Location Type On-... ...role you will help scale and optimize our... ...researchers and model engineers to translate ideas... ...the intersection of ML, software... ...Will Own training/inference infrastructure: Design... ...research needs into infra capabilities and guide...Full time
- ...We're looking for founding Machine Learning Engineers (MLEs) to own and... ...of LLM inference, browser understanding... ...instant response times with zero migration... ...architecture creates unique ML challenges. This... ...that run in real time,... ...model quality at scale Experiment with...Sleeping nights
$200k
...ultra-low-latency inference engines for large language... ..., throughput, and Time-To-First-Token (or... ...To-First-Audio) in real-time streaming... ...between the core ML training team and... ...accuracy. Large-Scale Distributed Systems... ...What We Offer Founding Team Initiative: Opportunity...Full timeWork at officeWorldwide- A leading AI solutions company seeks a Machine Learning Engineer to develop and optimize machine learning models in a remote-first... ...involve collaboration across teams and managing scalable ML models for real-time decision-making. Ideal candidates have 3+ years of...Remote work
- A leading technology company is looking for an ML Infrastructure Engineer in San Francisco. The successful candidate will build and maintain ML training pipelines and ensure low-latency model serving. Candidates should have over 4 years of experience in ML engineering,...Work at office
- ...company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on...
- uRun is seeking an ML Performance Engineer to build high-performance infrastructure for interactive... ...CUDA kernels and optimize model inference for speed and efficiency. This foundational... ...involves working closely with the founding team on critical performance challenges...
$180k - $270k
...infrastructure roles in San Francisco, focusing on building high-performance inference engines for speech AI. Ideal candidates will have substantial experience in GPU architecture and real-time systems. This position offers a competitive salary range of $180K - $270K,...$100k - $200k
...technology company in San Francisco is seeking a Founding Engineer to develop innovative voice-first technologies. In this full-time, on-site role, you will shape the technical foundation by designing, developing, and scaling systems that enhance Human-Computer...Full time- ...interaction remain coherent in real time. Our team sits... ...exceptional research engineers and applied researchers... ...Staff - Data & ML Infrastructure Engineer... ...Moonlake's model training and inference infrastructure. This... ..., and large-scale orchestration systems....
$250k
Alldus International Consulting Ltd is looking for a talented ML/AI Research Engineer to join their San Francisco team. You will be responsible... ...infrastructure that powers training, deployment, and governance of large-scale AI systems. The ideal candidate has a strong background in...- ...Technical Staff focused on building and optimizing ML inference systems in San Francisco. The role involves... ...pipelines and enhancing performance under real-world workloads. Candidates should have strong software engineering skills, experience with ML inference systems,...
- ...Abridge Abridge was founded in 2018 with the... ...clinical notes in real-time, with deep EMR integrations... ...technologists, and engineers working together to... .... The Role As an ML Infrastructure Engineer, Model Inference at Abridge, you’ll... ...product teams to scale backend...Hourly payFull timeFlexible hours
- ...partner with research and infra to prototype, train, and deploy... ...that power Sesame's real-time companion experience. Squeeze silicon — scale training and inference for LLM-class workloads; chase... .... Proven software engineer who loves ML; comfortable writing production...Full timeContract workFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Founding ML Infra Engineer: Scale Real-Time Inference. Be the first to apply!
- machine learning ai engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- entry level machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA

