Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Founding ML Infra Engineer: Scale Real-Time Inference

URun

URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling of our GPU compute platform from the ground up, ensuring high availability and low-latency inference. This is a founding technical hire position, requiring end-to-end ownership across the infrastructure stack, with promising growth and significant responsibilities. At URun, competitive salary and equity, along with full health coverage and flexibility, are offered. #J-18808-Ljbffr URun

Vacancy posted 13 hours ago
Similar jobs that could be interesting for youBased on the Founding ML Infra Engineer: Scale Real-Time Inference in San Francisco, CA vacancy
  • A cutting-edge technology company in San Francisco is seeking an ML Infrastructure Engineer to build and scale machine learning systems for real-time perception and inference. This role involves designing scalable training pipelines for computer vision models, optimizing... 
    Suggested

    Specter

    San Francisco, CA
    2 days ago
  • $225k - $400k

    A pioneering AI research firm is seeking a Founding Machine Learning Research Engineer in San Francisco to develop innovative AI systems for real-time voice agents. This high-impact role requires a strong ML research background and proficiency in PyTorch. Responsibilities... 
    Suggested

    Retell AI

    San Francisco, CA
    4 days ago
  •  ...ML Ops Engineer — Agentic AI Lab (Founding Team) Location: San Francisco...  ...Area Type: Full-Time Compensation:...  ..., and inference rollout Manage...  ...engineering, or infra-focused ML roles...  ...(spot instance scaling, batch prioritization...  ...some really hard real world problems –... 
    Suggested
    Full time

    Fabrion

    San Francisco, CA
    2 days ago
  •  .... is hiring a Machine Learning Infra Engineer in San Francisco to build and maintain ML training and inference frameworks. The role focuses on high performance and scaling across multiple nodes and GPUs....  ...along with a willingness to tackle real-world challenges. This position... 
    Suggested

    Reducto, Inc.

    San Francisco, CA
    4 days ago
  • Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,... 
    Suggested

    Reducto

    San Francisco, CA
    4 days ago
  •  ...media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the...  .... The ideal candidate will drive innovations in real-time model performance, design in-house inference runtimes... 
    Relocation package

    Reactor

    San Francisco, CA
    1 day ago
  • Arena Intelligence, Inc. in San Francisco, CA, is seeking a Senior Software Engineer (Infrastructure) to lead the design of scalable data and API systems. The role involves architecting real-time data pipelines, ensuring performance and reliability, and mentoring... 

    Arena Intelligence, Inc.

    San Francisco, CA
    4 days ago
  • $180k - $220k

     ...seeking a Senior Machine Learning Engineer for their Applied Data Science...  ...collaboration with data scientists on real-time optimization solutions. Required...  ...in AdTech, and expertise in AI/ML technologies like Java, Python, and large-scale frameworks. The position offers a... 

    Nexxen

    San Francisco, CA
    1 day ago
  • A leading tech startup in San Francisco seeks founding Machine Learning Engineers (MLEs) to enhance core action models for their proactive automation...  ...improving model accuracy and speed. This role demands strong ML skills and experience with LLMs, emphasizing... 

    Composite

    San Francisco, CA
    13 hours ago
  •  ...to deliver that at scale doesn't really exist...  ...fix it uRun is the inference cloud for...  ...compute layer that makes real‑time, stateful inference...  ...investors, and are founded by Keegan McCallum,...  ...infrastructure. As our ML Infrastructure and Platform Engineer, you will own the... 
    Flexible hours
    Shift work

    URun

    San Francisco, CA
    13 hours ago
  • Voiceflow is seeking a Software Engineer (Distributed Systems) in San Francisco. As a founding engineer, you will focus on building a real-time database replication solution leveraging Kafka and CDC while interacting directly with customers. The ideal candidate has strong... 

    Voiceflow

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...Founding ML Engineer Location: San Francisco, CA Company Stage: Early-Stage...  ..., understand, and act on real-time internet data. Instead of...  ...achieved strong early traction—scaling to millions in ARR within...  ...Experience scaling LLM inference pipelines in production... 
    Visa sponsorship

    Recruiting from Scratch

    San Francisco, CA
    2 days ago
  •  ...Founding Ml Engineer Skills: Python, PyTorch, NLP, LLMs, Information Retrieval...  ..., or data generation at scale ~ Strong Python and PyTorch...  ...records Given raw people data, infer the org chart — who reports...  ...ontology systems over messy real-world data Background in multilingual... 

    Crustdata (YC F24)

    San Francisco, CA
    4 days ago
  •  ...helps AI teams ingest real world enterprise...  ...processes at scale. We've grown incredibly...  ...Machine Learning Engineer to help us train...  ...As an ML Infra Engineer , you'll...  ...role in building the inference and training frameworks...  ...heuristics we built over time. Put simply, we... 
    Work at office
    Local area

    Reducto

    San Francisco, CA
    1 day ago
  •  ...Debrief Intelligence Engineer Navi captures...  ...Role This is a founding AI/ML role. You'll own...  ...accuracy against real-world CFI assessments...  ..., audio ML, time-series analysis, or...  ...training, evaluation, inference, and monitoring in...  ...Force A role that scales into technical... 

    Navi AI

    San Francisco, CA
    4 days ago
  • Judgment Labs, based in San Francisco, is seeking a Senior Data Infrastructure Engineer to design and scale real-time data pipelines critical for agent behavior analysis. The ideal candidate should have over 6 years of experience managing high-throughput, petabyte-scale... 

    Judgment Labs

    San Francisco, CA
    2 days ago
  •  ...Employment Type Full time Location Type On-...  ...role you will help scale and optimize our...  ...researchers and model engineers to translate ideas...  ...the intersection of ML, software...  ...Will Own training/inference infrastructure: Design...  ...research needs into infra capabilities and guide... 
    Full time

    Monograph

    San Francisco, CA
    2 days ago
  •  ...We're looking for founding Machine Learning Engineers (MLEs) to own and...  ...of LLM inference, browser understanding...  ...instant response times with zero migration...  ...architecture creates unique ML challenges. This...  ...that run in real time,...  ...model quality at scale Experiment with... 
    Sleeping nights

    Composite

    San Francisco, CA
    13 hours ago
  • $200k

     ...ultra-low-latency inference engines for large language...  ..., throughput, and Time-To-First-Token (or...  ...To-First-Audio) in real-time streaming...  ...between the core ML training team and...  ...accuracy. Large-Scale Distributed Systems...  ...What We Offer Founding Team Initiative: Opportunity... 
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    13 hours ago
  • A leading AI solutions company seeks a Machine Learning Engineer to develop and optimize machine learning models in a remote-first...  ...involve collaboration across teams and managing scalable ML models for real-time decision-making. Ideal candidates have 3+ years of... 
    Remote work

    Geminus

    San Francisco, CA
    13 hours ago
  • A leading technology company is looking for an ML Infrastructure Engineer in San Francisco. The successful candidate will build and maintain ML training pipelines and ensure low-latency model serving. Candidates should have over 4 years of experience in ML engineering,... 
    Work at office

    Lattice, Inc.

    San Francisco, CA
    4 days ago
  •  ...company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on... 

    Reflection AI

    San Francisco, CA
    13 hours ago
  • uRun is seeking an ML Performance Engineer to build high-performance infrastructure for interactive...  ...CUDA kernels and optimize model inference for speed and efficiency. This foundational...  ...involves working closely with the founding team on critical performance challenges... 

    URun

    San Francisco, CA
    13 hours ago
  • $180k - $270k

     ...infrastructure roles in San Francisco, focusing on building high-performance inference engines for speech AI. Ideal candidates will have substantial experience in GPU architecture and real-time systems. This position offers a competitive salary range of $180K - $270K,... 

    Plaud

    San Francisco, CA
    3 days ago
  • $100k - $200k

     ...technology company in San Francisco is seeking a Founding Engineer to develop innovative voice-first technologies. In this full-time, on-site role, you will shape the technical foundation by designing, developing, and scaling systems that enhance Human-Computer... 
    Full time

    Voice Cursor

    San Francisco, CA
    2 days ago
  •  ...interaction remain coherent in real time. Our team sits...  ...exceptional research engineers and applied researchers...  ...Staff - Data & ML Infrastructure Engineer...  ...Moonlake's model training and inference infrastructure. This...  ..., and large-scale orchestration systems.... 

    Moonlake AI

    San Francisco, CA
    13 hours ago
  • $250k

    Alldus International Consulting Ltd is looking for a talented ML/AI Research Engineer to join their San Francisco team. You will be responsible...  ...infrastructure that powers training, deployment, and governance of large-scale AI systems. The ideal candidate has a strong background in... 

    Alldus International Consulting Ltd

    San Francisco, CA
    3 days ago
  •  ...Technical Staff focused on building and optimizing ML inference systems in San Francisco. The role involves...  ...pipelines and enhancing performance under real-world workloads. Candidates should have strong software engineering skills, experience with ML inference systems,... 

    Acceler8 Talent

    San Francisco, CA
    4 days ago
  •  ...Abridge Abridge was founded in 2018 with the...  ...clinical notes in real-time, with deep EMR integrations...  ...technologists, and engineers working together to...  .... The Role As an ML Infrastructure Engineer, Model Inference at Abridge, you’ll...  ...product teams to scale backend... 
    Hourly pay
    Full time
    Flexible hours

    Abridge

    San Francisco, CA
    3 days ago
  •  ...partner with research and infra to prototype, train, and deploy...  ...that power Sesame's real-time companion experience. Squeeze silicon — scale training and inference for LLM-class workloads; chase...  .... Proven software engineer who loves ML; comfortable writing production... 
    Full time
    Contract work
    Flexible hours
    Shift work

    SESAME

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Founding ML Infra Engineer: Scale Real-Time Inference. Be the first to apply!