Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Founding ML Inference Engineer Ultra-Low Latency AI

Reactor.am

A media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations in real-time model performance, design in-house inference runtimes, and optimize models through advanced techniques. Competitive salary and relocation support are offered, along with generous health coverage. #J-18808-Ljbffr

Vacancy posted 12 hours ago
Similar jobs that could be interesting for youBased on the Founding ML Inference Engineer Ultra-Low Latency AI in San Francisco, CA vacancy
  •  ...URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling...  ...the ground up, ensuring high availability and low-latency inference. This is a founding technical hire position, requiring end-to-end... 
    Suggested

    U-Run

    San Francisco, CA
    4 days ago
  •  ...focused company in San Francisco seeks candidates with expertise in AI simulation development. The role emphasizes optimizing training efficiency, enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient in methodologies for gradient... 
    Suggested

    Embedding VC

    San Francisco, CA
    4 days ago
  • $200k

     ...deploying high‑throughput, ultra‑low‑latency inference engines for large language models or...  ...real‑time conversational AI. Possess a deep understanding...  ...between the core ML training team and the backend...  ...Kubernetes. What We Offer Founding Team Initiative: Opportunity... 
    Suggested
    Full time
    Work at office

    Plaud

    San Francisco, CA
    12 hours ago
  •  ...problem we saw Most AI infrastructure is...  ...fix it uRun is the inference cloud for interactive...  ...investors, and are founded by Keegan McCallum,...  .... As our ML Infrastructure and Platform Engineer, you will own the architecture...  ...availability and low‑latency inference across the... 
    Suggested
    Flexible hours
    Shift work

    U-Run

    San Francisco, CA
    12 hours ago
  • $150k - $200k

     ...United States Digital Space LLC is seeking a Founding ML Engineer for both general and audio/speech roles in San Francisco, CA. This position involves building AI players for gaming, focusing on low-latency performance and interactive capabilities. Applicants should have... 
    Suggested

    United States Digital Space LLC

    San Francisco, CA
    3 days ago
  •  ...strong foundation in low-level operating...  ...experienced with modern inference systems like TGI ,...  ...state-of-the-art AI models Optimizing model...  ...throughput and low latency at scale Developing...  ...current with ML infrastructure developments...  ...requires a large engineering effort dedicated to... 
    Work at office

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    12 hours ago
  • $150k - $220k

     ...Founding Machine Learning Engineer San Francisco Compensation ~ Estimated...  ...intersection of LLM inference, browser understanding, and low-latency systems, shipping...  ...or consumer-focused "AI browsers," we run AI directly...  ...creates unique ML challenges. This is... 
    H1b
    Work at office
    Visa sponsorship
    Sleeping nights

    Composite.ai

    San Francisco, CA
    2 days ago
  •  ...We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering. This is a highly technical, high-impact...  ...performance, and shaping Reactor's competitive edge in ultra-low-latency, high-throughput environments. What You'll Do Drive our frontier... 
    Full time
    Visa sponsorship
    Relocation package

    Reactor.am

    San Francisco, CA
    4 days ago
  •  ...About the Role ML Ops Engineer — Agentic AI Lab (Founding Team) — Location: San Francisco Bay Area — Type: Full...  ...Model conversion, quantization, and inference rollout Manage hybrid compute...  ...Instrument observability for model latency, token usage, performance metrics,... 
    Full time

    Fabrion

    San Francisco, CA
    3 days ago
  •  ...Composite is seeking founding Machine Learning Engineers to enhance our proactive automation platform. In this role, you will improve model accuracy and latency across web applications, and design inference pipelines that deliver instant user experiences. We're looking... 

    Composite.ai

    San Francisco, CA
    1 day ago
  •  ...tech startup in San Francisco seeks founding Machine Learning Engineers (MLEs) to enhance core action models...  ...automation platform. You will work on low-latency AI solutions in browser environments,...  ...and speed. This role demands strong ML skills and experience with LLMs, emphasizing... 

    Composite.ai

    San Francisco, CA
    12 hours ago
  •  ...About the Role Our company is hiring a Founding AI / ML Engineer to help architect and ship the next...  ...orchestration layers Improve model quality, latency, reliability, and cost efficiency;...  ...AI‑native services thesis adjacency) Inference optimization or evaluation systems... 
    H1b
    Visa sponsorship

    Ersilia

    San Francisco, CA
    1 day ago
  •  ...Founding Ml Engineer Weave (YC W25) is building the definitive platform for understanding and improving...  ...is fundamentally broken and that modern AI can give teams a far more accurate and...  ...through the inevitable highs and lows. You must be an excellent communicator... 

    Weave, Inc.

    San Francisco, CA
    2 days ago
  •  ...Icehouseventures is looking for founding Machine Learning Engineers (MLEs) to enhance their core action models for a proactive automation platform. This role offers the chance to work closely on LLM inference, aiming for instant response times without IT friction. Join... 

    Icehouseventures

    San Francisco, CA
    1 day ago
  •  ...new Machine Learning Engineer opportunities posted on AI Chopping Block...  ...optimize end-to-end ML pipelines encompassing...  ...high performance and low latency. Machine Learning Enginer...  ...team as the founding member, and leading...  ...tuning training and inference end-to-end for high... 
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    12 hours ago
  •  ...building the gateway to the internet for AI agents. Our APIs already power...  ...push the boundaries of what our ML systems can do. We're hiring a Founding ML Engineer to own the research and...  ...of records Given raw people data, infer the org chart — who reports to whom... 

    Crustdata (YC F24)

    San Francisco, CA
    12 hours ago
  • $150k - $300k

     ...An early-stage AI data company that went from zero...  ...web. You will own the ML systems that turn that...  ...from multiple sources. Infer organisational structures...  ...full ML research and engineering cycle, from prototype to...  ...with a direct path toward founding something of your own.... 

    Open Select

    San Francisco, CA
    1 day ago
  •  ...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful...  ...detection, error classification, and alignment Optimize inference latency and GPU resource utilization across cloud and on‑... 
    Full time

    Fabrion

    San Francisco, CA
    4 days ago
  •  ...Founding Applied ML Engineer Title of Role: Founding Applied ML Engineer Location: San Francisco, CA...  ...Description We're representing an early-stage AI company that operates at the...  ...environment. Familiarity with multilingual or low-resource language modeling challenges.... 
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    7 days ago
  • $250k

     ...client, a venture-backed AI Startup, is hiring a talented ML/AI Research Engineer to join their team in...  ...quantization and high‑performance inference deployment. Manage GPU‑...  ...observability for latency, token usage, drift...  ..., or prior startup/founding team experience is a bonus... 

    Alldus International Consulting Ltd

    San Francisco, CA
    12 hours ago
  •  ...leading streaming platform in San Francisco is seeking a Software Engineer to design and build scalable distributed systems. The ideal...  ..., particularly AWS. Responsibilities include building low-latency online microservices and collaborating with machine learning engineers... 
    Flexible hours

    Tubi TV

    San Francisco, CA
    4 days ago
  • $341k - $422k

     ...Partner 20, Applied ML, Engineer, ASG San Francisco,...  ...California, United States Founded in Silicon Valley in 2...  ...companies, across AI, bio + healthcare,...  ...training to large-scale, low-latency serving and robust MLOps...  ...(data, training, and inference) Collaborate with... 
    Work at office
    2 days per week
    3 days per week

    Andreessen Horowitz

    San Francisco, CA
    2 days ago
  •  ...intelligence. You're building the AI that makes sense of it all — the...  ...catch. About the role This is a founding AI/ML role. You'll own the...  ...ingestion, model training, evaluation, inference, and monitoring in production Strong engineering fundamentals — you can build the... 

    Flynavi

    San Francisco, CA
    1 day ago
  • $300k - $400k

     ...Global (NYSE: ZETA) is the AI-Powered Marketing...  ...marketing programs. Zeta was founded in 2007 by David A....  ...As a Principal AI/ML Engineer in our AdTech team, you...  ...at large scale and low latency to handle billions of...  ...training to real-time inference, for our real-time bidding... 

    Zeta Global

    San Francisco, CA
    2 days ago
  • $227.2k - $324.5k

     ...Headquartered in San Francisco and founded in ვინც 2014, Tubi is...  ...Role: This Software Engineering team works closely with...  ...The team’s efforts take inference systems to the next level of low‑latency serving by exploring new...  ...latency. Work with ML engineers to understand... 
    Full time
    Flexible hours

    Tubi Tv

    San Francisco, CA
    4 days ago
  •  ...Gravity Engineering Services Pvt Ltd. is looking for a Founding ML Engineer to lead the development of core ML intelligence systems. You will dive into the complexities...  ..., making significant contributions to the way AI agents access real-time information. Experience in NLP... 

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    1 day ago
  •  ...About Poesis Poesis is the AI-native investment manager pioneering a new foundation...  ...trading decisions are made. We’re hiring our Founding ML Engineer, the first full-time machine learning...  .... You thrive in high‑autonomy, low‑process environments and like being close... 
    Full time
    Immediate start
    Relocation
    Visa sponsorship
    Relocation package

    Poesis LLC

    San Francisco, CA
    4 days ago
  •  ...A cutting-edge AI technology company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure...  ...requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will... 

    Reflection AI

    San Francisco, CA
    12 hours ago
  • $200k

     ...Founding ML Engineer San Francisco, on-site, full-time - $200,000 - $500,000 per year. $10k referral bonus for successful hires (half equity,...  ...Yesterday's mouse data has already been crunched. The house AI parsed the PK curves, flagged the outliers, and ranked the candidates... 
    Full time
    Night shift
    Day shift
    Afternoon shift

    Stealth Deep Tech

    San Francisco, CA
    4 days ago
  •  ...A pioneering hedge fund in San Francisco is seeking a Founding ML Engineer to architect and build machine learning systems for investment decisions. This hands-on role requires 5–10+ years of experience and proficiency in Python and ML frameworks like PyTorch and TensorFlow... 

    Poesis LLC

    San Francisco, CA
    12 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Founding ML Inference Engineer Ultra-Low Latency AI. Be the first to apply!