Founding ML Inference Engineer Ultra-Low Latency AI

Reactor.am

A media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations in real-time model performance, design in-house inference runtimes, and optimize models through advanced techniques. Competitive salary and relocation support are offered, along with generous health coverage. #J-18808-Ljbffr

Apply

Vacancy posted 12 hours ago

Similar jobs that could be interesting for youBased on the Founding ML Inference Engineer Ultra-Low Latency AI in San Francisco, CA vacancy

Founding ML Infra Engineer: Scale Real-Time Inference
...URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling... ...the ground up, ensuring high availability and low-latency inference. This is a founding technical hire position, requiring end-to-end...
Suggested
U-Run
San Francisco, CA
4 days ago
Staff ML Engineer: Efficient ML & Low-Latency AI
...focused company in San Francisco seeks candidates with expertise in AI simulation development. The role emphasizes optimizing training efficiency, enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient in methodologies for gradient...
Suggested
Embedding VC
San Francisco, CA
4 days ago
Machine Learning Engineer, Inference & Serving (Speech LLM) - San Francisco
$200k
...deploying high‑throughput, ultra‑low‑latency inference engines for large language models or... ...real‑time conversational AI. Possess a deep understanding... ...between the core ML training team and the backend... ...Kubernetes. What We Offer Founding Team Initiative: Opportunity...
Suggested
Full time
Work at office
Plaud
San Francisco, CA
12 hours ago
Founding ML infrastructure Engineer
...problem we saw Most AI infrastructure is... ...fix it uRun is the inference cloud for interactive... ...investors, and are founded by Keegan McCallum,... .... As our ML Infrastructure and Platform Engineer, you will own the architecture... ...availability and low‑latency inference across the...
Suggested
Flexible hours
Shift work
U-Run
San Francisco, CA
12 hours ago
Founding ML Engineer: Real-Time Game AI
$150k - $200k
...United States Digital Space LLC is seeking a Founding ML Engineer for both general and audio/speech roles in San Francisco, CA. This position involves building AI players for gaming, focusing on low-latency performance and interactive capabilities. Applicants should have...
Suggested
United States Digital Space LLC
San Francisco, CA
3 days ago
LLM/ML Engineer (Inference)
...strong foundation in low-level operating... ...experienced with modern inference systems like TGI ,... ...state-of-the-art AI models Optimizing model... ...throughput and low latency at scale Developing... ...current with ML infrastructure developments... ...requires a large engineering effort dedicated to...
Work at office
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
12 hours ago
Founding Machine Learning Engineer
$150k - $220k
...Founding Machine Learning Engineer San Francisco Compensation ~ Estimated... ...intersection of LLM inference, browser understanding, and low-latency systems, shipping... ...or consumer-focused "AI browsers," we run AI directly... ...creates unique ML challenges. This is...
H1b
Work at office
Visa sponsorship
Sleeping nights
Composite.ai
San Francisco, CA
2 days ago
ML Inference Engineer San Francisco Engineering Full Time
...We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering. This is a highly technical, high-impact... ...performance, and shaping Reactor's competitive edge in ultra-low-latency, high-throughput environments. What You'll Do Drive our frontier...
Full time
Visa sponsorship
Relocation package
Reactor.am
San Francisco, CA
4 days ago
ML Ops Engineer Agentic AI Lab (Founding Team)
...About the Role ML Ops Engineer — Agentic AI Lab (Founding Team) — Location: San Francisco Bay Area — Type: Full... ...Model conversion, quantization, and inference rollout Manage hybrid compute... ...Instrument observability for model latency, token usage, performance metrics,...
Full time
Fabrion
San Francisco, CA
3 days ago
Founding ML Engineer Instant, Browser-Driven AI Actions
...Composite is seeking founding Machine Learning Engineers to enhance our proactive automation platform. In this role, you will improve model accuracy and latency across web applications, and design inference pipelines that deliver instant user experiences. We're looking...
Composite.ai
San Francisco, CA
1 day ago
Founding ML Engineer Real-Time In-Browser AI
...tech startup in San Francisco seeks founding Machine Learning Engineers (MLEs) to enhance core action models... ...automation platform. You will work on low-latency AI solutions in browser environments,... ...and speed. This role demands strong ML skills and experience with LLMs, emphasizing...
Composite.ai
San Francisco, CA
12 hours ago
Founding AI/ML Engineer
...About the Role Our company is hiring a Founding AI / ML Engineer to help architect and ship the next... ...orchestration layers Improve model quality, latency, reliability, and cost efficiency;... ...AI‑native services thesis adjacency) Inference optimization or evaluation systems...
H1b
Visa sponsorship
Ersilia
San Francisco, CA
1 day ago
Founding ML Engineer
...Founding Ml Engineer Weave (YC W25) is building the definitive platform for understanding and improving... ...is fundamentally broken and that modern AI can give teams a far more accurate and... ...through the inevitable highs and lows. You must be an excellent communicator...
Weave, Inc.
San Francisco, CA
2 days ago
Founding ML Engineer Instant, Browser-Driven AI Actions
...Icehouseventures is looking for founding Machine Learning Engineers (MLEs) to enhance their core action models for a proactive automation platform. This role offers the chance to work closely on LLM inference, aiming for instant response times without IT friction. Join...
Icehouseventures
San Francisco, CA
1 day ago
ML Engineer
...new Machine Learning Engineer opportunities posted on AI Chopping Block... ...optimize end-to-end ML pipelines encompassing... ...high performance and low latency. Machine Learning Enginer... ...team as the founding member, and leading... ...tuning training and inference end-to-end for high...
Flexible hours
AI Chopping Block, Inc.
San Francisco, CA
12 hours ago
Founding ML Engineer
...building the gateway to the internet for AI agents. Our APIs already power... ...push the boundaries of what our ML systems can do. We're hiring a Founding ML Engineer to own the research and... ...of records Given raw people data, infer the org chart — who reports to whom...
Crustdata (YC F24)
San Francisco, CA
12 hours ago
Founding ML Engineer
$150k - $300k
...An early-stage AI data company that went from zero... ...web. You will own the ML systems that turn that... ...from multiple sources. Infer organisational structures... ...full ML research and engineering cycle, from prototype to... ...with a direct path toward founding something of your own....
Open Select
San Francisco, CA
1 day ago
ML/AI Research Engineer Agentic AI Lab (Founding Team)
...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful... ...detection, error classification, and alignment Optimize inference latency and GPU resource utilization across cloud and on‑...
Full time
Fabrion
San Francisco, CA
4 days ago
Founding Applied ML Engineer
...Founding Applied ML Engineer Title of Role: Founding Applied ML Engineer Location: San Francisco, CA... ...Description We're representing an early-stage AI company that operates at the... ...environment. Familiarity with multilingual or low-resource language modeling challenges....
Work at office
Recruiting from Scratch
San Francisco, CA
7 days ago
Founding MLOps Engineer
$250k
...client, a venture-backed AI Startup, is hiring a talented ML/AI Research Engineer to join their team in... ...quantization and high‑performance inference deployment. Manage GPU‑... ...observability for latency, token usage, drift... ..., or prior startup/founding team experience is a bonus...
Alldus International Consulting Ltd
San Francisco, CA
12 hours ago
Staff ML Infra Engineer: Low-Latency Cloud Systems
...leading streaming platform in San Francisco is seeking a Software Engineer to design and build scalable distributed systems. The ideal... ..., particularly AWS. Responsibilities include building low-latency online microservices and collaborating with machine learning engineers...
Flexible hours
Tubi TV
San Francisco, CA
4 days ago
Partner 20, Applied ML, Engineer, ASG
$341k - $422k
...Partner 20, Applied ML, Engineer, ASG San Francisco,... ...California, United States Founded in Silicon Valley in 2... ...companies, across AI, bio + healthcare,... ...training to large-scale, low-latency serving and robust MLOps... ...(data, training, and inference) Collaborate with...
Work at office
2 days per week
3 days per week
Andreessen Horowitz
San Francisco, CA
2 days ago
ML/AI Founding Engineer
...intelligence. You're building the AI that makes sense of it all — the... ...catch. About the role This is a founding AI/ML role. You'll own the... ...ingestion, model training, evaluation, inference, and monitoring in production Strong engineering fundamentals — you can build the...
Flynavi
San Francisco, CA
1 day ago
Principal AI/ML Engineer - AdTech
$300k - $400k
...Global (NYSE: ZETA) is the AI-Powered Marketing... ...marketing programs. Zeta was founded in 2007 by David A.... ...As a Principal AI/ML Engineer in our AdTech team, you... ...at large scale and low latency to handle billions of... ...training to real-time inference, for our real-time bidding...
Zeta Global
San Francisco, CA
2 days ago
Staff, ML Infrastructure Engineer
$227.2k - $324.5k
...Headquartered in San Francisco and founded in ვინც 2014, Tubi is... ...Role: This Software Engineering team works closely with... ...The team’s efforts take inference systems to the next level of low‑latency serving by exploring new... ...latency. Work with ML engineers to understand...
Full time
Flexible hours
Tubi Tv
San Francisco, CA
4 days ago
Founding ML Engineer Build Real-Time AI Data Intelligence
...Gravity Engineering Services Pvt Ltd. is looking for a Founding ML Engineer to lead the development of core ML intelligence systems. You will dive into the complexities... ..., making significant contributions to the way AI agents access real-time information. Experience in NLP...
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
1 day ago
Founding Machine Learning Engineer
...About Poesis Poesis is the AI-native investment manager pioneering a new foundation... ...trading decisions are made. We’re hiring our Founding ML Engineer, the first full-time machine learning... .... You thrive in high‑autonomy, low‑process environments and like being close...
Full time
Immediate start
Relocation
Visa sponsorship
Relocation package
Poesis LLC
San Francisco, CA
4 days ago
Senior GPU ML Infra Engineer Mid-Training & Inference
...A cutting-edge AI technology company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure... ...requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will...
Reflection AI
San Francisco, CA
12 hours ago
Founding ML Engineer Peptide Drug Discovery AI
$200k
...Founding ML Engineer San Francisco, on-site, full-time - $200,000 - $500,000 per year. $10k referral bonus for successful hires (half equity,... ...Yesterday's mouse data has already been crunched. The house AI parsed the PK curves, flagged the outliers, and ranked the candidates...
Full time
Night shift
Day shift
Afternoon shift
Stealth Deep Tech
San Francisco, CA
4 days ago
Founding ML Engineer for AI-Driven Hedge Fund
...A pioneering hedge fund in San Francisco is seeking a Founding ML Engineer to architect and build machine learning systems for investment decisions. This hands-on role requires 5–10+ years of experience and proficiency in Python and ML frameworks like PyTorch and TensorFlow...
Poesis LLC
San Francisco, CA
12 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Founding ML Inference Engineer Ultra-Low Latency AI. Be the first to apply!