Founding ML Inference Engineer Ultra-Low Latency AI
Reactor.am
A media technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations in real-time model performance, design in-house inference runtimes, and optimize models through advanced techniques. Competitive salary and relocation support are offered, along with generous health coverage. #J-18808-Ljbffr
- ...URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling... ...the ground up, ensuring high availability and low-latency inference. This is a founding technical hire position, requiring end-to-end...Suggested
- ...focused company in San Francisco seeks candidates with expertise in AI simulation development. The role emphasizes optimizing training efficiency, enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient in methodologies for gradient...Suggested
$200k
...deploying high‑throughput, ultra‑low‑latency inference engines for large language models or... ...real‑time conversational AI. Possess a deep understanding... ...between the core ML training team and the backend... ...Kubernetes. What We Offer Founding Team Initiative: Opportunity...SuggestedFull timeWork at office- ...problem we saw Most AI infrastructure is... ...fix it uRun is the inference cloud for interactive... ...investors, and are founded by Keegan McCallum,... .... As our ML Infrastructure and Platform Engineer, you will own the architecture... ...availability and low‑latency inference across the...SuggestedFlexible hoursShift work
$150k - $200k
...United States Digital Space LLC is seeking a Founding ML Engineer for both general and audio/speech roles in San Francisco, CA. This position involves building AI players for gaming, focusing on low-latency performance and interactive capabilities. Applicants should have...Suggested- ...strong foundation in low-level operating... ...experienced with modern inference systems like TGI ,... ...state-of-the-art AI models Optimizing model... ...throughput and low latency at scale Developing... ...current with ML infrastructure developments... ...requires a large engineering effort dedicated to...Work at office
$150k - $220k
...Founding Machine Learning Engineer San Francisco Compensation ~ Estimated... ...intersection of LLM inference, browser understanding, and low-latency systems, shipping... ...or consumer-focused "AI browsers," we run AI directly... ...creates unique ML challenges. This is...H1bWork at officeVisa sponsorshipSleeping nights- ...We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering. This is a highly technical, high-impact... ...performance, and shaping Reactor's competitive edge in ultra-low-latency, high-throughput environments. What You'll Do Drive our frontier...Full timeVisa sponsorshipRelocation package
- ...About the Role ML Ops Engineer — Agentic AI Lab (Founding Team) — Location: San Francisco Bay Area — Type: Full... ...Model conversion, quantization, and inference rollout Manage hybrid compute... ...Instrument observability for model latency, token usage, performance metrics,...Full time
- ...Composite is seeking founding Machine Learning Engineers to enhance our proactive automation platform. In this role, you will improve model accuracy and latency across web applications, and design inference pipelines that deliver instant user experiences. We're looking...
- ...tech startup in San Francisco seeks founding Machine Learning Engineers (MLEs) to enhance core action models... ...automation platform. You will work on low-latency AI solutions in browser environments,... ...and speed. This role demands strong ML skills and experience with LLMs, emphasizing...
- ...About the Role Our company is hiring a Founding AI / ML Engineer to help architect and ship the next... ...orchestration layers Improve model quality, latency, reliability, and cost efficiency;... ...AI‑native services thesis adjacency) Inference optimization or evaluation systems...H1bVisa sponsorship
- ...Founding Ml Engineer Weave (YC W25) is building the definitive platform for understanding and improving... ...is fundamentally broken and that modern AI can give teams a far more accurate and... ...through the inevitable highs and lows. You must be an excellent communicator...
- ...Icehouseventures is looking for founding Machine Learning Engineers (MLEs) to enhance their core action models for a proactive automation platform. This role offers the chance to work closely on LLM inference, aiming for instant response times without IT friction. Join...
- ...new Machine Learning Engineer opportunities posted on AI Chopping Block... ...optimize end-to-end ML pipelines encompassing... ...high performance and low latency. Machine Learning Enginer... ...team as the founding member, and leading... ...tuning training and inference end-to-end for high...Flexible hours
- ...building the gateway to the internet for AI agents. Our APIs already power... ...push the boundaries of what our ML systems can do. We're hiring a Founding ML Engineer to own the research and... ...of records Given raw people data, infer the org chart — who reports to whom...
$150k - $300k
...An early-stage AI data company that went from zero... ...web. You will own the ML systems that turn that... ...from multiple sources. Infer organisational structures... ...full ML research and engineering cycle, from prototype to... ...with a direct path toward founding something of your own....- ...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful... ...detection, error classification, and alignment Optimize inference latency and GPU resource utilization across cloud and on‑...Full time
- ...Founding Applied ML Engineer Title of Role: Founding Applied ML Engineer Location: San Francisco, CA... ...Description We're representing an early-stage AI company that operates at the... ...environment. Familiarity with multilingual or low-resource language modeling challenges....Work at office
$250k
...client, a venture-backed AI Startup, is hiring a talented ML/AI Research Engineer to join their team in... ...quantization and high‑performance inference deployment. Manage GPU‑... ...observability for latency, token usage, drift... ..., or prior startup/founding team experience is a bonus...- ...leading streaming platform in San Francisco is seeking a Software Engineer to design and build scalable distributed systems. The ideal... ..., particularly AWS. Responsibilities include building low-latency online microservices and collaborating with machine learning engineers...Flexible hours
$341k - $422k
...Partner 20, Applied ML, Engineer, ASG San Francisco,... ...California, United States Founded in Silicon Valley in 2... ...companies, across AI, bio + healthcare,... ...training to large-scale, low-latency serving and robust MLOps... ...(data, training, and inference) Collaborate with...Work at office2 days per week3 days per week- ...intelligence. You're building the AI that makes sense of it all — the... ...catch. About the role This is a founding AI/ML role. You'll own the... ...ingestion, model training, evaluation, inference, and monitoring in production Strong engineering fundamentals — you can build the...
$300k - $400k
...Global (NYSE: ZETA) is the AI-Powered Marketing... ...marketing programs. Zeta was founded in 2007 by David A.... ...As a Principal AI/ML Engineer in our AdTech team, you... ...at large scale and low latency to handle billions of... ...training to real-time inference, for our real-time bidding...$227.2k - $324.5k
...Headquartered in San Francisco and founded in ვინც 2014, Tubi is... ...Role: This Software Engineering team works closely with... ...The team’s efforts take inference systems to the next level of low‑latency serving by exploring new... ...latency. Work with ML engineers to understand...Full timeFlexible hours- ...Gravity Engineering Services Pvt Ltd. is looking for a Founding ML Engineer to lead the development of core ML intelligence systems. You will dive into the complexities... ..., making significant contributions to the way AI agents access real-time information. Experience in NLP...
- ...About Poesis Poesis is the AI-native investment manager pioneering a new foundation... ...trading decisions are made. We’re hiring our Founding ML Engineer, the first full-time machine learning... .... You thrive in high‑autonomy, low‑process environments and like being close...Full timeImmediate startRelocationVisa sponsorshipRelocation package
- ...A cutting-edge AI technology company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure... ...requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will...
$200k
...Founding ML Engineer San Francisco, on-site, full-time - $200,000 - $500,000 per year. $10k referral bonus for successful hires (half equity,... ...Yesterday's mouse data has already been crunched. The house AI parsed the PK curves, flagged the outliers, and ranked the candidates...Full timeNight shiftDay shiftAfternoon shift- ...A pioneering hedge fund in San Francisco is seeking a Founding ML Engineer to architect and build machine learning systems for investment decisions. This hands-on role requires 5–10+ years of experience and proficiency in Python and ML frameworks like PyTorch and TensorFlow...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Founding ML Inference Engineer Ultra-Low Latency AI. Be the first to apply!
- graduate machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- intern - quantum machine learning for quantum computing San Francisco, CA


