Staff ML Systems Engineer: Low-Latency Inference
Nuance Labs
Nuance Labs in Seattle is seeking an early-career engineer focused on optimizing AI models for real-time interactions. With a strong foundation in ML systems and experience with inference frameworks, you will work to significantly improve response times of AI avatars by applying innovative techniques. The role requires expertise in Python and PyTorch, with opportunities to contribute to high-impact projects. Competitive salary and equity options are offered, along with a collaborative in-office environment. #J-18808-Ljbffr Nuance Labs
$139.5k - $258.1k
Apple Inc. is seeking a Software Development Engineer for Siri Runtime Systems in Seattle, Washington. This role focuses on designing and integrating next-generation Siri features, emphasizing low-latency interaction and battery optimization. Candidates should possess strong...Suggested$164k - $313.3k
...seeking a Senior Machine Learning (ML) Systems & Efficiency Engineer to join our R&D team focused on... ...production-ready improvements in inference performance, latency, and cost efficiency across image... ...Design and optimize high-throughput, low-latency inference systems....SuggestedTemporary workLocal areaWorldwide$171.6k - $302.2k
Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge... ...with incredible low latencies, drawing every... ...Research team to optimize inference for cutting edge... ...one of the popular ML Frameworks like... ...building and maintaining systems written in modern...SuggestedRelocation$233.4k - $339.65k
...We are seeking a highly skilled and experienced Principal ML Systems Engineer to join our Autonomous Vehicles team. In this role, you will... ..., Spark) and serving layers optimized for high-throughput, low-latency delivery. ~ Experience optimizing services for cost efficiency...SuggestedH1bLocal areaWork from homeRelocation packageFlexible hours- ...driven technology company in Seattle is seeking a Senior or Staff Software Engineer for the ML Infrastructure team. The role involves designing and operating systems for large-scale model training and inference, focusing on reliability and performance. Candidates should...Suggested
- ...is seeking a skilled Machine Learning Engineer in Seattle, WA. The role involves developing production-level ML solutions crucial for multiple teams, collaborating... ...machine learning, expertise in causal inference and recommendation systems, and a strong command of Python....
- ...Machine Learning Engineer Menlo Park, California... ...most successful ad systems at Google, including... ...of planet-scale ML systems. At Moloco... ...ingestion to online inference - on top of planet-scale... ...teams to reduce latency and improve throughput for low-cost, high-scale decisioning...Temporary workImmediate startShift work
$186.1k - $300.55k
...disconnected from business systems of record, costing... ...Senior Machine Learning Engineer to redefine how we operate... ...series data Optimize inference pipelines to run with low latency on streaming telemetry data... ..., C++, or Go), CI/CD for ML, and experience deploying...Contract workWork at officeLocal areaRemote work2 days per week$106.9k - $160.4k
...ML Engineer At Weyerhaeuser, we sustainably manage forests and manufacture... ...machine learning systems across Weyerhaeuser's AI portfolio... ...and deploy batch and real-time inference solutions using cloud-native... ...drift, prediction accuracy, latency, and implement retraining strategies...Full timeTemporary work$200k - $250k
...Senior Machine Learning Engineer (Mandarin Speaking)... ...the most successful ad systems at Google, including YouTube... ...deploy large-scale ML systems for ad ranking... ...requests per day with low-latency requirements... ...experience optimizing inference performance on GPUs, TPUs...Temporary workWork at officeFlexible hours- ...Performance Engineer, Inference Systems San Francisco, CA | New York City, NY |... ...four dimensions: throughput, latency, reliability, and correctness... ...Qualifications Experience with ML systems, especially training... ...: Currently, we expect all staff to be in one of our offices...Work at officeVisa sponsorshipFlexible hours
$99.6k - $234.6k
...The Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical... ...operating next-generation AI systems on Oracle Cloud... ...autonomous workflows, scalable inference infrastructure, and enterprise... ...services optimized for low latency, high throughput, GPU...Temporary workFlexible hours- A tech giant in Seattle seeks a Senior Systems Software Engineer for the Video Applications team. This role focuses on developing core application... ...layers that enhance high-performance video workflows, utilizing low-level technologies. Candidates should have significant...
$202k
...Senior Machine Learning Engineer on the Uber Direct... ...and build intelligent systems that improve operational... ...Do Develop High-Impact ML Solutions: Design,... ...both real-time and batch inference at scale. Drive Business... ...in a high‑throughput, low‑latency production environment...Full timeWork at officeLocal areaRemote workWorldwide$320k - $405k
A technology company in Seattle is seeking an experienced Machine Learning Systems Engineer to join their Encodings and Tokenization team. The role involves developing and optimizing tokenization systems, collaborating with research teams, and building critical infrastructure...- ...experienced Machine Learning (ML) Engineer to design, build, and... ...modern, production‑grade ML systems. The ideal candidate will bring... ...performance for accuracy, drift, and latency; manage retraining cycles... ...and maintain training and inference pipelines. Cloud & Platform...Full timeFlexible hours
$139.5k - $258.1k
...pioneering the decentralized data systems to prove it. Description This is not a standard Data Engineering or ML role. We are looking for a... ...of data. Ensure that inference systems can seamlessly access... ...Spark and Flink, and building low-latency, high-throughput data serving...Relocation$197.3k - $313.7k
...Staff Machine Learning Engineer To get the best candidate experience,... ...finetuning to join our ML team. You'll design,... ...on: you'll work at a low level with training frameworks... ...with recommendation systems or search.... ...model optimization for inference (quantization, pruning...$200k - $250k
...Manager of Machine Learning Engineering within the Advanced Technologies... ...of our foundational systems that power our next generation... ...engineering, annotation pipelines, ML Infrastructure and... ...infrastructure, model registries, and low-latency inference services. Ensure high...Temporary workWork at officeLocal area- ...AI startup is seeking a talented Machine Learning Engineer to play a key role in building their core AI inference platform in Seattle. Responsibilities include designing... ...components, researching and implementing advanced ML techniques, and collaborating with a multi-...
- ...orchestration tools; Develop and enhance ML models including time series forecasting... ..., and computer vision; Perform feature engineering and data preparation for ML tasks;... ...statistical reasoning, especially in causal inference; What Do We Offer The global benefits package...Remote workFlexible hours
$140.1k - $210.1k
...reliable on-demand, logistics engine for last-mile retail delivery!... ...engineer to help us develop the AI/ML to power DoorDash expand... ...infrastructure to build recommendation system, and implementing new AI... ...in applied ML for Causal Inference and Recommendation Systems - both...Full timeTemporary workWork at officeLocal areaRemote work$197.3k - $313.7k
Staff ML Engineer, Fine Tuning - SlackSkip to main content#Staff ML Engineer... ...is hands-on: you'll work at a low level with training... ...Expertise with recommendation systems or search.* Familiarity with model optimization for inference (quantization, pruning, speculative...Work at office- ...throughput AI training and inference demands, and the... ...handling high-bandwidth, low-latency sensor data at scale.... .../ AI Infrastructure Engineer, you will own all of it... ...directly with our AI/ML engineers, the Lead Architect... ...topology for client system security plans. •...Remote work
- ...aware, reliable, field-ready AI systems that solve the hardest... ...architectures, combining rigorous engineering with learning systems proven... ...field. We are seeking a Staff ML Systems Engineer to... ...machine learning training and inference systems. Familiarity with...Local area
- ...Tech Lead, Data & Inference Engineer Seattle, Washington, United States About the Job Tech... ...in the evolving world of intelligent systems. Location: San Francisco Work type... ...interfaces into trusted and low latency systems. Take full ownership of reliability...Full time
$171.6k - $230.1k
...Lead Machine Learning Engineer Technology is at... ...machine learning systems while guiding... ...operate production ML systems at scale, mentor... ...balance model quality, latency, throughput,... ...supporting both real-time inference and offline... ...operating ML systems in low-latency, high-...$171.6k - $302.2k
...Description As a Senior/Staff Engineer on the Foundation... ...scheduling and orchestration systems for large-scale TPU... ...-scale training and inference jobs. This role spans... ...utilization, fairness, startup latency, and reliability... ...for distributed ML workloads running on Kubernetes...Relocation- ...Sesame Engineer Position Sesame believes in a future where computers... ...the intersection of embedded systems and ML to enable rich, reliable... ...for gesture detection on ultra-low-power embedded hardware.... ...optimizing algorithms for power, latency, and memory footprint. Sesame...Full timeContract workFlexible hours
$147k - $211k
Google Inc. is seeking a Software Engineer specializing in Parallel File Systems and AI/ML Storage in Seattle, WA. In this role, you will develop the next-generation cloud storage tailored for extreme-scale, data-intensive workloads. The ideal candidate will possess a...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff ML Systems Engineer: Low-Latency Inference. Be the first to apply!
- staff data engineer Seattle, WA
- assistant engineer Seattle, WA
- staff engineer Seattle, WA
- software engineer staff Seattle, WA
- senior staff systems engineer Seattle, WA
- assistant civil engineer Seattle, WA
- senior staff engineer Seattle, WA
- project engineer assistant project manager Seattle, WA
- technology administrator Seattle, WA
- engineering aide Seattle, WA

