Lead ML Inference Engineer, Advertising

$246.5k

Roku, Building C

Teamwork Makes the Stream Work

Roku is changing how the world watches TV. Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico, and we've set our sights on powering every television in the world. Roku pioneered streaming to the TV. Our mission is to be the TV streaming platform that connects the entire TV ecosystem. We connect consumers to the content they love, enable content publishers to build and monetize large audiences, and provide advertisers unique capabilities to engage consumers.

From your first day at Roku, you'll make a valuable - and valued - contribution. We're a fast-growing public company where no one is a bystander. We offer you the opportunity to delight millions of TV streamers around the world while gaining meaningful experience across a variety of disciplines.

About the Team

The Advertising Performance group focuses on performance for all participants in the Advertising ecosystem - Advertisers, Publishers, and Roku. The systems and solutions span multiple disciplines and technologies to perform real-time multi-objective optimization across distributed systems at large scale and with low latency. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization Systems, and Auction Dynamics to solve a large set of complex problems. At the core of this is our Machine Learning and Inference Platform that powers the entire landscape.

About the Role

In this role, you will architect, design, and lead the development of a state-of-the-art Inference platform that can handle Advertising-level low latencies, scale, throughput, and availability with optimizations that span across hardware, software, and models. We're looking for a strong technical leader with deep experience in ML serving, high-performance computing, and industry standard frameworks - someone excited to mentor engineers, innovate at scale, and shape the future of machine learning at Roku.

For California Only - The estimated annual salary for this position is between $246,500 - $486,100 annually. Compensation packages are based on factors unique to each candidate, including but not limited to skill set, certifications, and specific geographical location. This role is eligible for health insurance, equity awards, life insurance, disability benefits, parental leave, wellness benefits, and paid time off.

What You'll Be Doing

Lead the design and development of a state-of-the-art Inference platform
Oversee the development of monitoring, observability, and other tooling to ensure system and model performance, reliability, and scalability of online inference services
Identify and resolve system inefficiencies, performance bottlenecks, and reliability issues, ensuring optimized end-to-end performance
Stay at the forefront of advancements in inference frameworks, ML hardware acceleration, and distributed systems, and incorporate innovations where and when they are impactful

We're Excited If You Have

M.S. or above in CS, ECE, or a related field
10+ years of experience in developing and deploying large-scale, distributed systems, with at least 5 years in a leadership or technical lead role
Strong programming skills in high-performance languages
Deep understanding of inference frameworks and ML system deployment
Proven experience optimizing performance for large-scale machine learning systems, including a deep knowledge of state-of-the-art model optimizations, hardware-software co-design, GPU acceleration, and HPC techniques
Excellent communication and collaboration skills
Experience leading teams working on high-throughput, low-latency ML serving systems
Experience collaborating with and leading global, cross-functional teams
Contributions to open-source ML or systems projects

Our Hybrid Work Approach

Roku fosters an inclusive and collaborative environment where teams work in the office Monday through Thursday. Fridays are flexible for remote work except for employees whose roles are required to be in the office five days a week or employees who are in offices with a five day in office policy.

Benefits

Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Employees are supported in taking time off, in accordance with local leave policies and other personal needs to support their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.

Accommodations

Roku welcomes applicants of all backgrounds and provides reasonable accommodations and adjustments in accordance with applicable law. If you require reasonable accommodation at any point in the hiring process, please direct your inquiries to View email address on click.appcast.io.

The Roku Culture

Roku is a great place for people who want to work in a fast-paced environment where everyone is focused on the company's success rather than their own. We try to surround ourselves with people who are great at their jobs, who are easy to work with, and who keep their egos in check. We appreciate a sense of humor. We believe a fewer number of very talented folks can do more for less cost than a larger number of less talented teams. We're independent thinkers with big ideas who act boldly, move fast and accomplish extraordinary things through collaboration and trust. In short, at Roku you'll be part of a company that's changing how the world watches TV.

We have a unique culture that we are proud of. We think of ourselves primarily as problem-solvers, which itself is a two-part idea. We come up with the solution, but the solution isn't real until it is built and delivered to the customer. That penchant for action gives us a pragmatic approach to innovation, one that has served us well since 2002.

To learn more about Roku, our global footprint, and how we've grown, visit

By providing your information, you acknowledge that you want Roku to contact you about job roles, that you have read Roku's Applicant Privacy Notice, and understand that Roku will use your information as described in that notice. If you do not wish to receive any communications from Roku regarding this role or similar roles in the future, you may unsubscribe at any time by emailing View email address on click.appcast.io.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Lead ML Inference Engineer, Advertising in San Jose, CA vacancy

Staff Inference ML Runtime Engineer
...allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning... ...users to effortlessly run large-scale ML applications, without the hassle of managing... ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated...
Suggested
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
1 day ago
Staff ML Infra Engineer: Scalable Inference Platform (Hybrid)
A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale, CA. The role... ...in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and competitive...
Suggested
General Motors
Sunnyvale, CA
16 hours ago
ML Engineer - Inference & Model Deployment
...HiringCafe is building a 100x better job search engine: fast, comprehensive, honest, and... ...placement. We are looking for a founding ML engineer who can help us turn powerful AI... ...infrastructure: deploying models, optimizing inference latency and throughput, scaling serving...
Suggested
Relocation package
HiringCafe
Cupertino, CA
1 day ago
Senior ML Compiler & Inference Systems Engineer
$152k - $287.5k
NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms...
Suggested
NVIDIA Gruppe
Santa Clara, CA
4 days ago
ML Engineer, Ads Experimentation & Intuitive UIs
Apple Inc. in Cupertino, California, is seeking a full-stack ML Engineer to enhance its advertising systems. The ideal candidate will design intuitive user interfaces, partner with cross-functional teams, and build production-ready RAG Machine Learning models. Required...
Suggested
Apple Inc.
Cupertino, CA
3 days ago
ML Engineer - Experimentation, Portal
$147.4k - $272.1k
...build is designed to empower people, including our advertising platform. We deliver ads in a way that benefits... ...is done right, it benefits everyone. The Ads ML Experimentation team is looking for a full‑stack ML Engineer to help shape the future of how Apple's advertising...
Relocation
Apple Inc.
Cupertino, CA
3 days ago
Founding ML Engineer: Production Inference & Deployment
HiringCafe is seeking a Founding ML Engineer in Cupertino to transform AI and ML models into reliable production systems. You'll be responsible for deploying models, optimizing their performance, and ensuring they run efficiently in production. Success in this role requires...
HiringCafe
Cupertino, CA
4 days ago
Remote Senior ML Inference Platform Engineer
General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With...
Remote job
General Motors
Sunnyvale, CA
1 day ago
Senior ML Inference Engineer - Platform
$128.7k - $261.3k
The Model Deployment & Inference Solutions team in GM AV deploys machine... ...is two-fold: build the ML deployment platform that makes... ...or equivalent) as part of your engineering workflow. Experience designing... ...mobility.We are determined to lead change for the world through technology...
Flexible hours
General Motors
Sunnyvale, CA
2 days ago
Machine Learning Engineer (PhD Intern)
...areas including e-commerce, advertising, and fulfillment. We use machine... ...open across our various ML teams. You can find a blurb on... ...modeling, and general causal inference. Search & Discovery ML :... ...works alongside world-class engineers, data scientists, and product...
Remote job
Permanent employment
Work experience placement
Internship
Work at office
Work from home
Flexible hours
Instacart
San Jose, CA
3 days ago
Staff ML Engineer, Inference Platform
$195k - $298k
...relocation assistance. About the Team The ML Inference Platform is part of the AI Compute... ...We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms... ..., and deliver incremental value. Lead technical decision-making on model serving...
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
1 day ago
Senior ML Test Engineer - AI Compute & CI/CD Lead
$180k - $300k
MixMode is seeking a Principal Software ML Test Engineer to lead testing for the d-Matrix AI compute engine in Santa Clara, California. This role involves overseeing test planning, automation, and execution, while collaborating closely with software development teams....
MixMode
Santa Clara, CA
3 days ago
Remote Lead ML & 3D Vision R&D Engineer
A leading real estate technology company is looking for a Lead Machine Learning R&D Engineer to innovate in spatial computing and advance their platform. This role, based in California, focuses on developing machine learning models that enhance how users interact with...
Remote job
CoStar
Sunnyvale, CA
16 hours ago
Cupertino, USA On-site Apple ML Engineer - Experimentation, Portal
Why Here? Apple’s advertising platform delivers relevant content across App Store, Apple News,... ...TV while protecting user privacy. The Ads ML Experimentation team shapes advertising... ...delight customers. What Will You Do? As an ML Engineer - Experimentation, Portal at Apple, you...
Experimentation Jobs
Cupertino, CA
16 hours ago
Tech Lead, Data & Inference Engineer
...Tech Lead, Data & Inference Engineer San Jose, California, United States About the Job Tech Lead, Data & Inference Engineer Our client is a fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars...
Full time
Catalyst Labs, LLC
San Jose, CA
1 day ago
Machine Learning Engineer, App Ads and Gaming
$156k - $316.8k
...boundaries of large-scale ads delivery systems and lead the innovations of TikTok's personalized online advertising. Hence, you'll have a chance to get deeply... ...machine learning algorithms. As a Machine Learning Engineer on the App Ads & Gaming team, you will make efforts...
Temporary work
Local area
Tik Tok
San Jose, CA
1 day ago
Machine Learning Engineer Intern (App Ads) - 2026 Fall (BS/MS)
$45 - $60 per hour
...Machine Learning Engineer Intern (App Ads) - 2026 Fall (BS/MS) Location: San Jose Employment Type: Intern... ...of large-scale ad delivery systems and lead the innovations of TikTok's personalized online advertising. We are looking for talented individuals to join...
Hourly pay
Full time
Summer work
Internship
Local area
Tik Tok
San Jose, CA
4 days ago
ML Runtime Optimization Engineer
...Software Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded... ...engineer with deep experience in optimizing ML models and deploying them on production-... ...optimize efficiency and latency of model inference for compute boards selected by our...
For contractors
For subcontractor
Casual work
Work at office
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
4 days ago
ML Engineers - with LLM GenAI
...Job Title: ML Engineers - with LLM GenAI (3 Resources) Responsibilities Write efficient machine learning workflows and pipelines... ...pipeline - Ingest/Preprocess/Vectorize and index data Inference pipeline - AI Guided workflow to respond to user requests or...
Work experience placement
Sparktek
San Jose, CA
3 days ago
ML Engineer
...ML Engineer Santa Clara, California, United States About the Job Our client is a rapidly... .... About Us Catalyst Labs is a leading talent agency with a specialized... ...preprocessing, model training, deployment, inference, and monitoring in production environments...
Full time
Catalyst Labs, LLC
Santa Clara, CA
6 days ago
Senior ML Accelerator Engineer - GPU
$128.7k - $261.3k
...export, kernel development, and performance engineering so that every cycle on our accelerators... ...that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous driving . We... ...and we embrace the responsibility to lead the change that will make our world better...
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Matterport - Senior ML Ops Engineer
$173k - $253k
...Senior MLOps Engineer Matterport is leading the digital transformation of the built world. Our groundbreaking... .... You will work closely with ML R&D Engineers and other engineering teams... ...analyze model performance, optimize inference speed and resource utilization, and ensure...
Work at office
Work from home
CoStar Group
Sunnyvale, CA
3 days ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...development, and performance engineering so that every cycle on our accelerators... ...models into fast, reliable inference across GPUs powering GM's next... ...reliable, and effortless for ML engineers across the AV... ...embrace the responsibility to lead the change that will make our...
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Remote ML Engineer — Production AI/ML, 12-Month Contract
$50 - $175 per hour
Title: Machine Learning Engineer / AI Engineer Job Type: Contract Contract Length: 12 Months... ...performance bottlenecks in the model inference path to minimize latency for end-users. Required... ...with Docker, Kubernetes, and cloud-based ML platforms (SageMaker, Vertex AI). Strong...
Remote job
Contract work
Immediate start
DeWinter Group
Campbell, CA
1 day ago
Senior ML Performance Engineer
...supercomputer — feel like one seamless engine. Developers can write once, run... ...Role We're looking for a Senior ML Performance Engineer to architect and lead our Performance Testing Platform from... ...platform for evaluating LLM inference workloads across GPU clusters Define...
Lemurian Labs
Santa Clara, CA
6 days ago
Director ML Engineering
$206.4k - $384.68k
...verticals. We are hiring a Director, ML Engineering to own the engineering function behind... ...Own the unit economics of Firefly Foundry inference. Cost-to-serve, GPU utilization , and... ...remit. What you will do ** Build and lead the engineering organization ** Lead a...
Temporary work
Local area
Worldwide
Adobe
San Jose, CA
16 hours ago
Senior ML Engineer - Model Compression
$128.7k - $261.3k
...developers and deployment and infra engineers to ship numerically robust,... ..., Mathematics, Data Science / ML, or a closely related... ...model compression / efficient inference or relevant experience ~ StrongproficiencyinPyTorchandexperience... ...embrace the responsibility to lead the change that will make our...
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Staff ML Infrastructure Engineer - Embodied AI Offboard Perception
...strategies. Implement CI/CD pipelines for ML systems, including automated testing,... ...frameworks, including training metrics, inference performance metrics, data‑quality checks,... ...performance on hard‑edge cases. Partner with ML engineers, researchers, and software teams to...
Local area
Remote work
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
Senior ML Systems Engineer
...allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning... ...users to effortlessly run large‑scale ML applications, without the hassle of managing... ...seeking a versatile and experienced engineer to join our SOTA Training Platform...
Internship
Cerebras
Sunnyvale, CA
2 days ago
Distributed Machine Learning Engineer
$150k
...researchers, data scientists, and engineers, tackling the most fundamental... ...The Role The Distributed ML Engineer will play a role at... ...stacks, especially at training and inference, and support the team to... ...algorithms Participate in, or lead design reviews with peers and...
Work experience placement
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead ML Inference Engineer, Advertising. Be the first to apply!