Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead ML Inference Engineer, Advertising

$246.5k

Roku, Building C

Teamwork Makes the Stream Work

Roku is changing how the world watches TV. Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico, and we've set our sights on powering every television in the world. Roku pioneered streaming to the TV. Our mission is to be the TV streaming platform that connects the entire TV ecosystem. We connect consumers to the content they love, enable content publishers to build and monetize large audiences, and provide advertisers unique capabilities to engage consumers.

From your first day at Roku, you'll make a valuable - and valued - contribution. We're a fast-growing public company where no one is a bystander. We offer you the opportunity to delight millions of TV streamers around the world while gaining meaningful experience across a variety of disciplines.

About the Team

The Advertising Performance group focuses on performance for all participants in the Advertising ecosystem - Advertisers, Publishers, and Roku. The systems and solutions span multiple disciplines and technologies to perform real-time multi-objective optimization across distributed systems at large scale and with low latency. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization Systems, and Auction Dynamics to solve a large set of complex problems. At the core of this is our Machine Learning and Inference Platform that powers the entire landscape.

About the Role

In this role, you will architect, design, and lead the development of a state-of-the-art Inference platform that can handle Advertising-level low latencies, scale, throughput, and availability with optimizations that span across hardware, software, and models. We're looking for a strong technical leader with deep experience in ML serving, high-performance computing, and industry standard frameworks - someone excited to mentor engineers, innovate at scale, and shape the future of machine learning at Roku.

For California Only - The estimated annual salary for this position is between $246,500 - $486,100 annually. Compensation packages are based on factors unique to each candidate, including but not limited to skill set, certifications, and specific geographical location. This role is eligible for health insurance, equity awards, life insurance, disability benefits, parental leave, wellness benefits, and paid time off.

What You'll Be Doing
  • Lead the design and development of a state-of-the-art Inference platform
  • Oversee the development of monitoring, observability, and other tooling to ensure system and model performance, reliability, and scalability of online inference services
  • Identify and resolve system inefficiencies, performance bottlenecks, and reliability issues, ensuring optimized end-to-end performance
  • Stay at the forefront of advancements in inference frameworks, ML hardware acceleration, and distributed systems, and incorporate innovations where and when they are impactful
We're Excited If You Have
  • M.S. or above in CS, ECE, or a related field
  • 10+ years of experience in developing and deploying large-scale, distributed systems, with at least 5 years in a leadership or technical lead role
  • Strong programming skills in high-performance languages
  • Deep understanding of inference frameworks and ML system deployment
  • Proven experience optimizing performance for large-scale machine learning systems, including a deep knowledge of state-of-the-art model optimizations, hardware-software co-design, GPU acceleration, and HPC techniques
  • Excellent communication and collaboration skills
  • Experience leading teams working on high-throughput, low-latency ML serving systems
  • Experience collaborating with and leading global, cross-functional teams
  • Contributions to open-source ML or systems projects
Our Hybrid Work Approach

Roku fosters an inclusive and collaborative environment where teams work in the office Monday through Thursday. Fridays are flexible for remote work except for employees whose roles are required to be in the office five days a week or employees who are in offices with a five day in office policy.

Benefits

Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Employees are supported in taking time off, in accordance with local leave policies and other personal needs to support their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.

Accommodations

Roku welcomes applicants of all backgrounds and provides reasonable accommodations and adjustments in accordance with applicable law. If you require reasonable accommodation at any point in the hiring process, please direct your inquiries to View email address on click.appcast.io.

The Roku Culture

Roku is a great place for people who want to work in a fast-paced environment where everyone is focused on the company's success rather than their own. We try to surround ourselves with people who are great at their jobs, who are easy to work with, and who keep their egos in check. We appreciate a sense of humor. We believe a fewer number of very talented folks can do more for less cost than a larger number of less talented teams. We're independent thinkers with big ideas who act boldly, move fast and accomplish extraordinary things through collaboration and trust. In short, at Roku you'll be part of a company that's changing how the world watches TV.

We have a unique culture that we are proud of. We think of ourselves primarily as problem-solvers, which itself is a two-part idea. We come up with the solution, but the solution isn't real until it is built and delivered to the customer. That penchant for action gives us a pragmatic approach to innovation, one that has served us well since 2002.

To learn more about Roku, our global footprint, and how we've grown, visit

By providing your information, you acknowledge that you want Roku to contact you about job roles, that you have read Roku's Applicant Privacy Notice, and understand that Roku will use your information as described in that notice. If you do not wish to receive any communications from Roku regarding this role or similar roles in the future, you may unsubscribe at any time by emailing View email address on click.appcast.io.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Lead ML Inference Engineer, Advertising in San Jose, CA vacancy
  • Roku, Inc. in San Jose is looking for a strong technical leader to architect and develop a state-of-the-art inference platform for advertising systems. The ideal candidate will have over 10 years of experience in distributed systems and at least 5 years in a leadership... 
    Suggested

    Roku, Inc.

    San Jose, CA
    5 days ago
  • $100k

     ...Netflix is one of the world's leading entertainment services with 27...  ...of this innovation. It offers ML/AI practitioners across Netflix...  ...hiring for a Machine Learning Engineer to join our team to contribute...  ...for efficient and scalable inference. -Develop and maintain online... 
    Suggested
    Hourly pay
    Full time
    Immediate start
    Flexible hours

    Netflix

    Los Gatos, CA
    7 hours ago
  •  ...allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning...  ...users to effortlessly run large-scale ML applications, without the hassle of managing...  ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated... 
    Suggested

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago
  •  ...everyday photography. We're a small team of researchers, engineers, and designers who have always been at the forefront of...  .... We're just getting started! The role: As our first ML Engineer specializing in inference and optimization, you'll bridge the gap between cutting... 
    Suggested
    Relocation
    Visa sponsorship
    Relocation package
    Shift work

    Photalabs

    San Jose, CA
    2 days ago
  • $124k - $195.5k

     ...Machine Learning Applications and Compiler Engineer for New College Grad 2026 in Santa Clara,...  ...will focus on developing algorithms for inference and compiler stack optimizations, working...  ...development, and experience with ML frameworks like TensorFlow and PyTorch. A... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale, CA. The role...  ...in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and competitive... 

    General Motors

    Sunnyvale, CA
    5 days ago
  • A leading technology company is hiring a Machine Learning Systems Engineer in Cupertino, California. You will collaborate with Siri modeling...  ...optimize model training and inference on Apple's custom Silicon. The...  ...has strong experience in ML models, with proficiency in Python... 

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $156k - $387.6k

     ...Machine Learning Engineer - Inference Location: San Jose Team: Technology Employment Type:...  ...Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/RDMA) or...  ...to do great things with great people. We lead with curiosity, humility, and a desire to... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    1 day ago
  • $147.4k - $272.1k

     ...ML Engineer - Experimentation, Portal At Apple, we believe in the power of technology to enrich people's lives. Everything we build is designed to empower people, including our advertising platform. We deliver ads in a way that benefits both customers and advertisers... 
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  •  ...creating a compelling path for advertisers to reach audiences that are...  ...Our Team The Ads Platform Engineering teams build advertising systems...  ...solution leveraging ML models and high performance ad...  ...-end ML model deployment and inference infra for low-latency real-time... 
    Hourly pay
    Full time
    Immediate start
    Flexible hours

    Netflix

    Los Gatos, CA
    3 days ago
  •  ...About Us Mintegral is a leading programmatic and interactive mobile advertising platform. Focused on the APAC region...  ...algorithmic innovation to optimize model inference performance, enhance monitoring...  ...in Computer Science, Software Engineering, AI, Mathematics, Physics, or a... 
    Work experience placement

    Mintegral

    Sunnyvale, CA
    2 days ago
  • $147.4k - $272.1k

     ...Machine Learning Engineer, Proactive - Large Language Models & Generative AI Inference The Intelligence Platform team empowers clients across Apple's operating systems...  ...of responsibilities. Your tasks will include: Leading the exploration and application of Large... 
    Relocation

    Apple

    Cupertino, CA
    3 hours ago
  • $184.5k

     ...Senior Machine Learning Engineer Expedia Technology...  ...join our high-performing Advertising Technology team, where...  ...scale batch and real-time ML systems that power...  ...use cases Propose, lead, and deliver high-impact...  ...and validation, scalable inference, monitoring, drift detection... 
    Local area
    Flexible hours

    Expedia Group

    San Jose, CA
    2 days ago
  • $128.7k - $261.3k

     ...Team The Model Deployment & Inference Solutions team in GM AV...  ...mission is two-fold: build the ML deployment platform that makes...  ...currently performed manually by engineers. Build the developer experience...  ...embrace the responsibility to lead the change that will make our... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours
    Shift work

    General Motors

    Sunnyvale, CA
    3 days ago
  • $155.42k - $205.9k

     ...Job Description About the Team: The ML Inference Platform is part of the AV ML...  ...are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms...  ...requirements, and deliver incremental value. Lead technical decision-making on model serving... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $185.5k - $270k

     ...relocation assistance. About the Team: The ML Inference Platform is part of the AI Compute...  ...are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute...  ...requirements, and deliver incremental value. Lead technical decision-making on model... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    7 hours ago
  •  ...areas including e-commerce, advertising, and fulfillment. We use machine...  ...open across our various ML teams. You can find a blurb on...  ...modeling, and general causal inference. Search & Discovery ML :...  ...works alongside world-class engineers, data scientists, and product... 
    Remote job
    Permanent employment
    Work experience placement
    Internship
    Work at office
    Work from home
    Flexible hours

    Instacart

    San Jose, CA
    3 days ago
  • Apple Inc. in Cupertino, California, is seeking a full-stack ML Engineer to enhance its advertising systems. The ideal candidate will design intuitive user interfaces, partner with cross-functional teams, and build production-ready RAG Machine Learning models. Required... 

    Apple Inc.

    Cupertino, CA
    3 days ago
  • A global travel technology company based in San Jose is seeking a Senior Machine Learning Engineer to join their Advertising Technology team. The role focuses on building and operating large-scale machine learning systems that enhance pricing and inventory optimization... 

    Expedia, Inc.

    San Jose, CA
    1 day ago
  • $147.4k - $272.1k

    A leading technology company is searching for a Machine Learning Engineer in Cupertino, California. The role involves working with Large Language Models and Generative AI to enhance user experiences across Apple's platforms. Candidates should have extensive experience in... 

    Apple Inc.

    Cupertino, CA
    5 days ago
  • $199.7k - $254.6k

     ...the Team Join Cisco’s CX AI Incubation Team as a Senior AI/ML DevOps Engineer and help productionize LLM/SLM capabilities for Intelligent...  ...reliable, secure, and observable AI services, optimizing inference performance from CPU and small GPUs to large multi-GPU servers... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    San Jose, CA
    3 days ago
  • A leading tech company is seeking a Machine Learning Engineer in Cupertino, California. In this role, you will design, implement, and optimize machine learning frameworks, develop text input features, and collaborate with data scientists and software developers. Required... 

    Apple Inc.

    Cupertino, CA
    3 days ago
  • An automotive leader seeks a Senior ML Infrastructure Engineer to build and enhance platforms for ML Inference workflows. The role involves collaborating with engineers to ensure optimal model serving and overseeing system performance. Candidates should have substantial... 

    General Motors

    Sunnyvale, CA
    1 day ago
  •  ...Learning Specialist in San Jose, California. This role involves leading the design, development, and implementation of advanced machine...  ...scientists and product teams to enhance services through innovative AI/ML solutions. Key responsibilities include building scalable ML... 

    PayPal

    San Jose, CA
    5 days ago
  •  ...Tech Lead, Data & Inference Engineer Santa Clara, California, United States About the Job Tech Lead, Data & Inference Engineer Our client is a fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million... 
    Full time

    Catalyst Labs, LLC

    Santa Clara, CA
    2 days ago
  • $180k - $300k

    MixMode is seeking a Principal Software ML Test Engineer to lead testing for the d-Matrix AI compute engine in Santa Clara, California. This role involves overseeing test planning, automation, and execution, while collaborating closely with software development teams.... 

    MixMode

    Santa Clara, CA
    3 days ago
  • A leading technology company is looking for a Principal GenAI Inference Optimization Engineer in San Jose, CA. This role will focus on optimizing performance and efficiency of generative AI on AMD GPU platforms. The ideal candidate will have significant expertise in GPU... 

    Advanced Micro Devices

    San Jose, CA
    1 day ago
  • A leading material engineering firm located in Santa Clara, CA is seeking a skilled individual ready to lead the research, design, and implementation of advanced algorithms for image processing and machine learning. Ideal candidates possess a strong background in computer... 

    Applied Materials, Inc.

    Santa Clara, CA
    2 days ago
  • A leading real estate technology company is looking for a Lead Machine Learning R&D Engineer to innovate in spatial computing and advance their platform. This role, based in California, focuses on developing machine learning models that enhance how users interact with... 
    Remote job

    CoStar

    Sunnyvale, CA
    5 days ago
  • Why Here? Apple’s advertising platform delivers relevant content across App Store, Apple News,...  ...TV while protecting user privacy. The Ads ML Experimentation team shapes advertising...  ...delight customers. What Will You Do? As an ML Engineer - Experimentation, Portal at Apple, you... 

    Experimentation Jobs

    Cupertino, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead ML Inference Engineer, Advertising. Be the first to apply!