Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead ML Inference Engineer, Advertising

Roku, Building C

Teamwork makes the stream work.

Roku is changing how the world watches TV

Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico, and we've set our sights on powering every television in the world. Roku pioneered streaming to the TV. Our mission is to be the TV streaming platform that connects the entire TV ecosystem. We connect consumers to the content they love, enable content publishers to build and monetize large audiences, and provide advertisers unique capabilities to engage consumers.

From your first day at Roku, you'll make a valuable - and valued - contribution. We're a fast-growing public company where no one is a bystander. We offer you the opportunity to delight millions of TV streamers around the world while gaining meaningful experience across a variety of disciplines.

About the team

The Advertising Performance group focuses on performance for all participants in the Advertising ecosystem - Advertisers, Publishers, and Roku. The systems and solutions span multiple disciplines and technologies to perform real-time multi-objective optimization across distributed systems at large scale and with low latency. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization Systems, and Auction Dynamics to solve a large set of complex problems. At the core of this is our Machine Learning and Inference Platform that powers the entire landscape.


About the role

In this role, you will architect, design, and lead the development of a SOTA Inference platform that can handle Advertising-level low latencies, scale, throughput, and availability with optimizations that span across hardware, software, and models. We're looking for a strong technical leader with deep experience in ML serving, high-performance computing, and industry standard frameworks - someone excited to mentor engineers, innovate at scale, and shape the future of machine learning at Roku.

What you'll be doing
  • Lead the design and development of a SOTA Inference platform
  • Oversee the development of monitoring, observability, and other tooling to ensure system and model performance, reliability, and scalability of online inference services
  • Identify and resolve system inefficiencies, performance bottlenecks, and reliability issues, ensuring optimized end-to-end performance
  • Stay at the forefront of advancements in inference frameworks, ML hardware acceleration, and distributed systems, and incorporate innovations where and when they are impactful
We're excited if you have
  • M.S. or above in CS, ECE, or a related field
  • 10+ years of experience in developing and deploying large-scale, distributed systems, with at least 5 years in a leadership or technical lead role
  • Strong programming skills in high-performance languages
  • Deep understanding of inference frameworks and ML system deployment
  • Proven experience optimizing performance for large-scale machine learning systems, including a deep knowledge of SOTA model optimizations, hardware-software co-design, GPU acceleration, and HPC techniques
  • Excellent communication and collaboration skills
  • Experience leading teams working on high-throughput, low-latency ML serving systems
  • Experience collaborating with and leading global, cross-functional teams
  • Contributions to open-source ML or systems projects
#LI-DH2

Our Hybrid Work Approach

Roku fosters an inclusive and collaborative environment where teams work in the office Monday through Thursday. Fridays are flexible for remote work except for employees whose roles are required to be in the office five days a week or employees who are in offices with a five day in office policy.

Benefits

Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Employees are supported in taking time off, in accordance with local leave policies and other personal needs to support their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.

Accommodations

Roku welcomes applicants of all backgrounds and provides reasonable accommodations and adjustments in accordance with applicable law. If you require reasonable accommodation at any point in the hiring process, please direct your inquiries to View email address on click.appcast.io.

The Roku Culture

Roku is a great place for people who want to work in a fast-paced environment where everyone is focused on the company's success rather than their own. We try to surround ourselves with people who are great at their jobs, who are easy to work with, and who keep their egos in check. We appreciate a sense of humor. We believe a fewer number of very talented folks can do more for less cost than a larger number of less talented teams. We're independent thinkers with big ideas who act boldly, move fast and accomplish extraordinary things through collaboration and trust. In short, at Roku you'll be part of a company that's changing how the world watches TV.


We have a unique culture that we are proud of. We think of ourselves primarily as problem-solvers, which itself is a two-part idea. We come up with the solution, but the solution isn't real until it is built and delivered to the customer. That penchant for action gives us a pragmatic approach to innovation, one that has served us well since 2002.


To learn more about Roku, our global footprint, and how we've grown, visit

By providing your information, you acknowledge that you want Roku to contact you about job roles, that you have read Roku's Applicant Privacy Notice, and understand that Roku will use your information as described in that notice. If you do not wish to receive any communications from Roku regarding this role or similar roles in the future, you may unsubscribe at any time by emailing View email address on click.appcast.io.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Lead ML Inference Engineer, Advertising in Austin, TX vacancy
  • $155.42k - $205.9k

     ...Job Description About the Team: The ML Inference Platform is part of the AV ML...  ...are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms...  ...requirements, and deliver incremental value. Lead technical decision-making on model serving... 
    Suggested
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    4 days ago
  • A leading automotive company seeks a Senior ML Infrastructure Engineer in Austin, Texas, to design and implement backend software for ML inference workflows. The engineer will collaborate with ML engineers to ensure efficient model serving and lead technical decisions... 
    Suggested
    Remote work

    General Motors

    Austin, TX
    4 days ago
  • $128.7k - $261.3k

     ...Team The Model Deployment & Inference Solutions team in GM AV...  ...mission is two-fold: build the ML deployment platform that makes...  ...currently performed manually by engineers. Build the developer experience...  ...embrace the responsibility to lead the change that will make our... 
    Suggested
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours
    Shift work

    General Motors

    Austin, TX
    1 day ago
  •  ...areas including e-commerce, advertising, and fulfillment. We use machine...  ...open across our various ML teams. You can find a blurb on...  ...modeling, and general causal inference. Search & Discovery ML :...  ...works alongside world-class engineers, data scientists, and product... 
    Suggested
    Remote job
    Permanent employment
    Work experience placement
    Internship
    Work at office
    Work from home
    Flexible hours

    Instacart

    Austin, TX
    1 day ago
  • Bumble Inc. is seeking a Staff Machine Learning Engineer in Austin, Texas. In this role, you will drive large-scale ML systems that power recommendations and personalisation, while influencing product direction and user outcomes. Ideal candidates should have deep expertise... 
    Suggested

    Bumble Inc.

    Austin, TX
    3 days ago
  • A leading tech company in Austin, Texas, is seeking a Staff Machine Learning Engineer to lead the design of large-scale machine learning systems. In this role, you will influence...  ...and strong skills in Python and relevant ML frameworks. The position offers a competitive... 

    Bumble Inc.

    Austin, TX
    2 days ago
  •  ...About the job Tech Lead, Data & Inference Engineer Our Client A fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars in funding and are transforming how business to business marketers... 
    Full time

    Catalyst Labs, LLC

    Austin, TX
    7 hours ago
  • A leading AI infrastructure company in Austin is seeking a senior operator-builder to design and manage the monetization strategy for...  ...The ideal candidate has extensive experience in production AI inference, strong technical skills, and a proven track record in navigating... 

    SupportFinity™

    Austin, TX
    1 day ago
  • $75 - $120 per hour

     ...Senior ML Engineer - Deployment and Databricks MLOps Location: Austin, Texas (Onsite) Employment Type: Contract Role Overview...  ...feature engineering, model training, evaluation, deployment, and inference. Build and operationalize ML pipelines in Databricks to... 
    Contract work

    Apex Systems

    Austin, TX
    1 day ago
  • $152k - $228k

     ...Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered...  ...model training and fine-tuning through inference optimization and production APIs. We move...  .... Core Focus & Primary Ownership Lead End-to-End MLOps and Productionization:... 
    Currently hiring
    Remote work
    Flexible hours

    Invoca

    Austin, TX
    1 day ago
  •  ...The Opportunity We're looking for a hands-on ML/LLM Engineer who's excited to ship real-world applications - not just benchmarks...  ...~ Implement retrieval strategies, prompt chaining, and inference orchestration for production use cases ~ Monitor and improve... 

    Autonomize Inc

    Austin, TX
    2 days ago
  • $128.7k - $261.3k

     ...export, kernel development, and performance engineering so that every cycle on our accelerators...  ...that sit at the heart of our on-vehicle ML inference for ADAS and autonomous driving . We...  ...and we embrace the responsibility to lead the change that will make our world better... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    1 day ago
  •  ...ML Engineer Austin, Texas, United States About the Job Our client is a rapidly growing...  .... About Us Catalyst Labs is a leading talent agency with a specialized vertical...  ...preprocessing, model training, deployment, inference, and monitoring in production... 
    Full time

    Catalyst Labs, LLC

    Austin, TX
    4 days ago
  • $128.7k - $261.3k

     ...development, and performance engineering so that every cycle on our accelerators...  ...models into fast, reliable inference across GPUs powering GM's next...  ...reliable, and effortless for ML engineers across the AV...  ...embrace the responsibility to lead the change that will make our... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    1 day ago
  •  ...disability, or status as a protected veteran. Job Description ML Engineer 3M Health Care is now Solventum At Solventum, we enable better...  ...FHIR, HL7) into clean, usable datasets for model training and inference. Feature Management: Help build and maintain feature stores and... 
    H1b
    Remote work

    Solventum

    Austin, TX
    4 days ago
  • Machine Learning Operations Engineer Habitat Energy is a fast growing technology company focussed...  ...extraction, training, evaluation, inference, and model lifecycle management. Applied...  ...Forecasting & Optimization Capability Development ML Infrastructure: Build the tooling and... 
    Work at office
    Flexible hours

    Habitat Energy Limited

    Austin, TX
    5 days ago
  •  ...Deployment: Build and maintain high-performance, scalable ML pipelines and GPU‑based inference systems in cloud environments (AWS/GCP/Azure)....  ...Work closely with product managers, data scientists, and engineers to translate business requirements into technical specifications... 
    Permanent employment
    Contract work
    Local area

    Robotics Prcocess Automation, LLC

    Austin, TX
    4 days ago
  • $150k - $300k

    A leading insurance provider is seeking a Senior Staff Machine Learning Engineer in Austin, TX, to drive the strategy and architecture of ML systems. This hands-on technical role involves building and integrating AI capabilities that enhance customer experiences. Candidates... 

    GEICO

    Austin, TX
    2 days ago
  • A leading technology company in Austin is seeking a hands-on Machine Learning Engineer to enhance advertising systems. The role involves designing and building machine learning systems and data pipelines, defining innovation roadmaps, and optimizing model performance. Ideal... 

    Apple Inc.

    Austin, TX
    4 days ago
  •  ...Audience Growth & Engagement Lead At Digital Turbine, we make mobile advertising experiences more meaningful and rewarding for users, app publishers, and...  ...bridge the gap between content strategy, product engineering, and monetization to build a durable competitive advantage... 

    Digital Turbine

    Austin, TX
    4 days ago
  • An Amazon-focused advertising agency is seeking a PPC Division Manager to lead the team in driving performance and communication while ensuring efficient operation of the PPC department. Responsibilities include managing team performance, onboarding new clients, and improving... 
    Remote work

    Scale Jet

    Austin, TX
    1 day ago
  •  ...and seeking to hire an experienced Lead Machine Learning Operations Engineer to join our talented team. This...  ...Define and execute an enterprise AI/ML platform strategy, encompassing MLOps...  ...supporting model training, inference, evaluation, monitoring, retraining... 
    Flexible hours

    CliftonLarsonAllen

    Austin, TX
    3 days ago
  • $170.6k - $261.3k

     ...Job Description Senior AI/ML Engineer, AV ML Infra We're General Motors (GM), a company...  ...includes: AI Validation & Inference: Ensures robust model performance by running...  ...systems preferred. ~1+ years of experience leading and driving large-scale initiatives. ~... 
    Local area
    Work from home
    Flexible hours

    General Motors

    Austin, TX
    7 hours ago
  •  ...Senior AI/ML Engineer Design, build, and productionize large-scale NLP and LLM systems that power information extraction, classification...  ...experiments to improve precision/recall, latency, and total inference spend (model selection, prompt and context optimization,... 

    Meltwater Social (formerly Sysomos)

    Austin, TX
    1 day ago
  • $188k - $250k

     ...and LLM systems that analyze AI Answering engine outputs and public web content to produce...  ...precision/recall, latency, and total inference spend (model selection, prompt and context...  ...to turn customer problems into measurable ML deliverables and ship production features... 
    Local area

    Meltwater

    Austin, TX
    23 days ago
  • $218.8k - $335.3k

     ...Job Description Staff AI/ML Engineer, AV ML Infra We're General Motors (GM), a company...  ...infrastructure includes: AI Validation & Inference: Ensures robust model performance by...  ...This is an individual contributor/Tech Lead role focused on deep technical impact rather... 
    Local area
    Work from home
    Flexible hours

    General Motors

    Austin, TX
    5 days ago
  • $103.8k - $122.1k

     ...can’t wait to meet you. Job Description We are seeking a Lead Systems Administrator to act as the technical escalation point...  ...analysts, and was voted a G2 “Best of Marketing and Digital Advertising Software Product” in 2025. Braze was also named a 2025 Best... 
    Full time
    Part time
    Work at office
    Flexible hours

    Braze

    Austin, TX
    7 hours ago
  •  ...Transformation - Senior Machine Learning Engineer Imagine what you could do...  ...practice. You will help lead the charge by developing...  ...workflows, and machine learning inference use cases Drive SLO definition...  ...operational posture across ML stack Develop and deploy frameworks... 

    Apple

    Austin, TX
    3 days ago
  • $118k - $176k

     ...25) Day to Day The Machine Learning Engineer I role partners closely with business partners...  ...models, and maintain and improve model inference services. You will learn and apply new...  ...problems across Indeed. Work spans classical ML through LLM systems. You improve search... 
    Work experience placement
    Local area

    Indeed

    Austin, TX
    1 day ago
  •  ...Platform Architect At Trunk Tools, we're the leading AI company revolutionizing construction,...  ...past year, and with 100+ employees (50+ engineers), we're scaling fast and entering a...  ...experience building and shipping production ML/AI systems. ~ Proven track record of... 
    Work at office
    Remote work
    Flexible hours

    Trunk Tools

    Austin, TX
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead ML Inference Engineer, Advertising. Be the first to apply!