Software Engineer L5, LLM Compute & Serving Systems

$100k

Netflix

Netflix is one of the world's leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

The Role
Netflix is the world's leading streaming entertainment service, with over 300 million paid members in over 190 countries, enjoying TV series, feature films, and games across numerous genres and languages. Members can watch or play as much as they want, anytime, anywhere, on any internet-connected screen.

Machine Learning/Artificial Intelligence powers all of our consumer experience, including content discovery and personalization, identifying and attracting new members to our product, optimizing our payment processing, and much more.

More recently, fast-paced innovation in large language models (LLMs) has greatly helped advance state-of-the-art technology in many areas of personalization, including search and recommendation experiences.

The Opportunity

The Consumer ML Serving team provides the computational platform on which we build nearly all our consumer-facing ML/AI applications. If you've seen it, we probably served it! We provide all the building blocks to serve ML at scale, including a real-time model serving platform, an event-driven model and feature compute framework, a distributed compute orchestration engine, and more. Additionally, as we expand to enable LLM innovation in numerous areas of personalization, we're building model serving infrastructure for LLMs and other large foundation models.

We are looking for a strong senior engineer to own and develop our long-term vision. Our systems power some of Netflix's most business-critical models, and we need you to take our ML/AI initiatives to the next level. You will play a highly cross-functional role, partnering with other engineers, product managers, machine learning engineers, and data/research scientists.

If you have a passion for building scalable, robust systems, are interested in pushing the envelope in applied ML algorithms, and enjoy seeing a direct line between your work and what our customers see on their screens, we want to talk to you.

You may enjoy working with us if you are:

Self-driven and highly motivated to deliver top-tier ML infrastructure while navigating highly ambiguous environments and can execute 0-to-1 projects.
Eager to learn about new domains and ship high-quality, well-tested code.
Able to produce generic and optimal solutions while balancing near-term needs.
Excited to work in a multidisciplinary environment (engineering, algorithms, data engineering/science, and product experimentation).
Comfortable working in a hybrid team with partners distributed across (US) geographies & time zones.
Willing to take broad ownership of team responsibilities (building roadmaps, scoping, task breakdowns, etc.)

We would especially love to work with you if have experience with:

Building and operating high-traffic, real-time distributed systems and ML serving infrastructure for LLMs and other large foundation models.
Supporting large-scale ML models with a direct impact on what customers see.
Translating the requirements of research scientists into generic platform offerings.
Delivering systems requiring high availability, throughput, and performance.
Navigating highly ambiguous environments.
Taking on and executing zero-to-one projects.
Leading projects with 3-4 other engineers.
Building applications in an object-oriented programming language. (We work primarily with Java, and while prior Java experience is not required to interview, you will be expected to become proficient on the job.)
DevOps for large applications, including performance tuning, optimization, deployment management, and capacity planning.
Public cloud like AWS, Azure, or GCP.

...and if:

You are a proactive, effective communicator and have a strong bias towards action.
You have a BS/MS in Computer Science, Applied Math, Engineering, or a related field.

Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $100,000 - $720,000.

Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more detail about our Benefits here.

Netflix is a unique culture and environment. Learn more here.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity of thought and background builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Software Engineer L5, LLM Compute & Serving Systems in United States vacancy

Principal Software Engineer - Large-Scale LLM Memory and Storage Systems
$272k - $431.25k
...inference framework for serving generative AI and... ...accelerators feel like a single system at datacenter scale. As... ...outgrow the memory and compute budget of any single... ...of cutting-edge LLM workloads. We are seeking... ...seeking a Principal Systems Engineer to define the vision...
Suggested
Local area
Remote work
NVIDIA
Santa Clara, CA
2 days ago
Software Engineer- LLM Systems (Remote)
$180k - $220k
...industry. Now we're building AI systems to make that quality... ...this role, you'll own the core LLM infrastructure powering two products... ...for B2B research. Build self‑serve features that compress weeks into... ...solutions while maintaining engineering best practices Nice to Have RAG...
Suggested
Immediate start
Remote work
Flexible hours
NewtonX
New York, NY
5 days ago
Senior Software Engineer / Researcher, AI-Native database systems
$202.16k - $368.22k
...column stores, search engines, and multi-model... ...-native database systems-intelligent,... ...agents. As a Senior Software Engineer or Researcher... ...and LLM agent memory backends... ...Master's, or Ph.D. in Computer Science or related... ...AI infra or model-serving infrastructure (especially...
Suggested
Temporary work
Local area
ByteDance
Seattle, WA
5 days ago
Senior Software Development Engineer - LLM Inference Framework
...accelerate next-generation computing experiences—from... ...and embedded systems. Grounded in a... ...senior member of the LLM inference... ...platform for LLM serving. This role sits... ...intersection of inference engines, distributed systems... ...Software Engineering ~...
Suggested
Advanced Micro Devices , Inc.
Santa Clara, CA
15 hours ago
Senior Software Engineer - TensorRT Edge-LLM
$152k - $241.5k
...NVIDIA's TensorRT Edge-LLM team and help shape... .... We build the software stack that enables... ...autoregressive model serving capabilities,... ...equivalent experience in Computer Science, Electrical/Computer Engineering, or a closely... ...autoregressive LLM serving systems, including...
Suggested
Remote work
NVIDIA
Santa Clara, CA
5 days ago
Software Engineer III, AI/ML GenAI, Google Cloud Compute
$147k - $211k
...core GenAI concepts (LLM, Multi-Modal, Large Vision... ...'s degree or PhD in Computer Science or related... ...About the job Google's software engineers develop the next-generation... ..., large-scale system design, networking and... ...representative of the users we serve, creating a culture of...
Full time
Google Inc.
Sunnyvale, CA
2 days ago
Software Engineer, GDC LLM Serving and GPU Performance
$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’... ...Master’s degree or PhD in Engineering, Computer Science, or a related technical... ...distributed computing, large‑scale system design, networking and data storage,...
Full time
Google Inc.
Sunnyvale, CA
4 days ago
Software Engineer, AI Systems
...Innovating in AI systems technologies, the full-time Software Engineer, AI Systems will develop libraries and GPU kernel... ...extensible abstractions for LLM serving engines and contributing to open... ...qualifications: Master's degree in Computer Science, Electrical Engineering,...
Full time
Remote work
Virtual Vocations Inc
United States
1 day ago
Software Development Engineer-AI/LLM Network-Global Frontier Tech Recruitment Start (PhD)
$212.8k
...Software Development Engineer-AI/LLM Network Location: San Jose Team: Technology... ...as Douyin and TikTok which serve hundreds of millions of users... ...development of platforms/systems for monitoring, analysis and... ...in Software Development, Computer Science, Computer...
Temporary work
Local area
ByteDance
San Jose, CA
4 days ago
Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme... ...push the frontier of accelerated computing for AI. What you’ll be... ...Experience building and optimizing LLM inference engines (e.g., vLLM,...
NVIDIA
Santa Clara, CA
3 days ago
I Software Engineer - Agentic AI System
$149.75k - $211.42k
...AI Software Engineer – Agentic AI System About the role As an AI Software Engineer... ...agent frameworks and model-serving systems Design and automate... ...degree (B.S. or B.A.) in Computer Science, Electrical... ...with prompt engineering and LLM API integrations (such as...
Local area
Immediate start
Remote work
Shift work
Intel
Hillsboro, OR
2 days ago
Software Engineer, LLM Infrastructure
$2,000 per month
...chain-of-thought reasoning. Software Engineer, LLM Infrastructure Transformer... ...nothing if the rest of the serving stack takes 100+ ms, and... ...out of KV cache space and re-compute them later? Can we cache common... ...and physical Etched systems You may be a good fit if you...
Work at office
Relocation package
OpenReq
Cupertino, CA
2 days ago
Software Engineer, Model Performance Systems
...Software Engineer Opportunity Baseten powers mission-critical... ...intersection of high-performance computing (HPC) and Large Language Model (LLM) engineering. You will... ...that ensure our systems are production-ready.... ...decoding, disaggregated serving, and kernel-level...
Remote work
Flexible hours
Baseten
United States
1 day ago
Sr. Software Engineer, Computer Vision
$160k - $225k
...human life on Mars. SR. SOFTWARE ENGINEER, COMPUTER VISION We are looking for... ...standard tools like Ray Train/Serve, Kubeflow, Airflow, or... ...AI-driven process monitoring systems Build and optimize data pipelines... ...in machine learning/LLM operations including model versioning...
Permanent employment
Full time
Temporary work
Weekend work
Spacex
Remote
1 hour ago
Senior Software Engineer (Python + Distributed systems)
$146.5k
.... For us, it also serves as a practical framework... ...The ML Data Engineering team powers metadata... ...worldwide. Our systems operate at massive... ...scalable ML and LLM-powered solutions... ...seeking a Senior Software Engineer with deep... ...Bachelor's degree in Computer Science or...
Local area
Worldwide
Home office
Flexible hours
Scribd
San Francisco, CA
5 days ago
Software Engineer - GPU Networking & Distributed Systems
...build the platform engineers turn to to ship AI... ...global operating system for distributed, heterogeneous... ...believe that as LLM and multi-modal... ...network is the computer. We are looking... ...to architect the software fabric that unifies... ...for Disaggregated Serving, Wide Expert...
Flexible hours
Baseten
New York, NY
3 days ago
Principal Systems Software Engineer
$260k - $340k
...our time. The demand for AI compute is boundless, and power is a... ...us at Crusoe. Principal Systems Software Engineer San Francisco, Sunnyvale... ...Systems Architect , you will serve as the visionary lead for... ...infrastructure for Large Language Model (LLM) training and inference at...
Full time
Temporary work
Crusoe
San Francisco, CA
2 days ago
LLM Applications Engineer
$130k - $175k
...interest in Uncountable Engineering! Description... ...AI. As an LLM Applications Engineer... ...the retrieval systems, agentic workflows... ...~2+ years of software development experience... ...Qualifications B.S. in Computer Science or a... ...queries that serve as the context for...
Uncountable Inc
New York, NY
5 days ago
Senior Software Engineer - API, Services and Backend Systems
...re seeking skilled a Senior Software Engineer to design and develop the core API services and backend systems that power InfiniteChoice's... ...native software solutions that serve millions of users, process... ...~ Bachelor's degree in Computer Science, Engineering, or equivalent...
Contract work
Remote work
Infinite Choice
Dallas, TX
5 days ago
Full Stack Software Engineer - ML Compute Capacity
$181.1k - $318.4k
...Full Stack Software Engineer - ML Compute Capacity Scaling machine learning workloads across thousands... ...bringing together expertise in distributed systems, machine learning infrastructure, and... ...systems that ingest, normalize, and serve fleet-wide utilization and cost data...
Relocation
Apple
Santa Clara, CA
2 days ago
IT Systems Engineer
$100k - $120k
...We are looking for an IT Systems Engineer to maintain, support, and enhance... ...AI Research & Advocacy: Serve as the primary resource for evaluating... ...: Bachelor's degree in Computer Science, Information Technology... ...(Workato, n8n), or LLM API integrations (OpenAI, Anthropic...
Work at office
Remote work
Work from home
Home office
Flexible hours
Shift work
Flock Safety
United States
5 days ago
Senior Systems Software Engineer - Omniverse
$184k - $287.5k
...NVIDIA has been transforming computer graphics, PC gaming, and... ...We are looking for a Senior Systems Software Engineer to help build the next generation... ...in AI coding agents, LLM-powered tools, or modern AI... ...or Kubernetes! Experience serving as the technical lead for a...
Remote work
NVIDIA
United States
5 days ago
Senior Software Engineer, Go - LLM Team
...Senior Software Engineer, Go - LLM Team AssemblyAI builds the best-in-class Voice AI models powering... ...generation of voice applications. Our models serve 600M+ inference calls monthly, process... ..., and building highly reliable systems in service of solving real customer problems...
Remote work
Easy work
AssemblyAI
United States
8 hours ago
Software Engineer, Compute
...Software Engineer Persona is the configurable identity platform built... ...— that's why we're able to serve a wide range of leading companies... ...! About the Role The Compute team's mission: any engineer... ...code, can debug distributed system failures across layers you...
Full time
For contractors
Internship
Remote work
Persona
United States
2 days ago
Software Development Engineer -Distributed KV Caching and Storage Systems
$148.2k - $300.96k
...ByteDance's KV caching and storage systems team, where we build and own... ...-generation shared-storage engines, and performance/cost optimization... ...and recovery capabilities. We serve ByteDance's core business... ...on business workloads. - Drive compute/storage efficiency improvements...
Temporary work
Local area
Remote work
ByteDance
United States
3 days ago
Software Engineering - Production Services
...The Cloud Infrastructure Engineering organization exists to manage... ...build, operate, and maintain Compute, Network, and Storage services... ...need, including things like serving the largest internet Live stream... ...on. Drive the evolution of systems and tooling used to manage hundreds...
Hourly pay
Full time
Immediate start
Remote work
Flexible hours
Netflix
United States
1 day ago
Senior Systems Software Engineer, AI Stack and Performance - DGX Station
$224k - $356.5k
...NVIDIA Systems Software Engineer DGX Station (Galaxy) is NVIDIA's workstation-class AI computer—built on GB300 Blackwell GPUs with NVLink interconnect... ...like NemoClaw, LLM inference via NIM, Hermes agents... ...simultaneous training jobs, inference serving alongside development, and...
Local area
NVIDIA
Santa Clara, CA
45 minutes ago
Senior Software Engineer - Machine Learning/Computer Vision
$149k - $204.6k
...supply chain. Intelligent software orchestrates advanced... ...-density, end-to-end system - reinventing... ...looking for Senior Software Engineer to join our Perception... ...other team members by serving as a technical mentor... ...related discipline (i.e. Computer Science, Electrical...
Symbotic
Austin, TX
8 hours ago
Senior Software Engineer - AI/Computer Vision (Camera Systems)
...lives. Our end-to-end suite of software solutions helps our customers... ...campuses, transportation systems, healthcare centers, public venues... ...team of scientists and engineers (located in Chicago, Boston,... ...Natural Language Understanding and Computer Vision. Our AI team is...
Live in
Work at office
Relocation
Motorola Solutions
Waltham, MA
8 hours ago
Sr. Software Engineer, High Performance Computing (Starlink)
$160k - $225k
...enabling human life on Mars. SR. SOFTWARE ENGINEER, HIGH PERFORMANCE COMPUTING (STARLINK) At SpaceX we’re... ...s most advanced broadband internet system. Starlink is the world’s largest satellite... ...experience, often providing under-served communities with affordable, life-changing...
Permanent employment
Temporary work
Worldwide
Weekend work
SpaceX
Sunnyvale, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer L5, LLM Compute & Serving Systems. Be the first to apply!