Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer L5, LLM Compute & Serving Systems

$100k

Netflix

Netflix is one of the world's leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

The Role
Netflix is the world's leading streaming entertainment service, with over 300 million paid members in over 190 countries, enjoying TV series, feature films, and games across numerous genres and languages. Members can watch or play as much as they want, anytime, anywhere, on any internet-connected screen.


Machine Learning/Artificial Intelligence powers all of our consumer experience, including content discovery and personalization, identifying and attracting new members to our product, optimizing our payment processing, and much more.


More recently, fast-paced innovation in large language models (LLMs) has greatly helped advance state-of-the-art technology in many areas of personalization, including search and recommendation experiences.


The Opportunity

The Consumer ML Serving team provides the computational platform on which we build nearly all our consumer-facing ML/AI applications. If you've seen it, we probably served it! We provide all the building blocks to serve ML at scale, including a real-time model serving platform, an event-driven model and feature compute framework, a distributed compute orchestration engine, and more. Additionally, as we expand to enable LLM innovation in numerous areas of personalization, we're building model serving infrastructure for LLMs and other large foundation models.


We are looking for a strong senior engineer to own and develop our long-term vision. Our systems power some of Netflix's most business-critical models, and we need you to take our ML/AI initiatives to the next level. You will play a highly cross-functional role, partnering with other engineers, product managers, machine learning engineers, and data/research scientists.


If you have a passion for building scalable, robust systems, are interested in pushing the envelope in applied ML algorithms, and enjoy seeing a direct line between your work and what our customers see on their screens, we want to talk to you.


You may enjoy working with us if you are:
  • Self-driven and highly motivated to deliver top-tier ML infrastructure while navigating highly ambiguous environments and can execute 0-to-1 projects.
  • Eager to learn about new domains and ship high-quality, well-tested code.
  • Able to produce generic and optimal solutions while balancing near-term needs.
  • Excited to work in a multidisciplinary environment (engineering, algorithms, data engineering/science, and product experimentation).
  • Comfortable working in a hybrid team with partners distributed across (US) geographies & time zones.
  • Willing to take broad ownership of team responsibilities (building roadmaps, scoping, task breakdowns, etc.)
We would especially love to work with you if have experience with:
  • Building and operating high-traffic, real-time distributed systems and ML serving infrastructure for LLMs and other large foundation models.
  • Supporting large-scale ML models with a direct impact on what customers see.
  • Translating the requirements of research scientists into generic platform offerings.
  • Delivering systems requiring high availability, throughput, and performance.
  • Navigating highly ambiguous environments.
  • Taking on and executing zero-to-one projects.
  • Leading projects with 3-4 other engineers.
  • Building applications in an object-oriented programming language. (We work primarily with Java, and while prior Java experience is not required to interview, you will be expected to become proficient on the job.)
  • DevOps for large applications, including performance tuning, optimization, deployment management, and capacity planning.
  • Public cloud like AWS, Azure, or GCP.
...and if:
  • You are a proactive, effective communicator and have a strong bias towards action.
  • You have a BS/MS in Computer Science, Applied Math, Engineering, or a related field.
Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $100,000 - $720,000.

Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more detail about our Benefits here.

Netflix is a unique culture and environment. Learn more here.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity of thought and background builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Software Engineer L5, LLM Compute & Serving Systems in United States vacancy
  • $272k - $431.25k

     ...inference framework for serving generative AI and...  ...accelerators feel like a single system at datacenter scale. As...  ...outgrow the memory and compute budget of any single...  ...of cutting-edge LLM workloads. We are seeking...  ...seeking a Principal Systems Engineer to define the vision... 
    Suggested
    Local area
    Remote work

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $180k - $220k

     ...industry. Now we're building AI systems to make that quality...  ...this role, you'll own the core LLM infrastructure powering two products...  ...for B2B research. Build self‑serve features that compress weeks into...  ...solutions while maintaining engineering best practices Nice to Have RAG... 
    Suggested
    Immediate start
    Remote work
    Flexible hours

    NewtonX

    New York, NY
    5 days ago
  • $202.16k - $368.22k

     ...column stores, search engines, and multi-model...  ...-native database systems-intelligent,...  ...agents. As a Senior Software Engineer or Researcher...  ...and LLM agent memory backends...  ...Master's, or Ph.D. in Computer Science or related...  ...AI infra or model-serving infrastructure (especially... 
    Suggested
    Temporary work
    Local area

    ByteDance

    Seattle, WA
    5 days ago
  •  ...accelerate next-generation computing experiences—from...  ...and embedded systems. Grounded in a...  ...senior member of the LLM inference...  ...platform for LLM serving. This role sits...  ...intersection of inference engines, distributed systems...  ...Software Engineering ~... 
    Suggested

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    15 hours ago
  • $152k - $241.5k

     ...NVIDIA's TensorRT Edge-LLM team and help shape...  .... We build the software stack that enables...  ...autoregressive model serving capabilities,...  ...equivalent experience in Computer Science, Electrical/Computer Engineering, or a closely...  ...autoregressive LLM serving systems, including... 
    Suggested
    Remote work

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $147k - $211k

     ...core GenAI concepts (LLM, Multi-Modal, Large Vision...  ...'s degree or PhD in Computer Science or related...  ...About the job Google's software engineers develop the next-generation...  ..., large-scale system design, networking and...  ...representative of the users we serve, creating a culture of... 
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $207k - $300k

    Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’...  ...Master’s degree or PhD in Engineering, Computer Science, or a related technical...  ...distributed computing, large‑scale system design, networking and data storage,... 
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  •  ...Innovating in AI systems technologies, the full-time Software Engineer, AI Systems will develop libraries and GPU kernel...  ...extensible abstractions for LLM serving engines and contributing to open...  ...qualifications: Master's degree in Computer Science, Electrical Engineering,... 
    Full time
    Remote work

    Virtual Vocations Inc

    United States
    1 day ago
  • $212.8k

     ...Software Development Engineer-AI/LLM Network Location: San Jose Team: Technology...  ...as Douyin and TikTok which serve hundreds of millions of users...  ...development of platforms/systems for monitoring, analysis and...  ...in Software Development, Computer Science, Computer... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    4 days ago
  • $184k - $287.5k

     ...highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme...  ...push the frontier of accelerated computing for AI. What you’ll be...  ...Experience building and optimizing LLM inference engines (e.g., vLLM,... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $149.75k - $211.42k

     ...AI Software Engineer – Agentic AI System About the role As an AI Software Engineer...  ...agent frameworks and model-serving systems Design and automate...  ...degree (B.S. or B.A.) in Computer Science, Electrical...  ...with prompt engineering and LLM API integrations (such as... 
    Local area
    Immediate start
    Remote work
    Shift work

    Intel

    Hillsboro, OR
    2 days ago
  • $2,000 per month

     ...chain-of-thought reasoning. Software Engineer, LLM Infrastructure Transformer...  ...nothing if the rest of the serving stack takes 100+ ms, and...  ...out of KV cache space and re-compute them later? Can we cache common...  ...and physical Etched systems You may be a good fit if you... 
    Work at office
    Relocation package

    OpenReq

    Cupertino, CA
    2 days ago
  •  ...Software Engineer Opportunity Baseten powers mission-critical...  ...intersection of high-performance computing (HPC) and Large Language Model (LLM) engineering. You will...  ...that ensure our systems are production-ready....  ...decoding, disaggregated serving, and kernel-level... 
    Remote work
    Flexible hours

    Baseten

    United States
    1 day ago
  • $160k - $225k

     ...human life on Mars. SR. SOFTWARE ENGINEER, COMPUTER VISION We are looking for...  ...standard tools like Ray Train/Serve, Kubeflow, Airflow, or...  ...AI-driven process monitoring systems Build and optimize data pipelines...  ...in machine learning/LLM operations including model versioning... 
    Permanent employment
    Full time
    Temporary work
    Weekend work

    Spacex

    Remote
    1 hour ago
  • $146.5k

     .... For us, it also serves as a practical framework...  ...The ML Data Engineering team powers metadata...  ...worldwide. Our systems operate at massive...  ...scalable ML and LLM-powered solutions...  ...seeking a Senior Software Engineer with deep...  ...Bachelor's degree in Computer Science or... 
    Local area
    Worldwide
    Home office
    Flexible hours

    Scribd

    San Francisco, CA
    5 days ago
  •  ...build the platform engineers turn to to ship AI...  ...global operating system for distributed, heterogeneous...  ...believe that as LLM and multi-modal...  ...network is the computer. We are looking...  ...to architect the software fabric that unifies...  ...for Disaggregated Serving, Wide Expert... 
    Flexible hours

    Baseten

    New York, NY
    3 days ago
  • $260k - $340k

     ...our time. The demand for AI compute is boundless, and power is a...  ...us at Crusoe. Principal Systems Software Engineer San Francisco, Sunnyvale...  ...Systems Architect , you will serve as the visionary lead for...  ...infrastructure for Large Language Model (LLM) training and inference at... 
    Full time
    Temporary work

    Crusoe

    San Francisco, CA
    2 days ago
  • $130k - $175k

     ...interest in Uncountable Engineering! Description...  ...AI. As an LLM Applications Engineer...  ...the retrieval systems, agentic workflows...  ...~2+ years of software development experience...  ...Qualifications B.S. in Computer Science or a...  ...queries that serve as the context for... 

    Uncountable Inc

    New York, NY
    5 days ago
  •  ...re seeking skilled a Senior Software Engineer to design and develop the core API services and backend systems that power InfiniteChoice's...  ...native software solutions that serve millions of users, process...  ...~ Bachelor's degree in Computer Science, Engineering, or equivalent... 
    Contract work
    Remote work

    Infinite Choice

    Dallas, TX
    5 days ago
  • $181.1k - $318.4k

     ...Full Stack Software Engineer - ML Compute Capacity Scaling machine learning workloads across thousands...  ...bringing together expertise in distributed systems, machine learning infrastructure, and...  ...systems that ingest, normalize, and serve fleet-wide utilization and cost data... 
    Relocation

    Apple

    Santa Clara, CA
    2 days ago
  • $100k - $120k

     ...We are looking for an IT Systems Engineer to maintain, support, and enhance...  ...AI Research & Advocacy: Serve as the primary resource for evaluating...  ...: Bachelor's degree in Computer Science, Information Technology...  ...(Workato, n8n), or LLM API integrations (OpenAI, Anthropic... 
    Work at office
    Remote work
    Work from home
    Home office
    Flexible hours
    Shift work

    Flock Safety

    United States
    5 days ago
  • $184k - $287.5k

     ...NVIDIA has been transforming computer graphics, PC gaming, and...  ...We are looking for a Senior Systems Software Engineer to help build the next generation...  ...in AI coding agents, LLM-powered tools, or modern AI...  ...or Kubernetes! Experience serving as the technical lead for a... 
    Remote work

    NVIDIA

    United States
    5 days ago
  •  ...Senior Software Engineer, Go - LLM Team AssemblyAI builds the best-in-class Voice AI models powering...  ...generation of voice applications. Our models serve 600M+ inference calls monthly, process...  ..., and building highly reliable systems in service of solving real customer problems... 
    Remote work
    Easy work

    AssemblyAI

    United States
    8 hours ago
  •  ...Software Engineer Persona is the configurable identity platform built...  ...— that's why we're able to serve a wide range of leading companies...  ...! About the Role The Compute team's mission: any engineer...  ...code, can debug distributed system failures across layers you... 
    Full time
    For contractors
    Internship
    Remote work

    Persona

    United States
    2 days ago
  • $148.2k - $300.96k

     ...ByteDance's KV caching and storage systems team, where we build and own...  ...-generation shared-storage engines, and performance/cost optimization...  ...and recovery capabilities. We serve ByteDance's core business...  ...on business workloads. - Drive compute/storage efficiency improvements... 
    Temporary work
    Local area
    Remote work

    ByteDance

    United States
    3 days ago
  •  ...The Cloud Infrastructure Engineering organization exists to manage...  ...build, operate, and maintain Compute, Network, and Storage services...  ...need, including things like serving the largest internet Live stream...  ...on. Drive the evolution of systems and tooling used to manage hundreds... 
    Hourly pay
    Full time
    Immediate start
    Remote work
    Flexible hours

    Netflix

    United States
    1 day ago
  • $224k - $356.5k

     ...NVIDIA Systems Software Engineer DGX Station (Galaxy) is NVIDIA's workstation-class AI computer—built on GB300 Blackwell GPUs with NVLink interconnect...  ...like NemoClaw, LLM inference via NIM, Hermes agents...  ...simultaneous training jobs, inference serving alongside development, and... 
    Local area

    NVIDIA

    Santa Clara, CA
    45 minutes ago
  • $149k - $204.6k

     ...supply chain. Intelligent software orchestrates advanced...  ...-density, end-to-end system - reinventing...  ...looking for Senior Software Engineer to join our Perception...  ...other team members by serving as a technical mentor...  ...related discipline (i.e. Computer Science, Electrical... 

    Symbotic

    Austin, TX
    8 hours ago
  •  ...lives. Our end-to-end suite of software solutions helps our customers...  ...campuses, transportation systems, healthcare centers, public venues...  ...team of scientists and engineers (located in Chicago, Boston,...  ...Natural Language Understanding and Computer Vision. Our AI team is... 
    Live in
    Work at office
    Relocation

    Motorola Solutions

    Waltham, MA
    8 hours ago
  • $160k - $225k

     ...enabling human life on Mars. SR. SOFTWARE ENGINEER, HIGH PERFORMANCE COMPUTING (STARLINK) At SpaceX we’re...  ...s most advanced broadband internet system. Starlink is the world’s largest satellite...  ...experience, often providing under-served communities with affordable, life-changing... 
    Permanent employment
    Temporary work
    Worldwide
    Weekend work

    SpaceX

    Sunnyvale, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer L5, LLM Compute & Serving Systems. Be the first to apply!