Software Engineer L5, LLM Compute & Serving Systems
$100kNetflix
Netflix is one of the world's leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.
The RoleNetflix is the world's leading streaming entertainment service, with over 300 million paid members in over 190 countries, enjoying TV series, feature films, and games across numerous genres and languages. Members can watch or play as much as they want, anytime, anywhere, on any internet-connected screen.
Machine Learning/Artificial Intelligence powers all of our consumer experience, including content discovery and personalization, identifying and attracting new members to our product, optimizing our payment processing, and much more.
More recently, fast-paced innovation in large language models (LLMs) has greatly helped advance state-of-the-art technology in many areas of personalization, including search and recommendation experiences.
The Opportunity The Consumer ML Serving team provides the computational platform on which we build nearly all our consumer-facing ML/AI applications. If you've seen it, we probably served it! We provide all the building blocks to serve ML at scale, including a real-time model serving platform, an event-driven model and feature compute framework, a distributed compute orchestration engine, and more. Additionally, as we expand to enable LLM innovation in numerous areas of personalization, we're building model serving infrastructure for LLMs and other large foundation models.
We are looking for a strong senior engineer to own and develop our long-term vision. Our systems power some of Netflix's most business-critical models, and we need you to take our ML/AI initiatives to the next level. You will play a highly cross-functional role, partnering with other engineers, product managers, machine learning engineers, and data/research scientists.
If you have a passion for building scalable, robust systems, are interested in pushing the envelope in applied ML algorithms, and enjoy seeing a direct line between your work and what our customers see on their screens, we want to talk to you.
You may enjoy working with us if you are:
- Self-driven and highly motivated to deliver top-tier ML infrastructure while navigating highly ambiguous environments and can execute 0-to-1 projects.
- Eager to learn about new domains and ship high-quality, well-tested code.
- Able to produce generic and optimal solutions while balancing near-term needs.
- Excited to work in a multidisciplinary environment (engineering, algorithms, data engineering/science, and product experimentation).
- Comfortable working in a hybrid team with partners distributed across (US) geographies & time zones.
- Willing to take broad ownership of team responsibilities (building roadmaps, scoping, task breakdowns, etc.)
- Building and operating high-traffic, real-time distributed systems and ML serving infrastructure for LLMs and other large foundation models.
- Supporting large-scale ML models with a direct impact on what customers see.
- Translating the requirements of research scientists into generic platform offerings.
- Delivering systems requiring high availability, throughput, and performance.
- Navigating highly ambiguous environments.
- Taking on and executing zero-to-one projects.
- Leading projects with 3-4 other engineers.
- Building applications in an object-oriented programming language. (We work primarily with Java, and while prior Java experience is not required to interview, you will be expected to become proficient on the job.)
- DevOps for large applications, including performance tuning, optimization, deployment management, and capacity planning.
- Public cloud like AWS, Azure, or GCP.
- You are a proactive, effective communicator and have a strong bias towards action.
- You have a BS/MS in Computer Science, Applied Math, Engineering, or a related field.
$272k - $431.25k
...inference framework for serving generative AI and... ...accelerators feel like a single system at datacenter scale. As... ...outgrow the memory and compute budget of any single... ...of cutting-edge LLM workloads. We are seeking... ...seeking a Principal Systems Engineer to define the vision...SuggestedLocal areaRemote work$180k - $220k
...industry. Now we're building AI systems to make that quality... ...this role, you'll own the core LLM infrastructure powering two products... ...for B2B research. Build self‑serve features that compress weeks into... ...solutions while maintaining engineering best practices Nice to Have RAG...SuggestedImmediate startRemote workFlexible hours$202.16k - $368.22k
...column stores, search engines, and multi-model... ...-native database systems-intelligent,... ...agents. As a Senior Software Engineer or Researcher... ...and LLM agent memory backends... ...Master's, or Ph.D. in Computer Science or related... ...AI infra or model-serving infrastructure (especially...SuggestedTemporary workLocal area- ...accelerate next-generation computing experiences—from... ...and embedded systems. Grounded in a... ...senior member of the LLM inference... ...platform for LLM serving. This role sits... ...intersection of inference engines, distributed systems... ...Software Engineering ~...Suggested
$152k - $241.5k
...NVIDIA's TensorRT Edge-LLM team and help shape... .... We build the software stack that enables... ...autoregressive model serving capabilities,... ...equivalent experience in Computer Science, Electrical/Computer Engineering, or a closely... ...autoregressive LLM serving systems, including...SuggestedRemote work$147k - $211k
...core GenAI concepts (LLM, Multi-Modal, Large Vision... ...'s degree or PhD in Computer Science or related... ...About the job Google's software engineers develop the next-generation... ..., large-scale system design, networking and... ...representative of the users we serve, creating a culture of...Full time$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’... ...Master’s degree or PhD in Engineering, Computer Science, or a related technical... ...distributed computing, large‑scale system design, networking and data storage,...Full time- ...Innovating in AI systems technologies, the full-time Software Engineer, AI Systems will develop libraries and GPU kernel... ...extensible abstractions for LLM serving engines and contributing to open... ...qualifications: Master's degree in Computer Science, Electrical Engineering,...Full timeRemote work
$212.8k
...Software Development Engineer-AI/LLM Network Location: San Jose Team: Technology... ...as Douyin and TikTok which serve hundreds of millions of users... ...development of platforms/systems for monitoring, analysis and... ...in Software Development, Computer Science, Computer...Temporary workLocal area$184k - $287.5k
...highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme... ...push the frontier of accelerated computing for AI. What you’ll be... ...Experience building and optimizing LLM inference engines (e.g., vLLM,...$149.75k - $211.42k
...AI Software Engineer – Agentic AI System About the role As an AI Software Engineer... ...agent frameworks and model-serving systems Design and automate... ...degree (B.S. or B.A.) in Computer Science, Electrical... ...with prompt engineering and LLM API integrations (such as...Local areaImmediate startRemote workShift work$2,000 per month
...chain-of-thought reasoning. Software Engineer, LLM Infrastructure Transformer... ...nothing if the rest of the serving stack takes 100+ ms, and... ...out of KV cache space and re-compute them later? Can we cache common... ...and physical Etched systems You may be a good fit if you...Work at officeRelocation package- ...Software Engineer Opportunity Baseten powers mission-critical... ...intersection of high-performance computing (HPC) and Large Language Model (LLM) engineering. You will... ...that ensure our systems are production-ready.... ...decoding, disaggregated serving, and kernel-level...Remote workFlexible hours
$160k - $225k
...human life on Mars. SR. SOFTWARE ENGINEER, COMPUTER VISION We are looking for... ...standard tools like Ray Train/Serve, Kubeflow, Airflow, or... ...AI-driven process monitoring systems Build and optimize data pipelines... ...in machine learning/LLM operations including model versioning...Permanent employmentFull timeTemporary workWeekend work$146.5k
.... For us, it also serves as a practical framework... ...The ML Data Engineering team powers metadata... ...worldwide. Our systems operate at massive... ...scalable ML and LLM-powered solutions... ...seeking a Senior Software Engineer with deep... ...Bachelor's degree in Computer Science or...Local areaWorldwideHome officeFlexible hours- ...build the platform engineers turn to to ship AI... ...global operating system for distributed, heterogeneous... ...believe that as LLM and multi-modal... ...network is the computer. We are looking... ...to architect the software fabric that unifies... ...for Disaggregated Serving, Wide Expert...Flexible hours
$260k - $340k
...our time. The demand for AI compute is boundless, and power is a... ...us at Crusoe. Principal Systems Software Engineer San Francisco, Sunnyvale... ...Systems Architect , you will serve as the visionary lead for... ...infrastructure for Large Language Model (LLM) training and inference at...Full timeTemporary work$130k - $175k
...interest in Uncountable Engineering! Description... ...AI. As an LLM Applications Engineer... ...the retrieval systems, agentic workflows... ...~2+ years of software development experience... ...Qualifications B.S. in Computer Science or a... ...queries that serve as the context for...- ...re seeking skilled a Senior Software Engineer to design and develop the core API services and backend systems that power InfiniteChoice's... ...native software solutions that serve millions of users, process... ...~ Bachelor's degree in Computer Science, Engineering, or equivalent...Contract workRemote work
$181.1k - $318.4k
...Full Stack Software Engineer - ML Compute Capacity Scaling machine learning workloads across thousands... ...bringing together expertise in distributed systems, machine learning infrastructure, and... ...systems that ingest, normalize, and serve fleet-wide utilization and cost data...Relocation$100k - $120k
...We are looking for an IT Systems Engineer to maintain, support, and enhance... ...AI Research & Advocacy: Serve as the primary resource for evaluating... ...: Bachelor's degree in Computer Science, Information Technology... ...(Workato, n8n), or LLM API integrations (OpenAI, Anthropic...Work at officeRemote workWork from homeHome officeFlexible hoursShift work$184k - $287.5k
...NVIDIA has been transforming computer graphics, PC gaming, and... ...We are looking for a Senior Systems Software Engineer to help build the next generation... ...in AI coding agents, LLM-powered tools, or modern AI... ...or Kubernetes! Experience serving as the technical lead for a...Remote work- ...Senior Software Engineer, Go - LLM Team AssemblyAI builds the best-in-class Voice AI models powering... ...generation of voice applications. Our models serve 600M+ inference calls monthly, process... ..., and building highly reliable systems in service of solving real customer problems...Remote workEasy work
- ...Software Engineer Persona is the configurable identity platform built... ...— that's why we're able to serve a wide range of leading companies... ...! About the Role The Compute team's mission: any engineer... ...code, can debug distributed system failures across layers you...Full timeFor contractorsInternshipRemote work
$148.2k - $300.96k
...ByteDance's KV caching and storage systems team, where we build and own... ...-generation shared-storage engines, and performance/cost optimization... ...and recovery capabilities. We serve ByteDance's core business... ...on business workloads. - Drive compute/storage efficiency improvements...Temporary workLocal areaRemote work- ...The Cloud Infrastructure Engineering organization exists to manage... ...build, operate, and maintain Compute, Network, and Storage services... ...need, including things like serving the largest internet Live stream... ...on. Drive the evolution of systems and tooling used to manage hundreds...Hourly payFull timeImmediate startRemote workFlexible hours
$224k - $356.5k
...NVIDIA Systems Software Engineer DGX Station (Galaxy) is NVIDIA's workstation-class AI computer—built on GB300 Blackwell GPUs with NVLink interconnect... ...like NemoClaw, LLM inference via NIM, Hermes agents... ...simultaneous training jobs, inference serving alongside development, and...Local area$149k - $204.6k
...supply chain. Intelligent software orchestrates advanced... ...-density, end-to-end system - reinventing... ...looking for Senior Software Engineer to join our Perception... ...other team members by serving as a technical mentor... ...related discipline (i.e. Computer Science, Electrical...- ...lives. Our end-to-end suite of software solutions helps our customers... ...campuses, transportation systems, healthcare centers, public venues... ...team of scientists and engineers (located in Chicago, Boston,... ...Natural Language Understanding and Computer Vision. Our AI team is...Live inWork at officeRelocation
$160k - $225k
...enabling human life on Mars. SR. SOFTWARE ENGINEER, HIGH PERFORMANCE COMPUTING (STARLINK) At SpaceX we’re... ...s most advanced broadband internet system. Starlink is the world’s largest satellite... ...experience, often providing under-served communities with affordable, life-changing...Permanent employmentTemporary workWorldwideWeekend work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer L5, LLM Compute & Serving Systems. Be the first to apply!
- software sales engineer United States
- software engineer full time United States
- facebook software engineer United States
- startup software engineer United States
- intermediate software engineer United States
- research software engineer United States
- software developer no experience United States
- labview software developer United States
- rust software engineer United States
- freelance software developer United States


