Software Engineer - AI Compute Infrastructure
$156k - $387.6kByteDance
Responsibilitie
About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed storage, machine learning training and inference, and edge computing across multi-cloud and global datacenters. With ByteDance's rapidly growing businesses and a global fleet of machines running hundreds of millions of containers daily, we are building the next generation of cloud-native, GPU-optimized orchestration systems. Our mission is to deliver infrastructure that is highly performant, massively scalable, cost-efficient, and easy to use-enabling both internal and external developers to bring AI workloads from research to production at scale. We are expanding our focus on LLM inference infrastructure to support new AI workloads, and are looking for engineers passionate about cloud-native systems, scheduling, and GPU acceleration. You'll work in a hyper-scale environment, collaborate with world-class engineers, contribute to the open-source community, and help shape the future of AI inference infrastructure globally. Responsibilities - Design and build large-scale, container-based cluster management and orchestration systems with extreme performance, scalability, and resilience. - Architect next-generation cloud-native GPU and AI accelerator infrastructure to deliver cost-efficient and secure ML platforms. - Collaborate across teams to deliver world-class inference solutions using vLLM, SGLang, TensorRT-LLM, and other LLM engines. - Stay current with the latest advances in open source (Kubernetes, Ray, etc.), AI/ML and LLM infrastructure, and systems research; integrate best practices into production systems. - Write high-quality, production-ready code that is maintainable, testable, and scalable. Qualification Minimum Qualifications - B.S./M.S. in Computer Science, Computer Engineering, or related fields with 2+ years of relevant experience (Ph.D. with strong systems/ML publications also considered). - Strong understanding of large model inference, distributed and parallel systems, and/or high-performance networking systems. - Hands-on experience building cloud or ML infrastructure in areas such as resource management, scheduling, request routing, monitoring, or orchestration. - Solid knowledge of container and orchestration technologies (Docker, Kubernetes). - Proficiency in at least one major programming language (Go, Rust, Python, or C++). Preferred Qualifications - Experience contributing to or operating large-scale cluster management systems (e.g., Kubernetes, Ray). - Experience with workload scheduling, GPU orchestration, scaling, and isolation in production environments. - Hands-on experience with GPU programming (CUDA) or inference engines (vLLM, SGLang, TensorRT-LLM). - Familiarity with public cloud providers (AWS, Azure, GCP) and their ML platforms (SageMaker, Azure ML, Vertex AI). - Strong knowledge of ML systems (Ray, DeepSpeed, PyTorch) and distributed training/inference platforms. - Excellent communication skills and ability to collaborate across global, cross-functional teams. - Passion for system efficiency, performance optimization, and open-source innovation. Job Information [For Pay Transparency]Compensation Description (Annually) The base salary range for this position in the selected city is $156000 - $387600 annually.Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) Candidates:
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:
1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and
3. Exercising sound judgment. About U Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join ByteDance Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.
As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.
Diversity & Inclusion
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. Reasonable Accommodation ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Software Engineer - AI Compute Infrastructure in San Jose, CA vacancy
$156k - $387.6k
...Responsibilitie About the Team The Compute Infrastructure - Orchestration & Scheduling team... ...containers and offline jobs daily, including AI and LLM workloads. The team is... ...infrastructure. We're seeking talented software engineers excited to optimize our infrastructure...SuggestedTemporary workInternshipLocal areaOverseas$248k - $391k
...NVIDIA has been reinventing computer graphics, PC gaming, and accelerated... ...the unlimited potential of AI to define the next era of... ...a highly skilled Principal Software Engineer to jo in our dynamic team.... ...the performance of our infrastructure both on-prem and in the cloud...Suggested- ...Fortanix we are pioneers in confidential computing and Confidential AI for hybrid and multicloud... ...and data across clouds, on-premises infrastructure, and devices. Our platform enables... ...security. The Role Staff Software Engineer (Rust) - Confidential Computing...SuggestedH1b
$126k - $186k
...Software Engineer - Cloud Infrastructure Sunnyvale, California, United States About Applied Intuition Applied... ...is powering the future of physical AI. Founded in 2017 and now valued at $... ...simulation infrastructure. The compute and data generation scale of our product...SuggestedFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$122.57k - $316.8k
...Software Engineer Graduate (Cloud Native Infrastructure) - 2026 Start (PhD) Location: San Jose Team: Technology... ...Responsibilities Team Introduction The Compute Infrastructure team uses... ...and offline jobs daily, including AI and LLM workloads. The team is dedicated...SuggestedTemporary workInternshipLocal areaOverseas- ...harness the power of production-grade AI agents, without the need for... ...safety and compliance. Job title Software Engineer - Platform Infrastructure Position overview We are looking... ...~ Bachelor's degree in Computer Science, Engineering, or a related...Flexible hours
$181.1k - $318.4k
...Sr Software Engineer - AI, Search & Knowledge Platform – Cloud Infrastructure Are you an open-source contributor passionate about building the next generation of cloud... .... Minimum Qualifications ~ BS/MS in Computer Science or equivalent practical experience....Relocation$130k - $182k
...Backend/Infrastructure Software Engineer WeRide.ai is looking for world class coders to work on transforming mobility by solving some of the most challenging... ...experience in tech industry ~ BS/MS/PhD degree in Computer Science or equivalent practical experience. ~...Full timeWork experience placement$141k - $202k
...with developing large - scale infrastructure, distributed systems or networks, or experience with compute technologies, storage or... ...diagnosis and resolution, and software test engineering. About the job Google's... ...enhance software solutions. The AI and Infrastructure team is...Full timeTemporary workWorldwide$141k - $202k
...years of experience with software development in C++. 2... ...large-scale infrastructure, distributed systems or... ...networks, or experience with compute technologies, storage... ...resolution, and software test engineering. About the job The... ...different stacks. The AI and Infrastructure...Full timeWorldwide$166k - $244k
Senior Software Engineer, Infrastructure, Google Cloud AI Apply info_outline info_outline X Note: By applying to this position you will have an opportunity... ...distributed systems or networks, or experience with compute technologies, storage or hardware architecture. Preferred...Full timeWorldwide$184k - $287.5k
...Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research... ...seeking an AI infrastructure software engineer to join our team. You'll be... ...'s degree or higher in Computer Science or a related technical...$212.8k - $387.6k
...Responsibilitie About the team: Compute division focuses on building... ...and highly available cloud infrastructure, which supports both public... ...as a senior development engineer. 3. 3 years experience of building... ...of cloud infrastructure or AI infrastructure. 6. Familiar with...Temporary workLocal area$152k - $241.5k
...has been transforming computer graphics, PC gaming, and... ...potential of AI to define the next era... ...container, GPU, and systems engineers. When useful, you will... ...large-scale workloads and infrastructure signals to find... ...prediction) inside existing software workflows. What we...Remote work$181.1k - $318.4k
...Senior Software Engineer - Compute People at Apple don't just build products — they craft the kind... ...team builds and provides systems and infrastructure that power Apple's services (such as... ...opportunities to solve problems through AI-assisted automation, including...Relocation$224k - $356.5k
...NVIDIA is hiring engineers to scale up the introduction... ...into its EDA Infrastructure. We expect you to have... ...systems, familiarity with software testing and deployment... ...the next generation of computing? Join us at the forefront... ...crowd: Developing ML/AI infrastructure....$141k - $202k
About the job Google's software engineers develop the next-generation technologies that change... ...information retrieval, distributed computing, large-scale system design,... ...and enhance software solutions. The AI and Infrastructure team works on the world’s toughest problems...Full timeWorldwide$174k - $252k
Senior Software Engineer, Performance, Platforms Infrastructure Engineering Google, Sunnyvale, CA, USA Bachelor’s degree or... ...: Master's degree or PhD in Computer Science or related technical field... ...enhance software solutions. The AI and Infrastructure team is redefining...Full timeWorldwide$19 - $65 per hour
...PlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered... ...and automation. Backend & Infrastructure Fundamentals: Solid... ...distributed systems, IoT, or edge computing environments (helpful for understanding...Hourly payInternship$165k - $242k
...Senior Software Engineer, Data Center Infrastructure Tooling CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology... ...to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave...Temporary workFlexible hours$140k - $200k
...include frontend and backend engineers, AI research scientists, and... ...through a tight integration of infrastructure, engineering, and research... ...are looking for a skilled Software Engineer to join us.What... ...Candidate Should HaveBS/MS/PhD in Computer Science or a related field....Full timeWork at officeShift work- ...Software Engineer - Data Infrastructure Services Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology... ...accelerate breakthroughs and turn compute into capability. Founded in 2017,...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours
- ...Coram Ai Engineer At Coram AI, we're reimagining video security... ...cloud-native platform uses computer vision and AI to help businesses... ...At Coram AI, our infrastructure isn't just your typical cloud... ...maintain the custom in-house software that we use for these purposes...Remote work
$224k - $356.5k
...to help NVIDIA's GPU software team advance its software... ..., improving the build infrastructure and coordinating with... ...~ BS or MS degree in Computer Science (or equivalent... ...unprecedented growth, our elite engineering teams are rapidly... .... NVIDIA uses AI tools in its recruiting...Temporary work$212.8k - $387.6k
...services that enable engineers to deliver high-quality... ...provide systems enabling software development streamline... ...and automation of the infrastructure, to ensure high... ...- Bachelor degree in computer science or a related technical... ...Qualifications: - AI applications development...Temporary workLocal area- ...This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one... ...step is to speak to Jack. Senior Software Engineer (Infrastructure) Company Description: Tavus - Series... ...research scientists in affective computing. Scale complex GPU workloads for a...
$120k - $300k
...Software Engineer - Developer Infrastructure Sunnyvale, California, United States About Applied Intuition... ...is powering the future of physical AI. Founded in 2017 and now valued at... ...Has: ~ A Bachelor's degree in Computer Science, Software Engineering, or equivalent...Full timeTemporary workFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$153k - $222k
...the future of physical AI. Founded in 2017 and... ...creating the digital infrastructure needed to bring intelligence... ...both infrastructure engineers with expertise in... ...training frameworks, compute, evaluation, and deployment... ...in Computer Science, Software Engineering, or equivalent...Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$109k - $160k
...GPU Infrastructure Software Engineer Sunnyvale, CA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools... ...to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$224k - $356.5k
...has been transforming computer graphics, PC gaming, and... ...potential of AI to define the next era... ...future of AI-powered software development! Our team... ...agentic AI, developer infrastructure, and runtime security.... ...quality and velocity of AI engineering practices. If you thrive...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer - AI Compute Infrastructure. Be the first to apply!
Related searches
- graduate software developer San Jose, CA
- rust software engineer San Jose, CA
- senior software design engineer San Jose, CA
- software engineer student San Jose, CA
- software engineer amazon San Jose, CA
- software developer positions San Jose, CA
- software engineer full time San Jose, CA
- software qa engineer San Jose, CA
- new graduate software engineer San Jose, CA
- junior software developer San Jose, CA

