Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD)

$136.8k - $259.2k

Pangleglobal

Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD)

Location: San Jose

Team: Technology

Employment Type: Regular

The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance’s Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed storage, machine learning training and inference, and edge computing across multi-cloud and global datacenters.

Our mission is to deliver infrastructure that is highly performant, massively scalable, cost-efficient, and easy to use—enabling both internal and external developers to bring AI workloads from research to production at scale. We are expanding our focus on LLM inference infrastructure to support new AI workloads, and are looking for engineers passionate about cloud-native systems, scheduling, and GPU acceleration.

Responsibilities
  • Design and build large-scale, container-based cluster management and orchestration systems with extreme performance, scalability, and resilience.
  • Architect next-generation cloud-native GPU and AI accelerator infrastructure to deliver cost-efficient and secure ML platforms.
  • Collaborate across teams to deliver world-class inference solutions using vLLM, SGLang, TensorRT-LLM, and other LLM engines.
  • Stay current with the latest advances in open source (Kubernetes, Ray, etc.), AI/ML and LLM infrastructure, and systems research; integrate best practices into production systems.
  • Write high-quality, production-ready code that is maintainable, testable, and scalable.
Qualifications

Minimum Qualifications:

  • B.S./M.S. in Computer Science, Computer Engineering, or related fields with 2+ years of relevant experience (Ph.D. with strong systems/ML publications also considered).
  • Strong understanding of large model inference, distributed and parallel systems, and/or high-performance networking systems.
  • Hands-on experience building cloud or ML infrastructure in areas such as resource management, scheduling, request routing, monitoring, or orchestration.
  • Solid knowledge of container and orchestration technologies (Docker, Kubernetes).
  • Proficiency in at least one major programming language (Go, Rust, Python, or C++).

Preferred Qualifications:

  • Experience contributing to or operating large-scale cluster management systems (e.g., Kubernetes, Ray).
  • Experience with workload scheduling, GPU orchestration, scaling, and isolation in production environments.
  • Hands-on experience with GPU programming (CUDA) or inference engines (vLLM, SGLang, TensorRT-LLM).
  • Familiarity with public cloud providers (AWS, Azure, GCP) and their ML platforms (SageMaker, Azure ML, Vertex AI).
  • Strong knowledge of ML systems (Ray, DeepSpeed, PyTorch) and distributed training/inference platforms.
  • Excellent communication skills and ability to collaborate across global, cross-functional teams.
  • Passion for system efficiency, performance optimization, and open-source innovation.

ByteDance is an equal opportunities employer and welcomes applications from all qualified candidates. We are committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

We offer a competitive salary range of $136,800 - $259,200 annually, as well as a comprehensive benefits package, including medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, and 10 paid holidays per year.

#J-18808-Ljbffr
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD) in San Jose, CA vacancy
  • $156k - $316.8k

     ...Responsibilitie About the Team The Inference Infrastructure team is the creator and...  ..., and are looking for engineers passionate about cloud-...  ...to join our team in 2026. As a graduate, you will get...  ...have recently completed a PhD degree in Software Development, Computer Science... 
    For graduates
    For phd
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    4 days ago
  • $145k - $250k

     ...Software Engineer Graduate (Data Arch - AI/ML Infrastructure) - 2026 Start (PhD) Join to apply for the Software Engineer Graduate (Data Arch - AI/ML Infrastructure)...  ...Research background in ML/AI, with a focus on inference optimization, model acceleration, or efficient... 
    For graduates
    For phd
    Full time
    Temporary work
    Summer work
    Internship
    Local area

    Tik Tok

    San Jose, CA
    2 days ago
  •  ...Bytedance's business, Bytedance's system infrastructure is currently at a massive scale and...  ...talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue...  ...Currently pursuing a PhD degree in Computer Science or related... 
    For graduates
    For phd

    Pangleglobal

    San Jose, CA
    2 days ago
  • $122.57k - $316.8k

     ...for this future position in 2026. As a graduate, you will get unparalleled opportunities...  ...clearly in your resume (Start date, End date). Candidates...  ...have recently completed a PhD degree in Software Development, Computer Science, Computer Engineering, or a related technical... 
    For graduates
    For phd
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    4 days ago
  • $60 per hour

     ...team combines system engineering and machine...  ...seeking talented Software Engineers specializing...  ...an internship in 2026. Internships at ByteDance...  ...at ByteDance. PhD internships at...  ...clearly in your resume (Start date, End date)....  ..., online inference, model management,... 
    For phd
    Hourly pay
    Internship
    Local area

    ByteDance

    San Jose, CA
    3 days ago
  • $60 per hour

     ...INTRODUCTION The Vision Engineering Team at TikTok is at...  ...our proprietary AI infrastructures, we streamline the creation...  ...well as large model inference and multi-machine...  ...for an internship in 2026. PhD Internships at TikTok...  ...in your resume (Start date, End date). Responsibilities... 
    For phd
    Hourly pay
    Internship
    Local area
    Worldwide

    Tik Tok

    San Jose, CA
    2 days ago
  • $136.8k - $359.72k

     ...Overview Research Scientist Graduate (TikTok Recommendation-Agentic AI) - 2026 Start (PhD) Applications reviewed on a rolling basis. Narratives about onboarding...  ...algorithms, and develop scalable training and inference frameworks. Conduct research on agentic... 
    For graduates
    For phd
    Internship

    Tik Tok

    San Jose, CA
    2 days ago
  • $136.8k - $259.2k

     ...Research Scientist Graduate (3D/4D Generation) - 2026 Start (PhD) Location: San Jose Team: Technology Employment...  ...upper limits, accelerate inference speed, reduce costs, increase resolution...  ...PhD degree in Computer Science, Engineering or quantitative field (completed... 
    For graduates
    For phd
    Temporary work

    ByteDance

    San Jose, CA
    2 days ago
  • $118.66k - $187.2k

     ...Mobile Software Engineer Graduate (Mobile Reliability, Android) - 2026 Start (BS/MS) Join to apply for the Mobile Software Engineer Graduate (Mobile Reliability...  ...on optimizing existing codebase, building infrastructure and automation tools. On the Mobile Reliability... 
    For graduates
    Temporary work
    Work experience placement
    Internship
    Remote work

    Tik Tok

    San Jose, CA
    2 days ago
  • $136.8k - $259.2k

     ...Research Scientist Graduate (3D/4D Reconstruction/...  ...Generation/Relighting) - 2026 Start (PHD) Location: San Jose...  ...and optimization, inference acceleration, and deployment...  ...to promote hardware‐software co‐innovation;...  ...in Computer Science, engineering or quantitative field... 
    For graduates
    For phd
    Temporary work
    Local area

    Pangleglobal

    San Jose, CA
    2 days ago
  • $30 per hour

     ...Fall 2026 Software Engineering Internship/Co-op Flexible - Any SpaceX...  ...a bachelor's degree or graduate program by the start of employment ~3+ months...  ...Engineering Intern/Completed PhD: $40.00/hour Your...  ...spacecraft, as well as the infrastructure and tools to enable... 
    For graduates
    For phd
    Permanent employment
    Full time
    Internship
    Relocation
    Flexible hours

    SpaceX

    Sunnyvale, CA
    5 days ago
  • $145k - $335k

     ...Research Scientist Graduate (TikTok Recommendation-Large Recommender Models) - 2026 Start (PhD) Join to apply for the Research...  ...techniques and large-scale systems engineering, collaborating with cross-...  ...cross-functional teams (infrastructure, product, research, etc.)... 
    For graduates
    For phd
    Full time
    Temporary work
    Fixed term contract
    Internship
    Local area
    Flexible hours

    Tik Tok

    San Jose, CA
    2 days ago
  • $136.8k - $259.2k

     ...A leading technology company is looking for a Software Engineer Graduate to join the Inference Infrastructure team in San Jose. This role involves designing and building large-scale cluster management systems and collaborating across teams for LLM inference solutions.... 
    For graduates

    Pangleglobal

    San Jose, CA
    2 days ago
  • $124k - $195.5k

    ## AI Inference Performance Engineer - New College Grad 2026Applylocations...  ...See:*** BS, MS, or PhD in Computer Science,...  ...2+ years of relevant software development experience...  ...human language. The GPU started out as the engine for...  ...least until June 7, 2026.This posting is for... 
    For phd

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $122.57k - $256k

     ...from network architecture, software defined networking (SDN),...  ...global, intelligent network infrastructure to meet the requirements...  ...individuals to join our team in 2026. As a graduate, you will get...  ...whole lifecycle of network engineering. Qualification Minimum... 
    For graduates
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    3 days ago
  • $122.57k - $316.8k

     ...from network architecture, software defined networking (SDN), network...  ..., intelligent network infrastructure to meet the requirements of...  ...a Software Development Engineer, Network Monitoring & Alerts...  ...individuals to join our team in 2026. As a graduate, you will get opportunities... 
    For graduates
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    3 days ago
  • $122.57k - $256k

     ...individuals to join our team in 2026. As a graduate, you will get unparalleled...  ...who are able to commit to these start dates. Please state your availability...  ...'s or Master's degree in Software Development, Computer Science, Computer Engineering, or a related technical... 
    For graduates
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    1 day ago
  • $150k - $316.8k

     ...Machine Learning Graduate (E-Commerce Governance-CV/NLP/Multimodal/LLM) - 2026 Start (PhD) Location: San Jose Team: Technology Employment...  ...completed a PhD degree in Software Development, Computer Science, Computer Engineering, or a related technical discipline... 
    For graduates
    For phd
    Temporary work
    Local area
    Overseas

    ByteDance

    San Jose, CA
    3 days ago
  • $152k - $241.5k

     ...Senior Software Engineer, Quantized Inference page is loaded## Senior Software Engineer, Quantized...  ..., build systems, training infrastructure, pipeline friction*...  ...open-source codebase* MS/PhD in Computer Science or related...  ...at least until March 1, 2026.This posting is for an... 
    For phd

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $124k - $195.5k

     ...looking for a Deep Learning Software Engineer, TensorRT Performance!...  ...of NVIDIA’s inference ecosystem! NVIDIA is rapidly...  ...~ Bachelors, Masters, PhD, or equivalent...  ...human language. The GPU started out as the engine for...  ...least until April 7, 2026. This posting is for... 
    For phd
    Remote work

    NVIDIA

    Santa Clara, CA
    7 days ago
  • $212.8k

     ...our team in 2027. As a graduate, you will get...  ...network architecture, software defined networking (SDN...  ..., intelligent network infrastructure to meet the requirements...  ...recently completed a PhD in Software Development...  ...Computer Science, Computer Engineering, or a related technical... 
    For graduates
    For phd
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    2 days ago
  • $184k - $287.5k

     ...highly skilled and motivated software engineers to join us and build AI inference systems that serve large...  ...level DSLs and compiler infrastructure to boost kernel...  ...years of experience; or PhD degree with the thesis and...  ...accepted at least until May2,2026. This posting is for an... 
    For phd

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...industry-leading training and inference speeds and empowers...  ...About The Role As a New Graduate Software Engineer, you will collaborate with...  ...usability of next-generation AI infrastructure. This role is ideal for...  ...discipline (graduating in 2026). Proficiency in C/C++... 
    For graduates
    Internship

    Cerebras

    Sunnyvale, CA
    9 hours ago
  • $45 - $60 per hour

     ...services that enable engineers to deliver high-quality...  ...provide systems enabling software development streamline...  ...for an internship in 2026. Internships at our...  ...clearly in your resume (Start date, End date)....  ..., CI/CD system, build infrastructure and big data. Preferred... 
    Hourly pay
    Summer work
    Internship
    Local area

    ByteDance

    San Jose, CA
    9 hours ago
  • $45 - $60 per hour

     ...Software Engineer Intern (Ads Infrastructure) - 2026 Summer (BS/MS) Location: San Jose Employment Type: Intern Job Code: A114384 Share this listing: Responsibilities...  ...product and business teams on product vision Summer Start Dates May 11th, 2026; May 18th, 2026; May 26th, 2026;... 
    Hourly pay
    Full time
    Summer work
    Internship
    Local area

    Ellis Technologies, Inc.

    San Jose, CA
    2 days ago
  • $122.57k - $316.8k

     ...talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue...  .... • Partner with PMs, designers, engineers from different teams on building backend...  ...Qualifications: • Demonstrated software engineering or big data experience from... 
    For graduates
    Temporary work
    Work experience placement
    Internship
    Local area

    Tik Tok

    San Jose, CA
    4 days ago
  • $60 per hour

     ...network architecture, software defined networking...  ...network infrastructure to meet the requirements...  ...an internship in 2026. Internships at ByteDance...  ...at ByteDance. PhD internships at...  ...clearly in your resume (Start date, End date)....  ...with AI training/inference systems and... 
    For phd
    Hourly pay
    Internship
    Local area

    ByteDance

    San Jose, CA
    3 days ago
  • $45 per hour

     ...system architectures in areas such as training, inference, and application. Collaborating with the Content...  ...- Currently pursuing an Undergraduate/Master in Software Development, Computer Science, Computer Engineering, or a related technical discipline. - Familiar with... 
    Hourly pay
    Temporary work
    Internship
    Local area

    Tik Tok

    San Jose, CA
    9 hours ago
  •  ...A leading tech company is seeking a Machine Learning Scientist Graduate to enhance its recommendation system. Candidates must hold a PhD in computer science or a related field and have strong skills in deep learning frameworks. The position involves building solutions... 
    For graduates
    For phd

    Tik Tok

    San Jose, CA
    2 days ago
  • $152k - $241.5k

     ...seeking talented and motivated engineers to join our TensorRT team in...  ...-leading deep learning inference software for NVIDIA AI accelerators....  ...we need to see: ~ BS, MS, PhD or equivalent experience in Computer...  ...accepted at least until April 14, 2026. This posting is for an... 
    For phd

    NVIDIA

    Santa Clara, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD). Be the first to apply!