Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer - Inference

$156k - $387.6k

ByteDance

Machine Learning Engineer - Inference

Location: San Jose

Team: Technology

Employment Type: Regular

Job Code: A143983

Responsibilities

The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company. We also drive substantial impact on core businesses of the company. Currently, we are looking for Machine Learning Engineer - Inference to join our team to support and advance that mission. Responsibilities:

  • Responsible for the design and implementation of distributed inference infrastructure for feeds, ads and search ranking models.
  • Responsible for building monitoring/managing tools to oversee the reliability and scalability of online inference servers
  • Responsible for triaging system inefficiency and bottlenecks and improving system performance
  • Responsible for building tools to analyze bottlenecks and sources of instability and then design and implement solutions
  • Responsible for collaboration with product teams and providing general solutions to meet their requirements
Qualifications

Minimum Qualifications:

  • At least 3 years of experience in developing and deploying large-scale systems.
  • Experience contributing to an open sourced machine learning framework (tensorflow / jax / pytorch / torchscript / mxnet / tensorrt).
  • Strong background in one of the following fields: Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/RDMA) or ML for Systems.

For Pay TransparencyCompensation Description (Annually)

The base salary range for this position in the selected city is $156000 - $387600 annually. Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure). The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates: Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment: Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues; Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems, and Exercising sound judgment.

About Us

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day. As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.

Diversity & Inclusion

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Reasonable Accommodation

ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer - Inference in San Jose, CA vacancy
  • $147.4k - $272.1k

     ...Machine Learning Engineer, Proactive - Large Language Models & Generative AI Inference The Intelligence Platform team empowers clients across Apple's operating systems with a high quality user-centric search and data platform, and the primary inference platform that... 
    Suggested
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  • $100k

     ...future of entertainment around the world. Machine Learning/Artificial Intelligence is powering...  ..., we are hiring for a Machine Learning Engineer to join our team to contribute to the team...  ...models for efficient and scalable inference. -Develop and maintain online inference... 
    Suggested
    Hourly pay
    Full time
    Immediate start
    Flexible hours

    Netflix

    Los Gatos, CA
    1 day ago
  •  ...allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML...  ...computation. About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our... 
    Suggested

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  • $124k - $195.5k

    NVIDIA Corporation is seeking a Machine Learning Applications and Compiler Engineer for New College Grad 2026 in Santa Clara, CA. You will focus on developing algorithms for inference and compiler stack optimizations, working at the intersection of deep learning and large... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • A leading technology company is hiring a Machine Learning Systems Engineer in Cupertino, California. You will collaborate with Siri modeling teams to optimize model training and inference on Apple's custom Silicon. The ideal candidate has strong experience in ML models... 
    Suggested

    Apple Inc.

    Cupertino, CA
    2 days ago
  •  ...automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale, CA. The role involves...  ...skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and... 

    General Motors

    Sunnyvale, CA
    1 day ago
  • $120k - $235k

     ...most innovative companies to build strong engineering teams ready for what’s next. Software...  ...out many otherwise viable architectures. Inference runs during a live assessment, which...  ...salary, target bonus, and equity. Want to learn more about HackerRank? Check out... 
    Shift work

    HackerRank

    Santa Clara, CA
    14 hours ago
  • $172.5k - $306.63k

     ...Machine Learning Engineer - Brand Intelligence Predict The Opportunity Join us at Adobe as a Machine Learning Engineer (MLE 50) on the...  ...from proof-of-concept to production. Develop complex inference and reasoning harnesses on top of frontier LLMs, agentic flows... 
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    2 days ago
  • $147.4k - $272.1k

     ...Machine Learning Engineer- GenAI Imagine what you could do here. At Apple, we believe new insights have a way of becoming excellent products...  ...(DAG), Docker, Conductor, Ray for LLM training and inference at scale is a plus. Hands-on experience with LangChain... 
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  •  ...We are seeking a highly skilled Machine Learning Engineer to design and build a low-latency query understanding and intelligent routing system...  ...-grade ML systems , with a focus on sub-second inference, CPU-based execution, and scalable domain evolution .... 
    Local area

    Sparktek

    San Jose, CA
    14 hours ago
  • $181.1k - $272.1k

     ...Machine Learning Engineer - LLM Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services...  ...Airflow (DAG), Docker, Conductor, Ray for LLM training and inference at scale. Hands-on experience with LangChain and... 
    Relocation

    Apple

    Cupertino, CA
    4 days ago
  •  ...Insight Global is seeking a team of experienced, driven Machine Learning Engineer to join an established health technology company sitting in...  ...workflow-from data collection and model training to deployment, inference, optimization, and evaluation. We are a company... 
    Permanent employment
    Full time

    Insight Global

    San Jose, CA
    14 hours ago
  • $165.2k - $223.6k

     ...Description The Product: Amazon's Machine Learning accelerators are at the forefront of our...  ...chip delivers best-in-class ML inference performance at the lowest cost in cloud...  ...multiple disciplines including silicon engineering, hardware design and verification, software... 
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    14 hours ago
  • $147.4k - $272.1k

     ...Machine Learning Engineer: Multimodal Sensor Fusion At Apple, individual creativity converges around shared values that drive innovation....  ...problems, architecting model efficiency strategies for on-device inference, and ensuring algorithms perform flawlessly in production... 
    Relocation

    Apple

    Sunnyvale, CA
    2 days ago
  •  ...Machine Learning Engineer UnitX builds the world's leading physical AI systems to automate repetitive visual tasks in factories. UnitX is...  ..., or a relevant technical field. Experience with model inference optimization Experience with non-ML CV algorithms... 

    UnitX

    Milpitas, CA
    4 days ago
  • $181.1k - $318.4k

     ...Machine Learning Compiler Engineer At Apple, we're on the cutting edge of delivering transformative experiences through Artificial Intelligence...  ...Neural Engine Accelerator, optimizing it for deep learning inference with a focus on performance, scalability, and power... 
    Relocation

    Apple

    Sunnyvale, CA
    4 days ago
  • $156k - $316.8k

     ...Machine Learning Engineer, Multimodal - Intelligent Integrity Location: San Jose Employment Type: Regular Job Code: A193136A Responsibilities...  ...-on experience in large model training, fine-tuning and inference deployment; excellent engineering capabilities, master... 
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    2 days ago
  • $172.5k - $306.63k

     ...Machine Learning Engineer & Architect Adobe Journey Optimizer B2B is redefining how enterprises engage buying groups through AI-powered customer...  ...(RAG), semantic embeddings, agentic AI workflows, and ML inference systems for personalization or recommendation use cases... 
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    1 day ago
  • $181.1k - $318.4k

     ...On-Device Machine Learning Engineer We're starting to see the incredible potential of multimodal foundation and large language models, and...  ...of models running on Apple devices. Optimize on-device inference latency and efficiency of CV/ML models. Minimum Qualifications... 
    Relocation

    Apple

    Sunnyvale, CA
    1 day ago
  • $246.5k

     ...large scale and with low latency. We use Machine Learning, Reinforcement Learning, AI, Control...  ...of this is our Machine Learning and Inference Platform that powers the entire landscape...  ...- someone excited to mentor engineers, innovate at scale, and shape the future... 
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    14 hours ago
  • $150k

     ...researchers, data scientists, and engineers, tackling the most fundamental...  ...performance computing in deep learning, driving impactful discoveries...  ...performance for the machine learning software stacks, especially at training and inference, and support the team to develop... 
    Work experience placement
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $187.74k - $272.1k

     ...in Cupertino, California and various unanticipated locations throughout the USA. Design, implement, extend, and refactor machine learning inference system frameworks to streamline the deployment, execution, and evaluation of ML models. Design and implement lowlatency inference... 
    Relocation

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $126.8k - $190.9k

    Sunnyvale, California, United States Machine Learning and AI At Apple, we're on the cutting edge...  ...team! As a Machine Learning Compiler Engineer on the Apple Neural Engine (ANE) team,...  ...Accelerator, optimizing it for deep learning inference with a focus on performance,... 
    Relocation package

    Apple Inc.

    Sunnyvale, CA
    14 hours ago
  • A leading tech company is seeking a Machine Learning Engineer in Cupertino, California. In this role, you will design, implement, and optimize machine learning frameworks, develop text input features, and collaborate with data scientists and software developers. Required... 

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $151.8k - $265.35k

     .... We're seeking an outstanding ML infra engineer with deep expertise in building large scale...  ...You will profile GPU utilization, trace inference and training runs and help craft...  ...AI like LLMs. ~ Strong Python and deep learning engineering skills, paired with experience... 
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    1 day ago
  •  ...and customer experiences. Join the Ai Data Platform Applied Machine Learning team to pioneer enterprise solutions where generative AI meets...  ...in AI Safety, privacy-preserving generations, efficient inference, and multimodal integration, while enabling teams to build on... 

    Apple

    Sunnyvale, CA
    3 days ago
  • $120.7k - $228.6k

     ...Adobe Firefly’s Generative AI Services team is seeking a Machine Learning Engineer for our GenAI Services area. In this high-impact role,...  ...Stock, and Premiere. You will design and develop efficient inference pipelines, optimize models for latency and through at inference... 
    Temporary work
    Local area

    Adobe

    San Jose, CA
    3 days ago
  •  ...organizations that keep the world running. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity. We thrive on...  ...(e.g., vLLM, TensorRT-LLM) or managing proprietary model inference endpoints. This position involves access to software/technology... 
    Immediate start

    Illumio

    Sunnyvale, CA
    4 days ago
  • $156k - $387.6k

     ...Machine Learning Engineer, AI Coding Tools Location: San Jose Team: Technology Employment Type: Regular Job Code: A100294 Responsibilities...  ...cutting-edge techniques in large model optimization and inference acceleration. Qualifications Minimum... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    2 days ago
  • $224k - $356.5k

     ...We are looking for outstanding Machine Learning Engineers to join our Physical AI teams! As the pioneers of the GPU—the visual cortex of modern...  ...computer/GPU architecture to improve the performance during inference/training. Familiarity with simulation platforms and... 

    NVIDIA

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer - Inference. Be the first to apply!