Machine Learning Engineer - Inference
$156k - $387.6kByteDance
Machine Learning Engineer - Inference
Location: San Jose
Team: Technology
Employment Type: Regular
Job Code: A143983
Responsibilities
The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company. We also drive substantial impact on core businesses of the company. Currently, we are looking for Machine Learning Engineer - Inference to join our team to support and advance that mission. Responsibilities:
- Responsible for the design and implementation of distributed inference infrastructure for feeds, ads and search ranking models.
- Responsible for building monitoring/managing tools to oversee the reliability and scalability of online inference servers
- Responsible for triaging system inefficiency and bottlenecks and improving system performance
- Responsible for building tools to analyze bottlenecks and sources of instability and then design and implement solutions
- Responsible for collaboration with product teams and providing general solutions to meet their requirements
Qualifications
Minimum Qualifications:
- At least 3 years of experience in developing and deploying large-scale systems.
- Experience contributing to an open sourced machine learning framework (tensorflow / jax / pytorch / torchscript / mxnet / tensorrt).
- Strong background in one of the following fields: Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/RDMA) or ML for Systems.
For Pay TransparencyCompensation Description (Annually)
The base salary range for this position in the selected city is $156000 - $387600 annually. Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure). The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) Candidates: Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment: Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues; Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems, and Exercising sound judgment.
About Us
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
Why Join ByteDance
Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day. As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.
Diversity & Inclusion
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
Reasonable Accommodation
ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
$147.4k - $272.1k
...Machine Learning Engineer, Proactive - Large Language Models & Generative AI Inference The Intelligence Platform team empowers clients across Apple's operating systems with a high quality user-centric search and data platform, and the primary inference platform that...SuggestedRelocation$100k
...future of entertainment around the world. Machine Learning/Artificial Intelligence is powering... ..., we are hiring for a Machine Learning Engineer to join our team to contribute to the team... ...models for efficient and scalable inference. -Develop and maintain online inference...SuggestedHourly payFull timeImmediate startFlexible hours- ...allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML... ...computation. About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our...Suggested
$124k - $195.5k
NVIDIA Corporation is seeking a Machine Learning Applications and Compiler Engineer for New College Grad 2026 in Santa Clara, CA. You will focus on developing algorithms for inference and compiler stack optimizations, working at the intersection of deep learning and large...Suggested- A leading technology company is hiring a Machine Learning Systems Engineer in Cupertino, California. You will collaborate with Siri modeling teams to optimize model training and inference on Apple's custom Silicon. The ideal candidate has strong experience in ML models...Suggested
- ...automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale, CA. The role involves... ...skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and...
$120k - $235k
...most innovative companies to build strong engineering teams ready for what’s next. Software... ...out many otherwise viable architectures. Inference runs during a live assessment, which... ...salary, target bonus, and equity. Want to learn more about HackerRank? Check out...Shift work$172.5k - $306.63k
...Machine Learning Engineer - Brand Intelligence Predict The Opportunity Join us at Adobe as a Machine Learning Engineer (MLE 50) on the... ...from proof-of-concept to production. Develop complex inference and reasoning harnesses on top of frontier LLMs, agentic flows...Temporary workLocal areaWorldwide$147.4k - $272.1k
...Machine Learning Engineer- GenAI Imagine what you could do here. At Apple, we believe new insights have a way of becoming excellent products... ...(DAG), Docker, Conductor, Ray for LLM training and inference at scale is a plus. Hands-on experience with LangChain...Relocation- ...We are seeking a highly skilled Machine Learning Engineer to design and build a low-latency query understanding and intelligent routing system... ...-grade ML systems , with a focus on sub-second inference, CPU-based execution, and scalable domain evolution ....Local area
$181.1k - $272.1k
...Machine Learning Engineer - LLM Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services... ...Airflow (DAG), Docker, Conductor, Ray for LLM training and inference at scale. Hands-on experience with LangChain and...Relocation- ...Insight Global is seeking a team of experienced, driven Machine Learning Engineer to join an established health technology company sitting in... ...workflow-from data collection and model training to deployment, inference, optimization, and evaluation. We are a company...Permanent employmentFull time
$165.2k - $223.6k
...Description The Product: Amazon's Machine Learning accelerators are at the forefront of our... ...chip delivers best-in-class ML inference performance at the lowest cost in cloud... ...multiple disciplines including silicon engineering, hardware design and verification, software...InternshipLocal areaFlexible hours$147.4k - $272.1k
...Machine Learning Engineer: Multimodal Sensor Fusion At Apple, individual creativity converges around shared values that drive innovation.... ...problems, architecting model efficiency strategies for on-device inference, and ensuring algorithms perform flawlessly in production...Relocation- ...Machine Learning Engineer UnitX builds the world's leading physical AI systems to automate repetitive visual tasks in factories. UnitX is... ..., or a relevant technical field. Experience with model inference optimization Experience with non-ML CV algorithms...
$181.1k - $318.4k
...Machine Learning Compiler Engineer At Apple, we're on the cutting edge of delivering transformative experiences through Artificial Intelligence... ...Neural Engine Accelerator, optimizing it for deep learning inference with a focus on performance, scalability, and power...Relocation$156k - $316.8k
...Machine Learning Engineer, Multimodal - Intelligent Integrity Location: San Jose Employment Type: Regular Job Code: A193136A Responsibilities... ...-on experience in large model training, fine-tuning and inference deployment; excellent engineering capabilities, master...Temporary workLocal area$172.5k - $306.63k
...Machine Learning Engineer & Architect Adobe Journey Optimizer B2B is redefining how enterprises engage buying groups through AI-powered customer... ...(RAG), semantic embeddings, agentic AI workflows, and ML inference systems for personalization or recommendation use cases...Temporary workLocal areaWorldwide$181.1k - $318.4k
...On-Device Machine Learning Engineer We're starting to see the incredible potential of multimodal foundation and large language models, and... ...of models running on Apple devices. Optimize on-device inference latency and efficiency of CV/ML models. Minimum Qualifications...Relocation$246.5k
...large scale and with low latency. We use Machine Learning, Reinforcement Learning, AI, Control... ...of this is our Machine Learning and Inference Platform that powers the entire landscape... ...- someone excited to mentor engineers, innovate at scale, and shape the future...Work at officeLocal areaRemote workMonday to ThursdayFlexible hours$150k
...researchers, data scientists, and engineers, tackling the most fundamental... ...performance computing in deep learning, driving impactful discoveries... ...performance for the machine learning software stacks, especially at training and inference, and support the team to develop...Work experience placementVisa sponsorship$187.74k - $272.1k
...in Cupertino, California and various unanticipated locations throughout the USA. Design, implement, extend, and refactor machine learning inference system frameworks to streamline the deployment, execution, and evaluation of ML models. Design and implement lowlatency inference...Relocation$126.8k - $190.9k
Sunnyvale, California, United States Machine Learning and AI At Apple, we're on the cutting edge... ...team! As a Machine Learning Compiler Engineer on the Apple Neural Engine (ANE) team,... ...Accelerator, optimizing it for deep learning inference with a focus on performance,...Relocation package- A leading tech company is seeking a Machine Learning Engineer in Cupertino, California. In this role, you will design, implement, and optimize machine learning frameworks, develop text input features, and collaborate with data scientists and software developers. Required...
$151.8k - $265.35k
.... We're seeking an outstanding ML infra engineer with deep expertise in building large scale... ...You will profile GPU utilization, trace inference and training runs and help craft... ...AI like LLMs. ~ Strong Python and deep learning engineering skills, paired with experience...Temporary workLocal areaWorldwide- ...and customer experiences. Join the Ai Data Platform Applied Machine Learning team to pioneer enterprise solutions where generative AI meets... ...in AI Safety, privacy-preserving generations, efficient inference, and multimodal integration, while enabling teams to build on...
$120.7k - $228.6k
...Adobe Firefly’s Generative AI Services team is seeking a Machine Learning Engineer for our GenAI Services area. In this high-impact role,... ...Stock, and Premiere. You will design and develop efficient inference pipelines, optimize models for latency and through at inference...Temporary workLocal area- ...organizations that keep the world running. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity. We thrive on... ...(e.g., vLLM, TensorRT-LLM) or managing proprietary model inference endpoints. This position involves access to software/technology...Immediate start
$156k - $387.6k
...Machine Learning Engineer, AI Coding Tools Location: San Jose Team: Technology Employment Type: Regular Job Code: A100294 Responsibilities... ...cutting-edge techniques in large model optimization and inference acceleration. Qualifications Minimum...Temporary workLocal area$224k - $356.5k
...We are looking for outstanding Machine Learning Engineers to join our Physical AI teams! As the pioneers of the GPU—the visual cortex of modern... ...computer/GPU architecture to improve the performance during inference/training. Familiarity with simulation platforms and...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer - Inference. Be the first to apply!
- machine learning ai engineer San Jose, CA
- machine learning engineer San Jose, CA
- machine learning software engineer San Jose, CA
- ai ml engineer San Jose, CA
- senior ml engineer San Jose, CA
- graduate machine learning engineer San Jose, CA
- computer vision machine learning engineer San Jose, CA
- machine learning research scientist San Jose, CA
- machine learning part time San Jose, CA
- artificial intelligence - machine learning intern San Jose, CA

