LLM Algorithmic Optimization Engineer

$143.2k - $186k

NIO

JOB DESCRIPTION

About NIO

NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO's mission is to shape a joyful lifestyle. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.

NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.

NIO's product portfolio consists of the ES8, a six-seater smart electric flagship SUV, the ES7 (or the EL7), a mid-large five-seater smart electric SUV, the ES6, a five-seater all-round smart electric SUV, the EC7, a five-seater smart electric flagship coupe SUV, the EC6, a five-seater smart electric coupe SUV, the ET7, a smart electric flagship sedan, and the ET5, a mid-size smart electric sedan.

About NIO

Roles and Responsibilities:

Conduct research and apply cutting-edge technologies to optimize Large Language Models (LLMs) and multimodal models, exploration and implementation of the core algorithmic optimization on heterogeneous architectures, for highly efficient LLM inference as well as deployment across distributed and heterogeneous hardware environments.
Focus on model optimization from a systems perspective, ensuring efficient deployment in the vehicle's digital cockpit and advanced driving (AD) domain.
Collaborate with cross-functional teams to ensure the integration of optimized models into real-world automotive applications.
Contribute to the entire pipeline from research, development, and testing, through to deployment on hardware, including GPUs and other distributed systems.

Qualifications:

Currently pursuing or completed a PhD or Master's degree in Computer Science, Computer Engineering, Applied Mathematics, Communications, Electronics, or a related field with relevant research projects and publications.
Strong understanding of GPU/NPU architecture and optimization techniques to identify and address bottlenecks.
Proficient in LLM and VLM architectures and algorithms, familiar with transformer based NLP / Audio / CV algorithms and technologies.
Proficiency in Python and experience with AI-related training and inference tools such as PyTorch.
Proficiency in C/C++ programming, familiar with at least one commonly used LLM inference engines.
Hands-on experience with model-serving frameworks such as Open Neural Network Exchange (ONNX).
Familiarity with debugging code in distributed computing environments.Experience in LLM inference optimization on resource constrained edge devices is a plus.

Preferred Qualification:

Ph.D. in computer science, artificial intelligence, or related fields; or Masters degree + 3 years of relevant industry experience
Experience in inference optimization techniques of deep learning models or libraries on hardware architectures;
Familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application framework
Those who have good publication records and have published high impact, innovative papers are preferred

Compensation:

The US base salary range for this full-time position is $143,200.00 - $186,000.00.

Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.
Please note that the compensation details listed in US role postings reflect the base salary only. It does not include discretionary bonus, equity, or benefits.

Benefits:

Along with competitive pay, as a full-time NIO employee, you are eligible for the following benefits on the first day you join NIO:

CIGNA EPO, HSA, and Kaiser HMO medical plans with $0 for Employee Only Coverage.
Dental (including orthodontic coverage) and vision plan. Both provide options with a $0 paycheck contribution covering you and your eligible dependents.
Company Paid HSA (Health Savings Account) Contribution when enrolled in the High Deductible CIGNA medical plan
Healthcare and Dependent Care Flexible Spending Accounts (FSA)
401(k) with Brokerage Link option
Company paid Basic Life, AD&D, short-term and long-term disability insurance
Employee Assistance Program
Sick and Vacation time
13 Paid Holidays a year
Paid Parental Leave for first 8 weeks at full pay (eligible after 90 days of employment with NIO)
Paid Disability Leave for first 6 weeks at full pay (eligible after 90 days of employment with NIO)
Voluntary benefits including: Voluntary Life and AD&D options for you, your spouse/domestic partner and dependent child(ren), pet insurance
Commuter benefits
Mobile Cell Phone Credit
Healthjoy mobile benefit app supporting you and your dependents with benefit questions on the go & support with benefit billing questions
Free lunch and snacks
Onsite gym
Employee discounts and perks program

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the LLM Algorithmic Optimization Engineer in San Jose, CA vacancy

Senior DL Algorithms Engineer - Inference Performance
$152k - $241.5k
We are looking for a Senior DL Algorithms Engineer for LLM/Omni model optimizations! Seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid to work...
Suggested
NVIDIA
Santa Clara, CA
1 day ago
Senior DL Inference Engineer - GPU Optimization Equity
NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate will have a PhD and 3+ years of experience in deep learning, specifically in inference. This role involves profiling, analyzing...
Suggested
NVIDIA
Santa Clara, CA
1 day ago
Distributed Systems Engineer 5 - Decisioning & Optimization
$388k
...Distributed Systems Engineer 5 - Decisioning & Optimization New York, New York, United States of America • Seattle, Washington, United States of... ...and Platform teams to productionize models and deploy algorithms into the serving stack Build simulation and testing...
Suggested
Hourly pay
Full time
Immediate start
Flexible hours
Netflix
Los Gatos, CA
3 days ago
Senior LLM Performance Engineer - GPU Inference
$184k - $356.5k
...California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference... ...-collaborative teams to implement cutting-edge algorithms. The ideal candidate has extensive software development...
Suggested
Full time
NVIDIA Corporation
Santa Clara, CA
4 days ago
Senior Deep Learning Engineer - LLM Performance & Optimization
$184k - $287.5k
NVIDIA is looking for a Senior Deep Learning Software Engineer in Santa Clara, California. This role involves analyzing and improving LLM inference performance using NVIDIA GPUs. Candidates should have extensive software development experience, strong skills in Python/C++...
Suggested
NVIDIA
Santa Clara, CA
1 day ago
Software Engineer Project Intern(Video-on-Demand Algorithm) - 2026 Start (BS/MS)
$45 per hour
...Responsibilities The Video-on-Demand (VoD) Algorithm team is responsible for optimizing the app experience related to performance for TikTok users.... ...in Software Development, Computer Science, Computer Engineering, or a related technical discipline - Able to commit to...
Hourly pay
Temporary work
Summer work
Internship
Local area
Tik Tok
San Jose, CA
1 day ago
Staff Research Engineer, LLM - TikTok Ads Core ML, Ranking
$244.8k
...Staff Research Engineer, LLM - TikTok Ads Core ML, Ranking Location: San Jose Employment... ..., LLM-based Ranking Application, and Optimization of System Resource Allocation with ROI... ...with basic data structure and algorithms. Familiar with Linux development environment...
Temporary work
Local area
Tik Tok
San Jose, CA
1 day ago
Machine Learning Video Processing Algorithm Engineer
$147.4k - $272.1k
Machine Learning Video Processing Algorithm Engineer Imagine the impact you can make. A billion users will use the technologies you helped... ...and compression technologies. Responsibilities Develop and optimize machine learning based video processing algorithms that work...
Relocation
Apple Inc.
Sunnyvale, CA
1 day ago
Cellular RF Receiver Algorithms Systems Engineer
$181.1k - $318.4k
Cellular RF Receiver Algorithms Systems Engineer Apple’s RF System Engineering team is seeking a motivated and dedicated expert to define the next... ..., calibrate, and compensate for RF impairments to ensure optimal system performance. Conduct conceptual work on novel...
Immediate start
Worldwide
Relocation
Apple Inc.
Sunnyvale, CA
11 hours ago
Senior Edge-LLM Real-Time Inference Engineer
NVIDIA Gruppe is looking for a skilled engineer to join their TensorRT Edge-LLM team in Santa Clara, California. The role involves developing a state... ...art inference framework for large language models and optimizing it for real-time performance on embedded platforms. Candidates...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Algorithms Engineer (all levels)
$180k - $250k
...industrial automation through advanced vision systems. Senior Algorithm Engineer You’ll be a cornerstone of our core team, building robot... ...the development lifecycle, from prototyping to deployment Optimize real-time performance for embedded systems Qualifications Technical...
Summer work
Work at office
Flexible hours
Summer Robotics
Campbell, CA
1 day ago
Video Codec Algorithm Modeling Engineer - Multimedia Lab
$212.8k
...billions of users. We are looking for strong video codec algorithm modeling engineers to design algorithms and C-model for advanced video encoding... ...Collaborate with the HW architecture and design team for optimal algorithm and architecture co-design Qualification Minimum...
Temporary work
Local area
ByteDance
San Jose, CA
1 day ago
Robotic Algorithm Engineer
...indoor service and industrial environments. As a Robotics Algorithm Engineer focused on Locomotion, you will work across simulation,... ...actuators Perform real-robot tuning, debugging, and performance optimization Work closely with firmware, motor control, and hardware...
Temporary work
Scylla Solutions LLC
Milpitas, CA
8 days ago
Systems Engineer I, Medical Devices & Algorithms
$73.9k - $116k
Abbott Laboratories in Sunnyvale is looking for a specialized systems engineer to work on medical devices. The role involves analyzing requirements, conducting tests, and facilitating algorithm transitions while complying with FDA regulations. The ideal candidate has a...
Abbott Laboratories
Sunnyvale, CA
2 days ago
Algorithm Engineer, Home and Audio Devices
$147.4k - $272.1k
Algorithm Engineer, Home and Audio Devices Cupertino, California, United States Hardware Apple is where individual imaginations gather together... ...metrics. Responsibilities Design, implement, and optimize sensing algorithms. Prototype and evaluate novel techniques...
Relocation
Apple Inc.
Cupertino, CA
2 days ago
Software Engineer, Embedded Systems
$197.9k - $270k
...experienced embedded software engineers like you joining the Roku OS... ...implementing new features and designing algorithms that deliver flawless video... ...all our users. This includes optimizing network interactions between... ...A familiarity with AI/ML and LLM technologies. ~ Experience...
Work at office
Local area
Remote work
Monday to Thursday
Flexible hours
Roku
San Jose, CA
5 days ago
Algorithm Engineer - Deep Learning
$136.3k - $231.7k
...into R&D. Our expert teams of physicists, engineers, data scientists and problem‑solvers... ...are looking for a full‑time Deep Learning Algorithm Engineer who is passionate about pioneering... ...or components to improve performance. Optimize DL or GenAI model throughput and cost....
Minimum wage
Full time
Work experience placement
Flexible hours
KLA
Milpitas, CA
4 days ago
HID Algorithms Engineer
$147.4k - $272.1k
...Join Apple's HID sensing and interaction algorithms team that develops advanced sensing algorithms... .... We are looking for an algorithm engineer who is passionate about bridging algorithm... ...deploying AI-powered developer tools (e.g., LLM-assisted coding, AI-based test generation...
Relocation
Apple Inc.
Cupertino, CA
2 days ago
Medical Imaging Engineer - QC Tool & Algorithms
...tool for a healthcare project. The role requires researching and evaluating algorithms to optimize performance and develop a unified testing platform. Candidates should possess an engineering degree and have entry-level to 3 years of experience, with familiarity in the...
Collabera
Santa Clara, CA
2 days ago
Senior High-Performance LLM Training Engineer
$184k - $287.5k
We are now looking for a Senior High-Performance LLM Training Engineer! NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing...
Work experience placement
NVIDIA
Santa Clara, CA
1 day ago
System Analyst - Robotic Algorithms & Control Engineer
...delivered for millions of patients worldwide. We're a team of engineers, clinicians, and innovators united by one purpose: to make... ...are primarily responsible for the tele-robotic motion control algorithms, robotic manipulator control and safety algorithms for new surgical...
Work experience placement
Local area
Worldwide
Flexible hours
Intuitive
Sunnyvale, CA
5 days ago
Quantum-Inspired HPC Optimization Engineer
Schlumberger is seeking a High Performance Computing (HPC) Engineer in Sunnyvale, CA, to tackle complex discrete optimization problems. The ideal candidate will have a strong... .... Key responsibilities include designing algorithms for large-scale optimization, collaborating with...
Schlumberger
Sunnyvale, CA
3 days ago
Senior LLM Performance Engineer - GPU-Accel & Equity
NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer focused on LLM performance in Santa Clara. You will optimize GPU-accelerated software for large language model deployment, working on performance tuning for various models. The ideal candidate has over 8...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Principal High-Performance LLM Training Engineer
$272k - $431.25k
NVIDIA is seeking a Principal Engineer to drive the performance of large‑scale AI training and post‑training workloads across... ...frameworks, and performance engineering. You will analyze and optimize frontier‑scale LLM workloads running on thousands of GPUs, drive improvements...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...skilled and motivated software engineers to join us and build AI... ...performance inference stacks, optimize GPU kernels and compilers, drive... ...plus; solid CS fundamentals: algorithms & data structures, operating... ...Experience building and optimizing LLM inference engines (e.g., vLLM...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Staff Machine Learning Engineer - Agentic Models, LLM, RAG, GenAI
...in complex distributed environments and optimize system performance. Contribute to the team... ...Knowledge and passion in machine learning algorithms, Gen AI, LLMs, and natural language... ...and inference optimization (vLLM, TensorRT‑LLM). Research experience in agentic AI or related...
Work experience placement
Nutanix
Santa Clara, CA
11 hours ago
Staff R&D Engineer-Algorithm
...Description Description We are looking for a Staff R&D Engineer - Algorithm with proven expertise in wearable IoT technologies and physiological... ...~ Lead the research, design, development, and optimization of algorithms for health and wellness monitoring using data...
Hourly pay
Full time
Temporary work
Internship
Local area
Flexible hours
Align Technology
San Jose, CA
18 days ago
Battery Management Systems Algorithm Engineer
...located in Cupertino, California seeks a Battery Management System Engineer to contribute to the development and qualification of advanced... ...-quality customer experiences through battery firmware and algorithm innovation. The ideal candidate has a BS degree in a related...
Apple Inc.
Cupertino, CA
11 hours ago
Senior Deep Learning Software Engineer, LLM Performance
...looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an... ...inference. This role focuses on designing and optimizing GPU-accelerated software for large... ...compiler in inference, deployment, algorithms, or implementation. Prior experience with...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Software Engineer, GDC LLM Serving and GPU Performance
$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications... ...experience with data structures and algorithms. 3 years of experience in a technical... ...performance and flexibility. You could be optimizing KV cache transfer mechanisms,...
Full time
Google Inc.
Sunnyvale, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to LLM Algorithmic Optimization Engineer. Be the first to apply!