Sr. Multimodal Model Training and Inference Optimization Engineer

$244.8k

ByteDance

Responsibilitie

About the team The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, and delivering intelligent solutions to TikTok, and Lemon8, enabling users to make and share creative content in a much easier way. The team has research groups dedicated to generative models for content creation, image generation, video synthesis, intelligent image/video editing, and virtual humans. We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, enhancing the performance, scalability, and deployment of large-scale generative AI models. Responsibilities - Optimize large model training pipelines to improve efficiency, speed, and scalability. - Develop and improve distributed training strategies such as data parallelism, model parallelism, pipeline parallelism and communication to accelerate model training. - Benchmark and profile deep learning models to identify performance bottlenecks and optimize computational resources.

Qualification

Minimum Qualifications: - M.S or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or a related field. - 3 years+ experience in AI model training optimization. - Strong software engineering skills, including proficiency in Python, C++, and CUDA. - Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed. - Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism. - Knowledge of transformers and diffusion models. Preferred Qualifications: - Candidates with publications at conferences such as MLSys, NeurIPS, ICLR, or ICML are preferred. - Strong communication and teamwork skills. - Self-motivated and strong problem-solving skills. - Ability to work collaboratively in multi-functional teams. - Experienced in implementing and optimizing complex and performance-critical systems.

Job Information

[For Pay Transparency]Compensation Description (Annually)

The base salary range for this position in the selected city is $244800 - $450000 annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.

Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates:

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:

1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;

2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and

3. Exercising sound judgment.

About U

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.

As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.

Diversity & Inclusion

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Reasonable Accommodation

ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Sr. Multimodal Model Training and Inference Optimization Engineer in San Jose, CA vacancy

Senior DL Engineer: Edge Model Optimization & Inference
...a skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California. Candidates should have a strong background in DL model training and deployment, ideally with a PhD or equivalent experience in Computer...
Senior
Training
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Deep Learning Engineer - Model Evaluation & AI Systems
$224k - $356.5k
.../ Principal Deep Learning Engineer — Model Evaluation & AI Systems, you... ..., agents, and vision/multimodal models. Build and expand NeMo... ...practices. Work alongside model training, inference, and product divisions to... ...that inform release and optimization decisions. What we need to...
Senior
Training
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles
$184k - $287.5k
Responsibilities Develop state‑of‑the‑art model optimization techniques—speculative... ...strategies for inference, such as automated model sharding... ...Computer Science, Computer Engineering, or a related technical... .... A proven track record of training, deploying, or optimizing large...
Senior
Training
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Datacenter Performance Model Engineer
This software engineering role involves developing datacenter‑scale performance‑modeling and prediction tools for AI researchers running AI workloads in GPU clusters... ...as PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job scheduling (Slurm...
Senior
Training
NVIDIA Corporation
Santa Clara, CA
3 days ago
Senior Performance Engineer, Inference
...to deliver industry-leading training and inference speeds and empowers machine... ...customers include top model labs, global enterprises, and... ...hiring a Senior Performance Engineer to join our Product team. You... ...-LLM), GPU kernel-level optimization toolchains (CUDA, Triton),...
Senior
Training
Contract work
Shift work
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
3 days ago
Senior DL Inference Engineer - GPU Optimization Equity
NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate... ...years of experience in deep learning, specifically in inference. This role involves profiling, analyzing bottlenecks, and...
Senior
NVIDIA
Santa Clara, CA
1 day ago
Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
$229.9k - $262.4k
...Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are... ...to millions of customers. Our AI models and platforms empower teams across... ...components including foundation model training, large language model inference,...
Senior
Training
Full time
Part time
Local area
Capital One Financial Corp
San Jose, CA
1 day ago
LLM Algorithmic Optimization Engineer
$143.2k - $186k
...cutting-edge technologies to optimize Large Language Models (LLMs) and multimodal models, exploration and... ...highly efficient LLM inference as well as deployment... ...Science, Computer Engineering, Applied Mathematics, Communications... ...with AI-related training and inference tools...
Training
Full time
Temporary work
Flexible hours
NIO
San Jose, CA
2 days ago
Senior AI Inference Systems Engineer: GPU-Optimized, Cloud
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...
Senior
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Software Engineer, AI Model Lifecycle
$172.43k - $230.95k
...Senior Software Engineer For The Ai Model Lifecycle Team Crusoe... ...maintain end-to-end training pipelines for Large... ...pipelines (e.g., preference optimization, policy optimization... ...Language Models, Multimodal). ~ Hands-on... ...on GPU systems and inference frameworks. Benefits...
Senior
Training
Temporary work
Crusoe
Sunnyvale, CA
6 days ago
ML Engineer - Inference & Model Deployment
...building a 100x better job search engine: fast, comprehensive, honest... ...us turn powerful AI and ML models into fast, reliable... ...infrastructure: deploying models, optimizing inference latency and throughput,... ...Deploy and integrate researcher-trained model checkpoints into our...
Training
Relocation package
HiringCafe
Cupertino, CA
3 days ago
Sr. Thermal Engineer, Liquid Cooling Systems
$2,000 per month
...the world’s first AI inference system purpose-built for... ...time video generation models and extremely deep &... ...and staffed by leading engineers, Etched is redefining... ...and lead our efforts in optimizing thermal management for... ...using more FLOPs to train and run models, and the...
Senior
Training
Work at office
Relocation package
Etched
San Jose, CA
9 days ago
Senior DL Algorithms Engineer - Inference Performance
$184k - $287.5k
Senior DL Algorithms Engineer - Inference Performance page is loaded## Senior DL Algorithms Engineer... ...mindful of performance analysis and optimization to help us squeeze every last clock... ...be doing:*** Implement language and multimodal model inference as part of NVIDIA...
Senior
NVIDIA Corporation
Santa Clara, CA
4 days ago
Senior Data Systems Engineer for Foundation Model Training
A leading technology company located in Cupertino, California, is seeking a Senior Research Engineer focused on training data infrastructure for advanced AI models. The ideal candidate will possess strong skills in distributed systems and a deep understanding of Machine...
Senior
Training
Apple Inc.
Cupertino, CA
2 days ago
AIML Researcher/Engineer - Foundation Model Post-Training
...knit group of researchers and engineers responsible for building... ...large scale frontier foundation models at Apple. We believe the... ...products. You will tackle core training challenges in instruction... ...novel algorithms for preference optimization, model steering, and safety....
Training
Apple Inc.
Cupertino, CA
1 day ago
Research Scientist - Vision Language Model
...Institute of Foundation Models We are a dedicated... ...foundation model training, alongside world-class... ...data scientists, and engineers, tackling the most fundamental... ...state-of-the-art multimodal foundation models... ...model modularity, and inference optimization. Build and improve...
Training
Institute of Foundation Models
Sunnyvale, CA
23 days ago
Sr. System Engineer - Network
$137k - $156k
...passionate, and committed engineers, technologists, and... ...Summary: As a Sr. System Engineer, you'... ...delivering impactful training sessions to customers... ...and testing, providing optimized benchmarks for HPC/AI... ...with MLPerf Training/Inference benchmark, LLM, HPL-AI...
Senior
Training
Worldwide
Supermicro
San Jose, CA
2 days ago
AI/ML Technical Leader - Language Model Inference & AI Ops
$212.3k - $275.8k
...collaborate with product and engineering teams to deploy reliable,... ...and observable AI services, optimizing inference performance from CPU and... ...deployment automation, and model/service observability. This... ...triage workflows. Support training and fine-tuning workflows for...
Training
Full time
Temporary work
Local area
Flexible hours
3 days per week
Cisco
San Jose, CA
4 days ago
Senior ML Engineer - Model Compression
$128.7k - $261.3k
...repeatable, high-velocity model deployments through... ...deployment and infra engineers to ship numerically robust... ...focused onmodel optimization and deployment, with significant... .../ efficient inference or relevant experience... ...toolingintegrated into training or evaluation pipelines...
Senior
Training
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
5 days ago
Sr. Business Model Strategy Product Manager
...Assistance, Inc., (SGA), is searching for a Sr. Business Model Strategy Product Manager for a... ...We do this by driving initiatives that optimize new and existing offerings via product... ...sales, and develop sales plays, and training as appropriate. Complete post-mortems...
Senior
Training
Contract work
2 days per week
3 days per week
SGA
San Jose, CA
1 day ago
Senior AI Systems Performance Engineer
...AI platform, from chip to model, optimized for enterprise and government... ...and driven ML performance engineer to optimize and scale... ...performance for large-scale AI inference. Responsibilities... ...on experience with LLM or multimodal model training and inference. Background...
Senior
Training
Full time
Temporary work
Local area
Flexible hours
SambaNova Systems
San Jose, CA
3 days ago
Senior Research Engineer, Foundation Model Training Infrastructure
$224k - $356.5k
...searching for a senior or principal engineer who specializes in building... ...for large‑scale foundation model training in the Generalist Embodied... ...influential works on multimodal foundation models, large-scale... ...foundation models for robotics. Optimize GPU and cluster utilization...
Senior
Training
Full time
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Sr. Product Manager, Business Model Strategy / Monetization
$148.1k - $282.1k
...business-case for applying a SaaS business model to our perpetual products - a move that... ...We do this by driving initiatives that optimize new and existing offerings via product configuration... ...direct sales, develop sales plays and training as appropriate. Complete post-mortems...
Senior
Training
Temporary work
Local area
Worldwide
Adobe
San Jose, CA
2 days ago
Sr. Engineer, Ethernet IP
$100k
...unify innovations in software models, compilers, platforms,... ...designs to architect, enable, and optimize the firmware that powers next... ...stack. You'd rather debug a link-training state machine than a high-... ...software teams to solve frontier engineering challenges. How to help...
Senior
Training
Permanent employment
Tenstorrent
Santa Clara, CA
1 day ago
Principal AI Inference Systems Engineer
...for a Senior Staff AI Infra Engineer who is passionate about... ...of hardware and software to optimize performance for next-generation... ..., including Large Language Models (LLMs) and Agentic AI... ...Optimize and accelerate LLM training and inference on AMD GPUs, improving kernel...
Training
Advanced Micro Devices , Inc.
Santa Clara, CA
2 days ago
Sr. Technologist - AI Systems & Performance Lab
...an experienced and visionary Sr. Technologist to join our... ...success metrics, and execution model aligned with organizational... ...Drive end‑to‑end analysis of AI training and inference workloads, spanning models,... ..., highly skilled team of engineers and researchers. Drive thought...
Senior
Training
Temporary work
Remote work
Flexible hours
Shift work
Sandisk
Milpitas, CA
17 days ago
Sr. Engineer, MEMS Characterization and Metro
$138.8k - $190.85k
...Senior MEMS Characterization and Metrology Engineer , you will use your technical and... ...qualification and release-to-production. Optimization of system performance focusing on... ...as location, experience, education, and training. In addition to base salary, this...
Senior
Training
SiTime Corporation
Santa Clara, CA
more than 2 months ago
Senior Perception Learning Engineer
$190k - $235k
...Senior Perception Learning Engineer Sunnyvale, CA... ...object detection, world modeling, and multi-sensor... ...You will design and optimize deep learning models for... ...scalable pipelines for training, evaluation, and deployment... ...infrastructure, and inference frameworks to accelerate...
Senior
Training
Local area
Apptronik
Sunnyvale, CA
20 days ago
Senior Perception Learning Engineer - SLAM
$190k - $235k
...Senior Learning Perception Engineer - Slam Sunnyvale, C... ...odometry, world modeling, and learning-based perception... ...You will design and optimize deep learning models... ...pipelines for training, evaluation, and deployment... ...training infrastructure, and inference frameworks to...
Senior
Training
Local area
Apptronik
Sunnyvale, CA
1 day ago
Senior Software Development Engineer - SGLang and Inference Stack
...will play a pivotal role in optimizing and developing deep learning... ..., accelerating deep learning models, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and... ...THE PERSON: Skilled engineer with strong technical and analytical...
Senior
Training
Advanced Micro Devices , Inc.
Santa Clara, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Multimodal Model Training and Inference Optimization Engineer. Be the first to apply!