Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. Multimodal Model Training and Inference Optimization Engineer

$244.8k

ByteDance

Responsibilitie

About the team The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, and delivering intelligent solutions to TikTok, and Lemon8, enabling users to make and share creative content in a much easier way. The team has research groups dedicated to generative models for content creation, image generation, video synthesis, intelligent image/video editing, and virtual humans. We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, enhancing the performance, scalability, and deployment of large-scale generative AI models. Responsibilities - Optimize large model training pipelines to improve efficiency, speed, and scalability. - Develop and improve distributed training strategies such as data parallelism, model parallelism, pipeline parallelism and communication to accelerate model training. - Benchmark and profile deep learning models to identify performance bottlenecks and optimize computational resources.

Qualification

Minimum Qualifications: - M.S or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or a related field. - 3 years+ experience in AI model training optimization. - Strong software engineering skills, including proficiency in Python, C++, and CUDA. - Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed. - Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism. - Knowledge of transformers and diffusion models. Preferred Qualifications: - Candidates with publications at conferences such as MLSys, NeurIPS, ICLR, or ICML are preferred. - Strong communication and teamwork skills. - Self-motivated and strong problem-solving skills. - Ability to work collaboratively in multi-functional teams. - Experienced in implementing and optimizing complex and performance-critical systems.

Job Information

[For Pay Transparency]Compensation Description (Annually)

The base salary range for this position in the selected city is $244800 - $450000 annually.


Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.


For Los Angeles County (unincorporated) Candidates:


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and


3. Exercising sound judgment.

About U

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.


As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.


Diversity & Inclusion


ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Reasonable Accommodation

ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Sr. Multimodal Model Training and Inference Optimization Engineer in San Jose, CA vacancy
  • $224k - $356.5k

     .../ Principal Deep Learning Engineer — Model Evaluation & AI Systems, you...  ..., agents, and vision/multimodal models. Build and expand...  .... Work alongside model training, inference, and product divisions to...  ...signals that inform release and optimization decisions. What we need... 
    Senior
    Training

    NVIDIA

    Santa Clara, CA
    8 hours ago
  • $184k - $287.5k

    Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous...  ...gap between cutting-edge multimodal architectures and real-time...  ...optimization strategies for inference, including automated model...  ...A proven track record of training, deploying, or optimizing... 
    Senior
    Training

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $193.3k - $261.5k

     ...JAX enabling unparalleled ML inference and training performance. The...  ...of running a wide range of models and supporting novel architecture...  ...hardware-software boundary, our engineers build systematic...  ...compute unit is fine tuned for optimal performance for our customers... 
    Senior
    Training
    Work experience placement
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    1 day ago
  • $45 per hour

     ...& audio, music processing, and multimodal deep learning. We are looking for...  ...and efficiency of large-scale AI models across training, inference, and deployment. This is an...  ...Responsibilities: - Support research and engineering efforts to optimize deep learning models for speed,... 
    Training
    Hourly pay
    Full time
    Summer work
    Internship
    Local area

    Tik Tok

    San Jose, CA
    1 day ago
  •  ...to deliver industry‑leading training and inference speeds and empowers machine...  ...customers include top model labs, global enterprises, and...  ...hiring a Senior Performance Engineer to join our Product team. You...  ...‑LLM), GPU kernel‑level optimization toolchains (CUDA, Triton),... 
    Senior
    Training
    Contract work
    Shift work

    Cerebras

    Sunnyvale, CA
    3 days ago
  • $184k - $287.5k

     ...is built. We are seeking a senior vision language model engineer to design and build agentic data and training workflows for Autonomous Vehicles, Robotics, and...  ...velocity. Build, curate, and maintain high‑quality multimodal datasets (e.g., video, sensor, language/action... 
    Senior
    Training

    NVIDIA

    Santa Clara, CA
    2 days ago
  • NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate...  ...years of experience in deep learning, specifically in inference. This role involves profiling, analyzing bottlenecks, and... 
    Senior

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $143.2k - $186k

     ...cutting-edge technologies to optimize Large Language Models (LLMs) and multimodal models, exploration and...  ...highly efficient LLM inference as well as deployment...  ...Science, Computer Engineering, Applied Mathematics, Communications...  ...with AI-related training and inference tools... 
    Training
    Full time
    Temporary work
    Flexible hours

    NIO

    San Jose, CA
    1 day ago
  • $152k - $241.5k

     ...multifaceted software team! This software engineering role involves developing datacenter scale performance modeling and predictions tools for AI researchers running...  ...like PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job scheduling (... 
    Senior
    Training
    Full time

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $172.43k - $230.95k

     ...Senior Software Engineer For The Ai Model Lifecycle Team Crusoe...  ...maintain end-to-end training pipelines for Large...  ...pipelines (e.g., preference optimization, policy optimization...  ...Language Models, Multimodal). ~ Hands-on...  ...on GPU systems and inference frameworks. Benefits... 
    Senior
    Training
    Temporary work

    Crusoe

    Sunnyvale, CA
    8 hours ago
  • $181.1k - $318.4k

     ...Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model Work Locations (2) Submit Resume Apple is where...  ...execution of large-scale training and inference jobs. This role spans...  ...engineering, and performance optimization. Responsibilities... 
    Senior
    Training
    Relocation

    Apple

    Santa Clara, CA
    1 day ago
  • $181.1k - $318.4k

     ...! Description As a Senior/Staff Engineer on the Foundation Model Compute Infrastructure team, you...  ...efficient execution of large‑scale training and inference jobs. This role spans scheduling...  ...reliability engineering, and performance optimization. Responsibilities Design and... 
    Senior
    Training
    Relocation

    Apple Inc.

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

    Senior DL Algorithms Engineer - Inference Performance page is loaded## Senior DL Algorithms Engineer...  ...mindful of performance analysis and optimization to help us squeeze every last clock...  ...be doing:*** Implement language and multimodal model inference as part of NVIDIA... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...seeking top-tier AI Compiler Engineers to drive innovation within...  ...and computational graph optimizations for next-generation NVIDIA...  ...problems for AI workloads (both inference and training) and successfully...  ...understanding of Large Language Model (LLM) inference and its profound... 
    Training

    NVIDIA

    Santa Clara, CA
    8 hours ago
  • A leading technology company located in Cupertino, California, is seeking a Senior Research Engineer focused on training data infrastructure for advanced AI models. The ideal candidate will possess strong skills in distributed systems and a deep understanding of Machine... 
    Senior
    Training

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $152k - $241.5k

     ...looking for versatile software engineers for our XLA team. NVIDIA is...  ...role, develop compiler optimization algorithms for deep learning...  .... You will optimize inference and training performance for the JAX framework...  ...OpenAI Triton, deep learning models and algorithms, and deep... 
    Senior
    Training

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $190k - $235k

     ...Senior Perception Learning Engineer, you will lead...  ...object detection, world modeling, and multi-sensor fusion...  ...You will design and optimize deep learning models for...  ...pipelines for training, evaluation, and deployment...  ...training infrastructure, and inference frameworks to... 
    Senior
    Training
    Local area

    Synthesia

    Sunnyvale, CA
    8 hours ago
  •  ...Institute of Foundation Models We are a dedicated...  ...foundation model training, alongside world-class...  ...data scientists, and engineers, tackling the most fundamental...  ...state-of-the-art multimodal foundation models...  ...model modularity, and inference optimization. Build and improve... 
    Training

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $184k - $287.5k

     ...exceptional Senior Perception Engineer to help design and...  ...3D perception models using multi-camera inputs...  ...follow best practices for training and evaluation, using...  ...platforms, including optimization for latency, memory,...  ...optimizing training or inference pipelines through custom... 
    Senior
    Training

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $215.28k - $364.32k

     ...Staff Machine Learning Engineer – Autonomous Driving Model Quantization &...  ...lead the effort to optimize and deploy our VLA...  ...roadmap for large-scale multimodal models (...  ...innovate in PTQ (Post-Training Quantization), QAT...  ...deep knowledge of inference engines like TensorRT... 
    Training
    Full time

    XPENG

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...Senior Performance Compiler Engineer to join our team and work...  ...breakthroughs in large language models, agents, and other high-...  ..., accelerating both training and inference. You will be immersed in a...  ...identify new opportunities for optimization. Designing and implementing... 
    Senior
    Training

    NVIDIA AI

    Santa Clara, CA
    4 days ago
  • A leading technology company is hiring a Machine Learning Systems Engineer in Cupertino, California. You will collaborate with Siri modeling teams to optimize model training and inference on Apple's custom Silicon. The ideal candidate has strong experience in ML models,... 
    Training

    Apple Inc.

    Cupertino, CA
    2 days ago
  •  ...will play a pivotal role in optimizing and developing deep learning...  ..., accelerating deep learning models, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and...  ...THE PERSON: Skilled engineer with strong technical and analytical... 
    Senior
    Training

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  • $89.3k - $157.55k

     ...Martin Space is seeking a Systems Engineer to support the system...  ...Kanban) • Understanding of Model-Based Systems Engineering concepts...  ...work experience, education/ training, key skills as well as market...  ...date in order to receive optimal consideration. At Lockheed... 
    Senior
    Training
    Full time
    Temporary work
    Work experience placement
    Work at office
    Remote work
    Relocation
    Flexible hours
    Shift work

    Lockheed Martin Corporation

    Sunnyvale, CA
    8 hours ago
  • $181.1k - $318.4k

     ...California, is looking for an experienced Machine Learning engineer to optimize and build production-grade solutions serving millions in real...  ...technologies, contributing directly to optimizing language and vision models. Applicants should have at least 5 years of industry... 

    Apple Inc.

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...a Senior Software Engineer specializing in Deep Learning Inference for our growing team...  ..., build, and optimize the GPU-accelerated...  ...efficient large-scale model serving and...  ...domains like LLM, Multimodal and Generative AI....  ...Prior experience with training, deploying or optimizing... 
    Senior
    Training
    Remote work

    NVIDIA

    Santa Clara, CA
    7 days ago
  • $158.4k - $237.6k

     ...Staff Software Engineer Join the Qualcomm AI Hub team...  ...tools to help developers optimize and deploy machine learning models on edge and mobile hardware...  ...vision, audio, and multimodal networks for deployment...  ...or similar families) for inference optimization ~ Familiarity... 
    Work experience placement
    Immediate start
    Work from home

    Qualcomm

    Santa Clara, CA
    4 days ago
  • $174.72k - $295.68k

     ...Senior Computer Vision Engineer Santa Clara, CA XPENG...  ...role, you will develop and optimize multi-modal models and computer vision systems...  ...experience with multi-modal model training and optimization, a strong...  ..., fine-tuning, and inference optimization strategies,... 
    Senior
    Training
    Full time

    XPENG

    Santa Clara, CA
    4 days ago
  • $184k - $356.5k

     ...in California is seeking a Senior DL Algorithms Engineer to drive inference performance for Deep Learning workloads. The role involves implementing advanced model inference and collaborating with co-design teams to optimize performance across hardware and software... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $184k - $356.5k

     ...leading AI computing company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-collaborative teams to implement cutting-edge... 
    Senior
    Full time

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Multimodal Model Training and Inference Optimization Engineer. Be the first to apply!