Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. Multimodal Model Training and Inference Optimization Engineer

$244.8k

ByteDance

Responsibilitie

About the team The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, and delivering intelligent solutions to TikTok, and Lemon8, enabling users to make and share creative content in a much easier way. The team has research groups dedicated to generative models for content creation, image generation, video synthesis, intelligent image/video editing, and virtual humans. We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, enhancing the performance, scalability, and deployment of large-scale generative AI models. Responsibilities - Optimize large model training pipelines to improve efficiency, speed, and scalability. - Develop and improve distributed training strategies such as data parallelism, model parallelism, pipeline parallelism and communication to accelerate model training. - Benchmark and profile deep learning models to identify performance bottlenecks and optimize computational resources.

Qualification

Minimum Qualifications: - M.S or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or a related field. - 3 years+ experience in AI model training optimization. - Strong software engineering skills, including proficiency in Python, C++, and CUDA. - Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed. - Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism. - Knowledge of transformers and diffusion models. Preferred Qualifications: - Candidates with publications at conferences such as MLSys, NeurIPS, ICLR, or ICML are preferred. - Strong communication and teamwork skills. - Self-motivated and strong problem-solving skills. - Ability to work collaboratively in multi-functional teams. - Experienced in implementing and optimizing complex and performance-critical systems.

Job Information

[For Pay Transparency]Compensation Description (Annually)

The base salary range for this position in the selected city is $244800 - $450000 annually.


Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.


Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).


The Company reserves the right to modify or change these benefits programs at any time, with or without notice.


For Los Angeles County (unincorporated) Candidates:


Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:


1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;


2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and


3. Exercising sound judgment.

About U

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.


As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.


Diversity & Inclusion


ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Reasonable Accommodation

ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Sr. Multimodal Model Training and Inference Optimization Engineer in San Jose, CA vacancy
  •  ...a skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California. Candidates should have a strong background in DL model training and deployment, ideally with a PhD or equivalent experience in Computer... 
    Senior
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     .../ Principal Deep Learning Engineer — Model Evaluation & AI Systems, you...  ..., agents, and vision/multimodal models. Build and expand NeMo...  ...practices. Work alongside model training, inference, and product divisions to...  ...that inform release and optimization decisions. What we need to... 
    Senior
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    Responsibilities Develop state‑of‑the‑art model optimization techniques—speculative...  ...strategies for inference, such as automated model sharding...  ...Computer Science, Computer Engineering, or a related technical...  .... A proven track record of training, deploying, or optimizing large... 
    Senior
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • This software engineering role involves developing datacenter‑scale performance‑modeling and prediction tools for AI researchers running AI workloads in GPU clusters...  ...as PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job scheduling (Slurm... 
    Senior
    Training

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...to deliver industry-leading training and inference speeds and empowers machine...  ...customers include top model labs, global enterprises, and...  ...hiring a Senior Performance Engineer to join our Product team. You...  ...-LLM), GPU kernel-level optimization toolchains (CUDA, Triton),... 
    Senior
    Training
    Contract work
    Shift work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate...  ...years of experience in deep learning, specifically in inference. This role involves profiling, analyzing bottlenecks, and... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) Overview: At Capital One, we are...  ...to millions of customers. Our AI models and platforms empower teams across...  ...components including foundation model training, large language model inference,... 
    Senior
    Training
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    1 day ago
  • $143.2k - $186k

     ...cutting-edge technologies to optimize Large Language Models (LLMs) and multimodal models, exploration and...  ...highly efficient LLM inference as well as deployment...  ...Science, Computer Engineering, Applied Mathematics, Communications...  ...with AI-related training and inference tools... 
    Training
    Full time
    Temporary work
    Flexible hours

    NIO

    San Jose, CA
    2 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $172.43k - $230.95k

     ...Senior Software Engineer For The Ai Model Lifecycle Team Crusoe...  ...maintain end-to-end training pipelines for Large...  ...pipelines (e.g., preference optimization, policy optimization...  ...Language Models, Multimodal). ~ Hands-on...  ...on GPU systems and inference frameworks. Benefits... 
    Senior
    Training
    Temporary work

    Crusoe

    Sunnyvale, CA
    6 days ago
  •  ...building a 100x better job search engine: fast, comprehensive, honest...  ...us turn powerful AI and ML models into fast, reliable...  ...infrastructure: deploying models, optimizing inference latency and throughput,...  ...Deploy and integrate researcher-trained model checkpoints into our... 
    Training
    Relocation package

    HiringCafe

    Cupertino, CA
    3 days ago
  • $2,000 per month

     ...the world’s first AI inference system purpose-built for...  ...time video generation models and extremely deep &...  ...and staffed by leading engineers, Etched is redefining...  ...and lead our efforts in optimizing thermal management for...  ...using more FLOPs to train and run models, and the... 
    Senior
    Training
    Work at office
    Relocation package

    Etched

    San Jose, CA
    9 days ago
  • $184k - $287.5k

    Senior DL Algorithms Engineer - Inference Performance page is loaded## Senior DL Algorithms Engineer...  ...mindful of performance analysis and optimization to help us squeeze every last clock...  ...be doing:*** Implement language and multimodal model inference as part of NVIDIA... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • A leading technology company located in Cupertino, California, is seeking a Senior Research Engineer focused on training data infrastructure for advanced AI models. The ideal candidate will possess strong skills in distributed systems and a deep understanding of Machine... 
    Senior
    Training

    Apple Inc.

    Cupertino, CA
    2 days ago
  •  ...knit group of researchers and engineers responsible for building...  ...large scale frontier foundation models at Apple. We believe the...  ...products. You will tackle core training challenges in instruction...  ...novel algorithms for preference optimization, model steering, and safety.... 
    Training

    Apple Inc.

    Cupertino, CA
    1 day ago
  •  ...Institute of Foundation Models We are a dedicated...  ...foundation model training, alongside world-class...  ...data scientists, and engineers, tackling the most fundamental...  ...state-of-the-art multimodal foundation models...  ...model modularity, and inference optimization. Build and improve... 
    Training

    Institute of Foundation Models

    Sunnyvale, CA
    23 days ago
  • $137k - $156k

     ...passionate, and committed engineers, technologists, and...  ...Summary: As a Sr. System Engineer, you'...  ...delivering impactful training sessions to customers...  ...and testing, providing optimized benchmarks for HPC/AI...  ...with MLPerf Training/Inference benchmark, LLM, HPL-AI... 
    Senior
    Training
    Worldwide

    Supermicro

    San Jose, CA
    2 days ago
  • $212.3k - $275.8k

     ...collaborate with product and engineering teams to deploy reliable,...  ...and observable AI services, optimizing inference performance from CPU and...  ...deployment automation, and model/service observability. This...  ...triage workflows. Support training and fine-tuning workflows for... 
    Training
    Full time
    Temporary work
    Local area
    Flexible hours
    3 days per week

    Cisco

    San Jose, CA
    4 days ago
  • $128.7k - $261.3k

     ...repeatable, high-velocity model deployments through...  ...deployment and infra engineers to ship numerically robust...  ...focused onmodel optimization and deployment, with significant...  .../ efficient inference or relevant experience...  ...toolingintegrated into training or evaluation pipelines... 
    Senior
    Training
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  •  ...Assistance, Inc., (SGA), is searching for a Sr. Business Model Strategy Product Manager for a...  ...We do this by driving initiatives that optimize new and existing offerings via product...  ...sales, and develop sales plays, and training as appropriate. Complete post-mortems... 
    Senior
    Training
    Contract work
    2 days per week
    3 days per week

    SGA

    San Jose, CA
    1 day ago
  •  ...AI platform, from chip to model, optimized for enterprise and government...  ...and driven ML performance engineer to optimize and scale...  ...performance for large-scale AI inference. Responsibilities...  ...on experience with LLM or multimodal model training and inference. Background... 
    Senior
    Training
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova Systems

    San Jose, CA
    3 days ago
  • $224k - $356.5k

     ...searching for a senior or principal engineer who specializes in building...  ...for large‑scale foundation model training in the Generalist Embodied...  ...influential works on multimodal foundation models, large-scale...  ...foundation models for robotics. Optimize GPU and cluster utilization... 
    Senior
    Training
    Full time

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $148.1k - $282.1k

     ...business-case for applying a SaaS business model to our perpetual products - a move that...  ...We do this by driving initiatives that optimize new and existing offerings via product configuration...  ...direct sales, develop sales plays and training as appropriate. Complete post-mortems... 
    Senior
    Training
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    2 days ago
  • $100k

     ...unify innovations in software models, compilers, platforms,...  ...designs to architect, enable, and optimize the firmware that powers next...  ...stack. You'd rather debug a link-training state machine than a high-...  ...software teams to solve frontier engineering challenges. How to help... 
    Senior
    Training
    Permanent employment

    Tenstorrent

    Santa Clara, CA
    1 day ago
  •  ...for a Senior Staff AI Infra Engineer who is passionate about...  ...of hardware and software to optimize performance for next-generation...  ..., including Large Language Models (LLMs) and Agentic AI...  ...Optimize and accelerate LLM training and inference on AMD GPUs, improving kernel... 
    Training

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    2 days ago
  •  ...an experienced and visionary Sr. Technologist to join our...  ...success metrics, and execution model aligned with organizational...  ...Drive end‑to‑end analysis of AI training and inference workloads, spanning models,...  ..., highly skilled team of engineers and researchers. Drive thought... 
    Senior
    Training
    Temporary work
    Remote work
    Flexible hours
    Shift work

    Sandisk

    Milpitas, CA
    17 days ago
  • $138.8k - $190.85k

     ...Senior MEMS Characterization and Metrology Engineer , you will use your technical and...  ...qualification and release-to-production. Optimization of system performance focusing on...  ...as location, experience, education, and training. In addition to base salary, this... 
    Senior
    Training

    SiTime Corporation

    Santa Clara, CA
    more than 2 months ago
  • $190k - $235k

     ...Senior Perception Learning Engineer Sunnyvale, CA...  ...object detection, world modeling, and multi-sensor...  ...You will design and optimize deep learning models for...  ...scalable pipelines for training, evaluation, and deployment...  ...infrastructure, and inference frameworks to accelerate... 
    Senior
    Training
    Local area

    Apptronik

    Sunnyvale, CA
    20 days ago
  • $190k - $235k

     ...Senior Learning Perception Engineer - Slam Sunnyvale, C...  ...odometry, world modeling, and learning-based perception...  ...You will design and optimize deep learning models...  ...pipelines for training, evaluation, and deployment...  ...training infrastructure, and inference frameworks to... 
    Senior
    Training
    Local area

    Apptronik

    Sunnyvale, CA
    1 day ago
  •  ...will play a pivotal role in optimizing and developing deep learning...  ..., accelerating deep learning models, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and...  ...THE PERSON: Skilled engineer with strong technical and analytical... 
    Senior
    Training

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Multimodal Model Training and Inference Optimization Engineer. Be the first to apply!