Staff Machine Learning Training Framework Engineer, GenAI
$173k - $307kFrame.io
The Opportunity
Adobe Applied Science & Machine Learning ( ASML ) is seeking a Staff Machine Learning Training Framework Engineer to play a critical role in building and scaling the core training systems behind Adobe’s generative AI foundation models.
In this role, you will serve as a senior technical owner for key components of our training framework , translating research needs into reliable, scalable, and high‑performance training infrastructure. Rather than focusing on a single model, your work will enable multiple multimodal and video foundation models by strengthening the shared systems used to train them.
You will operate at the intersection of applied research and large‑scale systems execution, ensuring that training workflows are robust, reproducible, and performant across large GPU clusters. This role is ideal for a senior engineer who thrives on deep technical ownership, complex execution, and close collaboration with research teams.
Job Responsibilities
- Training Framework Ownership: Own the design and implementation of major components of the training framework, including abstractions for model configuration, optimizer and scheduler integration, checkpointing, and experiment management.
- Large‑Scale Training Execution: Implement and support distributed training strategies such as PyTorch FSDP, Tensor Parallelism, and Pipeline Parallelism, ensuring correctness, stability, and scalability across multi‑node GPU environments.
- Reliability & Fault Tolerance: Improve the resilience of long‑running training jobs by strengthening restartability, state management, and failure handling mechanisms.
- Performance‑Aware Framework Design: Identify framework‑level inefficiencies and reduce overhead related to memory usage, communication, or execution orchestration in large training runs.
- Research Enablement: Partner directly with applied researchers to support new model architectures and training requirements, ensuring the framework adapts quickly to evolving research needs.
- Training Pipeline Integration: Collaborate with infrastructure and platform teams to integrate the training framework with scheduling, storage, monitoring, and logging systems used in production‑scale environments.
What You’ll Need to Succeed
- Education: Master’s or PhD degree in Computer Science, Electrical Engineering, or a related field, or equivalent practical experience.
- Strong Systems Engineering Skills: Proficiency in Python and C++, with experience contributing to large, shared codebases that support multiple users or teams.
- Proven ML Training Experience: Hands‑on experience training models using PyTorch (or JAX), including multi‑GPU and multi‑node distributed training setups.
- Distributed Systems Understanding: Solid understanding of synchronization, state management, fault tolerance, and performance tradeoffs in distributed systems.
- Senior‑Level Execution: Demonstrated ability to independently own complex technical problems, drive solutions to completion, and deliver high‑quality systems relied upon by others.
Preferred Experience
- Experience supporting large‑scale foundation model training or long‑running multi‑node training jobs.
- Familiarity with ML training infrastructure such as DeepSpeed, Accelerate, or internal training platforms.
- Experience working closely with applied research teams on rapidly evolving model requirements.
- Exposure to profiling, debugging, and optimizing training performance at scale.
About Adobe
Adobe empowers everyone to create through innovative platforms and tools that unleash creativity, productivity and personalized customer experiences. Adobe’s industry-leading offerings including Adobe Acrobat Studio, Adobe Express, Adobe Firefly, Creative Cloud, Adobe Experience Platform, Adobe Experience Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and driven by human ingenuity.
Our 30,000+ employees worldwide are creating the future and raising the bar as we drive the next decade of growth. We’re on a mission to hire the very best and believe in creating a company culture where all employees are empowered to make an impact. At Adobe, we believe that great ideas can come from anywhere in the organization. The next big idea could be yours.
Let’s Adobe together
At Adobe, we believe in creating a company culture where all employees are empowered to make an impact. Learn more about Adobe life, including our values and culture , focus on people, purpose and community , Adobe for All , comprehensive benefits programs , the stories we tell , the customers we serve, and how you can help us advance our mission of empowering everyone to create.
Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other protected characteristic. Learn more.
Adobe aims to make our Careers website and recruiting process accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email View email address on swooped.co or call View phone number on swooped.co.
AI Use Guidelines for Interviews:
Our interviews are designed to reflect your own skills and thinking. The use of AI or recording tools during live interviews is not permitted unless explicitly invited by the interviewer or approved in advance as part of a reasonable accommodation. If these tools are used inappropriately or in a way that misrepresents your work, your application may not move forward in the process.
At Adobe, we empower employees to innovate with AI — and we look for candidates eager to do the same. As part of the hiring experience, we provide clear guidance on where AI is encouraged during the process and where it’s restricted during live interviews. See how we think about AI in the hiring experience .
Expected Pay Range:
Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. The U.S. pay range for this position is $172,500 -- $306,625 annually. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. Your recruiter can share more about the specific salary range for the job location during the hiring process.
In California, the pay range for this position is $211,800 - $306,625
At Adobe, for sales roles starting salaries are expressed as total target compensation (TTC = base + commission), and short-term incentives are in the form of sales commission plans. Non-sales roles starting salaries are expressed as base salary and short-term incentives are in the form of the Annual Incentive Plan (AIP).
In addition, certain roles may be eligible for long-term incentives in the form of a new hire equity award.
State-Specific Notices:
California:
Fair Chance Ordinances
Adobe will consider qualified applicants with arrest or conviction records for employment in accordance with state and local laws and “fair chance” ordinances.
Colorado:
Application Window Notice
If this role is open to hiring in Colorado (as listed on the job posting), the application window will remain open until at least the date and time stated above in Pacific Time, in compliance with Colorado pay transparency regulations. If this role does not have Colorado listed as a hiring location, no specific application window applies, and the posting may close at any time based on hiring needs.
Massachusetts:
Massachusetts Legal Notice
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.
- ...Kernel Engineer Sunnyvale CA or Toronto Canada Cerebras Systems... ...Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large... ...Learning neural networks and frameworks such as TensorFlow and PyTorch....Training
$255.85k - $413.52k
...will establish scalable frameworks that automate... ...cycles, and increasing engineering efficiency. Key Responsibilities... ...the adoption of GenAI and AgenticAI... ...Background in data analytics, machine learning, or intelligent... ...relevant education or training. Your recruiter can share...TrainingFull timeInternshipLocal areaImmediate startShift work- ...to revolutionize how machines move, perceive, and interact... ...integrating control engineering, artificial intelligence, and machine learning at every level of... ...humanoid robots Design training pipelines including domain... ...on whole‑body control frameworks integrating learned...TrainingTemporary work
- ...Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users to effortlessly run large... ...hiring a Senior Performance Engineer to join our Product team. You... ...‑the‑art open‑source inference frameworks like vLLM, SGLang, or TensorRT‑...TrainingContract workShift work
$320k
...high-performance computing to machine learning applications for autonomous... ...centers is the ability to engineer integrated system designs in... ...sophisticated international regulatory frameworks and ITU-T standards to... ...requirements needed for AI training vs. inference. Deep...Training$184k - $287.5k
Intelligent machines powered by Artificial Intelligence computers that can learn, reason and interact with people... ...Senior Perception Engineer to develop and productize... ...using deep learning frameworks (e.g., PyTorch).... ...CUDA kernels as part of training or inference pipelines...TrainingWork experience placement$147.4k - $272.1k
Deep Learning Engineer - Perception Algorithms Sunnyvale, California, United States Machine Learning and AI Do you have a passion for deep learning... ...on perception tasks (training and evaluation sets).... ...utilizing distributed GPU training framework. Experience with advanced...TrainingRelocation$174k - $253k
...prompting, agent tooling, eval frameworks, and modern AI frameworks,... ...generation (RAG), machine learning templates, and document/image... ...product marketing management and engineering teams to stay on top of... ...and relevant education or training. US: $174,000 - $253,000 (...TrainingTemporary work$198.3k - $342.8k
Internationalization Engineering Manager - Intelligence... ..., United States Machine Learning and AI The International... ...software, models, and frameworks, combined with the... ...solutions, writing code, training models, and stepping... .... Experience using GenAI to prototype solutions...TrainingRelocation$150k - $188k
...Software Quality Assurance Engineer Applied Intuition is looking... ...projects Drive process training and awareness across engineering... ...and knowledge of process framework, hierarchy, and guideline... ...ASPICE PAM v4.0 processes MLE (Machine Learning Engineering) and HWE (...TrainingFull timeFor contractorsFor subcontractor$110k - $170k
...Artificial Intelligence (AI)/Machine Learning (ML) systems and other high-... ...Photonics Systems Test Engineer to own silicon photonics system... ...software using our Python-based framework; 4) Integrate and debug test... ...experience, skills, training, education, market demands and...Training$164.47k - $311.89k
...Join Intel's Hard IP Development Group (HIPD) within the Central Engineering Organization, where innovation meets execution. Our team... ...customers in a timely manner; provide IP reviews, lab demos, and training to customers as needed; generate industry‑standard IP documentation...TrainingLocal areaRemote work$224k - $356.5k
...As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI... ...practices. Work alongside model training, inference, and product divisions... ...or assessing contemporary machine learning and deep learning... ...or improving evaluation frameworks, benchmarks, or ML infrastructure...Training- ..., California, United States Machine Learning and AI We are a tight-knit group of researchers and engineers responsible for building large... .... You will tackle core training challenges in instruction following... ...and a major deep learning framework such as JAX or PyTorch....Training
- ...leading technology company is hiring a Machine Learning Systems Engineer in Cupertino, California. You will... ...modeling teams to optimize model training and inference on Apple's custom Silicon... ...Python and knowledge of various ML frameworks. The role offers competitive compensation...Training
$181.1k - $318.4k
...California, United States Machine Learning and AI Scaling machine learning... ...challenges that few engineers ever encounter. In Apple’s... ...that powers large-scale ML training and inference workloads, bringing... ...developing with modern web frameworks and RESTful APIs. At Apple...TrainingRelocation- ...Sunnyvale, California, is looking for an experienced engineer to join its SOTA Training Platform team. The ideal candidate will have over 5 years... ...and a strong background in Python, C/C++, and deep learning frameworks. Responsibilities include bringing ML models to life...Training
$147.4k - $272.1k
...Computer Vision & Machine Learning Engineer Apple is where individual imaginations gather together... ...Python and in a modern deep learning framework such as PyTorch or JAX Experience... ...with foundation model architectures and training methodologies Experience working...TrainingRelocation$131.56k - $193.6k
...Lead System Engineer Sony Electronics Inc. is looking for the... ...design Computer Vision and/or Machine Learning technologies and solutions... ...Collect annotated datasets for training AI models using tools or... ...conversion, from common ML framework to our edge AI processor, toolchain...TrainingSummer workHome office$224k - $356.5k
## Software Engineering Manager, Robotics Neural Reconstruction... ...integration in robot learning developments.* Hands-... ..., autonomous driving, machine learning, or related... ...with Deep Learning frameworks (PyTorch, JAX, TensorFlow... ...large-scale model training on GPU clusters.* Hands...Training- ...leverages cutting‑edge generative AI to assist engineers in RTL design, simulation, and... ..., and architect multi‑node clusters for training and inference that push the limits of LLM... ...robust evaluation harnesses and benchmarking frameworks that measure accuracy, throughput,...Training
- ...Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users to effortlessly run large... ...a versatile and experienced engineer to join our SOTA Training... ...Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and...TrainingInternship
$184k - $287.5k
...Senior Robotics Research Engineer (Robotics & AI for... ...control, reinforcement learning, imitation learning, simulation... ...planning pipelines Training robots to solve... ...Qualifications A PhD in Robotics, Machine Learning, Computer... ...modern deep learning frameworks such as PyTorch and...Training$147.4k - $272.1k
Description As a Machine Learning Systems Engineer, you will work closely with Siri modeling teams and other... ...-functional teams to optimize model training and inference. You will be working... ...across ML infrastructure, inference, and framework teams. You will write production-...TrainingRelocation- ...Materials Science AI Engineer Location: Santa Clara, CA - 5D Onsite Duration... ...and C++. Experience with machine learning and deep learning frameworks (e.g., PyTorch, TensorFlow). Experience... ..., aggregating and structuring training data, statistical theory, and cloud...TrainingContract workWork experience placement
$152k - $241.5k
...looking for a Sr. Software Engineer specializing in... ...multimodal representation learning, model adaptation,... ...experience in deep learning, machine learning, computer... ...modern deep learning frameworks such as PyTorch or TensorFlow... ...span data curation, training, evaluation, export,...TrainingImmediate startShift work$126k - $423k
...intelligence to every moving machine on the planet. Applied... ...a passionate Research Engineer to join the Research... ...contribute to and learn from best practices in... ...and use them for RL training. Process human data for... ...‑focused software frameworks or tools. Compensation...TrainingFull timeFor contractorsFor subcontractorWork at officeImmediate startRemote workDay shift$160k - $180k
...’ll Do As a Senior AI Systems Engineer, you will architect, deploy, and... ...for large-scale AI model training and inference. You will ensure our machine learning platforms are robust and efficient... ...Optimization: Leverage specialized frameworks to maximize hardware...TrainingLocal area$152k - $241.5k
Join NVIDIA's Solution Engineering team that is shaping... ...future of autonomous machines. Our goal is to build... ...the domains of machine learning and reinforcement learning... ...related middleware frameworks. Experience with robot... ...reinforcement learning for training and validation of...Training$147.4k - $220.9k
Software Engineer - Customer Systems, Information Systems & Technology... ...solution hand‑off and training for new features to operations... ...and establish governance framework across the organization. Develop... ...skills. Knowledge of machine learning fundamentals. Compensation...TrainingRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Machine Learning Training Framework Engineer, GenAI. Be the first to apply!
- machine learning ai engineer San Jose, CA
- ai ml engineer San Jose, CA
- senior ml engineer San Jose, CA
- machine learning engineer San Jose, CA
- computer vision machine learning engineer San Jose, CA
- machine learning software engineer San Jose, CA
- machine learning remote San Jose, CA
- machine learning research scientist San Jose, CA
- machine learning San Jose, CA
- artificial intelligence - machine learning intern San Jose, CA

