Senior AI Runtime Engineer: Distributed Training & Scale
FlexAI
A forward-thinking AI infrastructure company is seeking a Staff AI Runtime Engineer to lead the design and optimization of their AI compute platform. In this leadership role, you'll enhance AI training and inference capabilities. Successful candidates will have over 8 years of experience in systems engineering, expertise with PyTorch and TensorFlow, and strong programming skills in Python and C++. This role is based in Santa Clara, CA, and offers a competitive salary along with the chance to work on cutting-edge technology. #J-18808-Ljbffr FlexAI
$180k - $225k
...Build and Deploy AI the right way, anywhere... ...teams are strategically distributed across Silicon Valley... ...designed for next-generation training and inference workloads. As a Staff AI Runtime Engineer , you'll play a... ...training and inference at scale. Design resilient...TrainingWork at office- ...Senior Principal AI Agent / ML Software Engineer The Senior Principal AI Agent /... ...applications used in large-scale, business-critical... ...combines deep distributed systems experience... ...GPU inference or training workloads for... ...inference gateways, agent runtimes, workflow engines,...SeniorTraining
$180k
A cutting-edge AI research firm in California seeks a Member of Technical Staff specializing... ...hands-on experience with multimodal pre-training and a strong proficiency in Python, JAX,... ...Responsibilities include designing large-scale systems and developing data pipelines to push...SeniorTraining$184k - $287.5k
...the unlimited potential of AI to define the next era of... ...looking for outstanding Senior High Performance AI Engineer to build groundbreaking... ...build innovative agentic runtimes and compiler-integrated orchestration... .../libraries, frameworks, distributed training, and inference/serving—...SeniorTraining$180k - $240k
...the role We are seeking a Senior AI Infrastructure Engineer to design, build, and scale the high-performance AI... ...infrastructure that enables distributed training, experiment tracking, and seamless... ...artifacts using TensorRT, ONNX Runtime, and Triton Inference Server,...SeniorTrainingOdd jobWork at office- ...Senior AI Systems Performance Engineer Palo Alto, California, United States... ...and operations at scale. SambaNova Suite... ...collaborating across compiler, runtime, and hardware... ...single-node and distributed systems. Basic... ...multimodal model training and inference....SeniorTraining
$168k - $322k
NVIDIA Gruppe is seeking a Senior AI Platform Engineer to improve engineering efficiency and data security... ...Cloud and AI/ML teams to build and scale infrastructure and shape the... ...strong Python skills, and expertise in distributed systems along with Kubernetes. Competitive...Senior$144.7k - $261.3k
...infrastructure, and ML/AI GPU platforms for AV... ...GM is looking for a Senior Performance Engineer to join the AV Capacity... ...input into large scale ML infrastructure strategy... ...of large-scale ML training and inference environments... ...within large-scale distributed production...SeniorTrainingLocal areaRemote workWork from homeFlexible hours3 days per week$155.42k - $395.9k
...supports the end-to-end AI lifecycle of ML... ...experimentation and large-scale training to evaluation, lineage... ...interfaces, enabling ML engineers and researchers to... ...The Role: As a Senior AI/ML Engineer, you will... ...implement, and test scalable distributed computing and data...SeniorTrainingLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$170.6k - $261.3k
...world! The Data Labeling Engineering team designs, builds, and operates... ..., data engineering, and AI/ML, defining the strategies... ...that create reliable training data at scale. Our tools and platform are... ...experience building robust distributed platforms and applications....SeniorTrainingLocal areaRemote workWork from homeFlexible hours$200k - $400k
...Institute Of Foundation Models Engineer The Institute of... ...and operates ultra-scale GPU supercomputing systems to train next-generation foundation... ...communication systems, runtime, and hardware topology.... ...communication performance, distributed reliability, and cross-layer...SeniorTrainingVisa sponsorship- ...NVIDIA's DGX Cloud AI Efficiency Team... ...AI workloads - pre‑training, post‑training, inference... ...resources and scale to foster... ...infrastructure software engineer to join our team.... ...AI systems. As a senior DGX Cloud AI Infrastructure... ...large‑scale distributed systems. Experience...SeniorTraining
$208k - $327.75k
...Manager to lead strategic AI platform initiatives... ...closely with engineering, architecture, and platform... ...that improve how large-scale systems are built and... ...infrastructure, including distributed training, inference... ...test, deployment, and runtime environments. Outstanding...SeniorTrainingTemporary work$174.72k - $295.68k
...Senior AI Data Infrastructure/Pipeline Engineer Santa Clara, CA XPENG is a leading smart... ...dataset production → model training / simulation input. In... ...daily flow of petabyte-scale sensor data. Key Responsibilities... ...I/O, etc., and build a distributed data processing system...SeniorTrainingFull timeOverseas$160k - $253k
AI Factories, powered by NVIDIA accelerated... ...software to power AI at scale. To help customers... ..., we are seeking a Senior Technical Marketing Engineer focused on scale‑out... ...inference and training performance and power... ...including cabling, power distribution, and thermal scaling...SeniorTraining- ...Alto seeks a Staff/Principal ML Systems Engineer to enhance training performance for their innovative humanoid robots. You will optimize distributed training systems and engage closely... ...paced environments, and possess strong debugging skills. #J-18808-Ljbffr Rhoda AISeniorTraining
$200k - $270k
...Samsung SDS America AI Team is researching the... ..., policy training, and deployment on physical... ...We are looking for a Senior Physical AI Engineer to join the team developing... ...manufacturing at scale across thousands of factory... ...GPU acceleration and distributed training systems...SeniorTrainingWorldwideFlexible hours$110k - $190k
...Role Overview We are hiring a Senior Software & AI Engineer to build production-grade AI systems... ...the right solution: data preparation, training, evaluation, deployment, and monitoring... ...core to how we create value, scale operations, and differentiate in the...SeniorTraining$140k - $215k
...world's most advanced AI-native platform. Our... ...Development Engineer role on the Cloud Runtime Protection team that... ...workloads deployed at scale Design and develop... ...work effectively in a distributed team #LI-JC1 Benefits... ..., selection, training, compensation, benefits...SeniorTrainingWork experience placementWork at officeLocal area2 days per week3 days per week$176.8k - $265.2k
...is building an enterprise-scale Agentic AI platform to enable secure,... ...Principal Software Development Engineer to serve as the technical... ...ideal candidate has strong distributed systems expertise, deep familiarity... ..., promotion, benefits, training, discipline, and...SeniorTrainingLocal area$152k - $287.5k
NVIDIA Gruppe is seeking a highly motivated Software Engineer to contribute to the design and development of large-scale AI systems. The successful candidate will work on scalable infrastructure for ML training and cloud-native platforms, leveraging cutting-edge technologies...SeniorTraining$209k
...Machine Learning Platform Engineer Immigration sponsorship is... ...downtime. • Enable support for distributed model training and hyperparameter... ...Optimize GPU utilization for large-scale training workloads, ensuring... ..., and resource-efficient AI workloads across multi-node...SeniorTrainingWork at officeRemote work1 day per week$318.24k
Crusoe is looking for a Senior Staff Software Engineer to develop a managed platform for the AI Model Lifecycle team. The position focuses on fine-tuning large-scale AI models and implementing training pipelines, requiring over 8 years of industry experience and hands-on...SeniorTraining$244.14k - $413.16k
...Senior Staff AI Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront... ...Senior Staff AI Engineer to build and scale production-grade AI systems that drive... ...experience, and relevant education or training. We are an Equal Opportunity Employer....SeniorTrainingFull time$123k - $215.25k
...Senior AI Engineer II - Agentic AI New York, NY, United States Sunrise... ...operate responsibly and at scale across the enterprise. Our... ...and services: REST, gRPC Distributed systems: event-driven... ...~ Career development and training opportunities For a full...SeniorTrainingFull timeWork at officeLocal areaRemote workVisa sponsorshipFlexible hoursShift work$223k - $306.5k
...Integrity, and Inclusion. We weave AI into the fabric of everything... ...As a Sr Principal AI Engineer, you will join a dynamic team... ...behavioral analysis, and adversarial training to protect model instructions... ...environments, delivering large-scale implementations with...SeniorTrainingFull timeWork at office$188k - $237.5k
...Senior AI Engineer At Sonatus, we're driving the transformation to AI-enabled software-defined... ...agility of a fast-growing company with the scale and impact of an established partner.... ...development, including modeling, training, tuning, validating, deploying, and maintaining...SeniorTrainingWork at officeLocal areaWorldwideFlexible hoursShift work$139k - $229k
...NYC Sr. AI Engineer: Assets, Formats & Placements The... ...schema, APIs, and delivery runtime that enable creative... ...can launch once and scale everywhere with... ...outcomes. We're seeking a Senior AI Engineer to shape... ...exploration to model training, evaluation, and deployment...SeniorTrainingFor contractorsWork at officeFlexible hours- ...experiences-from AI and data centers... ...looking for a Senior Staff AI Infra Engineer who is passionate... ...accelerate LLM training and inference on... ...including large-scale training and inference... ..., network, and runtime layers. •... ...infrastructure, distributed systems, or performance...Training
$195.2k - $275.58k
The Software and AI (SAI) organization is seeking a highly skilled Software Development Engineer to contribute to the development and optimization... ..., TensorFlow, PyTorch, ONNX Runtime, and many others. This is a... ...deep‑learning inference and training throughput on current and...SeniorTrainingLocal areaRemote workWorldwideFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Runtime Engineer: Distributed Training & Scale. Be the first to apply!
- ai engineer remote Santa Clara, CA
- ai prompt engineer Santa Clara, CA
- senior ai engineer Santa Clara, CA
- machine learning ai engineer Santa Clara, CA
- ai engineer Santa Clara, CA
- ai developer Santa Clara, CA
- ai ml engineer Santa Clara, CA
- senior automation controls engineer Santa Clara, CA
- senior brand designer Santa Clara, CA
- senior business analyst contract Santa Clara, CA

