Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Runtime Engineer: Distributed Training & Scale

FlexAI

A forward-thinking AI infrastructure company is seeking a Staff AI Runtime Engineer to lead the design and optimization of their AI compute platform. In this leadership role, you'll enhance AI training and inference capabilities. Successful candidates will have over 8 years of experience in systems engineering, expertise with PyTorch and TensorFlow, and strong programming skills in Python and C++. This role is based in Santa Clara, CA, and offers a competitive salary along with the chance to work on cutting-edge technology. #J-18808-Ljbffr FlexAI

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior AI Runtime Engineer: Distributed Training & Scale in Santa Clara, CA vacancy
  • $180k - $225k

     ...Build and Deploy AI the right way, anywhere...  ...teams are strategically distributed across Silicon Valley...  ...designed for next-generation training and inference workloads. As a Staff AI Runtime Engineer , you'll play a...  ...training and inference at scale. Design resilient... 
    Training
    Work at office

    FlexAI

    Santa Clara, CA
    1 day ago
  •  ...Senior Principal AI Agent / ML Software Engineer The Senior Principal AI Agent /...  ...applications used in large-scale, business-critical...  ...combines deep distributed systems experience...  ...GPU inference or training workloads for...  ...inference gateways, agent runtimes, workflow engines,... 
    Senior
    Training

    Oracle

    Santa Clara, CA
    1 day ago
  • $180k

    A cutting-edge AI research firm in California seeks a Member of Technical Staff specializing...  ...hands-on experience with multimodal pre-training and a strong proficiency in Python, JAX,...  ...Responsibilities include designing large-scale systems and developing data pipelines to push... 
    Senior
    Training

    x.ai

    Palo Alto, CA
    1 day ago
  • $184k - $287.5k

     ...the unlimited potential of AI to define the next era of...  ...looking for outstanding Senior High Performance AI Engineer to build groundbreaking...  ...build innovative agentic runtimes and compiler-integrated orchestration...  .../libraries, frameworks, distributed training, and inference/serving—... 
    Senior
    Training

    2100 NVIDIA USA

    Santa Clara, CA
    3 days ago
  • $180k - $240k

     ...the role We are seeking a Senior AI Infrastructure Engineer to design, build, and scale the high-performance AI...  ...infrastructure that enables distributed training, experiment tracking, and seamless...  ...artifacts using TensorRT, ONNX Runtime, and Triton Inference Server,... 
    Senior
    Training
    Odd job
    Work at office

    Gatik AI

    Mountain View, CA
    1 day ago
  •  ...Senior AI Systems Performance Engineer Palo Alto, California, United States...  ...and operations at scale. SambaNova Suite...  ...collaborating across compiler, runtime, and hardware...  ...single-node and distributed systems. Basic...  ...multimodal model training and inference.... 
    Senior
    Training

    SambaNova Systems

    Palo Alto, CA
    3 days ago
  • $168k - $322k

    NVIDIA Gruppe is seeking a Senior AI Platform Engineer to improve engineering efficiency and data security...  ...Cloud and AI/ML teams to build and scale infrastructure and shape the...  ...strong Python skills, and expertise in distributed systems along with Kubernetes. Competitive... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $144.7k - $261.3k

     ...infrastructure, and ML/AI GPU platforms for AV...  ...GM is looking for a Senior Performance Engineer to join the AV Capacity...  ...input into large scale ML infrastructure strategy...  ...of large-scale ML training and inference environments...  ...within large-scale distributed production... 
    Senior
    Training
    Local area
    Remote work
    Work from home
    Flexible hours
    3 days per week

    General Motors

    Sunnyvale, CA
    3 days ago
  • $155.42k - $395.9k

     ...supports the end-to-end AI lifecycle of ML...  ...experimentation and large-scale training to evaluation, lineage...  ...interfaces, enabling ML engineers and researchers to...  ...The Role: As a Senior AI/ML Engineer, you will...  ...implement, and test scalable distributed computing and data... 
    Senior
    Training
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  • $170.6k - $261.3k

     ...world! The Data Labeling Engineering team designs, builds, and operates...  ..., data engineering, and AI/ML, defining the strategies...  ...that create reliable training data at scale. Our tools and platform are...  ...experience building robust distributed platforms and applications.... 
    Senior
    Training
    Local area
    Remote work
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    a month ago
  • $200k - $400k

     ...Institute Of Foundation Models Engineer The Institute of...  ...and operates ultra-scale GPU supercomputing systems to train next-generation foundation...  ...communication systems, runtime, and hardware topology....  ...communication performance, distributed reliability, and cross-layer... 
    Senior
    Training
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  •  ...NVIDIA's DGX Cloud AI Efficiency Team...  ...AI workloads - pre‑training, post‑training, inference...  ...resources and scale to foster...  ...infrastructure software engineer to join our team....  ...AI systems. As a senior DGX Cloud AI Infrastructure...  ...large‑scale distributed systems. Experience... 
    Senior
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $208k - $327.75k

     ...Manager to lead strategic AI platform initiatives...  ...closely with engineering, architecture, and platform...  ...that improve how large-scale systems are built and...  ...infrastructure, including distributed training, inference...  ...test, deployment, and runtime environments. Outstanding... 
    Senior
    Training
    Temporary work

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $174.72k - $295.68k

     ...Senior AI Data Infrastructure/Pipeline Engineer Santa Clara, CA XPENG is a leading smart...  ...dataset production → model training / simulation input. In...  ...daily flow of petabyte-scale sensor data. Key Responsibilities...  ...I/O, etc., and build a distributed data processing system... 
    Senior
    Training
    Full time
    Overseas

    XPENG

    Santa Clara, CA
    4 days ago
  • $160k - $253k

    AI Factories, powered by NVIDIA accelerated...  ...software to power AI at scale. To help customers...  ..., we are seeking a Senior Technical Marketing Engineer focused on scale‑out...  ...inference and training performance and power...  ...including cabling, power distribution, and thermal scaling... 
    Senior
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...Alto seeks a Staff/Principal ML Systems Engineer to enhance training performance for their innovative humanoid robots. You will optimize distributed training systems and engage closely...  ...paced environments, and possess strong debugging skills. #J-18808-Ljbffr Rhoda AI
    Senior
    Training

    Rhoda AI

    Palo Alto, CA
    4 days ago
  • $200k - $270k

     ...Samsung SDS America AI Team is researching the...  ..., policy training, and deployment on physical...  ...We are looking for a Senior Physical AI Engineer to join the team developing...  ...manufacturing at scale across thousands of factory...  ...GPU acceleration and distributed training systems... 
    Senior
    Training
    Worldwide
    Flexible hours

    Samsung SDS America

    Mountain View, CA
    2 days ago
  • $110k - $190k

     ...Role Overview We are hiring a Senior Software & AI Engineer to build production-grade AI systems...  ...the right solution: data preparation, training, evaluation, deployment, and monitoring...  ...core to how we create value, scale operations, and differentiate in the... 
    Senior
    Training

    Covalent

    Sunnyvale, CA
    14 hours ago
  • $140k - $215k

     ...world's most advanced AI-native platform. Our...  ...Development Engineer role on the Cloud Runtime Protection team that...  ...workloads deployed at scale Design and develop...  ...work effectively in a distributed team #LI-JC1 Benefits...  ..., selection, training, compensation, benefits... 
    Senior
    Training
    Work experience placement
    Work at office
    Local area
    2 days per week
    3 days per week

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    1 day ago
  • $176.8k - $265.2k

     ...is building an enterprise-scale Agentic AI platform to enable secure,...  ...Principal Software Development Engineer to serve as the technical...  ...ideal candidate has strong distributed systems expertise, deep familiarity...  ..., promotion, benefits, training, discipline, and... 
    Senior
    Training
    Local area

    F5

    San Jose, CA
    4 days ago
  • $152k - $287.5k

    NVIDIA Gruppe is seeking a highly motivated Software Engineer to contribute to the design and development of large-scale AI systems. The successful candidate will work on scalable infrastructure for ML training and cloud-native platforms, leveraging cutting-edge technologies... 
    Senior
    Training

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $209k

     ...Machine Learning Platform Engineer Immigration sponsorship is...  ...downtime. • Enable support for distributed model training and hyperparameter...  ...Optimize GPU utilization for large-scale training workloads, ensuring...  ..., and resource-efficient AI workloads across multi-node... 
    Senior
    Training
    Work at office
    Remote work
    1 day per week

    Zoom Video Communications

    San Jose, CA
    3 days ago
  • $318.24k

    Crusoe is looking for a Senior Staff Software Engineer to develop a managed platform for the AI Model Lifecycle team. The position focuses on fine-tuning large-scale AI models and implementing training pipelines, requiring over 8 years of industry experience and hands-on... 
    Senior
    Training

    Crusoe

    Sunnyvale, CA
    2 days ago
  • $244.14k - $413.16k

     ...Senior Staff AI Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront...  ...Senior Staff AI Engineer to build and scale production-grade AI systems that drive...  ...experience, and relevant education or training. We are an Equal Opportunity Employer.... 
    Senior
    Training
    Full time

    XPENG

    Santa Clara, CA
    18 days ago
  • $123k - $215.25k

     ...Senior AI Engineer II - Agentic AI New York, NY, United States Sunrise...  ...operate responsibly and at scale across the enterprise. Our...  ...and services: REST, gRPC Distributed systems: event-driven...  ...~ Career development and training opportunities For a full... 
    Senior
    Training
    Full time
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Flexible hours
    Shift work

    American Express

    Palo Alto, CA
    4 days ago
  • $223k - $306.5k

     ...Integrity, and Inclusion. We weave AI into the fabric of everything...  ...As a Sr Principal AI Engineer, you will join a dynamic team...  ...behavioral analysis, and adversarial training to protect model instructions...  ...environments, delivering large-scale implementations with... 
    Senior
    Training
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $188k - $237.5k

     ...Senior AI Engineer At Sonatus, we're driving the transformation to AI-enabled software-defined...  ...agility of a fast-growing company with the scale and impact of an established partner....  ...development, including modeling, training, tuning, validating, deploying, and maintaining... 
    Senior
    Training
    Work at office
    Local area
    Worldwide
    Flexible hours
    Shift work

    Sonatus

    Sunnyvale, CA
    1 day ago
  • $139k - $229k

     ...NYC Sr. AI Engineer: Assets, Formats & Placements The...  ...schema, APIs, and delivery runtime that enable creative...  ...can launch once and scale everywhere with...  ...outcomes. We're seeking a Senior AI Engineer to shape...  ...exploration to model training, evaluation, and deployment... 
    Senior
    Training
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Sunnyvale, CA
    2 days ago
  •  ...experiences-from AI and data centers...  ...looking for a Senior Staff AI Infra Engineer who is passionate...  ...accelerate LLM training and inference on...  ...including large-scale training and inference...  ..., network, and runtime layers. •...  ...infrastructure, distributed systems, or performance... 
    Training

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    14 hours ago
  • $195.2k - $275.58k

    The Software and AI (SAI) organization is seeking a highly skilled Software Development Engineer to contribute to the development and optimization...  ..., TensorFlow, PyTorch, ONNX Runtime, and many others. This is a...  ...deep‑learning inference and training throughput on current and... 
    Senior
    Training
    Local area
    Remote work
    Worldwide
    Flexible hours
    Shift work

    Intel Corporation

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Runtime Engineer: Distributed Training & Scale. Be the first to apply!