Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Systems Performance Engineer

SambaNova Systems

Senior AI Systems Performance Engineer

Palo Alto, California, United States

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.

SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the SambaNova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets.

About the Role

We are seeking a talented and driven ML performance engineer to optimize and scale state-of-the-art foundation models on SambaNova's reconfigurable dataflow platform. You'll work hands-on with some of the most advanced models in the world — such as DeepSeek R1, GPT OSS, and other frontier architectures — to push the limits of throughput, latency, and efficiency. In this role, you'll bridge the gap between deep learning and systems performance, collaborating across compiler, runtime, and hardware layers to deliver world-record performance for large-scale AI inference.

Responsibilities
  • Bring up and optimize cutting-edge foundation models (e.g., DeepSeek, Llama, Qwen, and others) on the SambaNova platform through the SambaNova software stack.
  • Profile and enhance model performance across compiler, runtime, and hardware layers to achieve SOTA throughput and latency.
  • Collaborate with machine learning, compiler, runtime, and hardware teams to deliver co-designed, high-performance AI applications.
  • Integrate the latest advances in model architecture, quantization, scheduling, and memory optimization from both academia and industry.
  • Develop robust, scalable, and efficient end-to-end inference solutions aligned with customer needs.
  • Identify performance bottlenecks and propose dataflow or scheduling optimizations for both single-node and distributed systems.
Basic Qualifications
  • Bachelor's or higher degree in computer science, electrical engineering, or a related field (e.g., applied mathematics, physics, or statistics).
  • 3+ years of experience in one or more of the following areas:
    • Deep learning model development and performance optimization
    • Compiler, runtime, or kernel-level optimization
    • Software–hardware co-design or systems performance tuning
  • Proficiency in Python or C++, with strong foundations in algorithms, data structures, and numerical computing.
  • Experience with at least one major ML framework — PyTorch, TensorFlow, or JAX.
  • Demonstrated ability to analyze and optimize performance in real-world ML pipelines.
Preferred Qualifications
  • Hands-on experience with LLM or multimodal model training and inference.
  • Background in large-scale distributed training, continuous batching, and high-throughput inference systems.
  • Familiarity with quantization, graph optimization, kernel fusion, and model partitioning.
  • Experience with frameworks such as DeepSpeed, Megatron, vLLM, or TensorRT.
  • Strong GPU programming skills (CUDA, Triton, or OpenCL); experience with cuDNN, cuBLAS, or similar libraries is a plus.
  • Knowledge of memory hierarchy optimization, caching, and scheduling for large-scale model execution.
  • Publication record or open-source contributions in ML systems or performance optimization is a plus.

Submission Guidelines Please note that in order to be considered an applicant for any position at SambaNova Systems, you must submit an application form for each position for which you believe you are qualified.

EEO Policy SambaNova Systems is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard basis of age (40 and over), color, disability, gender identity, genetic information, marital status, military or veteran status, national origin/ancestry, race, religion, creed, sex (including pregnancy, childbirth, breastfeeding), sexual orientation, and any other applicable status protected by federal, state, or local laws.

Benefits Summary for US-Based, Full-Time Employment Positions SambaNova offers a competitive total rewards package, including the base salary, plus equity and benefits. We cover 95% premium coverage for employee medical insurance, and 77% premium coverage for dependents and offer a Health Savings Account (HSA) with employer contribution. We also offer Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans in addition to Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care. Our library of well-being benefits available to you and your dependents includes a full subscription to Headspace, Gympass+ membership with access to physical gyms, One Medical membership, counseling services with an Employee Assistance Program, and much more.

Vacancy posted 2 hours ago
Similar jobs that could be interesting for youBased on the Senior AI Systems Performance Engineer in Palo Alto, CA vacancy
  • $150k - $230k

     ...Senior Systems Engineer - AI Infrastructure On Site, Palo Alto, California About the Role We're building infrastructure for fault-tolerant, high-performance distributed GPU training. You'll work at the intersection of GPU systems, high-speed networking, and distributed... 
    Senior
    Performance

    Clockwork Systems

    Palo Alto, CA
    2 hours ago
  • $125k - $191.7k

     ...categorized as hybrid/Remote Role: As a Senior Software Systems Engineer on the Software Validation team...  ..., and verifying the safety and performance of autonomous systems. You will be responsible...  ...of evaluation methodologies for AI systems and other ADAS features, architecting... 
    Senior
    Performance
    Local area
    Remote work
    Work from home
    Flexible hours

    General Motors

    Mountain View, CA
    2 days ago
  • $160.36k - $240.54k

     ...Senior Software Engineer – GenAI Infrastructure & Agent Systems for Engineering Efficiency Mountain View, California (HQ)...  ...driver, combining cutting-edge AI with automotive-grade hardware....  ...PR review Detect and resolve performance and reliability issues automatically... 
    Senior
    Performance

    Nuro

    Mountain View, CA
    2 days ago
  • $160.36k - $240.54k

     ...Senior/Staff Systems Engineering Technical Program Manager Mountain View, California (HQ) Nuro is a...  ...scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro...  ...position is also eligible for an annual performance bonus, equity, and a competitive... 
    Senior
    Performance
    Odd job

    Nuro

    Mountain View, CA
    1 day ago
  • $184k - $287.5k

     ...are increasingly known as “the AI computing company.” We're...  ...Designing and developing performance optimized UEFI/BIOS solutions...  ...automation for qualifying the whole system software and firmware stack....  ...Degree or higher; in Electrical Engineering or Computer Science or... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    19 hours ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture...  ...a versatile and experienced engineer to join our SOTA Training Platform...  ...achieving unprecedented levels of performance, efficiency, and scalability for AI... 
    Senior
    Performance
    Internship

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  •  ...You'll design, build, and ship core systems that power our AI platform. You'll work across the stack—...  ...research and product. Improve foundations: performance, reliability, observability, developer velocity. Raise the engineering bar through code review and design... 
    Senior
    Performance
    Internship
    Work at office

    Voltai

    Palo Alto, CA
    1 day ago
  • $184k - $287.5k

     ...into the unlimited potential of AI to define the next era of...  ...where everyone is motivated to perform at their highest level. Come join...  ...: Develop use cases and system requirements for L3 and L4 autonomous...  ...with Data Analytics, Test Engineering, and System Integration & Test... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    19 hours ago
  •  ...are Moveworks is the Agentic AI Assistant platform that...  ...converse with all of their business systems through natural language to...  ...automation with Moveworks' Reasoning Engine and natural language...  ...to achieve state-of-the-art AI performance in production, in every meaning... 
    Senior
    Performance
    Work at office
    Immediate start
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    19 hours ago
  • $150k - $250k

     ...We Are HP IQ is HP's new AI innovation lab. Combining...  ...a diverse, world-class team-engineers, designers, researchers, and...  ...Learning Engineer - Recommender Systems, you'll play a central role in...  ...user interactions and system performance to guide algorithmic improvements... 
    Senior
    Performance
    Full time
    Temporary work
    Local area
    Flexible hours

    HP IQ

    Palo Alto, CA
    5 days ago
  • $131k - $175k

     ...Senior Hardware Systems Engineer – AI Rack & Cluster Infrastructure Arista Networks is an industry leader in data-driven, client-to-cloud networking...  ...to maintain the highest standards of quality and performance in everything we do. Job Description Who You'll Work... 
    Senior
    Performance
    Remote work
    Flexible hours

    Arista Networks, Inc.

    Santa Clara, CA
    2 days ago
  • $160.36k - $240.54k

     ...Senior Software Engineer, Distributed Compute System Mountain View, California (HQ) Who We Are Nuro is a self...  ...scalable driver, combining cutting-edge AI with automotive-grade hardware....  ...is also eligible for an annual performance bonus, equity, and a competitive... 
    Senior
    Performance

    Nuro

    Mountain View, CA
    1 day ago
  • $152k - $241.5k

     ...for a creative and experienced Software Systems Engineer to help bring NVIDIA's next generation...  ...compare the impact of ODDs on relevant performance metrics, translating data and analysis...  ...languages such as Python and the use of AI tooling to enhance requirement and test... 
    Senior
    Performance
    Odd job

    NVIDIA

    Santa Clara, CA
    9 days ago
  • $135.8k - $237.05k

     ...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-26...  ...daily decisions, with a focus on the performance, reliability, and scalability of inference... 
    Senior
    Performance
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    3 days ago
  • $160k - $240k

     ...PhoenixAI is the Agentic AI Database, purpose-built...  ...in a single AI-native engine, combining the speed...  ...of StarRocks, our high-performance SQL engine purpose-built...  ...developing advanced database systems and enjoy solving...  ...is remote friendly to senior+ candidates. The... 
    Senior
    Performance
    Remote work

    CelerData, Inc.

    Menlo Park, CA
    4 days ago
  • $193.93k - $291.15k

     ...Senior Software Engineer, Networking & Real-Time Systems Mountain View, California (HQ) Who We Are Nuro is a self...  ...scalable driver, combining cutting-edge AI with automotive-grade hardware....  ...Correction) algorithms that out-perform standard protocols. About the... 
    Senior
    Performance
    Remote work

    Nuro

    Mountain View, CA
    4 days ago
  • $208k - $276k

     ...Senior Software Engineer, Prototyping - Warfighter Systems Mountain View, California, United States Anduril Industries...  ...is powered by Lattice OS, an AI-powered operating system that turns...  ...sensors, optimizing their performance and seamless integration into prototype... 
    Senior
    Performance
    Full time
    Contract work
    Work experience placement
    Immediate start

    anduril

    Mountain View, CA
    19 hours ago
  • $161.5k - $190k

     ...their careers. We're a high-performing, fast-moving team with ethics...  ...rewards. The Corporate Systems team focuses on maintaining...  ...closely with Security, IT, and Engineering partners to manage identity...  ...documentation. You will also use AI-assisted tools to enhance... 
    Senior
    Performance
    Work at office
    Flexible hours
    Shift work
    3 days per week

    Robinhood

    Menlo Park, CA
    3 days ago
  • $230k - $284k

     ...Senior Systems Engineer, Hardware Architecture Waymo is an autonomous driving technology company with the mission to be the world's most...  ...architecture of the next-generation Waymo Driver, balancing performance, reliability, cost, and other key factors Develop and... 
    Senior
    Performance
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 hour ago
  • $140k - $197k

     ...Senior Systems Engineer (Pre-Sales) Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center...  ...strive to maintain the highest standards of quality and performance in everything we do. Job Description This role requires... 
    Senior
    Performance
    Work experience placement
    Local area
    Immediate start

    Arista Networks, Inc.

    Santa Clara, CA
    29 minutes ago
  • $196k - $242k

     ...tens of billions in simulation across 15+ U.S. states. Waymo's Systems Engineering team works together to blend software and hardware systems in groundbreaking new ways. We set the high performance standards that ensure our vehicles run smoothly and keep passengers... 
    Senior
    Performance
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 hours ago
  • $224k - $356.5k

     ...tapping into the unlimited potential of AI to define the next era of computing. An era...  ...company at the forefront of AI and high-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful role in crafting... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...Social, we're building the AI-native social operating system that enables this new era...  ...by ex-Meta product and engineering leaders, we've raised over...  ...We're looking for a Senior Site Reliability Engineer...  ...prevent recurrence Improve performance, cost efficiency, and capacity... 
    Senior
    Performance
    Full time
    Shift work

    Nectar

    Palo Alto, CA
    1 day ago
  • $213k - $263k

     ...driving data into robust, generalizable, and performant deep neural networks. These models...  ...environments safely and efficiently. The system architecture team handles the onboard...  ...challenging real-world problems with ML and engineering solutions. Use state of the art... 
    Senior
    Performance
    Full time
    Contract work
    Internship
    Remote work

    Waymo

    Mountain View, CA
    2 hours ago
  • $152k - $241.5k

     ...imaging, personal gaming, and high-performance computing. Our success...  ...informative telemetry and data systems that provide real-time...  ...distributed infrastructure. As an engineer on our team, you will play a...  ...existing vacancy.  NVIDIA uses AI tools in its recruiting... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $168k - $270.25k

     ...Senior Engineer For Factory Infrastructure And Automation NVIDIA is the...  ...platform upon which every new AI-powered application is built....  ...NVIDIA optimizes and serves performant inferencing for every AI model...  ...distributed and compute systems, backend services, microservices... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

     ...Automation Engineer NVIDIA's platform and innovations help developers...  ...technologies for Physical AI that uses gaussian splatting...  ..., Release automation, build systems, test infrastructure, or developer...  ...CI systems and reproducible performance/regression tracking for ML or... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...the unlimited potential of AI to define the next era of computing...  ..., and improve software systems for rack, networking, and...  ...and management. As a Senior Software Engineer - Datacenter Systems, you will...  ...infrastructure or systems in high-performance or distributed environments... 
    Senior
    Performance
    Remote work

    NVIDIA

    Santa Clara, CA
    19 hours ago
  •  ...Senior Software Engineer, Systems/Solutions Test This role has been designed as 'Hybrid' with an expectation...  ...reliability, scalability, and performance across complex network environments....  ...through emerging technologies, including AI-assisted testing workflows.... 
    Senior
    Performance
    Work at office
    2 days per week

    Hewlett Packard Enterprise

    Sunnyvale, CA
    2 days ago
  • $184k - $287.5k

     ...We are looking for a Senior Software Engineer to help build NeMo Platform, NVIDIA’s product for developing...  ...evaluating, deploying, and operating AI systems at scale. This role will focus on...  ..., observability, debuggability, and performance across NeMoStack services, SDKs,... 
    Senior
    Performance
    Remote work

    NVIDIA

    Santa Clara, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Systems Performance Engineer. Be the first to apply!