Senior AI Systems Performance Engineer
SambaNova Systems
The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.
SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the SambaNova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets. About the role We are seeking a talented and driven ML performance engineer to optimize and scale state-of-the-art foundation models on SambaNova's reconfigurable dataflow platform. You'll work hands-on with some of the most advanced models in the world - such as DeepSeek R1, GPT OSS, and other frontier architectures - to push the limits of throughput, latency, and efficiency. In this role, you'll bridge the gap between deep learning and systems performance, collaborating across compiler, runtime, and hardware layers to deliver world-record performance for large-scale AI inference.Responsibilities
- Bring up and optimize cutting-edge foundation models (e.g., DeepSeek, Llama, Qwen, and others) on the SambaNova platform through the SambaNova software stack.
- Profile and enhance model performance across compiler, runtime, and hardware layers to achieve SOTA throughput and latency.
- Collaborate with machine learning, compiler, runtime, and hardware teams to deliver co-designed, high-performance AI applications.
- Integrate the latest advances in model architecture, quantization, scheduling, and memory optimization from both academia and industry.
- Develop robust, scalable, and efficient end-to-end inference solutions aligned with customer needs.
- Identify performance bottlenecks and propose dataflow or scheduling optimizations for both single-node and distributed systems.
- Bachelor's or higher degree in computer science, electrical engineering, or a related field (e.g., applied mathematics, physics, or statistics).
- 3+ years of experience in one or more of the following areas:
- Deep learning model development and performance optimization
- Compiler, runtime, or kernel-level optimization
- Software-hardware co-design or systems performance tuning
- Proficiency in Python or C++, with strong foundations in algorithms, data structures, and numerical computing.
- Experience with at least one major ML framework - PyTorch, TensorFlow, or JAX.
- Demonstrated ability to analyze and optimize performance in real-world ML pipelines.
- Hands-on experience with LLM or multimodal model training and inference.
- Background in large-scale distributed training, continuous batching, and high-throughput inference systems.
- Familiarity with quantization, graph optimization, kernel fusion, and model partitioning.
- Experience with frameworks such as DeepSpeed, Megatron, vLLM, or TensorRT.
- Strong GPU programming skills (CUDA, Triton, or OpenCL); experience with cuDNN, cuBLAS, or similar libraries is a plus.
- Knowledge of memory hierarchy optimization, caching, and scheduling for large-scale model execution.
- Publication record or open-source contributions in ML systems or performance optimization is a plus.
EEO Policy SambaNova Systems is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard basis of age (40 and over), color, disability, gender identity, genetic information, marital status, military or veteran status, national origin/ancestry, race, religion, creed, sex (including pregnancy, childbirth, breastfeeding), sexual orientation, and any other applicable status protected by federal, state, or local laws. Benefits Summary for US-Based, Full-Time Employment Positions
SambaNova offers a competitive total rewards package, including the base salary, plus equity and benefits. We cover 95% premium coverage for employee medical insurance, and 77% premium coverage for dependents and offer a Health Savings Account (HSA) with employer contribution. We also offer Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans in addition to Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care. Our library of well-being benefits available to you and your dependents includes a full subscription to Headspace, Gympass+ membership with access to physical gyms, One Medical membership, counseling services with an Employee Assistance Program, and much more.
Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Senior AI Systems Performance Engineer in Palo Alto, CA vacancy
$150k - $230k
...Senior Systems Engineer - AI Infrastructure On Site, Palo Alto, California About the Role We're building infrastructure for fault-tolerant, high-performance distributed GPU training. You'll work at the intersection of GPU systems, high-speed networking, and distributed...SeniorPerformance- ...robotics company in Palo Alto seeks a Staff/Principal ML Systems Engineer to enhance training performance for their innovative humanoid robots. You will... ...modern ML tools, thrive in fast-paced environments, and possess strong debugging skills. #J-18808-Ljbffr Rhoda AISeniorPerformance
- A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems....SeniorPerformance
$125k - $191.7k
...categorized as hybrid/Remote Role: As a Senior Software Systems Engineer on the Software Validation team... ..., and verifying the safety and performance of autonomous systems. You will be responsible... ...of evaluation methodologies for AI systems and other ADAS features, architecting...SeniorPerformanceLocal areaRemote workWork from homeFlexible hours$262k - $365k
A leading technology firm is seeking a Senior Staff Research Engineer to work on cutting-edge AI projects. Responsibilities include developing evaluation frameworks... ..., optimizing data usage, and analyzing agent performance. Candidates should have at least 8 years of...SeniorPerformanceFull time$171k - $231.5k
...looking for a creative and enthusiastic Senior Design System Engineer to join our Design Technology group. A design... ...expected to architect highly scalable, performant component libraries while leveraging the latest generative AI tools (like GitHub Copilot, Cursor, or...SeniorPerformance$160.36k - $240.54k
...Senior Software Engineer – GenAI Infrastructure & Agent Systems for Engineering Efficiency Mountain View, California (HQ)... ...driver, combining cutting-edge AI with automotive-grade hardware.... ...PR review Detect and resolve performance and reliability issues automatically...SeniorPerformance$160.36k - $240.54k
...Senior/Staff Systems Engineering Technical Program Manager Mountain View, California (HQ) Nuro is a... ...scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro... ...position is also eligible for an annual performance bonus, equity, and a competitive...SeniorPerformanceOdd job$152k - $228k
...world‑class autonomous driving system that combines AV hardware with our generalized AI‑first self‑driving software. Built... ...work cross‑functionally with engineers, product managers, tooling &... ...deliverables meet technical and performance standards. Cross‑functional work...SeniorPerformance- A leading AI technology company located in Sunnyvale, California, is looking for an experienced engineer to join its SOTA Training Platform team. The ideal candidate will... ...bringing ML models to life on Cerebras CSX systems, performance tuning, and contributing to tool...SeniorPerformance
- Google Inc. seeks a Senior Software Engineer to work on TPU Performance and Hardware Software Co-Design. The role demands 5 years of experience in software... ...optimization. The engineer will manage projects, enhance ML systems, and ensure peak efficiency in a collaborative...SeniorPerformance
- ...leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative... ...a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro DevicesSeniorPerformance
- NVIDIA in Santa Clara is seeking an experienced engineer to design and optimize AI systems for the CUDA ecosystem. Ideal candidates will have strong C/C++ and Python skills, with a solid background in AI systems development. The position offers competitive salaries, equitably...SeniorPerformance
$120.1k - $225.7k
...: Design and implement high-performance inference frameworks; optimize... ...members to build a robust AI inference technical ecosystem... ...Computer Science, Electronic Engineering, AI, or related fields; significant... ...Intelligent Routing . Systems Proficiency: Expert in...SeniorPerformanceRelocation package$184k - $287.5k
...are increasingly known as “the AI computing company.” We're... ...Designing and developing performance optimized UEFI/BIOS solutions... ...automation for qualifying the whole system software and firmware stack.... ...Degree or higher; in Electrical Engineering or Computer Science or...SeniorPerformance$184k - $287.5k
...NVIDIA Autonomous Driving Systems Engineer Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU serves... ...environment where everyone is motivated to perform at their highest level. Come join the team...SeniorPerformance$168k - $258.75k
...recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is... .... We are now looking for a System Design Engineer in the System Product Team. In this role... ...to pursue the balance of product cost, performance, and schedule under the guidance of system...SeniorPerformance$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...SeniorPerformance- A leading technology company in Santa Clara is seeking a Senior AI-Native Systems Software Engineer to design an AI-native framework, optimizing performance for critical use cases. This role requires strong modern C++ skills, familiarity with deep learning frameworks, and...SeniorPerformance
$150k - $250k
...Senior Machine Learning Engineer, Recommender Systems Palo Alto, CA Who We Are HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, we're building... ...Analyze user interactions and system performance to guide algorithmic improvements...SeniorPerformanceFull timeTemporary workLocal areaFlexible hours- ...are Moveworks is the Agentic AI Assistant platform that... ...converse with all of their business systems through natural language to... ...automation with Moveworks' Reasoning Engine and natural language... ...to achieve state-of-the-art AI performance in production, in every meaning...SeniorPerformanceWork at officeImmediate startRemote workFlexible hours
$200k - $322k
...are seeking a self‑motivated senior engineer for the Aerial Omniverse... ...of emulated devices, across systems of potentially thousands of... ...we need to see: PhD in high‑performance computing, computer architecture... ...existing vacancy. NVIDIA uses AI tools in its recruiting...SeniorPerformance$224k - $356.5k
NVIDIA Corporation is seeking a System Software Engineer for Vision AI in Santa Clara, CA. In this impactful role, you will develop and optimize high-performance vision systems, creating AI pipelines that process video and 3D data. Your expertise in modern C++, deep learning...SeniorPerformance$152k - $241.5k
...for a creative and experienced Software Systems Engineer to help bring NVIDIA's next generation... ...compare the impact of ODDs on relevant performance metrics, translating data and analysis... ...languages such as Python and the use of AI tooling to enhance requirement and test...SeniorPerformanceOdd job- Job Description As a Senior Systems Research Engineer , you will join a future-forward team to explore and build embodied AI applications at the intersection of state-of-the-art AI... ...on technical role, you will recommend performant architectures, and iteratively develop...SeniorPerformance
$131k - $175k
...Senior Hardware Systems Engineer – AI Rack & Cluster Infrastructure Arista Networks is an industry leader in data-driven, client-to-cloud networking... ...to maintain the highest standards of quality and performance in everything we do. Job Description Who You'll Work...SeniorPerformanceRemote workFlexible hours- Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture... ...a versatile and experienced engineer to join our SOTA Training Platform... ...achieving unprecedented levels of performance, efficiency, and scalability for AI...SeniorPerformanceInternship
$135.8k - $237.05k
...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-26... ...daily decisions, with a focus on the performance, reliability, and scalability of inference...SeniorPerformanceWork at officeWorldwideRelocation package$160k - $240k
...PhoenixAI is the Agentic AI Database, purpose-built... ...in a single AI-native engine, combining the speed... ...of StarRocks, our high-performance SQL engine purpose-built... ...developing advanced database systems and enjoy solving... ...is remote friendly to senior+ candidates. The...SeniorPerformanceRemote work$193.93k - $291.15k
...Senior Software Engineer, Networking & Real-Time Systems Mountain View, California (HQ) Who We Are Nuro is a self... ...scalable driver, combining cutting-edge AI with automotive-grade hardware.... ...Correction) algorithms that out-perform standard protocols. About the...SeniorPerformanceRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Systems Performance Engineer. Be the first to apply!
Related searches
- ai research engineer Palo Alto, CA
- machine learning ai engineer Palo Alto, CA
- ai engineer remote Palo Alto, CA
- ai prompt engineer Palo Alto, CA
- ai developer Palo Alto, CA
- ai engineer Palo Alto, CA
- ai ml engineer Palo Alto, CA
- senior ai engineer Palo Alto, CA
- healthcare systems engineer Palo Alto, CA
- electronic systems engineer Palo Alto, CA

