Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Inference Systems Engineer: High-Throughput

$135k

United States Digital Space LLC

United States Digital Space LLC in Palo Alto seeks an Application Software Engineer to develop high-performance AI inference systems. This role emphasizes the design and optimization of large-scale systems used for mission-critical applications. The ideal candidate possesses experience in full stack development, distributed systems, and is proficient in Rust or C++. This is an onsite position; remote work is not an option. Generous benefits include medical coverage, a 401(k) retirement plan, and three weeks of paid vacation annually, as well as competitive salaries starting from $135,000 per year. #J-18808-Ljbffr United States Digital Space LLC

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Inference Systems Engineer: High-Throughput in Palo Alto, CA vacancy
  •  ...Department: Backend Engineer · Work type: On-...  ...About A rchetype AI Archetype AI is developing...  ...an exceptionally high-caliber team from...  ...distributed systems. You’ll work closely...  ...for latency and throughput, and contribute to...  ...-latency AI model inference and data services.... 
    Suggested
    Full time

    Neara

    Palo Alto, CA
    1 day ago
  • $180k - $250k

    A leading AI infrastructure firm is seeking a TPU Systems Engineer to develop high-performance systems using JAX, XLA, and Pallas. This role involves pushing large-model workloads on TPU hardware and optimizing performance across the stack. Candidates should have at least... 
    Suggested

    RadixArk

    Palo Alto, CA
    4 days ago
  • A leading AI software company in California is seeking a Software Engineer to develop and enhance runtime stacks for scalable ML...  ...applications. The role involves working on system software and collaborating with...  ...to support next-generation high-performance compute systems.... 
    Suggested

    SambaNova

    Palo Alto, CA
    2 days ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries...  ...for NVIDIA's architecture, emphasizing high-impact AI workloads. The ideal candidate will design... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • Senior AI Systems Performance Engineer Palo Alto, California, United States The era...  ...architectures — to push the limits of throughput, latency, and efficiency....  ...for large‑scale AI inference. Responsibilities Bring...  ...to deliver co‑designed, high‑performance AI... 
    Suggested
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova

    Palo Alto, CA
    15 hours ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...verification with agentic AI workflows. Our...  ...AI to assist engineers in RTL design, simulation...  ...are seeking an ML Systems Engineer to...  ...large language model inference powering our...  ...the limits of LLM throughput and latency. Your...  ...GPU computing, or high‑performance inference... 

    ScOp Venture Capital

    Santa Clara, CA
    1 day ago
  • ScOp Venture Capital is looking for an ML Systems Engineer to optimize LLM inference systems crucial for their AI platform. The role focuses on enhancing performance and efficiency via low-level systems optimization, directly impacting industry leader processes in semiconductor... 

    ScOp Venture Capital

    Santa Clara, CA
    1 day ago
  • Model AI is looking for a Founding Machine Learning Infrastructure Engineer in Palo Alto to help optimize infrastructure for AI systems. In this role, you will focus on enhancing model serving performance and cost efficiency. The ideal candidate will have strong experience... 

    Model AI

    Palo Alto, CA
    2 days ago
  • $128k - $312k

     ...Expect The Tesla AI Hardware team is at...  ...Comprising brilliant engineers and visionaries,...  ...advanced AI inference chips tailored to...  ...custom supercomputer system built to efficiently...  ...developers to deliver high‑performance, power...  ..., focusing on throughput, scalability, and... 
    Hourly pay
    Temporary work
    Flexible hours
    Night shift

    Tesla Motors, Inc.

    Palo Alto, CA
    1 day ago
  • $184k - $287.5k

    Position Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems.... 

    SambaNova

    Palo Alto, CA
    3 days ago
  •  ...Micro Devices in Santa Clara is seeking a senior software engineer committed to enhancing AI performance on GPUs. You will work on cutting-edge...  ...architectures, and the ability to effectively mentor others in a high-impact role. #J-18808-Ljbffr Advanced Micro Devices

    Advanced Micro Devices

    Santa Clara, CA
    3 days ago
  • A leading AI infrastructure company in California is seeking a Member of Technical Staff — Inference to design and optimize large-scale AI inference systems. The role demands 5+ years in systems engineering and expertise in large-scale inference systems. Successful candidates... 
    Flexible hours

    RadixArk

    Palo Alto, CA
    4 days ago
  • $112k - $184k

    Systems Engineer II, AI Solutions & IT Operations Technology Shared Services Enterprise Technology About the Role We are seeking a versatile and...  ...: Architect, design, and consult with customers to develop high‑impact technical AI solutions. Apply creative freedom in solution... 
    Temporary work
    Immediate start

    Woven

    Palo Alto, CA
    4 days ago
  •  ...is looking for a Senior Manufacturing Test Engineer to design and implement comprehensive manufacturing test solutions for AI systems. Situated in Sunnyvale, CA, this position requires...  ...and collaboration across teams to ensure high-performance hardware. The ideal candidate... 

    Dormont Manufacturing Co

    Sunnyvale, CA
    1 day ago
  • $198k - $326k

     ...team. LinkedIn’s AI Infrastructure...  ...builds and extends high-performance serving...  ...Senior Staff Software Engineer with deep...  ...the intersection of systems, machine learning,...  ..., and large-scale inference. This is a highly...  ...including latency, throughput, GPU utilization,... 
    Full time
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Mountain View, CA
    8 hours ago
  • d-Matrix inc. is seeking a Staff Runtime Systems Engineer to join our team in Santa Clara, CA. This hybrid role involves working onsite three...  ...days a week and focuses on runtime software engineering for an AI compute platform. You will be responsible for the architecture... 
    3 days per week

    d-Matrix inc.

    Santa Clara, CA
    1 day ago
  • $125k - $191.7k

     ...hybrid/Remote Role: As a Senior Software Systems Engineer on the Software Validation team within...  ...future of evaluation methodologies for AI systems and other ADAS features,...  ...communication skills, capable of switching between high-level and detailed discussions What... 
    Local area
    Remote work
    Work from home
    Flexible hours

    General Motors

    Mountain View, CA
    2 days ago
  • $141.3k - $226k

     ...Description: Broadcom is looking for a Software Systems Engineer (P5) to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...  ...class private cloud AI platform. You will have a high impact by playing a critical role designing and implementing... 
    Full time
    Work at office
    Local area

    Broadcom Corporation

    Palo Alto, CA
    2 days ago
  • $171k - $231.5k

     ...creative and enthusiastic Senior Design System Engineer to join our Design Technology group. A design...  .... You will be expected to architect highly scalable, performant component libraries while leveraging the latest generative AI tools (like GitHub Copilot, Cursor, or Claude... 

    Intuit Inc.

    Mountain View, CA
    15 hours ago
  • Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel...  ...industry‑leading training and inference speeds and empowers...  ...key workloads with ultra high‑speed inference. Thanks to...  ...versatile and experienced engineer to join our SOTA Training... 
    Internship

    Cerebras

    Sunnyvale, CA
    4 days ago
  • $112.7k - $169.1k

     ...opportunity Unity's Vector AI team builds the machine learning systems that decide which ads...  ...the world's leading game engine. Recommendation and ranking...  ...experiments using causal inference, A/B testing, and offline...  ...track record of rigorous, high-quality research —... 
    Internship
    Work at office
    Worldwide
    Relocation package
    Shift work

    jobr.pro

    Mountain View, CA
    2 days ago
  • $152k - $287.5k

    NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...for a Machine Learning Engineer to help build cutting edge...  ...-end machine learning systems. The ML infra team...  ...distributed training and inference pipeline for large language...  ...customers experience AI. Put another way, this...  ...you are looking for a high-impact, fast-moving role... 

    Moveworks

    Mountain View, CA
    2 days ago
  • A leading technology company is hiring a Machine Learning Systems Engineer in Cupertino, California. You will collaborate with Siri modeling teams to optimize model training and inference on Apple's custom Silicon. The ideal candidate has strong experience in ML models,... 

    Apple

    Cupertino, CA
    3 days ago
  • Senior Manufacturing Test Engineer - AI Inferencing Systems Location: Sunnyvale, CA Experience...  ...company that builds very high-performance, low-power generative AI inference systems. Our mission,...  ...human intervention and maximize throughput. Diagnostics & Utilities: Collaborate... 
    Contract work
    Remote work
    Flexible hours

    Dormont Manufacturing Co

    Sunnyvale, CA
    1 day ago
  • $155k - $207k

     ...and deploy cutting-edge AI technology to help...  ...veteran scientists and engineers. As a Senior/Staff Machine...  ...scale and optimize our ML systems—creating and...  ...tuning, post-training, and inference strategies for large language...  ...constraints. Is highly effective at cross-functional... 
    Permanent employment

    Cacheflow

    Mountain View, CA
    2 days ago
  •  ...optical semiconductor solutions for the AI infrastructure market, focused on...  ...solutions and architectures. We are seeking a Systems/Algorithms Engineer with experience in serial communication...  ...(CDR) systems that enable reliable high-speed data transmission. Key Responsibilities... 

    CspeedIO, Inc.

    Palo Alto, CA
    2 days ago
  •  ...is seeking a Senior Research Engineer focused on training data infrastructure for advanced AI models. The ideal candidate will...  ...strong skills in distributed systems and a deep understanding of Machine...  ...include designing high-throughput data ingestion systems, ensuring... 

    Apple Inc.

    Cupertino, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Inference Systems Engineer: High-Throughput. Be the first to apply!