Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead AI Inference Performance Engineer (GPU)

Advanced Micro Devices

A leading technology company is looking for a Principal AI Performance Engineer to optimize AI inference performance on GPUs. In this role, you will lead a team driving performance optimization across various configurations, diagnose complex performance issues, and interact with customers. Ideal candidates possess extensive experience in GPU computing, strong analytical skills, and a background in AI systems. The position is based in San Jose, CA, offering an innovative environment where you can make a significant impact. #J-18808-Ljbffr Advanced Micro Devices

Vacancy posted 14 hours ago
Similar jobs that could be interesting for youBased on the Lead AI Inference Performance Engineer (GPU) in San Jose, CA vacancy
  • $152k - $241.5k

     ...invention of the GPU 1999 sparked the growth...  ...ignited modern AI — the next era of...  ...-tier AI Compiler Engineers to drive innovation...  ...is possible in AI performance and help build the...  ...workloads (both inference and training) and...  ...design: Partner with leading experts across our... 
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...NVIDIA's invention of the GPU 1999 sparked the...  ...learning ignited modern AI — the next era of...  ...Deep Learning Compiler Engineer. NVIDIA is hiring...  ...backbone of NVIDIA’s inference engine, spanning across...  ...compiler must deliver leading inference performance, fast build time, reduced... 
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • A leading technology company is looking for a Principal GenAI Inference Optimization Engineer in San Jose, CA. This role will focus on optimizing performance and efficiency of generative AI on AMD GPU platforms. The ideal candidate will have significant expertise in GPU... 
    Performance

    Advanced Micro Devices

    San Jose, CA
    14 hours ago
  • $184k - $287.5k

     ...skilled and motivated software engineers to join us and build AI inference systems that serve large-...  ...architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive...  ...submissions to the industry-leading MLPerf Inference benchmarking... 
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $160k - $253k

    Senior Technical Marketing Engineer - GPU and System Architecture page...  ...centers are transforming into AI factories, and NVIDIA...  ...center platforms integrate high performance compute, networking, and a full...  ...and efficiency for AI inference & training.**What you’ll be... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...experiences-from AI and data centers,...  ...Software Quality Engineer to serve as the senior...  ...on AMD Instinct™ GPU platforms. You will...  ..., workload, performance, stress, stability...  ...meet them. ~ Lead system-level testing...  ...LLM training and inference (PyTorch, vLLM, Triton... 
    Performance
    Contract work
    Shift work

    Advanced Micro Devices , Inc.

    San Jose, CA
    4 days ago
  • A leading technology company based in California is seeking a Senior Compiler Engineer to shape the future of compiler technologies. This role...  ...design for next-generation GPU architectures. Ideal candidates...  ...and GPU computing, driving performance and efficiency in high-... 
    Performance

    Intel Corporation

    Santa Clara, CA
    2 days ago
  • $184k - $356.5k

    A leading AI computing company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-collaborative teams to implement cutting-edge algorithms... 
    Performance
    Full time

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $207k - $300k

    Google is seeking an experienced AI/ML Software Engineer to enhance GPU architectures and optimize performance benchmarks. The role involves collaborating with teams to solve ML model challenges and architect transformative AI solutions, contributing to Google's machine... 
    Performance

    Google

    Sunnyvale, CA
    14 hours ago
  • $220.2k - $330.4k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering...  ...edge, focusing on AI, edge computing and connectivity...  ...IoT (IE‑IoT) BU leads the transformation of...  ...for generative AI inference and computer vision workloads...  ...the accessibility and performance of a datacenter... 
    Performance
    Work experience placement
    Work at office

    Qualcomm

    Santa Clara, CA
    3 days ago
  • $150k - $275k

     ...cutting-edge tech company in San Jose is seeking a Supercomputing Engineer to ensure the reliability of its inference servers. This role involves designing and executing test suites, analyzing performance, and collaborating with engineering teams. Ideal candidates will... 
    Performance

    Etched

    San Jose, CA
    2 days ago
  •  ...generation computing experiences-from AI and data centers, to PCs,...  ...for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks...  ...optimizing scale-up and scale-out inference. Develop methods and tooling... 
    Performance

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    14 hours ago
  • $200k - $322k

     ...seeking a self‑motivated senior engineer for the Aerial Omniverse...  ...you will design and implement GPU kernels that apply time‑varying...  ...What we need to see:PhD in high‑performance computing, computer architecture...  ...existing vacancy.NVIDIA uses AI tools in its recruiting processes... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    Senior System Software Engineer - GPU Power Management page is loaded## Senior...  ...innovation that’s fueled by leading technology—and amazing people...  ...the unlimited potential of AI to define the next era of...  ...components with focus on power and performance.* Drive end-to-end GPU power... 
    Performance
    Work experience placement
    Local area
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...eager to work on cutting-edge AI technology for safety-...  ...TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling high-performance AI inference solutions for automotive safety...  ...! What you'll be doing: Lead the design and development of... 
    Performance

    NVIDIA

    Santa Clara, CA
    14 hours ago
  • $140k - $224.25k

     ...NVIDIA Software Development Engineer in Test Are you passionate about...  ...substantial experience with AI technologies for automation of...  ...the testing workflows in GPU domain. Write maintainable,...  ...reviews including feedback on performance, scalability, and correctness... 
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $184k - $356.5k

     ...seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa...  ..., CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep learning frameworks like... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    14 hours ago
  •  ...Jose, California is seeking a skilled AI Systems Engineer to lead the development and support of AI...  ...infrastructure. The role requires managing GPU clusters, deploying advanced AI...  ...technical experience, focusing on high-performance computing or AI systems. The position... 
    Performance

    Cadence

    San Jose, CA
    14 hours ago
  • A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative and independent work,...  ...innovative solutions and advancing AI technologies. #J-18808-Ljbffr... 
    Performance

    Advanced Micro Devices

    Santa Clara, CA
    2 days ago
  • $90 - $100 per hour

    Job Title GPU Design Verification Engineer Position Description Protingent Staffing has an exciting contract...  ...for verifying the functionality and performance of our cutting‑edge GPU designs....  ...innovation — from Software and Aerospace to AI, Clean Tech, Medical Devices, and... 
    Performance
    Contract work

    Protingent

    San Jose, CA
    4 days ago
  • $272k - $431.25k

     ...We are seeking software engineers to work on next-generation high-speed interconnect...  ...demanding high-speed IO applications a GPU or high-performance computing server will encounter in its...  ...interpersonal skills ~ Capability to use AI prompt tools Your base salary... 
    Performance

    NVIDIA

    Santa Clara, CA
    14 hours ago
  • $120k - $180k

     ...redefining the future of AI with our groundbreaking innovations...  ..., enabling superior performance and energy efficiency...  ...centers. Our talented team of engineers and industry‑leading executives drives this...  ...architecting ultra‑low‑power edge inference solutions around the SoC... 
    Performance
    For contractors

    TETRAMEM INC

    San Jose, CA
    14 hours ago
  • $184k - $356.5k

    A leading technology company in California is seeking a Senior DL Algorithms Engineer to drive inference performance for Deep Learning workloads. The role involves implementing advanced model...  ...of experience in deep learning and GPU programming. This position offers a competitive... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $120k - $180k

    Application Engineer - Low Power Edge Inference (DIB Focus) About this Role We are seeking an Application Engineer...  ...the SoC Profile and improve system performance (latency, energy per inference,...  ...in deploying cutting‑edge edge AI silicon into real‑world, resource‑constrained... 
    Performance
    For contractors
    Internship

    TetraMem Inc

    San Jose, CA
    2 days ago
  • $184k - $287.5k

    Senior DL Algorithms Engineer - Inference Performance page is loaded## Senior DL Algorithms Engineer - Inference...  ...of the hardware/software stack from GPU architecture to Deep Learning...  ...fast-growing technology company that leads the AI revolution.**What you will be doing:... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  •  ...Principal Machine Learning Engineer - Inference Serving Frameworks...  ...building rack-level AI inference systems. Our...  ...source software to serve leading‑edge models with...  ...experts to architect high‑performance inference stacks and design...  ...‑level debugging. GPU kernel development... 
    Performance
    Full time

    Acceler8 Talent

    Santa Clara, CA
    2 days ago
  •  ...builds the world's largest AI chip, 56 times larger than...  ...to deliver industry‑leading training and inference speeds and empowers machine...  ...Role We are hiring a Senior Performance Engineer to join our Product team....  ...vLLM, SGLang, TensorRT‑LLM), GPU kernel‑level optimization... 
    Performance
    Contract work
    Shift work

    Cerebras

    Sunnyvale, CA
    1 day ago
  • $224k - $356.5k

     ...unlimited potential of AI to define the next era...  ...computing. An era in which our GPU acts as the brains of...  ...of AI and high-performance computing. As a Senior...  ...Principal Deep Learning Engineer — Model Evaluation & AI...  ...alongside model training, inference, and product divisions... 
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $2,000 per month

     ...building the world’s first AI inference system purpose‑built for transformers...  ...delivering over 10x higher performance and dramatically lower cost...  ...investors and staffed by leading engineers, Etched is redefining the...  ...workflows and monitoring GPU‑accelerated workloads.... 
    Performance
    Work at office
    Relocation package

    Etched

    San Jose, CA
    4 days ago
  •  ...builds the world's largest AI chip, 56 times larger...  ...Cerebras to deliver industry-leading training and inference speeds and empowers...  ...over 10 times faster than GPU-based hyperscale cloud...  ...About The Role Engineers on the inference performance team operate at the intersection... 
    Performance

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead AI Inference Performance Engineer (GPU). Be the first to apply!