Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Inference Performance Architect

NVIDIA Gruppe

NVIDIA Gruppe is seeking a highly qualified candidate to contribute to workload characterization, deep learning software projects, and performance benchmarking in Santa Clara, CA. The position emphasizes collaboration with both internal teams and external AI startups to enhance NVIDIA's performance leadership. Candidates should have a Master's or PhD in Computer Science, solid software development experience, and proficiency in programming frameworks like PyTorch. The role offers competitive compensation, including equity and benefits. #J-18808-Ljbffr NVIDIA Gruppe

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI Inference Performance Architect in Santa Clara, CA vacancy
  • $224k - $356.5k

    NVIDIA Gruppe in Santa Clara is seeking an AI Storage Platform Architect to connect advanced GPU and CPU solutions with real-world AI deployments...  ...collaboration with ecosystem partners, ensuring optimal performance for modern AI workloads. Ideal candidates have over 12 years... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...generation computing experiences-from AI and data centers, to PCs, gaming...  .... Your expertise will drive 2-3x performance gains in both training and inference pipelines through innovative...  ...expert and data parallel dimensions Architect memory-efficient training systems... 
    Performance
    Remote work

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  • $224k - $356.5k

     ...Engineer in Santa Clara, California to optimize deep learning inference for autonomous vehicles and robotics. The ideal candidate will...  ...role involves working directly with automotive OEMs and pushing performance benchmarks across various platforms. A salary range of 224,000... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    14 hours ago
  • $190k - $280k

    MixMode is seeking a Principal Architect in Santa Clara to accelerate AI application performance through innovative hardware and software solutions. The role involves...  ...with product and hardware teams to enhance inference accelerators. The ideal candidate has a BSEE/MSEE... 
    Performance

    MixMode

    Santa Clara, CA
    1 day ago
  • $208k - $327.75k

     ...forefront of accelerated computing, AI, and autonomous machines....  ...state-of-the-art AI, high-performance compute, and scalable software...  ...are looking for a Senior AI Architect to help define the next generation...  ...systems, scaling laws, and inference optimization techniques.... 
    Performance
    Worldwide

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $254k - $349.25k

     ...protect how people, data, and AI agents connect across email,...  ...are seeking a Principal ML Architect to lead the design and development...  ...continuously improve model performance and reliability...  ...time environments Optimize inference systems for low latency, high... 
    Performance
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    4 days ago
  • $254k - $349.25k

     ...protect how people, data, and AI agents connect across email,...  ...We are seeking a Principal ML Architect to lead the design and...  ...to continuously improve model performance and reliability Productionization...  ...‑time environments Optimize inference systems for low latency, high... 
    Performance
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    1 day ago
  •  ...Job Summary T he AI Interconnect Architect designs and engineers high-speed networking and communication systems for AI inference infrastructure, including servers, racks, and chips...  ...to optimize power, cost, and performance across diverse workloads. Key Responsibilities... 
    Performance

    Compunnel

    Milpitas, CA
    14 hours ago
  •  ...Snowflake Presales Architect (Data & AI)1 Role Overview We are looking for a highly skilled...  ...(dbt, Informatica, etc.) ~ SQL, performance tuning, and data modeling Experience...  ...engineering, model lifecycle, inference patterns) Exposure to Snowflake AI... 
    Performance

    Jade Global

    San Jose, CA
    3 days ago
  •  ...located in Sunnyvale, California, is seeking a Vector Compute Architect Intern to join our advanced architecture team. The intern...  ...defining and optimizing vector compute architectures for AI and high-performance computing platforms. The role involves driving architecture... 
    Performance
    Internship

    Jobleads-US

    Sunnyvale, CA
    4 days ago
  •  ...generation computing experiences-from AI and data centers, to PCs,...  ...We are seeking a Robotics AI Architect to define and scale next-...  ...to achieve production-grade performance targets. THE PERSON:...  ...Deep understanding of: AI inference runtimes and deployment tradeoffs... 
    Performance

    Advanced Micro Devices , Inc.

    San Jose, CA
    1 day ago
  • $206.4k - $379.1k

     ...produce impressive content. The AI Foundations team constructs a...  ...'re looking for a Principal Architect to build and implement the AI...  ...model orchestration, inference systems, data pipelines, caching...  ...frameworks. Develop high-performance runtime services for inference... 
    Performance
    Temporary work
    Local area
    Worldwide
    Flexible hours

    Adobe

    San Jose, CA
    1 day ago
  •  ...Job Description: Edge AI Architect - CUDA / C++ / Computer Vision Experience Level: 10+ Years Department: Edge AI & Embedded Systems...  ...-powered medical devices that operate at the edge, ensuring performance, reliability, and compliance with healthcare standards.... 
    Performance
    Contract work

    Codvo.ai

    Santa Clara, CA
    4 days ago
  •  ...Senior AI Architect Nexxa.ai is building artificial super intelligence for heavy industries — enabling machines, systems and operations...  ...science best practices to understand user behavior, system performance, and model impact Build and maintain data-driven context... 
    Performance

    Nexxa.ai

    Sunnyvale, CA
    14 hours ago
  •  ...recognized globally for innovation, performance and quality. Sandisk has...  ...Job Description In this AI/ML ASIC Architecture position...  ...product.  As an AI/ML ASIC Architect you will help drive new...  ...Architect memory-efficient inference/training systems utilizing techniques... 
    Performance
    Temporary work
    Remote work
    Flexible hours
    Shift work
    Night shift

    Sandisk

    Milpitas, CA
    22 days ago
  •  ...recognized globally for innovation, performance and quality. Sandisk has two facilities...  ...forward. Job Description An AI Interconnect Architect defines and engineers high-speed...  ...networking and communication systems for AI Inference infrastructure which include servers,... 
    Performance
    Temporary work
    Remote work
    Flexible hours
    Shift work

    Sandisk

    Milpitas, CA
    19 days ago
  •  ...Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years of experience... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $184k - $356.5k

    NVIDIA Corporation is seeking a Senior CPU Performance Architect to join their CPU performance architecture team in Santa Clara, California. This...  ...the development of CPU technology for applications in AI, HPC, and gaming. The ideal candidate will have 7+ years of experience... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • Netpreme is seeking an ASIC Architect to define the architecture and microarchitecture for their silicon product with offices in Boston, MA. Responsibilities include performance modeling for GPU memory systems, architecting data planes, and owning SoC architecture specs... 
    Performance

    Netpreme

    Santa Clara, CA
    3 days ago
  • $192k - $278k

    Google Inc. is looking for a Machine Learning Hardware Architect to innovate and lead TPU technology. You will collaborate with teams to develop high-performance systems and define requirements for AI workloads. Your expertise will influence technical roadmaps and drive... 
    Performance

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $184k - $287.5k

    NVIDIA Corporation is hiring a Senior Deep Learning Performance Architect in Santa Clara, CA. This role involves developing cutting-edge architectures to enhance deep learning performance and efficiency. Candidates should possess a strong background in performance analysis... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    14 hours ago
  • codvo-team is seeking an Edge AI Architect in Santa Clara, California, to drive the development of AI-powered medical devices. The ideal...  ...in the Edge AI & Embedded Systems department, optimizing performance and ensuring compliance with healthcare standards. Join us for... 
    Performance

    codvo-team

    Santa Clara, CA
    1 day ago
  •  ...architecture specifications into micro-architecture feature requirements. The role involves performance analysis, modeling, and working with a breakthrough architecture that supports AI applications at scale. Ideal candidates will hold an advanced degree in Electrical or... 
    Performance

    Cerebras Systems

    Sunnyvale, CA
    14 hours ago
  • $224k - $431.25k

    NVIDIA Gruppe is seeking an experienced AI Platform Architect to build and enhance internal AI solutions used globally. The role requires a strong AI-first approach, with a focus on developing systems software and deploying AI solutions. The candidate should have extensive... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe seeks a Principal Architect to drive the architectural vision for AI communication systems. This role involves setting the technical direction...  ...and systems software, particularly in high-performance environments, as well as a degree in Computer Science... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • NVIDIA Corporation is looking for a Technical Marketing Engineer (TME) in Santa Clara, California. In this role, you will architect high-performance demos and develop media pipelines while evangelizing artificial intelligence for live media. The ideal candidate will... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...vision and architecture focused on measuring and visualizing AV model performance. This role will involve scaling state-of-the-art tools and partnering with cross-functional teams within the Embodied AI group. The ideal candidate will have strong backend and frontend... 
    Performance

    General Motors

    Sunnyvale, CA
    1 day ago
  • Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders... 
    Performance

    Advanced Micro Devices

    Santa Clara, CA
    4 days ago
  • NVIDIA Gruppe in Santa Clara is seeking a Deep Learning Communication Architect to optimize DNN models and enhance communication performance during distributed training. This role requires collaboration with hardware/software teams to implement efficient communication... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...improve CI reliability for their open-source LLM inference engine. The role requires 3+ years' experience in...  ...team that’s passionate about building world-class AI infrastructure, ensuring fast and secure system performance. This is an opportunity to make a significant... 
    Performance

    RadixArk

    Palo Alto, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Inference Performance Architect. Be the first to apply!