AI Inference Performance Architect
NVIDIA Gruppe
NVIDIA Gruppe is seeking a highly qualified candidate to contribute to workload characterization, deep learning software projects, and performance benchmarking in Santa Clara, CA. The position emphasizes collaboration with both internal teams and external AI startups to enhance NVIDIA's performance leadership. Candidates should have a Master's or PhD in Computer Science, solid software development experience, and proficiency in programming frameworks like PyTorch. The role offers competitive compensation, including equity and benefits. #J-18808-Ljbffr NVIDIA Gruppe
$224k - $356.5k
NVIDIA Gruppe in Santa Clara is seeking an AI Storage Platform Architect to connect advanced GPU and CPU solutions with real-world AI deployments... ...collaboration with ecosystem partners, ensuring optimal performance for modern AI workloads. Ideal candidates have over 12 years...Performance- ...generation computing experiences-from AI and data centers, to PCs, gaming... .... Your expertise will drive 2-3x performance gains in both training and inference pipelines through innovative... ...expert and data parallel dimensions Architect memory-efficient training systems...PerformanceRemote work
$224k - $356.5k
...Engineer in Santa Clara, California to optimize deep learning inference for autonomous vehicles and robotics. The ideal candidate will... ...role involves working directly with automotive OEMs and pushing performance benchmarks across various platforms. A salary range of 224,000...Performance$190k - $280k
MixMode is seeking a Principal Architect in Santa Clara to accelerate AI application performance through innovative hardware and software solutions. The role involves... ...with product and hardware teams to enhance inference accelerators. The ideal candidate has a BSEE/MSEE...Performance$208k - $327.75k
...forefront of accelerated computing, AI, and autonomous machines.... ...state-of-the-art AI, high-performance compute, and scalable software... ...are looking for a Senior AI Architect to help define the next generation... ...systems, scaling laws, and inference optimization techniques....PerformanceWorldwide$254k - $349.25k
...protect how people, data, and AI agents connect across email,... ...are seeking a Principal ML Architect to lead the design and development... ...continuously improve model performance and reliability... ...time environments Optimize inference systems for low latency, high...PerformanceFlexible hours$254k - $349.25k
...protect how people, data, and AI agents connect across email,... ...We are seeking a Principal ML Architect to lead the design and... ...to continuously improve model performance and reliability Productionization... ...‑time environments Optimize inference systems for low latency, high...PerformanceFlexible hours- ...Job Summary T he AI Interconnect Architect designs and engineers high-speed networking and communication systems for AI inference infrastructure, including servers, racks, and chips... ...to optimize power, cost, and performance across diverse workloads. Key Responsibilities...Performance
- ...Snowflake Presales Architect (Data & AI)1 Role Overview We are looking for a highly skilled... ...(dbt, Informatica, etc.) ~ SQL, performance tuning, and data modeling Experience... ...engineering, model lifecycle, inference patterns) Exposure to Snowflake AI...Performance
- ...located in Sunnyvale, California, is seeking a Vector Compute Architect Intern to join our advanced architecture team. The intern... ...defining and optimizing vector compute architectures for AI and high-performance computing platforms. The role involves driving architecture...PerformanceInternship
- ...generation computing experiences-from AI and data centers, to PCs,... ...We are seeking a Robotics AI Architect to define and scale next-... ...to achieve production-grade performance targets. THE PERSON:... ...Deep understanding of: AI inference runtimes and deployment tradeoffs...Performance
$206.4k - $379.1k
...produce impressive content. The AI Foundations team constructs a... ...'re looking for a Principal Architect to build and implement the AI... ...model orchestration, inference systems, data pipelines, caching... ...frameworks. Develop high-performance runtime services for inference...PerformanceTemporary workLocal areaWorldwideFlexible hours- ...Job Description: Edge AI Architect - CUDA / C++ / Computer Vision Experience Level: 10+ Years Department: Edge AI & Embedded Systems... ...-powered medical devices that operate at the edge, ensuring performance, reliability, and compliance with healthcare standards....PerformanceContract work
- ...Senior AI Architect Nexxa.ai is building artificial super intelligence for heavy industries — enabling machines, systems and operations... ...science best practices to understand user behavior, system performance, and model impact Build and maintain data-driven context...Performance
- ...recognized globally for innovation, performance and quality. Sandisk has... ...Job Description In this AI/ML ASIC Architecture position... ...product. As an AI/ML ASIC Architect you will help drive new... ...Architect memory-efficient inference/training systems utilizing techniques...PerformanceTemporary workRemote workFlexible hoursShift workNight shift
- ...recognized globally for innovation, performance and quality. Sandisk has two facilities... ...forward. Job Description An AI Interconnect Architect defines and engineers high-speed... ...networking and communication systems for AI Inference infrastructure which include servers,...PerformanceTemporary workRemote workFlexible hoursShift work
- ...Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years of experience...Performance
$184k - $356.5k
NVIDIA Corporation is seeking a Senior CPU Performance Architect to join their CPU performance architecture team in Santa Clara, California. This... ...the development of CPU technology for applications in AI, HPC, and gaming. The ideal candidate will have 7+ years of experience...Performance- Netpreme is seeking an ASIC Architect to define the architecture and microarchitecture for their silicon product with offices in Boston, MA. Responsibilities include performance modeling for GPU memory systems, architecting data planes, and owning SoC architecture specs...Performance
$192k - $278k
Google Inc. is looking for a Machine Learning Hardware Architect to innovate and lead TPU technology. You will collaborate with teams to develop high-performance systems and define requirements for AI workloads. Your expertise will influence technical roadmaps and drive...Performance$184k - $287.5k
NVIDIA Corporation is hiring a Senior Deep Learning Performance Architect in Santa Clara, CA. This role involves developing cutting-edge architectures to enhance deep learning performance and efficiency. Candidates should possess a strong background in performance analysis...Performance- codvo-team is seeking an Edge AI Architect in Santa Clara, California, to drive the development of AI-powered medical devices. The ideal... ...in the Edge AI & Embedded Systems department, optimizing performance and ensuring compliance with healthcare standards. Join us for...Performance
- ...architecture specifications into micro-architecture feature requirements. The role involves performance analysis, modeling, and working with a breakthrough architecture that supports AI applications at scale. Ideal candidates will hold an advanced degree in Electrical or...Performance
$224k - $431.25k
NVIDIA Gruppe is seeking an experienced AI Platform Architect to build and enhance internal AI solutions used globally. The role requires a strong AI-first approach, with a focus on developing systems software and deploying AI solutions. The candidate should have extensive...Performance- NVIDIA Gruppe seeks a Principal Architect to drive the architectural vision for AI communication systems. This role involves setting the technical direction... ...and systems software, particularly in high-performance environments, as well as a degree in Computer Science...Performance
- NVIDIA Corporation is looking for a Technical Marketing Engineer (TME) in Santa Clara, California. In this role, you will architect high-performance demos and develop media pipelines while evangelizing artificial intelligence for live media. The ideal candidate will...Performance
- ...vision and architecture focused on measuring and visualizing AV model performance. This role will involve scaling state-of-the-art tools and partnering with cross-functional teams within the Embodied AI group. The ideal candidate will have strong backend and frontend...Performance
- Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders...Performance
- NVIDIA Gruppe in Santa Clara is seeking a Deep Learning Communication Architect to optimize DNN models and enhance communication performance during distributed training. This role requires collaboration with hardware/software teams to implement efficient communication...Performance
- ...improve CI reliability for their open-source LLM inference engine. The role requires 3+ years' experience in... ...team that’s passionate about building world-class AI infrastructure, ensuring fast and secure system performance. This is an opportunity to make a significant...Performance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Inference Performance Architect. Be the first to apply!
- senior performance engineer Santa Clara, CA
- application performance engineer Santa Clara, CA
- performance engineer Santa Clara, CA
- performance food group Santa Clara, CA
- senior performance tester Santa Clara, CA
- system performance engineer Santa Clara, CA
- performance testing Santa Clara, CA
- high performance computing engineer Santa Clara, CA
- performance test engineer Santa Clara, CA
- acting performance Santa Clara, CA


