Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Hybrid Principal Architect: AI Inference Modeling

d-Matrix

d-Matrix, based in Santa Clara, CA, is looking for a Principal Architect specializing in Performance Analysis and Modeling to advance AI application performance at the hardware-software intersection. The role demands strong expertise in computer architecture and programming, with responsibilities including analyzing ML workloads, developing performance models, and collaborating with various teams. A minimum of 10 years of experience or 8 years with an MSEE is required, and the position offers a hybrid work environment. #J-18808-Ljbffr d-Matrix

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Hybrid Principal Architect: AI Inference Modeling in Santa Clara, CA vacancy
  • $254k - $349.25k

     ...how people, data, and AI agents connect across...  ...Overview We are seeking a Principal ML Architect to lead the design and...  ...deep expertise in model architecture, training...  ...retrieval‑augmented systems, hybrid models) Balance trade...  ...Optimize inference systems for low latency... 
    Principal
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    2 days ago
  •  ...computing experiences—from AI and data centers, to...  ...career. THE ROLE As a Principal Engineer, you will...  ...specifications that enable massive model training at scale. Your...  ...in both training and inference pipelines through...  ...data parallel dimensions Architect memory‑efficient... 
    Principal
    Remote work

    Advanced Micro Devices

    Santa Clara, CA
    19 hours ago
  • $190k - $280k

     ...Full time Location Type Hybrid Department R&D - CTO & Architecture...  ...potential of generative AI to power the...  ...week Hybrid. The role: Principal Architect- Performance Analysis and Modeling d-Matrix is seeking outstanding...  ...(such as generative inference etc.). Our acceleration... 
    Principal
    Full time
    3 days per week

    MixMode

    Santa Clara, CA
    1 day ago
  • Proofpoint is seeking a Principal ML Architect to lead the design and development of next-generation AI systems focused on cybersecurity. In this role, you will leverage advanced...  ...with significant expertise in deep learning, model architecture, and security. The position... 
    Principal

    Proofpoint

    Sunnyvale, CA
    1 day ago
  • $190k - $280k

    MixMode is seeking a Principal Architect in Santa Clara to accelerate AI application performance through innovative hardware and software solutions. The role...  ...with product and hardware teams to enhance inference accelerators. The ideal candidate has a BSEE/MSEE with... 
    Principal

    MixMode

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

     ...Architecture group is solving some of AI’s hardest infrastructure...  ...interconnects. This Principal Architect role leads the research...  ...disaggregated prefill/decode, model parallelism). Integrating...  ...or distributed training and inference patterns. ~ Proficiency in... 
    Principal

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $60 - $65 per hour

     ...Principal AI Architect Pay Range: $60hr - $65hr The Principal AI Architect...  ...multi-agent AI systems and hybrid retrieval architectures....  ...hallucination mitigation, and scalable inference systems. ~ Experience...  ...with machine learning models including XGBoost, NLP,... 
    Principal

    Cynet Systems

    Mountain View, CA
    1 day ago
  •  ...AI Architecture & System Design • AI system architectures...  ...layers, RAG pipelines, hybrid retrieval systems (...  ...engines, and real-time inference APIs. • Define and...  ...: data qualification, model training, evaluation,...  ...• Proven track record architecting and shipping multi-... 
    Principal

    E-Solutions

    San Jose, CA
    2 days ago
  • d-Matrix inc. is seeking a Principal Architect specializing in Performance Analysis and Modeling to join our Santa Clara, CA team. The role involves analyzing ML workloads, developing hardware/software features, and creating analytical models that enhance performance across... 

    d-Matrix inc.

    Santa Clara, CA
    3 days ago
  • Entrada Ventures is seeking a Principal Hardware Design Engineer to...  ...will play a pivotal role in architecting the physical foundation...  ...for our silicon-integrated AI accelerators. The ideal candidate...  ...Orcad. The position requires a hybrid work model with onsite presence 3-5... 
    Principal
    3 days per week

    Entrada Ventures

    Santa Clara, CA
    4 days ago
  • $208k - $327.75k

     ...forefront of accelerated computing, AI, and autonomous machines. From...  ...architectures.We are looking for a Senior AI Architect to help define the next generation of AI model paradigms for autonomous vehicles...  ...systems, scaling laws, and inference optimization techniques.... 
    Worldwide

    NVIDIA Corporation

    Santa Clara, CA
    19 hours ago
  • $192k - $267k

     ...Principal Architect, AI and Semiconductors, Google Cloud Google San Francisco, CA, USA ; Sunnyvale, CA, USA ~ Bachelor's degree in Computer...  ...with structured enterprise architecture practices, hybrid cloud deployments, and on-premise-to-cloud migration deployments... 
    Principal

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • Intel is seeking an AI Algorithm Engineer to design and build generative AI agents and...  ...products utilizing local and cloud-based models, focusing on enhancing intelligent reasoning...  ...C++ and Python. This position supports a hybrid work model, combining on-site and remote work... 
    Local area
    Remote work

    Intel

    Santa Clara, CA
    2 days ago
  • Location: Bay Area (Onsite or Hybrid) About Velaura AI Velaura AI is building the next generation of...  ...are assembling a world‑class team of architects and engineers to tackle some of the...  ...Overview We are looking for a Performance Modeling Architect who will develop the... 

    Velaura

    Santa Clara, CA
    19 hours ago
  • $208k - $327.75k

    NVIDIA Corporation, located in Santa Clara, California, is seeking a Senior AI Architect to define next-generation AI model paradigms for autonomous vehicles. This pivotal role collaborates with elite AI researchers and silicon architects to shape autonomous mobility through... 

    NVIDIA Corporation

    Santa Clara, CA
    19 hours ago
  •  ...builds the world's largest AI chip, 56 times larger...  ...-leading training and inference speeds; over 10 times...  ...works with the leading model labs, global enterprises...  ...inference. We're hiring a Principal Engineer for our...  ...Reliability & Performance. Architect active‑active systems... 
    Principal

    Cerebras Systems, Inc.

    Sunnyvale, CA
    8 hours ago
  • $272k - $431.25k

    NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves...  ...systems. You will collaborate closely with internal model teams, infrastructure/SRE, and product to... 
    Principal

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

     ...NVIDIA is seeking a world‑class Principal DRAM Architect to define, drive, and...  ...implementation of next‑generation AI and graphics memory...  ...(TSVs, interposers, CoWoS, hybrid bonding, FOWLP) to optimize...  ...specific roadmap alignment. Model and quantify system‑level trade... 
    Principal

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $172.8k - $304.9k

     ...Engineering Group, Engineering Group ASICS EngineeringGeneral Summary:Qualcomm is looking for an experienced SoC architect to work on the next generation AI products in the datacenter.We are looking for a data center engineer whose expertise spans security, RAS and/or... 
    Principal
    Work experience placement
    Work from home

    Nutanix

    Santa Clara, CA
    1 day ago
  •  ...technology innovation company is looking for an experienced AI Security Architect to enhance security in their AI systems. This role involves...  ...architecture and secure computing. The position allows for hybrid work based in Santa Clara, CA, reflecting the company's commitment... 
    Principal

    d-Matrix

    Santa Clara, CA
    3 days ago
  • Cerebras Systems, Inc. is seeking a Principal Engineer to lead their Inference Cloud Platform team. This pivotal role involves identifying key platform issues...  ...This is a unique opportunity to work on groundbreaking AI technology and be part of a company that values... 
    Principal

    Cerebras Systems, Inc.

    Sunnyvale, CA
    8 hours ago
  • $272k - $431.25k

    NVIDIA Gruppe is looking for a Principal Software Engineer to advance open-source AI inference. This hands-on role emphasizes running high-performance inference on NVIDIA platforms and involves collaboration across various teams. Key responsibilities include optimizing... 
    Principal

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $216k - $345k

     ...NVIDIA Corporation is seeking a Principal Solutions Architect in Santa Clara to drive the technical strategy for semiconductor testing. This role involves leading the architectural vision and modernizing test infrastructure. Candidates should have over 15 years of relevant... 
    Principal

    Jobleads-US

    Santa Clara, CA
    2 days ago
  • JPMorgan Chase in Palo Alto seeks a Senior Principal Software Engineer to lead the architectural development of AI solutions. The role requires expertise in LLMs and GNNs, applying extensive experience in deploying model serving platforms to drive business impact. You will... 
    Principal

    TwinThread

    Palo Alto, CA
    4 days ago
  • $400k

    About Inflection AI Inflection AI is a Public Benefit...  ...AI’s foundation model, proving that AI can be...  ...transformer-based and hybrid architectures, including...  ...both training-time and inference-time performance. Lead...  ...leading, or serving as a principal contributor to, large-scale... 
    Principal
    Full time

    Inflection AI

    Palo Alto, CA
    2 days ago
  •  ...NVIDIA Gruppe seeks a Principal Architect to drive the architectural vision for AI communication systems. This role involves setting the technical direction, conducting original research on networking solutions, and mentoring engineers. The ideal candidate will have... 
    Principal

    Jobleads-US

    Santa Clara, CA
    2 days ago
  • d-Matrix is seeking a Principal Compute Design Engineer to be responsible for the micro-architecture and design of AI sub-system modules. You will collaborate with System Architects to develop efficient solutions, ensuring high performance and efficiency in RTL design.... 
    Principal
    3 days per week

    MixMode

    Santa Clara, CA
    4 days ago
  •  ...Young Oman is looking for a Senior Manager in AI Architecture to lead Knowledge based AI...  ...extensive experience with AI Foundation Models and governance tooling, coupled with the ability...  ...to influence stakeholders. EY offers a hybrid work model, comprehensive benefits, and a... 

    Ernst & Young Oman

    Palo Alto, CA
    2 days ago
  • Advanced Micro Devices is looking for a Principal Engineer in Santa Clara, CA to lead AI infrastructure development, define GPU architecture specifications, and drive performance gains in ML systems. The role involves leading innovative techniques, collaborating with stakeholders... 
    Principal

    Advanced Micro Devices

    Santa Clara, CA
    19 hours ago
  • NVIDIA Gruppe is seeking a Principal Engineer to lead the development of production-grade AI applications. You will be responsible for improving performance, reliability, and observability across various AI products while collaborating closely with cross-functional teams... 
    Principal

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Hybrid Principal Architect: AI Inference Modeling. Be the first to apply!