Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Chief Architect, AI Inference Supercomputing

Etched

A pioneering AI hardware company is seeking a Head of Supercomputing to define and lead the architecture and software for its cluster-scale AI compute systems. This role involves deep systems expertise and managing a talented engineering team. Responsibilities include setting the technical vision, overseeing development, and ensuring system reliability and efficiency. The ideal candidate will have extensive experience in system software and infrastructure, along with proven leadership abilities in a fast-paced environment. #J-18808-Ljbffr Etched

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Chief Architect, AI Inference Supercomputing in San Jose, CA vacancy
  •  ...seeking a technical leader for their Advertising Performance group in San Jose, California. The role involves architecting and developing a state-of-the-art inference platform to enhance advertising performance. Ideal candidates should possess over 10 years of experience... 
    Suggested

    TryApplyNow

    San Jose, CA
    1 day ago
  • Automation Anywhere Inc. is seeking an experienced Business Development leader to drive revenue growth and establish strategic partnerships. The role involves managing the lifecycle of partnerships, collaborating with multiple departments, and conducting financial analysis...
    Suggested
    Remote job

    Automation Anywhere Inc.

    San Jose, CA
    5 days ago
  • d-Matrix inc. in Santa Clara, CA is seeking a skilled individual for FPGA design and verification for AI solutions. The role involves collaborating with teams to meet project specifications and implementing robust hardware and software modules. The ideal candidate has... 
    Suggested

    d-Matrix inc.

    Santa Clara, CA
    4 days ago
  • A leading AI technology company is seeking a Senior Product Architect specializing in storage. In this role, you will architect end-to-end reference architectures for AI deployments, ensuring that the hardware and software are optimized for performance. Candidates must... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  •  ...Founder Vice President, AI Inference Software About the Company Confidential AI systems company Industry Information Technology and Services Type Privately Held, VC-backed About the Role The Company is seeking a senior software leader to take on... 
    Suggested

    Confidential

    San Jose, CA
    5 days ago
  •  ...solutions. Building the world's leading AI-powered, cloud-native products that shape...  ...teams. We are looking for a hands-on AI Architect to deliver technical designs and...  ...Scalability: Design a model-agnostic inference layer that allows us to switch between models... 

    IBM

    San Jose, CA
    4 days ago
  • $124k - $195.5k

     ...We are now looking for a Deep Learning Architect, LLM Inference! NVIDIA is at the forefront of the generative AI revolution. The Inference Benchmarking (IB) team specifically focuses on inference server performance optimization for Large Language Models (LLMs). If you... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $55 - $60 per hour

     ...transformation, and validation pipelines using Kubeflow Pipelines and Vertex AI Pipelines. Implement scalable and reusable feature...  ...automation. Configure and manage low-latency production inference environments using Vertex AI Endpoints. Ensure scalable, reliable... 

    Cynet Systems

    San Jose, CA
    2 days ago
  •  ...We are looking for a skilled MLOps Architect to join our team and help us build, deploy, and...  ...services like Kubeflow Pipelines and Vertex AI Pipelines. • Feature Engineering:...  ...using Vertex AI Endpoints for real-time inference. Qualifications • Strong experience... 

    Yochana

    San Jose, CA
    5 days ago
  • $150k - $160k

     ...interested in working with the World's leading AI-powered Quality Engineering Company?...  ...for a Digital Engineering - Gen AI Architect to join our growing team in United States...  ...lifecycle management (training, fine-tuning, inference, observability). Guide engineering and... 
    Casual work
    Local area
    Remote work
    Flexible hours

    QualiTest Group

    Santa Clara, CA
    5 days ago
  •  ...Snowflake Presales Architect (Data & AI)1 Role Overview We are looking for a highly skilled Snowflake Presales Architect with strong...  ...of AI/ML concepts (feature engineering, model lifecycle, inference patterns) Exposure to Snowflake AI capabilities (Snowpark... 

    Jade Global

    San Jose, CA
    1 day ago
  •  ...next-generation computing experiences-from AI and data centers, to PCs, gaming and...  ...We are seeking a Robotics AI Architect to define and scale next-generation Physical...  ...stakeholders ~ Deep understanding of: ~ AI inference runtimes and deployment tradeoffs ~ System... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    3 days ago
  •  ...AI Architecture & System Design AI system architectures: multi-agent orchestration...  ...), text-to-SQL engines, and real-time inference APIs. Define and own technical blueprints...  ...retrieval. Proven track record architecting and shipping multi-agent systems, knowledge... 

    Yochana

    San Jose, CA
    7 days ago
  • $190k - $280k

    MixMode is seeking a Principal Architect in Santa Clara to accelerate AI application performance through innovative hardware and software solutions. The...  ...with product and hardware teams to enhance inference accelerators. The ideal candidate has a BSEE/MSEE with... 

    MixMode

    Santa Clara, CA
    4 days ago
  • $124k - $241.5k

    NVIDIA Corporation is seeking a Deep Learning Architect specializing in LLM Inference for New College Grads in Santa Clara, California. The role involves optimizing performance for Large Language Models, developing profiling tools, and collaborating on benchmarking methodologies... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  •  ...next-generation computing experiences—from AI and data centers, to PCs, gaming and...  ...x performance gains in both training and inference pipelines through innovative system design...  ...context, expert and data parallel dimensions Architect memory‑efficient training systems... 
    Remote work

    Advanced Micro Devices

    Santa Clara, CA
    2 days ago
  • $254k - $349.25k

     ...cybersecurity. We protect how people, data, and AI agents connect across email, cloud, and...  ...We are seeking a Principal ML Architect to lead the design and development of next...  ...scale, real-time environments Optimize inference systems for low latency, high throughput... 
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    2 days ago
  •  ...Chief Blockchain Architect (CBA) About the Company Forward-thinking blockchain & gaming ecosystem Industry Computer Software Type Privately...  ..., or secure multi-party computation, and a background in AI-driven systems or developer tool ecosystems are considered... 
    Contract work

    Confidential

    San Jose, CA
    2 days ago
  •  ...memory it needs to keep our world moving forward. Job Description An AI Interconnect Architect defines and engineers high-speed networking and communication systems for AI Inference infrastructure which include servers, racks, and chips with a focus on bandwidth... 
    Temporary work
    Remote work
    Flexible hours
    Shift work

    Sandisk

    Milpitas, CA
    a month ago
  •  ...forward. Job Description In this AI/ML ASIC Architecture position, you will develop...  ...Accelerator product.  As an AI/ML ASIC Architect you will help drive new architecture...  ...bandwidth Architect memory-efficient inference/training systems utilizing techniques like... 
    Temporary work
    Remote work
    Flexible hours
    Shift work
    Night shift

    Sandisk

    Milpitas, CA
    5 days ago
  • $219k - $351k

    Principal engineer, AI Serving Framework Architect (Software) San Jose, California, United States Please Note: To provide the best candidate experience...  ...dynamic scheduling methodologies for maximizing AI inference performance in multi‑rack scale memory‑centric systems... 
    Work at office
    Flexible hours

    Conductor

    San Jose, CA
    1 day ago
  • Roku, Inc. in San Jose is looking for a strong technical leader to architect and develop a state-of-the-art inference platform for advertising systems. The ideal candidate will have over 10 years of experience in distributed systems and at least 5 years in a leadership... 

    Roku, Inc.

    San Jose, CA
    5 days ago
  • $254k - $349.25k

     ...cybersecurity. We protect how people, data, and AI agents connect across email, cloud, and...  ...Overview We are seeking a Principal ML Architect to lead the design and development of...  ...‑scale, real‑time environments Optimize inference systems for low latency, high throughput,... 
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    4 days ago
  •  ...Key Responsibilities Design comprehensive AI solutions integrating the latest AI technology developments. Lead architectural...  ...and cloud architecture optimization for client solutions. Architect innovative AI solutions from ideation to MVP, delivering... 
    Work at office

    E-Solutions

    Santa Clara, CA
    2 days ago
  • $260k - $280k

     ...collegial atmosphere, then Lattice may well be just what you're looking for. Responsibilities & Skills We are looking for an AI Architect to design and lead the development of a large-scale Agentic AI system that automates complex engineering workflows using LLMs,... 
    Worldwide

    Lattice Semiconductor

    San Jose, CA
    3 days ago
  • $172.8k - $304.9k

     ...Job Title SoC Architect for Next Generation AI Products in Datacenter Job Description Qualcomm is looking for an experienced SoC architect to work on the next generation AI products in the datacenter. We are looking for a data center engineer whose expertise... 
    Work experience placement

    Qualcomm

    Santa Clara, CA
    3 days ago
  •  ...AI/ML Compliance Architect About this Featured Opportunity: The Privacy/Compliance team is looking for a Compliance Architect to ensure that all systems adhere to regulatory and industry standards/requirements. The Opportunity for You:... 
    Local area
    Flexible hours

    INSPYR Solutions

    Cupertino, CA
    3 days ago
  • $208k - $327.75k

     ...Parallel Compute Infrastructure, and Agentic AI - the biggest technology breakthroughs of...  ..., and we're seeking a visionary Product Architect with strong expertise in systems...  ...including agentic & RAG-based workflows, inference at scale, large scale training & fine-tuning... 
    Remote work

    NVIDIA

    Santa Clara, CA
    8 days ago
  • $100k

     ...Power Architect, AI Data Center Chiplets Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations... 

    Tenstorrent

    Santa Clara, CA
    3 days ago
  •  ...compassionate world. About the Role The Senior Principal Architect provides strategic and hands-on architectural leadership across...  ...scalable, secure, and future-ready solutions with a strong focus on AI-enabled capabilities and enterprise integrations. Lead end-to... 
    Work at office
    Local area
    Flexible hours

    Mini Med

    San Jose, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Chief Architect, AI Inference Supercomputing. Be the first to apply!