Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Founder Vice President, AI Inference Software

Confidential

Founder Vice President, AI Inference Software

About the Company

Confidential AI systems company

Industry
Information Technology and Services

Type
Privately Held, VC-backed

About the Role

The Company is seeking a senior software leader to take on a founder-level role in shaping the future of high-performance AI inference. The successful candidate will have the opportunity to influence architecture, team design, and product direction, with a focus on the transition from software acceleration to programmable hardware and future silicon. Key responsibilities include leading the architecture and delivery of a high-performance AI inference software stack, building and mentoring an elite engineering team, and partnering with founders to define the product roadmap and infrastructure strategy. The role also involves improving the efficiency of modern accelerator-based deployments, working with strategic customers, and establishing engineering standards in a fast-moving startup environment. Applicants for this role at the company should have a deep background in building or optimizing production-scale AI inference or distributed systems software, particularly in accelerator-rich environments. Strong technical skills in areas such as GPU programming, kernel optimization, and runtime design are essential. The ideal candidate will have a proven track record of delivering technically challenging platforms and experience in leading senior engineers. Familiarity with modern model serving patterns and performance measurement for large-scale AI workloads is also required. The role is suited to individuals who are comfortable in early-stage companies and can thrive in an environment of high ambiguity, where they will play a key role in defining the technical and operational playbook.

Travel Percent
Less than 10%

Functions

  • Engineering
Confidential
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Founder Vice President, AI Inference Software in San Jose, CA vacancy
  • $230k - $250k

    Cerebras Systems is seeking a Sr. Member of Technical Staff in Sunnyvale, CA. This role involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a Master’s degree in Computer Science and experience... 
    Software

    Cerebras Systems

    Sunnyvale, CA
    5 days ago
  • $110k - $300k

     ...TetraMem, we are redefining the future of AI with our groundbreaking innovations in In...  .... Work closely with hardware and software teams to integrate ML models into production...  ...consumption for embedded AI applications. Improve inference efficiency and model compression... 
    Software

    TETRAMEM INC

    San Jose, CA
    2 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our...  ...to deliver industry-leading training and inference speeds and empowers machine learning users...  ...Qualifications ~8+ years of experience in software engineering, with substantial individual... 
    Software

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    5 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking...  ...One. Design, develop, test, deploy, and support AI software components including foundation model training, large language... 
    Software
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    5 days ago
  • $229.9k - $262.4k

    ## Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)Applylocations: San Jose, CA: San Francisco, CA: McLean, VA: Cambridge...  ...One.* Design, develop, test, deploy, and support AI software components including foundation model training, large language... 
    Software
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    2 days ago
  •  ...Tech Lead, Data & Inference Engineer Sunnyvale, California, United States About the Job...  ...with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand...  .... We collaborate directly with Founders, CTOs, and Heads of AI in those themes who... 
    Full time

    Catalyst Labs, LLC

    Sunnyvale, CA
    4 days ago
  • $136.8k - $259.2k

     ...Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD) Location: San Jose Team: Technology Employment Type: Regular The Inference...  ...—enabling both internal and external developers to bring AI workloads from research to production at scale. We are... 
    Software
    Temporary work

    Pangleglobal

    San Jose, CA
    2 days ago
  • $184k - $287.5k

     ...Position Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $208.8k

     ...A leading tech company in San Jose is looking for a Tech Lead Software Engineer specializing in AI Inference Infrastructure. This role entails designing container-based management systems and collaborating across teams to develop state-of-the-art inference solutions.... 
    Software

    ByteDance

    San Jose, CA
    2 days ago
  • $152k - $241.5k

    We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry...  ..., agentic workflows, and other emerging AI use cases. Collaborate with framework and...  ...experience. 5+ years of relevant software development experience. Strong Python or... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $195k - $298k

     ...eligible for relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure...  ...’ll be doing Design and implement core platform backend software components. Collaborate with ML engineers and researchers... 
    Software
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  •  ...A leading AI technology company in Sunnyvale, California, is seeking a skilled software engineer to optimize its AI cloud platform for model training and inference. In this role, you'll enhance deployment efficiency and ensure system reliability and scalability. The ideal... 
    Software

    Cerebras

    Sunnyvale, CA
    2 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our...  ...to deliver industry-leading training and inference speeds and empowers machine learning users...  ...inference services. About The Role As a software engineer on our AI cloud platform, you will... 
    Software

    Cerebras

    Sunnyvale, CA
    2 days ago
  • $156k - $316.8k

     ...Responsibilitie About the Team The Inference Infrastructure team is the creator and open...  ...internal and external developers to bring AI workloads from research to production at...  ...have recently completed a PhD degree in Software Development, Computer Science, Computer Engineering... 
    Software
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    4 days ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...Introduction At IBM Software, we transform client challenges into solutions. Building the world's leading AI-powered, cloud-native products that shape the future of business...  ...see. Scalability: Design a model‑agnostic inference layer that allows us to switch between models... 
    Software

    IBM Computing

    San Jose, CA
    2 days ago
  •  ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling...  ...performance and usability. As a Senior Software Engineer on the Inference ML Engineering...  ...enable running state‑of‑the‑art generative AI models on our custom hardware. You will architect... 
    Software

    Dormont Manufacturing Company

    Sunnyvale, CA
    3 days ago
  • $224k - $356.5k

     ...tapping into the unlimited potential of AI to define the next era of computing. An era...  ...LLM serving performance across various inference frameworks. Hyperscalers, cloud providers...  ...equivalent experience. ~8+ overall years of software engineering experience building... 
    Software
    Local area
    Worldwide

    NVIDIA

    Santa Clara, CA
    7 days ago
  • $184k - $287.5k

     ...help in the enablement of Network Industry Software Vendors (ISVs). These ISVs are developing a network stack for distributed inference which will be used to orchestrate wide area...  ...a lasting impact on the world by applying AI inference aware technology to existing networks... 
    Software
    Work experience placement

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $224k - $356.5k

     ...tapping into the unlimited potential of AI to define the next era of computing. An era...  ...LLM serving performance across various inference frameworks. Hyperscalers, cloud providers...  ...equivalent experience. 8+ overall years of software engineering experience building... 
    Software
    Local area
    Worldwide

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our...  ...to deliver industry-leading training and inference speeds and empowers machine learning users...  ...content creators, influencers, and popular software communities Feature Cerebras in... 
    Software
    Shift work
    Night shift

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • $148k - $235.75k

     ...data center business and pivotal in our inference marketing. You will be focused on working...  ...strategy to showcase our leadership position in AI inference. Want to join a fun,...  ...-on Technical Competence – Background in software development, AI infrastructure, data center... 
    Software

    NVIDIA

    Santa Clara, CA
    7 days ago
  • $152k - $241.5k

     ...We are looking for a Senior System Software Engineer to work on. NVIDIA is hiring software...  ...are using GPUs to power a revolution in AI, enabling breakthroughs in problems from...  ...paced team building a highly-performant AI inference platform to make design and deployment of... 
    Software

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...center business and will be pivotal in our inference marketing. You will work closely with...  ...strategy to showcase our leadership position in AI inference. What You’ll Be Doing Help...  ...Hands‑on technical competence—background in software development, AI infrastructure, data‑... 
    Software

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $216k - $414k

     ...We are looking for Software Engineering Manager to lead the development efforts for the Triton Inference Server team! Academic and commercial groups around the world are using...  ...developing, and optimizing software that streamlines AI inferencing. Ideal candidates will not only... 
    Software

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $332k

     ...are looking for a Senior leader to orchestrate embedded NVIDIA AI software Go‑To‑Market strategy and co‑sales with our partners. As...  ...and Solution Architects to Define and Implement a leadership Inference go‑to‑market strategy! Strategic Ecosystem Engagement: Identify... 
    Software
    Worldwide

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...next-generation computing experiences-from AI and data centers, to PCs, gaming and...  ...ROLE: As a senior member of the LLM inference framework team, you will be responsible for...  ...architectures and kernel development Software Engineering ~ Expertise in Python and... 
    Software

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  • OXMIQ designs GPU and AI silicon for large-scale model inference and training and is developing an infrastructure and AI service orchestration platform...  ...end; maintain onboarding runbooks and standard hardware/software configurations; conduct day‑one IT orientation and... 
    Software
    Work at office

    Oxmiq Labs

    Campbell, CA
    3 days ago
  •  ...class founding team, we build multi-agent AI systems that can automate complex...  ...AIs to interact with critical enterprise software platforms. Ensure AI designs align with...  ...platforms such as AWS, Azure, or GCP. Deploy inference endpoints and serve AI and LLM... 
    Software

    Tessera Labs

    San Jose, CA
    4 days ago
  •  ...confidence in Kai reflects what we've built: an AI-powered cybersecurity platform that...  ...demands. ~ Experienced founders: Our founding team consists of second-time...  ..., accurate inventory of all hardware and software assets across the organization Manage... 
    Software
    Work at office

    Kai Cyber, Inc.

    San Jose, CA
    7 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Founder Vice President, AI Inference Software. Be the first to apply!