Founder Vice President, AI Inference Software
Confidential
Founder Vice President, AI Inference Software
About the Company
Confidential AI systems company
Industry
Information Technology and Services
Type
Privately Held, VC-backed
About the Role
The Company is seeking a senior software leader to take on a founder-level role in shaping the future of high-performance AI inference. The successful candidate will have the opportunity to influence architecture, team design, and product direction, with a focus on the transition from software acceleration to programmable hardware and future silicon. Key responsibilities include leading the architecture and delivery of a high-performance AI inference software stack, building and mentoring an elite engineering team, and partnering with founders to define the product roadmap and infrastructure strategy. The role also involves improving the efficiency of modern accelerator-based deployments, working with strategic customers, and establishing engineering standards in a fast-moving startup environment. Applicants for this role at the company should have a deep background in building or optimizing production-scale AI inference or distributed systems software, particularly in accelerator-rich environments. Strong technical skills in areas such as GPU programming, kernel optimization, and runtime design are essential. The ideal candidate will have a proven track record of delivering technically challenging platforms and experience in leading senior engineers. Familiarity with modern model serving patterns and performance measurement for large-scale AI workloads is also required. The role is suited to individuals who are comfortable in early-stage companies and can thrive in an environment of high ambiguity, where they will play a key role in defining the technical and operational playbook.
Travel Percent
Less than 10%
Functions
- Engineering
$230k - $250k
Cerebras Systems is seeking a Sr. Member of Technical Staff in Sunnyvale, CA. This role involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a Master’s degree in Computer Science and experience...Software$110k - $300k
...TetraMem, we are redefining the future of AI with our groundbreaking innovations in In... .... Work closely with hardware and software teams to integrate ML models into production... ...consumption for embedded AI applications. Improve inference efficiency and model compression...Software- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our... ...to deliver industry-leading training and inference speeds and empowers machine learning users... ...Qualifications ~8+ years of experience in software engineering, with substantial individual...Software
$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking... ...One. Design, develop, test, deploy, and support AI software components including foundation model training, large language...SoftwareFull timePart timeLocal area$229.9k - $262.4k
## Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)Applylocations: San Jose, CA: San Francisco, CA: McLean, VA: Cambridge... ...One.* Design, develop, test, deploy, and support AI software components including foundation model training, large language...SoftwareFull timePart timeLocal area- ...Tech Lead, Data & Inference Engineer Sunnyvale, California, United States About the Job... ...with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand... .... We collaborate directly with Founders, CTOs, and Heads of AI in those themes who...Full time
$136.8k - $259.2k
...Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD) Location: San Jose Team: Technology Employment Type: Regular The Inference... ...—enabling both internal and external developers to bring AI workloads from research to production at scale. We are...SoftwareTemporary work$184k - $287.5k
...Position Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and...Software$208.8k
...A leading tech company in San Jose is looking for a Tech Lead Software Engineer specializing in AI Inference Infrastructure. This role entails designing container-based management systems and collaborating across teams to develop state-of-the-art inference solutions....Software$152k - $241.5k
We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry... ..., agentic workflows, and other emerging AI use cases. Collaborate with framework and... ...experience. 5+ years of relevant software development experience. Strong Python or...Software$195k - $298k
...eligible for relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure... ...’ll be doing Design and implement core platform backend software components. Collaborate with ML engineers and researchers...SoftwareRelocation packageFlexible hours- ...A leading AI technology company in Sunnyvale, California, is seeking a skilled software engineer to optimize its AI cloud platform for model training and inference. In this role, you'll enhance deployment efficiency and ensure system reliability and scalability. The ideal...Software
- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our... ...to deliver industry-leading training and inference speeds and empowers machine learning users... ...inference services. About The Role As a software engineer on our AI cloud platform, you will...Software
$156k - $316.8k
...Responsibilitie About the Team The Inference Infrastructure team is the creator and open... ...internal and external developers to bring AI workloads from research to production at... ...have recently completed a PhD degree in Software Development, Computer Science, Computer Engineering...SoftwareTemporary workLocal area$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Software- ...Introduction At IBM Software, we transform client challenges into solutions. Building the world's leading AI-powered, cloud-native products that shape the future of business... ...see. Scalability: Design a model‑agnostic inference layer that allows us to switch between models...Software
- ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling... ...performance and usability. As a Senior Software Engineer on the Inference ML Engineering... ...enable running state‑of‑the‑art generative AI models on our custom hardware. You will architect...Software
$224k - $356.5k
...tapping into the unlimited potential of AI to define the next era of computing. An era... ...LLM serving performance across various inference frameworks. Hyperscalers, cloud providers... ...equivalent experience. ~8+ overall years of software engineering experience building...SoftwareLocal areaWorldwide$184k - $287.5k
...help in the enablement of Network Industry Software Vendors (ISVs). These ISVs are developing a network stack for distributed inference which will be used to orchestrate wide area... ...a lasting impact on the world by applying AI inference aware technology to existing networks...SoftwareWork experience placement$224k - $356.5k
...tapping into the unlimited potential of AI to define the next era of computing. An era... ...LLM serving performance across various inference frameworks. Hyperscalers, cloud providers... ...equivalent experience. 8+ overall years of software engineering experience building...SoftwareLocal areaWorldwide- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our... ...to deliver industry-leading training and inference speeds and empowers machine learning users... ...content creators, influencers, and popular software communities Feature Cerebras in...SoftwareShift workNight shift
$148k - $235.75k
...data center business and pivotal in our inference marketing. You will be focused on working... ...strategy to showcase our leadership position in AI inference. Want to join a fun,... ...-on Technical Competence – Background in software development, AI infrastructure, data center...Software$152k - $241.5k
...We are looking for a Senior System Software Engineer to work on. NVIDIA is hiring software... ...are using GPUs to power a revolution in AI, enabling breakthroughs in problems from... ...paced team building a highly-performant AI inference platform to make design and deployment of...Software- ...center business and will be pivotal in our inference marketing. You will work closely with... ...strategy to showcase our leadership position in AI inference. What You’ll Be Doing Help... ...Hands‑on technical competence—background in software development, AI infrastructure, data‑...Software
$216k - $414k
...We are looking for Software Engineering Manager to lead the development efforts for the Triton Inference Server team! Academic and commercial groups around the world are using... ...developing, and optimizing software that streamlines AI inferencing. Ideal candidates will not only...Software$332k
...are looking for a Senior leader to orchestrate embedded NVIDIA AI software Go‑To‑Market strategy and co‑sales with our partners. As... ...and Solution Architects to Define and Implement a leadership Inference go‑to‑market strategy! Strategic Ecosystem Engagement: Identify...SoftwareWorldwide- ...next-generation computing experiences-from AI and data centers, to PCs, gaming and... ...ROLE: As a senior member of the LLM inference framework team, you will be responsible for... ...architectures and kernel development Software Engineering ~ Expertise in Python and...Software
- OXMIQ designs GPU and AI silicon for large-scale model inference and training and is developing an infrastructure and AI service orchestration platform... ...end; maintain onboarding runbooks and standard hardware/software configurations; conduct day‑one IT orientation and...SoftwareWork at office
- ...class founding team, we build multi-agent AI systems that can automate complex... ...AIs to interact with critical enterprise software platforms. Ensure AI designs align with... ...platforms such as AWS, Azure, or GCP. Deploy inference endpoints and serve AI and LLM...Software
- ...confidence in Kai reflects what we've built: an AI-powered cybersecurity platform that... ...demands. ~ Experienced founders: Our founding team consists of second-time... ..., accurate inventory of all hardware and software assets across the organization Manage...SoftwareWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Founder Vice President, AI Inference Software. Be the first to apply!
- vice president of application development San Jose, CA
- vice president digital media San Jose, CA
- vice president technical operations San Jose, CA
- vice president internal communications San Jose, CA
- vice president data analytics San Jose, CA
- vp internal audit San Jose, CA
- vice president corporate communications San Jose, CA
- vice president of product strategy San Jose, CA
- vice president of retail San Jose, CA
- vice president shared services San Jose, CA

