AI Kernel Engineer
Quadric
Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel library for a variety of AI/LLM models; [2] analyze the performance and optimize the kernel for different hardware configurations; This senior technical role demands deep knowledge of hardware architecture, compiler toolchain and optimization techniques. Responsibilities Develop AI/LLM kernels/operators on Quadric platform for efficient inference Optimize the kernel performance for different hardware configurations and workloads Profile and analyze kernel performance in terms of compute, data and parallelism; identify micro‑architecture and software bottlenecks and provide optimization solutions Optimize kernel C/C++ codes, maximize hardware utilization Make improvement to Quadric toolchain, compiler and runtime Provide technical support and documents to customers and developer community Requirements Bachelor’s or Master’s in Computer Science and/or Electrical Engineering. 5+ years of experience in AI kernel development and optimization Experience with model and kernel inference performance profiling Experience with at least one of the following compute development: CUDA, DSP, NEON, Triton‑lang Proficiency in C/C++ and Python, experience with assembly language a plus Demonstrate good capability in problem solving, debug and communication Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation, Sick & Public Holidays) #J-18808-Ljbffr Quadric
- Quadric in San Francisco is looking for an experienced AI Kernel Engineer to develop and optimize AI kernels for their innovative neural processing platform. This role involves enhancing performance for various hardware configurations and providing technical support to...Suggested
- A leading AI technology firm located in San Francisco is seeking a Research Engineer specializing in AI Performance & Kernel Optimization. The role involves enhancing the performance of large-scale AI systems, optimizing kernels, and collaborating with various teams. Ideal...Suggested
- Asari AI in San Francisco is seeking individuals to optimize high-performance, mission-critical computing systems. You'll work with AI agents to improve performance and design complex systems. The ideal candidate has strong CUDA C experience and fluency in Python and C...SuggestedFlexible hours
$175k - $225k
...Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple, and... .... The Role We're looking for an AI Inference Engineer who lives at the... ...the country. If you are obsessed with CUDA kernels, TensorRT optimizations, and the challenge...SuggestedLocal areaRemote work- ...you "get stuff done" end-to-end. You use AI to work smarter and solve problems faster... ...tooling across the spectrum: from prompt engineering and in-context learning to fine-tuned models... ..., LlamaIndex, AutoGen, CrewAI, Semantic Kernel, and emerging OSS stacks. Apply the...SuggestedWorldwide
- ...follow us on social media. Who You Are The Agentic AI Software Engineer - Cybersecurity Systems designs, develops, and deploys... ...frameworks (e.g., LangChain, LlamaIndex, AutoGen, Semantic Kernel, or similar). Experience implementing secure software development...Local areaWork from home
- ...Description The Senior Software Engineer will be a technical leader for the design... ...excited about finding new ways to work with AI, this is the role for you! Key Responsibilities... ...frameworks (LangChain, Semantic Kernel, or similar). ~ Solid understanding of...Remote workWorldwide3 days per week
- ...sized workplaces globally. Our Software Engineers are end to end owners who have the opportunity... ...engineering team building internal AI solutions that support teams across The Trade... ...like LangChain, LlamaIndex, and Semantic Kernel Develop Retrieval Augmented Generation...Full timeTemporary workLocal area
$124.9k - $228.9k
...to meet you. Whatwe do You'll join a software engineering team building internal AI solutions that support teams across The Trade Desk. We leverage... ...frameworks like LangChain, LlamaIndex, and Semantic Kernel. Develop Retrieval-Augmented Generation (RAG)...Full timeTemporary workLocal areaWorldwide- About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the low-level compute kernels that power our large-scale, GPU-accelerated AI inference platform. You will be delivering world-class inference speed across NVIDIA and AMD GPUs. With...Flexible hours
- ...level solutions designed for the unique demands of advanced AI workloads. The team is responsible for building the next... ...AI. About the Role We are looking for a systems‑minded engineer to help advance our kernel development, performance engineering, and hardware‑...
$175.82k - $263.82k
...Applied AI Engineer - Bay Area Redis Labs San Francisco, CA, US Job Type: Full-Time Function: Engineering Software Industry... ...to projects like Langchain, LlamaIndex, Semantic Kernel, Redis-related projects, or MLOps systems for feature orchestration...Full timeLocal areaWorldwide- ...processes, applications, and experiences. Its AI-powered platform enables teams to... ...conferences Mentor and collaborate with LLM engineers on implementation and deployment... ...Proficiency in CUDA programming and custom kernel development for LLM operations Background...InternshipWork at officeRemote workFlexible hours
$220k
Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience...$150k - $350k
...About Collate Collate is an AI document generation platform for life sciences. We automate paperwork with AI, helping our customers... ...at Y Combinator and founder of Lever. Our AI researchers, engineers, and designers have worked at Google, Nvidia, Meta, Netflix, Amazon...$150k - $250k
...Max AI – Stripe for Healthcare Max AI is the World’s first human-free, fully-autonomous medical billing AI agent. Many startups... ...research at MIT and Caltech for over 10 years. And our Head of Engineering was one of the earliest engineers at Figma. AI Engineer...- ...At Falconer, we’re transforming how engineers create, access, and share knowledge. We’re looking for a Founding AI Engineer to help us build an AI-powered knowledge platform that companies love. As a founding engineer, you won’t just help shape our product development...Work experience placementWork at officeFlexible hours
- ...Yutori is reimagining how people interact with the web by building AI agents that can reliably do everyday digital tasks. We are... ...multimodal LLMs acting in web environments Work closely with product engineers to translate cutting‑edge AI capabilities into elegant and...Work at officeRelocationVisa sponsorship
$124.9k - $228.9k
...to meet you. Whatwe do You'll join a business engineering team building internal AI solutions that support teams across The Trade Desk. We leverage... ...frameworks like LangChain, LlamaIndex, and Semantic Kernel. Contributring to Retrieval-Augmented Generation (RAG...Full timeTemporary workLocal areaWorldwide$190.9k - $232.8k
About This Role As a staff software engineer for GenAI Performance and Kernel, you will own the design, implementation, optimization, and correctness of the... ...32,800 USD About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide —...Local areaWorldwide$100k - $120k
Coda Robotics is looking for an experienced engineer to join their founding team, focusing on low-level compute kernels to enhance robotic foundation models. The ideal candidate will have substantial experience in systems programming (C/C++, assembly), expertise in GPU...$230k - $385k
...About the Team The Codex Core Agent team builds the kernel of Codex. We own making the agent better, accelerating research, and... ...over time. About the Role We're looking for applied AI engineers to help bring Codex agents from impressive demos to dependable...- ...Francisco, CA (Onsite | Remote) About Virtue AI Virtue AI sets the standard for... .... What You'll Do As an AI infra Engineer, you will own the reliability, scaling,... ...decoding or batching strategies) and inference kernels Startup experience: you move fast,...Remote work
- ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures... ...to support in API Gateway. Port our in-house CUDA kernels to NVIDIA's CuTe DSL so they run on GB200 today and are portable...
- ...AI Systems Engineer - Codex Core Agents About The Team The Codex Core Agents team builds the agent harness that turns model capability... ...post-training feedback loops. Background in compilers, kernels, runtimes, inference optimization, GPU systems, benchmarking...
$150k - $250k
...Founding Software Engineer (Voice AI)$150,000 - $250,000 + Strong Equity + Full Benefits On-site, San Francisco Looking for a role where you’ll have real ownership, direct impact, and the ability to shape both product and technical direction from day one? This...Immediate start$200k - $350k
...AI Engineer - forward Deployed SF / Onsite - $200k - $350k I'm hiring on behalf of one of the most exciting names in frontier AI right now. A very well-funded research lab building open foundation models, with a founding team pulled from the biggest names in the...Relocation packageFlexible hours- AI Systems Engineer - Codex Core Agents Location San Francisco Employment Type Full time Department Applied AI Compensation 230K-385K Offers... ...or post‑training feedback loops. Background in compilers, kernels, runtimes, inference optimization, GPU systems, benchmarking...Full timeWork at officeLocal areaRelocation packageFlexible hours
- About the company Taste Labs is building the data and infrastructure layer for taste. Our goal is to end AI slop. To make AI feel right, not just be correct. We raised $18.5M in seed co-led by Amplify and CRV, and most frontier labs are already customers. AI has...
$99.6k - $234.6k
...Principal Software Engineer Join Oracle's Health Data Intelligence (HDI) team as a Principal... ...systems, automation frameworks, and AI-powered operational tooling that enable mission... ...as LangChain, AutoGen, CrewAI, Semantic Kernel, OpenAI, or equivalent AI platforms...Temporary workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Kernel Engineer. Be the first to apply!
- ai engineer remote San Francisco, CA
- ai prompt engineer San Francisco, CA
- senior ai engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- ai engineer San Francisco, CA
- ai developer San Francisco, CA
- ai ml engineer San Francisco, CA
- ai research engineer San Francisco, CA
- embedded ai engineer
- ai network engineer



