Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Kernel Engineer

Quadric

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel library for a variety of AI/LLM models; [2] analyze the performance and optimize the kernel for different hardware configurations; This senior technical role demands deep knowledge of hardware architecture, compiler toolchain and optimization techniques. Responsibilities Develop AI/LLM kernels/operators on Quadric platform for efficient inference Optimize the kernel performance for different hardware configurations and workloads Profile and analyze kernel performance in terms of compute, data and parallelism; identify micro‑architecture and software bottlenecks and provide optimization solutions Optimize kernel C/C++ codes, maximize hardware utilization Make improvement to Quadric toolchain, compiler and runtime Provide technical support and documents to customers and developer community Requirements Bachelor’s or Master’s in Computer Science and/or Electrical Engineering. 5+ years of experience in AI kernel development and optimization Experience with model and kernel inference performance profiling Experience with at least one of the following compute development: CUDA, DSP, NEON, Triton‑lang Proficiency in C/C++ and Python, experience with assembly language a plus Demonstrate good capability in problem solving, debug and communication Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation, Sick & Public Holidays) #J-18808-Ljbffr Quadric

Vacancy posted 19 hours ago
Similar jobs that could be interesting for youBased on the AI Kernel Engineer in San Francisco, CA vacancy
  • Quadric in San Francisco is looking for an experienced AI Kernel Engineer to develop and optimize AI kernels for their innovative neural processing platform. This role involves enhancing performance for various hardware configurations and providing technical support to... 
    Suggested

    Quadric

    San Francisco, CA
    19 hours ago
  • A leading AI technology firm located in San Francisco is seeking a Research Engineer specializing in AI Performance & Kernel Optimization. The role involves enhancing the performance of large-scale AI systems, optimizing kernels, and collaborating with various teams. Ideal... 
    Suggested

    Zyphra

    San Francisco, CA
    1 day ago
  • Asari AI in San Francisco is seeking individuals to optimize high-performance, mission-critical computing systems. You'll work with AI agents to improve performance and design complex systems. The ideal candidate has strong CUDA C experience and fluency in Python and C... 
    Suggested
    Flexible hours

    Asari AI

    San Francisco, CA
    4 days ago
  • $175k - $225k

     ...Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple, and...  .... The Role We're looking for an AI Inference Engineer who lives at the...  ...the country. If you are obsessed with CUDA kernels, TensorRT optimizations, and the challenge... 
    Suggested
    Local area
    Remote work

    Sauron

    San Francisco, CA
    1 day ago
  •  ...you "get stuff done" end-to-end. You use AI to work smarter and solve problems faster...  ...tooling across the spectrum: from prompt engineering and in-context learning to fine-tuned models...  ..., LlamaIndex, AutoGen, CrewAI, Semantic Kernel, and emerging OSS stacks. Apply the... 
    Suggested
    Worldwide

    Airwallex

    San Francisco, CA
    2 days ago
  •  ...follow us on social media. Who You Are The Agentic AI Software Engineer - Cybersecurity Systems designs, develops, and deploys...  ...frameworks (e.g., LangChain, LlamaIndex, AutoGen, Semantic Kernel, or similar). Experience implementing secure software development... 
    Local area
    Work from home

    Bishop Fox

    San Francisco, CA
    3 days ago
  •  ...Description The Senior Software Engineer will be a technical leader for the design...  ...excited about finding new ways to work with AI, this is the role for you! Key Responsibilities...  ...frameworks (LangChain, Semantic Kernel, or similar). ~ Solid understanding of... 
    Remote work
    Worldwide
    3 days per week

    OutSystems

    San Francisco, CA
    3 days ago
  •  ...sized workplaces globally. Our Software Engineers are end to end owners who have the opportunity...  ...engineering team building internal AI solutions that support teams across The Trade...  ...like LangChain, LlamaIndex, and Semantic Kernel Develop Retrieval Augmented Generation... 
    Full time
    Temporary work
    Local area

    The Trade Desk

    San Francisco, CA
    4 days ago
  • $124.9k - $228.9k

     ...to meet you. Whatwe do You'll join a software engineering team building internal AI solutions that support teams across The Trade Desk. We leverage...  ...frameworks like LangChain, LlamaIndex, and Semantic Kernel. Develop Retrieval-Augmented Generation (RAG)... 
    Full time
    Temporary work
    Local area
    Worldwide

    The Trade Desk

    San Francisco, CA
    1 day ago
  • About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the low-level compute kernels that power our large-scale, GPU-accelerated AI inference platform. You will be delivering world-class inference speed across NVIDIA and AMD GPUs. With... 
    Flexible hours

    FriendliAI

    San Francisco, CA
    19 hours ago
  •  ...level solutions designed for the unique demands of advanced AI workloads. The team is responsible for building the next...  ...AI. About the Role We are looking for a systems‑minded engineer to help advance our kernel development, performance engineering, and hardware‑... 

    OpenAI

    San Francisco, CA
    3 days ago
  • $175.82k - $263.82k

     ...Applied AI Engineer - Bay Area Redis Labs San Francisco, CA, US Job Type: Full-Time Function: Engineering Software Industry...  ...to projects like Langchain, LlamaIndex, Semantic Kernel, Redis-related projects, or MLOps systems for feature orchestration... 
    Full time
    Local area
    Worldwide

    Softbank Investment Advisers

    San Francisco, CA
    4 days ago
  •  ...processes, applications, and experiences. Its AI-powered platform enables teams to...  ...conferences Mentor and collaborate with LLM engineers on implementation and deployment...  ...Proficiency in CUDA programming and custom kernel development for LLM operations Background... 
    Internship
    Work at office
    Remote work
    Flexible hours

    Workato

    San Francisco, CA
    19 hours ago
  • $220k

    Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience... 

    Perplexity

    San Francisco, CA
    4 days ago
  • $150k - $350k

     ...About Collate Collate is an AI document generation platform for life sciences. We automate paperwork with AI, helping our customers...  ...at Y Combinator and founder of Lever. Our AI researchers, engineers, and designers have worked at Google, Nvidia, Meta, Netflix, Amazon... 

    Collate

    San Francisco, CA
    3 days ago
  • $150k - $250k

     ...Max AI – Stripe for Healthcare Max AI is the World’s first human-free, fully-autonomous medical billing AI agent. Many startups...  ...research at MIT and Caltech for over 10 years. And our Head of Engineering was one of the earliest engineers at Figma. AI Engineer... 

    Maxcare

    San Francisco, CA
    3 days ago
  •  ...At Falconer, we’re transforming how engineers create, access, and share knowledge. We’re looking for a Founding AI Engineer to help us build an AI-powered knowledge platform that companies love. As a founding engineer, you won’t just help shape our product development... 
    Work experience placement
    Work at office
    Flexible hours

    Falconer

    San Francisco, CA
    3 days ago
  •  ...Yutori is reimagining how people interact with the web by building AI agents that can reliably do everyday digital tasks. We are...  ...multimodal LLMs acting in web environments Work closely with product engineers to translate cutting‑edge AI capabilities into elegant and... 
    Work at office
    Relocation
    Visa sponsorship

    Yutori

    San Francisco, CA
    3 days ago
  • $124.9k - $228.9k

     ...to meet you. Whatwe do You'll join a business engineering team building internal AI solutions that support teams across The Trade Desk. We leverage...  ...frameworks like LangChain, LlamaIndex, and Semantic Kernel. Contributring to Retrieval-Augmented Generation (RAG... 
    Full time
    Temporary work
    Local area
    Worldwide

    The Trade Desk

    San Francisco, CA
    5 days ago
  • $190.9k - $232.8k

    About This Role As a staff software engineer for GenAI Performance and Kernel, you will own the design, implementation, optimization, and correctness of the...  ...32,800 USD About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide —... 
    Local area
    Worldwide

    Cacheflow

    San Francisco, CA
    19 hours ago
  • $100k - $120k

    Coda Robotics is looking for an experienced engineer to join their founding team, focusing on low-level compute kernels to enhance robotic foundation models. The ideal candidate will have substantial experience in systems programming (C/C++, assembly), expertise in GPU... 

    Coda Robotics

    San Francisco, CA
    2 days ago
  • $230k - $385k

     ...About the Team The Codex Core Agent team builds the kernel of Codex. We own making the agent better, accelerating research, and...  ...over time. About the Role We're looking for applied AI engineers to help bring Codex agents from impressive demos to dependable... 

    OpenAI

    San Francisco, CA
    2 days ago
  •  ...Francisco, CA (Onsite | Remote) About Virtue AI Virtue AI sets the standard for...  .... What You'll Do As an AI infra Engineer, you will own the reliability, scaling,...  ...decoding or batching strategies) and inference kernels Startup experience: you move fast,... 
    Remote work

    Virtue AI

    San Francisco, CA
    19 hours ago
  •  ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures...  ...to support in API Gateway. Port our in-house CUDA kernels to NVIDIA's CuTe DSL so they run on GB200 today and are portable... 

    Perplexity AI

    San Francisco, CA
    2 days ago
  •  ...AI Systems Engineer - Codex Core Agents About The Team The Codex Core Agents team builds the agent harness that turns model capability...  ...post-training feedback loops. Background in compilers, kernels, runtimes, inference optimization, GPU systems, benchmarking... 

    OpenAI

    San Francisco, CA
    4 days ago
  • $150k - $250k

     ...Founding Software Engineer (Voice AI)$150,000 - $250,000 + Strong Equity + Full Benefits On-site, San Francisco Looking for a role where you’ll have real ownership, direct impact, and the ability to shape both product and technical direction from day one? This... 
    Immediate start

    Rise Technical

    San Francisco, CA
    19 hours ago
  • $200k - $350k

     ...AI Engineer - forward Deployed SF / Onsite - $200k - $350k I'm hiring on behalf of one of the most exciting names in frontier AI right now. A very well-funded research lab building open foundation models, with a founding team pulled from the biggest names in the... 
    Relocation package
    Flexible hours

    scalr

    San Francisco, CA
    21 hours ago
  • AI Systems Engineer - Codex Core Agents Location San Francisco Employment Type Full time Department Applied AI Compensation 230K-385K Offers...  ...or post‑training feedback loops. Background in compilers, kernels, runtimes, inference optimization, GPU systems, benchmarking... 
    Full time
    Work at office
    Local area
    Relocation package
    Flexible hours

    Slope

    San Francisco, CA
    3 days ago
  • About the company Taste Labs is building the data and infrastructure layer for taste. Our goal is to end AI slop. To make AI feel right, not just be correct. We raised $18.5M in seed co-led by Amplify and CRV, and most frontier labs are already customers. AI has... 

    Taste Labs

    San Francisco, CA
    19 hours ago
  • $99.6k - $234.6k

     ...Principal Software Engineer Join Oracle's Health Data Intelligence (HDI) team as a Principal...  ...systems, automation frameworks, and AI-powered operational tooling that enable mission...  ...as LangChain, AutoGen, CrewAI, Semantic Kernel, OpenAI, or equivalent AI platforms... 
    Temporary work
    Flexible hours

    Oracle

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Kernel Engineer. Be the first to apply!