Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, Kernels

$200k - $350k

Inception LLC

The Role We’re looking for engineers and scientists to design, optimize, and maintain the compute foundations that power large-scale language model training and inference. You will develop high-performance ML kernels, enable efficient low-precision arithmetic, and improve the distributed compute stack that makes training and serving large models possible. Key Responsibilities Design and implement custom ML kernels (CUDA, CuTe, Triton) for core dLLM operations such as attention, matrix multiplication, gating, and normalization, optimized for modern GPU architectures. Design compute primitives to reduce memory bandwidth bottlenecks and improve kernel efficiency. Contribute to infrastructure stability and scalability, ensuring reproducibility, consistency across precision formats, and high utilization of compute resources. Qualifications BS/MS/PhD in Computer Science, Engineering, or a related field (or equivalent experience). Proficiency in CUDA, CuTe, Triton, or other GPU programming frameworks. Understanding of ML frameworks (PyTorch, TensorFlow) from a systems perspective. Background in performance optimization and profiling of ML systems. Experience implementing low-precision formats (FP8, INT8, block floating point) or contributing to related compiler stacks (XLA, TVM). Familiarity with distributed training techniques (data parallel, model parallel, pipeline parallel). Proficiency in Python and at least one systems programming language (C++/Rust/Go). Experience with containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines. Preferred Skills Experience building and maintaining large-scale language models with tens of billions of parameters or more. Experience with distributed systems and cloud computing platforms (AWS/GCP/Azure). Familiarity with distributed frameworks such as PyTorch/XLA, DeepSpeed, Megatron-LM. Prior contributions to open-source deep learning infrastructure such as PyTorch, DeepSpeed, or XLA. Compensation The annual base salary range for this role is $200,000 – $350,000 USD. Final compensation is determined based on experience, skills, and qualifications. Equity and benefits are included in the total package. Why Join Inception Work with World-Class Talent : Collaborate with the inventors of diffusion models and leading AI researchers. Shape Foundational Technology : Your decisions will influence how the next generation of AI products are built and used. Immediate Impact : Join at the ground floor where your contributions directly shape product direction and company trajectory. Competitive salary and equity in a rapidly growing startup. Flexible vacation and paid time off (PTO). Health, dental, and vision insurance. Catered meals (breakfast, lunch, & dinner). A collaborative and inclusive culture. About Us Inception creates the world’s fastest, most efficient AI models. Today’s autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception’s diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality. Inception was co‑founded by Stanford professor Stefano Ermon, who co‑invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co‑invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co‑founder Volodymyr Kuleshov, who co‑invented MDLM and Block Diffusion. We pioneered the application of diffusion to language, with world’s first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today’s image and video AI, and we’re making it the standard for LLMs as well. Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top‑tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft’s venture fund), Snowflake Ventures, Databricks, and Innovation Endeavors, and by tech luminaries such as Andrew Ng, Andrej Karpathy, and Eric Schmidt. If you are talented, innovative, and ambitious, come help us invent the future of AI. We are an equal opportunity employer and encourage candidates of all backgrounds to apply. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Kernels in San Francisco, CA vacancy
  • $150k - $350k

     ...power real production workloads built to scale to gigawatt‑class AI datacenters. Mission Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU performance. In this role, you will work close to accelerators and execution hardware to extract maximum... 
    Suggested

    Gimlet Labs, Inc.

    San Francisco, CA
    4 days ago
  •  ...increasingly diverse compute architectures. Kernel engineers sit at the center of that...  ...datacenters for years to come. As an early member of our team, you will have significant ownership, work alongside highly technical engineers, and help shape both the systems... 
    Suggested

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • $225k

     ...frontier-scale pre‑training, domain‑specific RL, ultra‑long context, and inference‑time compute to achieve this goal. About The Role As a Kernel Engineer, you will design, implement, and maintain high-performance kernels to optimize throughput and latency during training and... 
    Suggested
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    4 days ago
  •  ...capabilities and alignment more reliably than humans can alone. Our technical approach combines frontier-scale pre-training, domain-specific...  ...utilization, and hardware efficiency. About the Role As a Kernel Engineer, you will design, implement, and maintain high-performance... 
    Suggested
    Work at office
    Visa sponsorship
    Relocation package
    Flexible hours

    Acceler8 Talent

    San Francisco, CA
    4 days ago
  •  ...to achieving its transformative potential. About the Role We’re hiring experienced Machine Learning Engineers for our Low Level / Kernels Capabilities team. The Kernels team builds reinforcement learning (RL) environments at the lowest layers of the stack. Think GPU... 
    Suggested
    Full time
    Visa sponsorship
    Relocation package

    Preference Model

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...systems into our RL training stack. Core Technical Responsibilities LLM Serving Multi‑...  ...performance. End‑to‑End Performance: Profile kernels, memory bandwidth and transport; apply...  ...in open development and encourage team members to contribute to the broader AI community... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime Intellect

    San Francisco, CA
    3 days ago
  •  ...to gigawatt-class AI datacenters. Gimlet Labs is seeking a Member of Technical Staff focused on compilers. In this role, you will work on the core...  ...multi-level IRs spanning graph-level, tensor-level, and kernel-level representations Implement partitioning and lowering strategies... 

    Gimlet Labs

    San Francisco, CA
    4 days ago
  • $180k

     ...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI systems that can accurately understand the...  ...balancing, auto-scaling) to deep low-level optimizations (GPU kernels, quantization, speculative decoding, tail latency). This... 
    Temporary work

    Xai

    San Francisco, CA
    2 days ago
  • $150k - $250k

     ...results come when people take ownership, grow together, and share both the challenges and the wins. What you'll do Bring deep kernel expertise to our AI agents that optimize high-performance, mission-critical computing systems. You'll shape the tools and feedback... 
    Work at office
    Flexible hours

    Asari AI

    San Francisco, CA
    9 days ago
  •  ...Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration...  ...frontier models Designing and implementing custom ML kernels Optimizing performance (latency, throughput, cost)... 

    Mirendil

    San Francisco, CA
    4 days ago
  •  ...gigawatt‑class AI datacenters. Mission Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference. In this role, you will...  ..., and system boundaries. Work closely with compilers, kernels, networking, and distributed systems to deliver end‑to‑end... 

    Gimlet Labs, Inc.

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...of your systems skills, from deep Linux kernel topics to high-level distributed system...  ...fast, robust, and reliable at scale. Core Technical Responsibilities Infrastructure...  ...in open development and encourage team members to contribute to the broader AI community... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect, Inc.

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, Hardware Security Modules At Anchorage Digital, we are building the world’s most advanced digital asset platform for...  ...comprehensive experience with secure low‑level C/C++ programming, kernel development/hardening, and/or embedded systems. You have a... 
    Flexible hours

    Crypto Pro Network

    San Francisco, CA
    4 days ago
  •  ...Member of Technical Staff, ML Infrastructure & Inference Overview We are a cutting-edge AI infrastructure company is building a scalable cloud...  ..., runtimes, and distributed systems Partner with compiler, kernel, networking, and infrastructure teams to improve system-wide... 

    Acceler8 Talent

    San Francisco, CA
    4 days ago
  •  ...design and the responsibility to defend. About the Role As a Member of Technical Staff, Infrastructure & Training Systems at Radical Numerics, you...  ...how every layer of the stack affects research velocity: kernel performance, communication overhead, fault tolerance, observability... 
    Local area

    Radical Numerics Inc.

    San Francisco, CA
    11 hours ago
  •  ...Member Of Technical Staff - Image / Video Generation Freiburg (Germany) About Black Forest Labs We're the team behind Latent Diffusion...  ...Have experience writing forward and backward Triton kernels and ensuring their correctness while considering floating point... 
    Remote work
    Worldwide
    2 days per week

    Black Forest Labs

    San Francisco, CA
    21 hours ago
  •  ...Member of Technical Staff, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying...  ...performance techniques, including GPU/CUDA optimizations, kernel-level improvements, and model execution strategies for MoE... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    4 days ago
  •  ...serving, such as batching, caching, load balancing, and parallelism , Worked on low-level optimizations for inference, such as GPU kernels and code generation , Worked on algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding... 

    Xai

    San Francisco, CA
    4 days ago
  •  ...Braintrust, Modal, Hugging Face, and Dropbox. Working with some of the leading AI companies in the world to power their core agent products. We’re building our team of founding Members of Technical Staff to design the frontier of continually learning systems. #J-18808-Ljbffr... 

    Trajectory

    San Francisco, CA
    4 days ago
  • $227.5k - $401k

     ...your career. We are motivated individuals who tackle unique technical challenges at scale and solve them as a team, delivering...  ...AI research within the financial technology sector. As a Member of Technical Staff, you will operate with a high degree of autonomy and responsibility... 
    Work at office
    Immediate start
    Relocation
    Flexible hours

    Adyen

    San Francisco, CA
    3 days ago
  •  ...Member of Technical Staff – Product Eng at Composio We are building infrastructure that allows agents to communicate with the tools you use for work, including GitHub, Gmail, Notion, Salesforce, etc. We are a small team of engineers wrangling problems from context to search... 

    Resume Refiner

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...our Solutions Architect for GPU Infrastructure, you'll be the technical expert who transforms customer requirements into production‑ready...  ...GPFS) for optimal I/O performance Tune system performance from kernel parameters to CUDA configurations Production Operations &... 

    Prime Intellect

    San Francisco, CA
    3 days ago
  •  ...Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff As a founding member of the engineering team, you will impact the design and direction of Pixeltable at a formative stage, contributing to some of our most foundational... 
    Full time
    Part time
    Work at office
    Work from home
    Flexible hours
    2 days per week

    Pixeltable, Inc.

    San Francisco, CA
    4 days ago
  •  ...maintain high quality software for the Atlas business. You will be the primary driver of the system architecture, technical direction and each team member’s technical skill development. At Anchorage Digital, we believe the product is never "done," we can always do better... 

    Motive Partners

    San Francisco, CA
    3 days ago
  •  ...Member of Technical Staff, Infrastructure Join us and help shape the future of AI by architecting next-generation knowledge systems. Join us and help shape the future of AI by defining the narrative around document understanding. About the Role The Infra team at LlamaIndex... 
    Work at office

    LlamaIndex, Inc.

    San Francisco, CA
    3 days ago
  • $225k

     ...for serving and RL systems Profile and eliminate performance bottlenecks across GPU, networking, and storage layers Collaborate with Kernels and Research to align execution systems with model architecture What we’re looking for Strong software engineering and distributed... 
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    4 days ago
  •  ...inseparable. Radical Numerics was founded to develop both the power to design and the responsibility to defend. About the Role As a Member of Technical Staff, Mechanistic Interpretability at Radical Numerics, you will study how multimodal genome language models represent, process... 
    Local area

    Radical Numerics Inc.

    San Francisco, CA
    10 hours ago
  •  ...We’re a team of AI engineers and seasoned architects, bridging domain expertise with frontier technology. The Role Being a Member of Technical Staff at SketchPro means the problem in front of you will keep changing. You might spend a week designing how an agent... 
    Work at office
    Shift work

    SketchPro

    San Francisco, CA
    4 days ago
  •  ...What we are looking for? Seeking a Member of Technical Staff - Backend with 5+ years of experience. We are looking for an exceptional builder who seeks outsized responsibility and impact and has a demonstrated history of thoughtful, pragmatic decision‑making. This individual... 
    Work experience placement

    RST Recruitment

    San Francisco, CA
    4 days ago
  •  ...Moonlake is hiring a Member of Technical Staff — Diffusion Models to design and train advanced multimodal generative systems. This role focuses on developing diffusion architectures and large-scale training processes to enhance interactive world generation. The ideal candidate... 

    Moon Lake

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Kernels. Be the first to apply!