Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, Kernels

Inception LLC

The Role

We're looking for engineers and scientists to design, optimize, and maintain the compute foundations that power large-scale language model training and inference. You will develop high-performance ML kernels, enable efficient low-precision arithmetic, and improve the distributed compute stack that makes training and serving large models possible.

Key Responsibilities

  • Design and implement custom ML kernels (CUDA, CuTe, Triton) for core dLLM operations such as attention, matrix multiplication, gating, and normalization, optimized for modern GPU architectures.
  • Design compute primitives to reduce memory bandwidth bottlenecks and improve kernel efficiency.
  • Contribute to infrastructure stability and scalability, ensuring reproducibility, consistency across precision formats, and high utilization of compute resources.
Qualifications
  • BS/MS/PhD in Computer Science, Engineering, or a related field (or equivalent experience).
  • Proficiency in CUDA, CuTe, Triton, or other GPU programming frameworks.
  • Understanding of ML frameworks (PyTorch, TensorFlow) from a systems perspective.
  • Background in performance optimization and profiling of ML systems.
  • Experience implementing low-precision formats (FP8, INT8, block floating point) or contributing to related compiler stacks (XLA, TVM).
  • Familiarity with distributed training techniques (data parallel, model parallel, pipeline parallel).
  • Proficiency in Python and at least one systems programming language (C++/Rust/Go).
  • Experience with containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines.
Preferred Skills
  • Experience building and maintaining large-scale language models with tens of billions of parameters or more.
  • Experience with distributed systems and cloud computing platforms (AWS/GCP/Azure).
  • Familiarity with distributed frameworks such as PyTorch/XLA, DeepSpeed, Megatron-LM.
  • Prior contributions to open-source deep learning infrastructure such as PyTorch, DeepSpeed, or XLA.
Why Join Inception
  • Work with World-Class Talent : Collaborate with the inventors of diffusion models and leading AI researchers
  • Shape Foundational Technology : Your decisions will influence how the next generation of AI products are built and used
  • Immediate Impact : Join at the ground floor where your contributions directly shape product direction and company trajectory
Perks & Benefits
  • Competitive salary and equity in a rapidly growing startup
  • Flexible vacation and paid time off (PTO)
  • Health, dental, and vision insurance
  • Catered meals (breakfast, lunch, & dinner)
  • Commuter subsidies
  • A collaborative and inclusive culture

About Us

Inception creates the world's fastest, most efficient AI models. Today's autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception's diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality.

Inception was co-founded by Stanford professor Stefano Ermon, who co-invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co-invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co-founder Volodymyr Kuleshov, who co-invented MDLM and Block Diffusion.

We pioneered the application of diffusion to language, with world's first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well.

Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top-tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft's venture fund), Snowflake Ventures, Databricks, and Innovation Endeavors, and by tech luminaries such as Andrew Ng, Andrej Karpathy, and Eric Schmidt.

If you are talented, innovative, and ambitious, come help us invent the future of AI.

We are an equal opportunity employer and encourage candidates of all backgrounds to apply.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Kernels in San Mateo, CA vacancy
  • $175k - $220k

     ...Member of Technical Staff, Performance Optimization San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure...  ...performance at every layer of the stack—from low-level GPU kernels to large-scale distributed systems. A key focus will be... 
    Suggested

    Fireworks AI

    San Mateo, CA
    1 day ago
  •  ...throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization... 
    Suggested

    GenesisAI

    San Carlos, CA
    3 days ago
  •  ...convergence by profiling and eliminating bottlenecks across the foundation model training stack stack, from data pipelines to GPU kernels Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU clusters, ensuring scalability, robustness, and... 
    Suggested
    Remote work

    AI Chopping Block, Inc.

    San Carlos, CA
    5 hours ago
  •  ...Cost - deploying our models 2-10× faster & cheaper without quality regressions. Scope of Work: - GPU performance: CUDA/Triton kernels, FlashAttention family, paged attention, CUDA Graphs. - Serving stack: TensorRT-LLM/Triton Inference Server, vLLM/TGI; continuous... 
    Suggested

    Embedding VC

    San Mateo, CA
    2 days ago
  • $175k - $240k

     ...Member of Technical Staff, Research San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We've been independently... 
    Suggested
    Work experience placement
    Internship

    Fireworks AI

    San Mateo, CA
    3 days ago
  • The Role We seek experienced scientists and engineers with deep expertise in post-training large language models through reinforcement learning. You will design and implement RL training pipelines for our diffusion LLMs, develop reward modeling strategies, and build...
    Immediate start
    Flexible hours

    Inception LLC

    San Mateo, CA
    2 days ago
  • Introducing Moonlake, AI for creating real-time interactive content Mission : As an applied AI Research Engineer: Code agents (post training + systems) Scope of Work : - Agentic systems design: Tool catalogs, function calling, program synthesis/repair loops, ...

    Embedding VC

    San Mateo, CA
    2 days ago
  • Job Title Develop a high-throughput, GPU-based simulation pipeline (primarily rigid body simulation for robots) to train robotics foundation models Implement essential robotics features, including actuators, sensors, and controllers, in collaboration with the robotics...

    GenesisAI

    San Carlos, CA
    3 days ago
  • Job Title What You'll Do Develop a high-throughput rendering pipeline for training robotics foundation models Design protocols and interfaces between the rendering pipeline, physics engine, and 3D generative models Build an efficient platform for large-scale...

    GenesisAI

    San Carlos, CA
    3 days ago
  • Job Title What You'll Do Develop and optimize a learning-based robotic manipulation control stack Design and maintain a teleoperation system with smooth, precise motion and low latency Train robotic policies for manipulation and locomotion with reinforcement...

    GenesisAI

    San Carlos, CA
    3 days ago
  • $175k - $220k

     ...Member of Technical Staff, Software Engineer San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We've been... 

    Fireworks AI

    San Mateo, CA
    2 days ago
  • The Role We're looking for engineers and scientists to design, optimize, and scale the systems that power our diffusion LLMs in production. Your work will make inference faster, more cost-effective, and more reliable. Key Responsibilities Build and optimize ...
    Immediate start
    Flexible hours

    Inception LLC

    San Mateo, CA
    2 days ago
  • What You’ll Do Design, build, and maintain large-scale data pipelines (batch and streaming) for robotics foundation model training and evaluation at petabyte scale Own core data infrastructure: data model, storage systems, ingestion pipelines, transformation frameworks...
    Remote work

    AI Chopping Block, Inc.

    San Carlos, CA
    1 day ago
  • Security Infrastructure Engineer What You'll Do Design, build, and scale security infrastructure from the ground up across our systems, networks, endpoints, and products Own and evolve security architecture across endpoint security, network security, application...
    Interim role

    GenesisAI

    San Carlos, CA
    2 days ago
  • The Role We seek experienced scientists and engineers with deep expertise in pre- and mid-training large language models. You will advance our diffusion-based LLM models, developing novel training techniques and pushing the boundaries of parallel token generation....
    Immediate start
    Flexible hours

    Inception LLC

    San Mateo, CA
    5 hours ago
  • $96.8k - $223.4k

    Principal Member of Technical Staff-Bay Area Redwood City, CA, United States Job Description Design, develop, troubleshoot and debug software programs for databases and cloud services with emphasis on new extensions to SQL. Implement data structures and algorithms to accelerate... 
    Temporary work
    Flexible hours

    Ll Oefentherapie

    Redwood City, CA
    4 days ago
  •  ...generation paradigm of physical data synthesis— combining simulation, generative models, and autonomous agents Deep curiosity and strong technical ownership, with a track record of driving complex, open‑ended projects from concept to implementation Experience with (multimodal)... 
    Remote work

    AI Chopping Block, Inc.

    San Carlos, CA
    16 days ago
  • $175k - $220k

     ...to deliver unparalleled reliability, efficiency, and scalability, fueling the world's most innovative AI products.This is a highly technical role requiring deep expertise in distributed systems, cloud-native infrastructure, and machine learning platforms. You’ll partner... 
    Full time

    Fireworks Ai

    San Mateo, CA
    1 hour ago
  •  ...The Role We're hiring a hands-on Staff Security Engineer to build the security foundation for a frontier AI platform serving...  ..., privacy, compliance, and infrastructure risk as we scale - a technical leader, not a friction point for the engineering team. What... 
    Immediate start
    Flexible hours

    Inception LLC

    San Mateo, CA
    1 day ago
  •  ...working with biological data at scale. Comfort working directly with enterprise customers and translating their scientific needs into technical requirements. Ability to move quickly in a fast-paced research and product environment. Nice to Have AI-native working style;... 
    Work at office

    Phylo, Inc.

    South San Francisco, CA
    3 days ago
  •  ...quality, close gaps in patient care, drive member enrollment, and patient acquisition,...  ...reimbursement, scaling growth without hiring more staff. We are on a mission to improve the...  ...What You'll Do: Provide on-site technical support to staff including access management... 
    Work at office
    Remote work
    Monday to Friday
    3 days per week

    Notable

    San Mateo, CA
    1 day ago
  • $18 per hour

     ...0 - $18.00 Hourly Overview The Service Desk Representative is a high-profile customer service position delivering beyond our member's expectations. They contribute to member retention, as well as new membership sales. This person has the responsibility of being responsive... 
    Hourly pay
    Shift work

    Peninsula Jewish Community Center

    San Mateo, CA
    1 day ago
  •  ...manufacturing, automotive, or supply chain environments is required. Technical Skills: Experience with SAP and/or PLM (3DX) is necessary....  ...our 'Welcome Packet' as well, which an Everforth Apex team member can provide. Everforth Apex Systems is an equal opportunity... 
    Contract work

    Apex Systems

    Foster, CA
    1 day ago
  • About Phylo Phylo is an applied research lab building agentic intelligence to accelerate discovery for every biomedical scientist. We believe AI agents will fundamentally transform how biomedical research is done, enabling faster and more systematic scientific progress...
    Work at office

    Phylo

    South San Francisco, CA
    2 days ago
  • Top Must Have's: MES Client-X, SAP EWM, Data and integration architecture, Process design in GMP manufacturing, Fit-gap analysis and requirements engineering • Translate NCF business needs into process designs, functional requirements, data requirements, and integration...

    Omni Inclusive

    Foster, CA
    2 days ago
  • $200k - $300k

    About Phylo Phylo is an applied research lab building agentic intelligence to accelerate discovery for every biomedical scientist. We believe AI agents will fundamentally transform how biomedical research is done. Our fast-growing team brings together researchers and engineers...
    Work at office

    Phylo

    South San Francisco, CA
    19 hours ago
  •  ...environment; self-motivate and work independently Strong interpersonal skills to build and maintain productive relationships with team members Provide constructive feedback during code reviews and be open to receiving feedback on your own code Problem-Solving and Analytical... 

    Vidorra LLC

    Foster, CA
    5 hours ago
  • $102k

     ...solutions, as driven by the business. The position works with technical staff, business partners and senior management across...  ...Methodology, and Key Controls Share knowledge amongst direct team members and project team members; contribute to domain knowledge library... 
    Permanent employment
    Work at office
    Local area
    Visa sponsorship
    Work visa

    Visa

    San Mateo, CA
    3 days ago
  •  ...tune MicroStrategy reports to determine and fix data issues, incorrect joins, incorrect results, and performance issues. ~ Senior member of team that interacts with business engagement teams to define dashboards and reporting solutions to meet diverse, complex... 
    Contract work
    Immediate start
    Work visa

    Futran Tech Solutions Pvt. Ltd.

    San Mateo, CA
    4 days ago
  •  ...automation efforts to drive operational efficiencies Mentor team members in AI/ML business analysis and product development...  ...ownership Not delivery or execution management Not a purely technical ML, data science or data analyst role We are a company committed... 

    Insight Global

    San Mateo, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Kernels. Be the first to apply!