Member of Technical Staff, Kernels

Inception LLC

The Role

We're looking for engineers and scientists to design, optimize, and maintain the compute foundations that power large-scale language model training and inference. You will develop high-performance ML kernels, enable efficient low-precision arithmetic, and improve the distributed compute stack that makes training and serving large models possible.

Key Responsibilities

Design and implement custom ML kernels (CUDA, CuTe, Triton) for core dLLM operations such as attention, matrix multiplication, gating, and normalization, optimized for modern GPU architectures.
Design compute primitives to reduce memory bandwidth bottlenecks and improve kernel efficiency.
Contribute to infrastructure stability and scalability, ensuring reproducibility, consistency across precision formats, and high utilization of compute resources.

Qualifications

BS/MS/PhD in Computer Science, Engineering, or a related field (or equivalent experience).
Proficiency in CUDA, CuTe, Triton, or other GPU programming frameworks.
Understanding of ML frameworks (PyTorch, TensorFlow) from a systems perspective.
Background in performance optimization and profiling of ML systems.
Experience implementing low-precision formats (FP8, INT8, block floating point) or contributing to related compiler stacks (XLA, TVM).
Familiarity with distributed training techniques (data parallel, model parallel, pipeline parallel).
Proficiency in Python and at least one systems programming language (C++/Rust/Go).
Experience with containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines.

Preferred Skills

Experience building and maintaining large-scale language models with tens of billions of parameters or more.
Experience with distributed systems and cloud computing platforms (AWS/GCP/Azure).
Familiarity with distributed frameworks such as PyTorch/XLA, DeepSpeed, Megatron-LM.
Prior contributions to open-source deep learning infrastructure such as PyTorch, DeepSpeed, or XLA.

Why Join Inception

Work with World-Class Talent : Collaborate with the inventors of diffusion models and leading AI researchers
Shape Foundational Technology : Your decisions will influence how the next generation of AI products are built and used
Immediate Impact : Join at the ground floor where your contributions directly shape product direction and company trajectory

Perks & Benefits

Competitive salary and equity in a rapidly growing startup
Flexible vacation and paid time off (PTO)
Health, dental, and vision insurance
Catered meals (breakfast, lunch, & dinner)
Commuter subsidies
A collaborative and inclusive culture

About Us

Inception creates the world's fastest, most efficient AI models. Today's autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception's diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality.

Inception was co-founded by Stanford professor Stefano Ermon, who co-invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co-invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co-founder Volodymyr Kuleshov, who co-invented MDLM and Block Diffusion.

We pioneered the application of diffusion to language, with world's first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well.

Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top-tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft's venture fund), Snowflake Ventures, Databricks, and Innovation Endeavors, and by tech luminaries such as Andrew Ng, Andrej Karpathy, and Eric Schmidt.

If you are talented, innovative, and ambitious, come help us invent the future of AI.

We are an equal opportunity employer and encourage candidates of all backgrounds to apply.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff, Kernels in San Mateo, CA vacancy

Member of Technical Staff, Performance Optimization
$175k - $220k
...Member of Technical Staff, Performance Optimization San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure... ...performance at every layer of the stack—from low-level GPU kernels to large-scale distributed systems. A key focus will be...
Suggested
Fireworks AI
San Mateo, CA
1 day ago
Member of Technical Staff, Inference (Bay Area)
...throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization...
Suggested
GenesisAI
San Carlos, CA
3 days ago
Member of Technical Staff, Training (Bay Area, Remote)
...convergence by profiling and eliminating bottlenecks across the foundation model training stack stack, from data pipelines to GPU kernels Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU clusters, ensuring scalability, robustness, and...
Suggested
Remote work
AI Chopping Block, Inc.
San Carlos, CA
5 hours ago
Member of Technical Staff - ML Infrastructure & Performance
...Cost - deploying our models 2-10× faster & cheaper without quality regressions. Scope of Work: - GPU performance: CUDA/Triton kernels, FlashAttention family, paged attention, CUDA Graphs. - Serving stack: TensorRT-LLM/Triton Inference Server, vLLM/TGI; continuous...
Suggested
Embedding VC
San Mateo, CA
2 days ago
Member of Technical Staff, Research
$175k - $240k
...Member of Technical Staff, Research San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We've been independently...
Suggested
Work experience placement
Internship
Fireworks AI
San Mateo, CA
3 days ago
Member of Technical Staff, Reinforcement Learning
The Role We seek experienced scientists and engineers with deep expertise in post-training large language models through reinforcement learning. You will design and implement RL training pipelines for our diffusion LLMs, develop reward modeling strategies, and build...
Immediate start
Flexible hours
Inception LLC
San Mateo, CA
2 days ago
Member of Technical Staff - Code Generation
Introducing Moonlake, AI for creating real-time interactive content Mission : As an applied AI Research Engineer: Code agents (post training + systems) Scope of Work : - Agentic systems design: Tool catalogs, function calling, program synthesis/repair loops, ...
Embedding VC
San Mateo, CA
2 days ago
Member of Technical Staff, Simulation (Bay Area)
Job Title Develop a high-throughput, GPU-based simulation pipeline (primarily rigid body simulation for robots) to train robotics foundation models Implement essential robotics features, including actuators, sensors, and controllers, in collaboration with the robotics...
GenesisAI
San Carlos, CA
3 days ago
Member of Technical Staff, Rendering (Bay Area)
Job Title What You'll Do Develop a high-throughput rendering pipeline for training robotics foundation models Design protocols and interfaces between the rendering pipeline, physics engine, and 3D generative models Build an efficient platform for large-scale...
GenesisAI
San Carlos, CA
3 days ago
Member of Technical Staff, Robot Learning (Bay Area)
Job Title What You'll Do Develop and optimize a learning-based robotic manipulation control stack Design and maintain a teleoperation system with smooth, precise motion and low latency Train robotic policies for manipulation and locomotion with reinforcement...
GenesisAI
San Carlos, CA
3 days ago
Member of Technical Staff, Software Engineer
$175k - $220k
...Member of Technical Staff, Software Engineer San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We've been...
Fireworks AI
San Mateo, CA
2 days ago
Member of Technical Staff, Inference & Serving
The Role We're looking for engineers and scientists to design, optimize, and scale the systems that power our diffusion LLMs in production. Your work will make inference faster, more cost-effective, and more reliable. Key Responsibilities Build and optimize ...
Immediate start
Flexible hours
Inception LLC
San Mateo, CA
2 days ago
Member of Technical Staff, Data (Bay Area, Remote)
What You’ll Do Design, build, and maintain large-scale data pipelines (batch and streaming) for robotics foundation model training and evaluation at petabyte scale Own core data infrastructure: data model, storage systems, ingestion pipelines, transformation frameworks...
Remote work
AI Chopping Block, Inc.
San Carlos, CA
1 day ago
Member of Technical Staff, Security
Security Infrastructure Engineer What You'll Do Design, build, and scale security infrastructure from the ground up across our systems, networks, endpoints, and products Own and evolve security architecture across endpoint security, network security, application...
Interim role
GenesisAI
San Carlos, CA
2 days ago
Member of Technical Staff, Pre/Mid-Training
The Role We seek experienced scientists and engineers with deep expertise in pre- and mid-training large language models. You will advance our diffusion-based LLM models, developing novel training techniques and pushing the boundaries of parallel token generation....
Immediate start
Flexible hours
Inception LLC
San Mateo, CA
5 hours ago
Principal Member of Technical Staff-Bay Area
$96.8k - $223.4k
Principal Member of Technical Staff-Bay Area Redwood City, CA, United States Job Description Design, develop, troubleshoot and debug software programs for databases and cloud services with emphasis on new extensions to SQL. Implement data structures and algorithms to accelerate...
Temporary work
Flexible hours
Ll Oefentherapie
Redwood City, CA
4 days ago
Member of Technical Staff, Data Agent (Bay Area, Remote)
...generation paradigm of physical data synthesis— combining simulation, generative models, and autonomous agents Deep curiosity and strong technical ownership, with a track record of driving complex, open‑ended projects from concept to implementation Experience with (multimodal)...
Remote work
AI Chopping Block, Inc.
San Carlos, CA
16 days ago
Member of Technical Staff, Cloud Infrastructure
$175k - $220k
...to deliver unparalleled reliability, efficiency, and scalability, fueling the world's most innovative AI products.This is a highly technical role requiring deep expertise in distributed systems, cloud-native infrastructure, and machine learning platforms. You’ll partner...
Full time
Fireworks Ai
San Mateo, CA
1 hour ago
Member of Technical Staff, Security Engineering
...The Role We're hiring a hands-on Staff Security Engineer to build the security foundation for a frontier AI platform serving... ..., privacy, compliance, and infrastructure risk as we scale - a technical leader, not a friction point for the engineering team. What...
Immediate start
Flexible hours
Inception LLC
San Mateo, CA
1 day ago
Member of Technical Staff - Computational Biology
...working with biological data at scale. Comfort working directly with enterprise customers and translating their scientific needs into technical requirements. Ability to move quickly in a fast-paced research and product environment. Nice to Have AI-native working style;...
Work at office
Phylo, Inc.
South San Francisco, CA
3 days ago
IT Support Specialist
...quality, close gaps in patient care, drive member enrollment, and patient acquisition,... ...reimbursement, scaling growth without hiring more staff. We are on a mission to improve the... ...What You'll Do: Provide on-site technical support to staff including access management...
Work at office
Remote work
Monday to Friday
3 days per week
Notable
San Mateo, CA
1 day ago
Service Desk Representative
$18 per hour
...0 - $18.00 Hourly Overview The Service Desk Representative is a high-profile customer service position delivering beyond our member's expectations. They contribute to member retention, as well as new membership sales. This person has the responsibility of being responsive...
Hourly pay
Shift work
Peninsula Jewish Community Center
San Mateo, CA
1 day ago
Business Analyst- Data Governance
...manufacturing, automotive, or supply chain environments is required. Technical Skills: Experience with SAP and/or PLM (3DX) is necessary.... ...our 'Welcome Packet' as well, which an Everforth Apex team member can provide. Everforth Apex Systems is an equal opportunity...
Contract work
Apex Systems
Foster, CA
1 day ago
Member of Technical Staff - System Engineering
About Phylo Phylo is an applied research lab building agentic intelligence to accelerate discovery for every biomedical scientist. We believe AI agents will fundamentally transform how biomedical research is done, enabling faster and more systematic scientific progress...
Work at office
Phylo
South San Francisco, CA
2 days ago
Life Sciences Manufacturing IT - Technical Business Analyst
Top Must Have's: MES Client-X, SAP EWM, Data and integration architecture, Process design in GMP manufacturing, Fit-gap analysis and requirements engineering • Translate NCF business needs into process designs, functional requirements, data requirements, and integration...
Omni Inclusive
Foster, CA
2 days ago
Member of Technical Staff - Product Engineering
$200k - $300k
About Phylo Phylo is an applied research lab building agentic intelligence to accelerate discovery for every biomedical scientist. We believe AI agents will fundamentally transform how biomedical research is done. Our fast-growing team brings together researchers and engineers...
Work at office
Phylo
South San Francisco, CA
19 hours ago
Business Analyst
...environment; self-motivate and work independently Strong interpersonal skills to build and maintain productive relationships with team members Provide constructive feedback during code reviews and be open to receiving feedback on your own code Problem-Solving and Analytical...
Vidorra LLC
Foster, CA
5 hours ago
Associate Technical Program Analyst - New College Grad
$102k
...solutions, as driven by the business. The position works with technical staff, business partners and senior management across... ...Methodology, and Key Controls Share knowledge amongst direct team members and project team members; contribute to domain knowledge library...
Permanent employment
Work at office
Local area
Visa sponsorship
Work visa
Visa
San Mateo, CA
3 days ago
Data Analyst - MicroStrategy development
...tune MicroStrategy reports to determine and fix data issues, incorrect joins, incorrect results, and performance issues. ~ Senior member of team that interacts with business engagement teams to define dashboards and reporting solutions to meet diverse, complex...
Contract work
Immediate start
Work visa
Futran Tech Solutions Pvt. Ltd.
San Mateo, CA
4 days ago
AI/ML Business Analyst
...automation efforts to drive operational efficiencies Mentor team members in AI/ML business analysis and product development... ...ownership Not delivery or execution management Not a purely technical ML, data science or data analyst role We are a company committed...
Insight Global
San Mateo, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Kernels. Be the first to apply!