Member of Technical Staff, Kernels
Inception LLC
The Role We're looking for engineers and scientists to design, optimize, and maintain the compute foundations that power large-scale language model training and inference. You will develop high-performance ML kernels, enable efficient low-precision arithmetic, and improve the distributed compute stack that makes training and serving large models possible. Key Responsibilities
- Design and implement custom ML kernels (CUDA, CuTe, Triton) for core dLLM operations such as attention, matrix multiplication, gating, and normalization, optimized for modern GPU architectures.
- Design compute primitives to reduce memory bandwidth bottlenecks and improve kernel efficiency.
- Contribute to infrastructure stability and scalability, ensuring reproducibility, consistency across precision formats, and high utilization of compute resources.
- BS/MS/PhD in Computer Science, Engineering, or a related field (or equivalent experience).
- Proficiency in CUDA, CuTe, Triton, or other GPU programming frameworks.
- Understanding of ML frameworks (PyTorch, TensorFlow) from a systems perspective.
- Background in performance optimization and profiling of ML systems.
- Experience implementing low-precision formats (FP8, INT8, block floating point) or contributing to related compiler stacks (XLA, TVM).
- Familiarity with distributed training techniques (data parallel, model parallel, pipeline parallel).
- Proficiency in Python and at least one systems programming language (C++/Rust/Go).
- Experience with containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines.
- Experience building and maintaining large-scale language models with tens of billions of parameters or more.
- Experience with distributed systems and cloud computing platforms (AWS/GCP/Azure).
- Familiarity with distributed frameworks such as PyTorch/XLA, DeepSpeed, Megatron-LM.
- Prior contributions to open-source deep learning infrastructure such as PyTorch, DeepSpeed, or XLA.
- Work with World-Class Talent : Collaborate with the inventors of diffusion models and leading AI researchers
- Shape Foundational Technology : Your decisions will influence how the next generation of AI products are built and used
- Immediate Impact : Join at the ground floor where your contributions directly shape product direction and company trajectory
- Competitive salary and equity in a rapidly growing startup
- Flexible vacation and paid time off (PTO)
- Health, dental, and vision insurance
- Catered meals (breakfast, lunch, & dinner)
- Commuter subsidies
- A collaborative and inclusive culture
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Kernels in San Mateo, CA vacancy
$175k - $220k
...Member of Technical Staff, Performance Optimization San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure... ...performance at every layer of the stack—from low-level GPU kernels to large-scale distributed systems. A key focus will be...Suggested- ...throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization...Suggested
- ...convergence by profiling and eliminating bottlenecks across the foundation model training stack stack, from data pipelines to GPU kernels Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU clusters, ensuring scalability, robustness, and...SuggestedRemote work
- ...Cost - deploying our models 2-10× faster & cheaper without quality regressions. Scope of Work: - GPU performance: CUDA/Triton kernels, FlashAttention family, paged attention, CUDA Graphs. - Serving stack: TensorRT-LLM/Triton Inference Server, vLLM/TGI; continuous...Suggested
$175k - $240k
...Member of Technical Staff, Research San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We've been independently...SuggestedWork experience placementInternship- The Role We seek experienced scientists and engineers with deep expertise in post-training large language models through reinforcement learning. You will design and implement RL training pipelines for our diffusion LLMs, develop reward modeling strategies, and build...Immediate startFlexible hours
- Introducing Moonlake, AI for creating real-time interactive content Mission : As an applied AI Research Engineer: Code agents (post training + systems) Scope of Work : - Agentic systems design: Tool catalogs, function calling, program synthesis/repair loops, ...
- Job Title Develop a high-throughput, GPU-based simulation pipeline (primarily rigid body simulation for robots) to train robotics foundation models Implement essential robotics features, including actuators, sensors, and controllers, in collaboration with the robotics...
- Job Title What You'll Do Develop a high-throughput rendering pipeline for training robotics foundation models Design protocols and interfaces between the rendering pipeline, physics engine, and 3D generative models Build an efficient platform for large-scale...
- Job Title What You'll Do Develop and optimize a learning-based robotic manipulation control stack Design and maintain a teleoperation system with smooth, precise motion and low latency Train robotic policies for manipulation and locomotion with reinforcement...
$175k - $220k
...Member of Technical Staff, Software Engineer San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We've been...- The Role We're looking for engineers and scientists to design, optimize, and scale the systems that power our diffusion LLMs in production. Your work will make inference faster, more cost-effective, and more reliable. Key Responsibilities Build and optimize ...Immediate startFlexible hours
- What You’ll Do Design, build, and maintain large-scale data pipelines (batch and streaming) for robotics foundation model training and evaluation at petabyte scale Own core data infrastructure: data model, storage systems, ingestion pipelines, transformation frameworks...Remote work
- Security Infrastructure Engineer What You'll Do Design, build, and scale security infrastructure from the ground up across our systems, networks, endpoints, and products Own and evolve security architecture across endpoint security, network security, application...Interim role
- The Role We seek experienced scientists and engineers with deep expertise in pre- and mid-training large language models. You will advance our diffusion-based LLM models, developing novel training techniques and pushing the boundaries of parallel token generation....Immediate startFlexible hours
$96.8k - $223.4k
Principal Member of Technical Staff-Bay Area Redwood City, CA, United States Job Description Design, develop, troubleshoot and debug software programs for databases and cloud services with emphasis on new extensions to SQL. Implement data structures and algorithms to accelerate...Temporary workFlexible hours- ...generation paradigm of physical data synthesis— combining simulation, generative models, and autonomous agents Deep curiosity and strong technical ownership, with a track record of driving complex, open‑ended projects from concept to implementation Experience with (multimodal)...Remote work
$175k - $220k
...to deliver unparalleled reliability, efficiency, and scalability, fueling the world's most innovative AI products.This is a highly technical role requiring deep expertise in distributed systems, cloud-native infrastructure, and machine learning platforms. You’ll partner...Full time- ...The Role We're hiring a hands-on Staff Security Engineer to build the security foundation for a frontier AI platform serving... ..., privacy, compliance, and infrastructure risk as we scale - a technical leader, not a friction point for the engineering team. What...Immediate startFlexible hours
- ...working with biological data at scale. Comfort working directly with enterprise customers and translating their scientific needs into technical requirements. Ability to move quickly in a fast-paced research and product environment. Nice to Have AI-native working style;...Work at office
- ...quality, close gaps in patient care, drive member enrollment, and patient acquisition,... ...reimbursement, scaling growth without hiring more staff. We are on a mission to improve the... ...What You'll Do: Provide on-site technical support to staff including access management...Work at officeRemote workMonday to Friday3 days per week
$18 per hour
...0 - $18.00 Hourly Overview The Service Desk Representative is a high-profile customer service position delivering beyond our member's expectations. They contribute to member retention, as well as new membership sales. This person has the responsibility of being responsive...Hourly payShift work- ...manufacturing, automotive, or supply chain environments is required. Technical Skills: Experience with SAP and/or PLM (3DX) is necessary.... ...our 'Welcome Packet' as well, which an Everforth Apex team member can provide. Everforth Apex Systems is an equal opportunity...Contract work
- About Phylo Phylo is an applied research lab building agentic intelligence to accelerate discovery for every biomedical scientist. We believe AI agents will fundamentally transform how biomedical research is done, enabling faster and more systematic scientific progress...Work at office
- Top Must Have's: MES Client-X, SAP EWM, Data and integration architecture, Process design in GMP manufacturing, Fit-gap analysis and requirements engineering • Translate NCF business needs into process designs, functional requirements, data requirements, and integration...
$200k - $300k
About Phylo Phylo is an applied research lab building agentic intelligence to accelerate discovery for every biomedical scientist. We believe AI agents will fundamentally transform how biomedical research is done. Our fast-growing team brings together researchers and engineers...Work at office- ...environment; self-motivate and work independently Strong interpersonal skills to build and maintain productive relationships with team members Provide constructive feedback during code reviews and be open to receiving feedback on your own code Problem-Solving and Analytical...
$102k
...solutions, as driven by the business. The position works with technical staff, business partners and senior management across... ...Methodology, and Key Controls Share knowledge amongst direct team members and project team members; contribute to domain knowledge library...Permanent employmentWork at officeLocal areaVisa sponsorshipWork visa- ...tune MicroStrategy reports to determine and fix data issues, incorrect joins, incorrect results, and performance issues. ~ Senior member of team that interacts with business engagement teams to define dashboards and reporting solutions to meet diverse, complex...Contract workImmediate startWork visa
- ...automation efforts to drive operational efficiencies Mentor team members in AI/ML business analysis and product development... ...ownership Not delivery or execution management Not a purely technical ML, data science or data analyst role We are a company committed...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff, Kernels. Be the first to apply!
Related searches
- technical support associate San Mateo, CA
- desktop support analyst San Mateo, CA
- customer support technician San Mateo, CA
- technical support analyst San Mateo, CA
- support analyst San Mateo, CA
- tech assistant San Mateo, CA
- technical support specialist San Mateo, CA
- technical support assistant San Mateo, CA
- systems support technician San Mateo, CA
- customer support analyst San Mateo, CA


