Member of Technical Staff, ML Kernels

Netpreme

About the Role

We are seeking a Member of Technical Staff, Machine Learning Kernels to design, optimize, and benchmark high-performance compute kernels for modern machine learning workloads. This role is for a deeply technical engineer who enjoys working close to hardware - writing CUDA kernels, investigating subtle performance artifacts, building benchmarks, and serving as a go-to expert on accelerator behavior.

You will act as a hands-on performance specialist, partnering closely with research, systems, and infrastructure teams to unlock efficiency gains across GPUs today and other accelerators (e.g., TPU, Trainium) as we expand our hardware partnerships.

This role will be performed onsite from one of our offices in Santa Clara, CA or Boston, MA.
Essential Duties & Responsibilities

Design, implement, and optimize high-performance ML kernels, primarily targeting GPUs (CUDA), with an emphasis on throughput, latency, and memory efficiency.
Profile, benchmark, and analyze performance across different hardware configurations, identifying bottlenecks and narrow artifacts.
Debug and reason about low-level performance issues involving memory hierarchy, scheduling, synchronization, and numerical formats.
Build and maintain benchmarking and evaluation tools to compare performance across GPUs and other accelerators.
Advise internal teams on GPU and accelerator performance characteristics, tradeoffs, and best practices.
Explore and prototype support for alternative accelerator platforms (e.g., TPU, Amazon Trainium) as partnerships and needs evolve.
Collaborate closely with ML researchers and systems engineers to translate algorithmic needs into efficient kernel implementations.

Qualifications

Strong experience writing and optimizing CUDA kernels or equivalent low-level accelerator code.
Deep understanding of GPU architecture, including memory systems, parallel execution, and performance tradeoffs.
Experience with performance profiling and benchmarking tools (e.g., Nsight Systems / Compute, nvprof, framework-level profilers).
Proficiency in C++ and low-level performance-oriented programming.
Ability to independently investigate ambiguous or poorly understood performance issues and drive them to resolution.
Comfortable switching between different hardware ecosystems and learning new accelerator stacks as needed.

Preferred Qualifications

Experience with ML framework internals (e.g., PyTorch, TensorFlow, XLA) and custom operator development.
Prior work with non-GPU accelerators such as TPU, Trainium, IPU, or similar.
Familiarity with mixed-precision and low-precision compute (e.g., FP16, BF16, FP8).
Contributions to open-source performance, systems, or ML infrastructure projects.

Compensation & Benefits

Competitive salary commensurate with experience including base salary, performance-based bonus, and early stage equity grant
Comprehensive benefits including health, dental, vision, and life insurance
Well-equipped, sunny offices in Santa Clara, CA and Boston, MA
Relocation assistance and visa sponsorship
Perks include a daily lunch stipend, 401k match, and more
A collaborative, continuous-learning work environment with smart, dedicated colleagues engaged in developing the next generation of architecture for high-performance computing

The Opportunity

Impact: We are tackling a fundamental challenge at the infrastructure layer: unlocking greater AI capability while dramatically improving efficiency. The work we do here compounds across state-of-the-art AI models, systems, and real-world applications.
Timing: Joining now means real ownership of the company and meaningful influence over product direction and execution. You'll work from first principles, move quickly from insight to execution, and see your contributions directly reflected in what we build.
Culture: You'll work alongside a group of people who care deeply about rigor, clarity, and impact. We value thoughtful disagreement, fast learning, and intellectual fearlessness. This is a place where strong ideas shine, curiosity is encouraged, and growth is a daily practice.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff, ML Kernels in Santa Clara, CA vacancy

Member of Technical Staff - Training
...the Role RadixArk is seeking a Member of Technical Staff - Training to build and scale the systems... ...This role sits at the intersection of ML, systems, and performance engineering.... ...training. Our team has optimized kernels serving billions of tokens daily,...
Suggested
Flexible hours
RadixArk
Palo Alto, CA
1 day ago
Member of Technical Staff - Supercomputing
...About the Role RadixArk is hiring a Member of Technical Staff - Supercomputing to help build,... ...infrastructure, LLM inference serving, ML systems, or large-scale training workloads... .... Our team has optimized kernels serving billions of tokens daily, designed...
Suggested
Flexible hours
RadixArk
Palo Alto, CA
2 days ago
Member of Technical Staff - CI Engineer
...About the Role RadixArk is hiring a Member of Technical Staff - CI Engineer to own the... ...experience in CI contexts ~ Familiarity with ML inference workloads (model loading, KV... ...and training. Our team has optimized kernels serving billions of tokens daily, designed...
Suggested
Flexible hours
Night shift
RadixArk
Palo Alto, CA
5 hours ago
Member of Technical Staff (Software Engineer)
$169.6k
...empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of... ...computation. Cerebras Systems Inc. has multiple openings for Member of Technical Staff (Software Engineer) Title : Member of Technical Staff (...
Suggested
Full time
Part time
Internship
Remote work
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
1 day ago
Member of Technical Staff, Capacity & Efficiency Infrastructure - MAI Superintelligence Team
$119.8k - $234.7k
...Overview Microsoft AI is looking for a Member of Technical Staff - Capacity & Efficiency Infrastructure , to help us improve manage, and improve... ...to help us improve. This will require you to partner with ML researchers to scale up the latest research recipes,...
Suggested
Ongoing contract
Work at office
Local area
Microsoft Corporation
Mountain View, CA
1 day ago
Member of Technical Staff 2- AI/ML
$109.6k - $218.4k
...AI/ML Engineer Opportunity Hungry, Humble, Honest, with Heart. The Opportunity... ...who champions ML innovation and provides technical mentorship to help you grow as an ML engineer... ..., eagerness to learn from senior team members, and enthusiasm for contributing ideas, staying...
Work at office
Remote work
Relocation package
3 days per week
Nutanix
San Jose, CA
2 days ago
Member of Technical Staff - Mid-training
$180k
...Member Of Technical Staff - Mid-training Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and... ...checkpoints. Basic Qualifications: Expertise in ML and large model scaling, with familiarity across all kinds of...
Temporary work
Xai
Palo Alto, CA
1 day ago
Member of Technical Staff, Mid-training
$180k
...intelligent systems. About the Role We are looking for a Member of Technical Staff - Mid-Training to lead the development of training... ...programming systems, compilers, formal methods, or large-scale ML - rather than post-training specifically. The field is new enough...
Full time
Hark
San Jose, CA
2 days ago
Member of Technical Staff, Pre/Mid-Training
...Member Of Technical Staff Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning... ...equivalent experience). At least 2 years of experience working on ML projects in PyTorch (or equivalent), preferably in a...
Immediate start
Flexible hours
Inception LLC
Palo Alto, CA
5 hours ago
Member of Technical Staff - Multimodal Understanding
$180k
...Member of Technical Staff - Multimodal Understanding Palo Alto, CA About xAI xAI's mission is to create AI systems that can accurately understand... ...track record building or optimizing large-scale distributed ML systems (training/inference optimization, GPU utilization,...
Temporary work
Xai
Palo Alto, CA
2 days ago
Member of Technical Staff - Data Platform
$180k
...Member Of Technical Staff - Data Platform Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and... ...Kafka, HDFS, Spark, Flink, and Trino, enabling real-time ML pipelines, feed ranking, experimentation, analytics, and observability...
Temporary work
Xai
Palo Alto, CA
1 day ago
Member of Technical Staff, ML Performance
...consumption on large GPU clusters. Partner with our elite team of ML researchers and engineers to ensure model architectures are... ...and inference infrastructure. Have significant autonomy in technical decisions. Use the latest-generation GPUs. Who You Are...
Odyssey
Santa Clara, CA
2 hours ago
Member of Technical Staff - Machine Learning (AI Team)
$119.8k - $234.7k
...Overview As a Member of Technical Staff - Machine Learning (AI Team), you will work to create LLM models for general purpose capabilities and... ...data meets the model needs. Generalize machine learning (ML) solutions into repeatable frameworks. Lead evaluation efforts...
Ongoing contract
Work at office
Local area
Microsoft Corporation
Mountain View, CA
4 days ago
Member of Technical Staff - Pre-Training
$180k
...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the... ...skills with passion on model-hardware co-design. Expert in ML and large model scaling, familiar with all kinds of scaling...
Temporary work
Xai
Palo Alto, CA
4 days ago
Founding Member of Technical Staff - ML Infra
...limits of AI4EDA and building the intelligence layer for the hardware revolution. What You'll Do As a Founding Member of the Technical Staff (ML infra) at Architect, you'll be responsible for the critical algorithms and infrastructure that our researchers depend...
Architect Labs
Palo Alto, CA
4 days ago
Member of Technical Staff - Agent Platform (Agent OS)
$150k - $400k
...protocols. This requires deep collaboration with Application and ML Platform teams to meet tight latency, jitter, and quality of... ...and integrating with centralized policy/safety infrastructure. Technical Leadership: Strong technical communication skills and the ability...
Boson AI
Santa Clara, CA
22 hours ago
Member of Technical Staff - Geometry / Meshing Engineer
...points/cells). You will collaborate with our ML and physics simulation teams, contribute... .... Experience working in a relevant technical domain such as computational geometry, graphics... .../export (STEP, IGES, B-Rep) and geometry kernels (Parasolid, ACIS, OpenCASCADE)....
Vinci AI
Palo Alto, CA
3 days ago
Member of Technical Staff - Data Scientist
$119.8k - $234.7k
...scale experiments, and safely advance model capabilities. Each team member owns meaningful parts of the post-training pipeline and has... ...quantities of data Can design, run, and interpret large-scale ML experiments with careful statistical and empirical reasoning....
Ongoing contract
Local area
Worldwide
Microsoft Corporation
Mountain View, CA
4 days ago
Member of Technical Staff, Research Engineer
...what's possible next. Who You Are A staff-level or senior engineer with experience in large... ...optimization, or model deployment. Deeply technical and hands-on: you find energy in writing code, optimizing kernels, and making systems faster, more reliable, and...
Odyssey
Santa Clara, CA
22 hours ago
Member of Technical Staff, Software Engineer
...Member Of Technical Staff, Software Engineer Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's... ...management, and incident response. Collaborate across research, ML infrastructure, and product teams to understand pain points...
Immediate start
Flexible hours
Inception LLC
Palo Alto, CA
1 day ago
Senior Member of Technical Staff - AI Research
$148.5k - $223.9k
...iterate agentic AI systems with customers. With your strong technical competence, strategic thinking and customer engagement, you will... ...candidates who Has exceptional engineering skills. Has deep ML knowledge with meaningful implementation track records....
Salesforce.Com Inc
Palo Alto, CA
1 day ago
Member of Technical Staff, ASIC Verification
...About the Role We are seeking a Member of Technical Staff, Senior ASIC Verification Engineers. This role demands proven technical leadership in ASIC verification and simulation methodologies. You will propose and setup the optimum verification methodology. You...
Visa sponsorship
Relocation package
Netpreme
Santa Clara, CA
3 days ago
Member of Technical Staff - Software Engineer & Machine Learning
$119.8k - $234.7k
...Overview As a Member of Technical Staff - Software Engineer & Machine Learning, you will work building AI Insights, a Copilot analytics product... ...ingestion, anomaly detection, and cohort segmentation. Implement ML-driven insights (prompted classifiers, anomaly detection)...
Ongoing contract
Work at office
Local area
Microsoft Corporation
Mountain View, CA
7 days ago
Member of Technical Staff, Full Stack, LLM Applications
...interfaces and backend services — creating intuitive experiences and robust APIs capable of serving at scale. You'll work closely with ML engineers and researchers to bridge the gap between complex AI models and user-friendly products. Key Responsibilities...
Immediate start
Flexible hours
Inception LLC
Palo Alto, CA
1 day ago
Member of Technical Staff, Reinforcement Learning Systems - MAI Superintelligence Team
$142.8k - $274.8k
...Overview Microsoft AI is looking for a Member of Technical Staff - Reinforcement Learning Systems to help build the world's most advanced reinforcement learning systems. We are responsible for designing, developing, and operating the large-scale reinforcement learning...
Ongoing contract
Work at office
Local area
Shift work
Microsoft Corporation
Mountain View, CA
4 days ago
Principal Member of Technical Staff
...Job Title : Principal Member of Technical Staff Reports to : Head, ASIC Engineering Job Location : Santa Clara, CA About Exo At Exo, we're united by one mission, to take medical imaging where you never thought it could go - everywhere. Through the...
Immediate start
EXO
Santa Clara, CA
3 days ago
Sr. Member of Technical Staff
...implementation. You will also mentor junior engineers, new-grads, and interns to help them grow as engineers and become productive members of the team. You will primarily write code in Go and work with data pipeline using SQL or other types of interfaces. We leverage...
Immediate start
Illumio
Sunnyvale, CA
2 days ago
Member of Technical Staff
...Member Of Technical Staff Location – Santa Clara, CA 95054 Duration: 6 Months Full life cycle software development from design through development, validation and release. Implementation of software for inventory, configuration, firmware and software management including...
CapLeo Global
Santa Clara, CA
2 hours ago
Member of Technical Staff - Inference
$180k
...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI systems that can accurately understand the... ...balancing, auto-scaling) to deep low-level optimizations (GPU kernels, quantization, speculative decoding, tail latency). This...
Temporary work
Xai
Palo Alto, CA
5 hours ago
Sr. Technical Staff
$250k
...learning users to effortlessly run large-scale ML applications, without the hassle of... ...Inc. has multiple openings for Sr. Technical Staff. Title: Sr. Technical Staff Job Duties... ...model. Work with system team members to diagnose system related failures. Understand...
Remote work
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, ML Kernels. Be the first to apply!