Member of Technical Staff, ML Kernels
Netpreme
About the Role We are seeking a Member of Technical Staff, Machine Learning Kernels to design, optimize, and benchmark high-performance compute kernels for modern machine learning workloads. This role is for a deeply technical engineer who enjoys working close to hardware - writing CUDA kernels, investigating subtle performance artifacts, building benchmarks, and serving as a go-to expert on accelerator behavior. You will act as a hands-on performance specialist, partnering closely with research, systems, and infrastructure teams to unlock efficiency gains across GPUs today and other accelerators (e.g., TPU, Trainium) as we expand our hardware partnerships. This role will be performed onsite from one of our offices in Santa Clara, CA or Boston, MA.
Essential Duties & Responsibilities
Essential Duties & Responsibilities
- Design, implement, and optimize high-performance ML kernels, primarily targeting GPUs (CUDA), with an emphasis on throughput, latency, and memory efficiency.
- Profile, benchmark, and analyze performance across different hardware configurations, identifying bottlenecks and narrow artifacts.
- Debug and reason about low-level performance issues involving memory hierarchy, scheduling, synchronization, and numerical formats.
- Build and maintain benchmarking and evaluation tools to compare performance across GPUs and other accelerators.
- Advise internal teams on GPU and accelerator performance characteristics, tradeoffs, and best practices.
- Explore and prototype support for alternative accelerator platforms (e.g., TPU, Amazon Trainium) as partnerships and needs evolve.
- Collaborate closely with ML researchers and systems engineers to translate algorithmic needs into efficient kernel implementations.
- Strong experience writing and optimizing CUDA kernels or equivalent low-level accelerator code.
- Deep understanding of GPU architecture, including memory systems, parallel execution, and performance tradeoffs.
- Experience with performance profiling and benchmarking tools (e.g., Nsight Systems / Compute, nvprof, framework-level profilers).
- Proficiency in C++ and low-level performance-oriented programming.
- Ability to independently investigate ambiguous or poorly understood performance issues and drive them to resolution.
- Comfortable switching between different hardware ecosystems and learning new accelerator stacks as needed.
- Experience with ML framework internals (e.g., PyTorch, TensorFlow, XLA) and custom operator development.
- Prior work with non-GPU accelerators such as TPU, Trainium, IPU, or similar.
- Familiarity with mixed-precision and low-precision compute (e.g., FP16, BF16, FP8).
- Contributions to open-source performance, systems, or ML infrastructure projects.
- Competitive salary commensurate with experience including base salary, performance-based bonus, and early stage equity grant
- Comprehensive benefits including health, dental, vision, and life insurance
- Well-equipped, sunny offices in Santa Clara, CA and Boston, MA
- Relocation assistance and visa sponsorship
- Perks include a daily lunch stipend, 401k match, and more
- A collaborative, continuous-learning work environment with smart, dedicated colleagues engaged in developing the next generation of architecture for high-performance computing
- Impact: We are tackling a fundamental challenge at the infrastructure layer: unlocking greater AI capability while dramatically improving efficiency. The work we do here compounds across state-of-the-art AI models, systems, and real-world applications.
- Timing: Joining now means real ownership of the company and meaningful influence over product direction and execution. You'll work from first principles, move quickly from insight to execution, and see your contributions directly reflected in what we build.
- Culture: You'll work alongside a group of people who care deeply about rigor, clarity, and impact. We value thoughtful disagreement, fast learning, and intellectual fearlessness. This is a place where strong ideas shine, curiosity is encouraged, and growth is a daily practice.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, ML Kernels in Santa Clara, CA vacancy
- ...the Role RadixArk is seeking a Member of Technical Staff - Training to build and scale the systems... ...This role sits at the intersection of ML, systems, and performance engineering.... ...training. Our team has optimized kernels serving billions of tokens daily,...SuggestedFlexible hours
- ...About the Role RadixArk is hiring a Member of Technical Staff - Supercomputing to help build,... ...infrastructure, LLM inference serving, ML systems, or large-scale training workloads... .... Our team has optimized kernels serving billions of tokens daily, designed...SuggestedFlexible hours
- ...About the Role RadixArk is hiring a Member of Technical Staff - CI Engineer to own the... ...experience in CI contexts ~ Familiarity with ML inference workloads (model loading, KV... ...and training. Our team has optimized kernels serving billions of tokens daily, designed...SuggestedFlexible hoursNight shift
$169.6k
...empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of... ...computation. Cerebras Systems Inc. has multiple openings for Member of Technical Staff (Software Engineer) Title : Member of Technical Staff (...SuggestedFull timePart timeInternshipRemote work$119.8k - $234.7k
...Overview Microsoft AI is looking for a Member of Technical Staff - Capacity & Efficiency Infrastructure , to help us improve manage, and improve... ...to help us improve. This will require you to partner with ML researchers to scale up the latest research recipes,...SuggestedOngoing contractWork at officeLocal area$109.6k - $218.4k
...AI/ML Engineer Opportunity Hungry, Humble, Honest, with Heart. The Opportunity... ...who champions ML innovation and provides technical mentorship to help you grow as an ML engineer... ..., eagerness to learn from senior team members, and enthusiasm for contributing ideas, staying...Work at officeRemote workRelocation package3 days per week$180k
...Member Of Technical Staff - Mid-training Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and... ...checkpoints. Basic Qualifications: Expertise in ML and large model scaling, with familiarity across all kinds of...Temporary work$180k
...intelligent systems. About the Role We are looking for a Member of Technical Staff - Mid-Training to lead the development of training... ...programming systems, compilers, formal methods, or large-scale ML - rather than post-training specifically. The field is new enough...Full time- ...Member Of Technical Staff Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning... ...equivalent experience). At least 2 years of experience working on ML projects in PyTorch (or equivalent), preferably in a...Immediate startFlexible hours
$180k
...Member of Technical Staff - Multimodal Understanding Palo Alto, CA About xAI xAI's mission is to create AI systems that can accurately understand... ...track record building or optimizing large-scale distributed ML systems (training/inference optimization, GPU utilization,...Temporary work$180k
...Member Of Technical Staff - Data Platform Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and... ...Kafka, HDFS, Spark, Flink, and Trino, enabling real-time ML pipelines, feed ranking, experimentation, analytics, and observability...Temporary work- ...consumption on large GPU clusters. Partner with our elite team of ML researchers and engineers to ensure model architectures are... ...and inference infrastructure. Have significant autonomy in technical decisions. Use the latest-generation GPUs. Who You Are...
$119.8k - $234.7k
...Overview As a Member of Technical Staff - Machine Learning (AI Team), you will work to create LLM models for general purpose capabilities and... ...data meets the model needs. Generalize machine learning (ML) solutions into repeatable frameworks. Lead evaluation efforts...Ongoing contractWork at officeLocal area$180k
...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the... ...skills with passion on model-hardware co-design. Expert in ML and large model scaling, familiar with all kinds of scaling...Temporary work- ...limits of AI4EDA and building the intelligence layer for the hardware revolution. What You'll Do As a Founding Member of the Technical Staff (ML infra) at Architect, you'll be responsible for the critical algorithms and infrastructure that our researchers depend...
$150k - $400k
...protocols. This requires deep collaboration with Application and ML Platform teams to meet tight latency, jitter, and quality of... ...and integrating with centralized policy/safety infrastructure. Technical Leadership: Strong technical communication skills and the ability...- ...points/cells). You will collaborate with our ML and physics simulation teams, contribute... .... Experience working in a relevant technical domain such as computational geometry, graphics... .../export (STEP, IGES, B-Rep) and geometry kernels (Parasolid, ACIS, OpenCASCADE)....
$119.8k - $234.7k
...scale experiments, and safely advance model capabilities. Each team member owns meaningful parts of the post-training pipeline and has... ...quantities of data Can design, run, and interpret large-scale ML experiments with careful statistical and empirical reasoning....Ongoing contractLocal areaWorldwide- ...what's possible next. Who You Are A staff-level or senior engineer with experience in large... ...optimization, or model deployment. Deeply technical and hands-on: you find energy in writing code, optimizing kernels, and making systems faster, more reliable, and...
- ...Member Of Technical Staff, Software Engineer Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's... ...management, and incident response. Collaborate across research, ML infrastructure, and product teams to understand pain points...Immediate startFlexible hours
$148.5k - $223.9k
...iterate agentic AI systems with customers. With your strong technical competence, strategic thinking and customer engagement, you will... ...candidates who Has exceptional engineering skills. Has deep ML knowledge with meaningful implementation track records....- ...About the Role We are seeking a Member of Technical Staff, Senior ASIC Verification Engineers. This role demands proven technical leadership in ASIC verification and simulation methodologies. You will propose and setup the optimum verification methodology. You...Visa sponsorshipRelocation package
$119.8k - $234.7k
...Overview As a Member of Technical Staff - Software Engineer & Machine Learning, you will work building AI Insights, a Copilot analytics product... ...ingestion, anomaly detection, and cohort segmentation. Implement ML-driven insights (prompted classifiers, anomaly detection)...Ongoing contractWork at officeLocal area- ...interfaces and backend services — creating intuitive experiences and robust APIs capable of serving at scale. You'll work closely with ML engineers and researchers to bridge the gap between complex AI models and user-friendly products. Key Responsibilities...Immediate startFlexible hours
$142.8k - $274.8k
...Overview Microsoft AI is looking for a Member of Technical Staff - Reinforcement Learning Systems to help build the world's most advanced reinforcement learning systems. We are responsible for designing, developing, and operating the large-scale reinforcement learning...Ongoing contractWork at officeLocal areaShift work- ...Job Title : Principal Member of Technical Staff Reports to : Head, ASIC Engineering Job Location : Santa Clara, CA About Exo At Exo, we're united by one mission, to take medical imaging where you never thought it could go - everywhere. Through the...Immediate start
- ...implementation. You will also mentor junior engineers, new-grads, and interns to help them grow as engineers and become productive members of the team. You will primarily write code in Go and work with data pipeline using SQL or other types of interfaces. We leverage...Immediate start
- ...Member Of Technical Staff Location – Santa Clara, CA 95054 Duration: 6 Months Full life cycle software development from design through development, validation and release. Implementation of software for inventory, configuration, firmware and software management including...
$180k
...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI systems that can accurately understand the... ...balancing, auto-scaling) to deep low-level optimizations (GPU kernels, quantization, speculative decoding, tail latency). This...Temporary work$250k
...learning users to effortlessly run large-scale ML applications, without the hassle of... ...Inc. has multiple openings for Sr. Technical Staff. Title: Sr. Technical Staff Job Duties... ...model. Work with system team members to diagnose system related failures. Understand...Remote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff, ML Kernels. Be the first to apply!
Related searches
- technical support assistant Santa Clara, CA
- technical analyst Santa Clara, CA
- IT assistant Santa Clara, CA
- help desk assistant Santa Clara, CA
- IT support technician Santa Clara, CA
- operations support technician Santa Clara, CA
- desktop support analyst Santa Clara, CA
- support analyst Santa Clara, CA
- technical associate Santa Clara, CA
- infrastructure support analyst Santa Clara, CA

