Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, ML Kernels

Netpreme

About the Role

We are seeking a Member of Technical Staff, Machine Learning Kernels to design, optimize, and benchmark high-performance compute kernels for modern machine learning workloads. This role is for a deeply technical engineer who enjoys working close to hardware - writing CUDA kernels, investigating subtle performance artifacts, building benchmarks, and serving as a go-to expert on accelerator behavior.

You will act as a hands-on performance specialist, partnering closely with research, systems, and infrastructure teams to unlock efficiency gains across GPUs today and other accelerators (e.g., TPU, Trainium) as we expand our hardware partnerships.

This role will be performed onsite from one of our offices in Santa Clara, CA or Boston, MA.
Essential Duties & Responsibilities
  • Design, implement, and optimize high-performance ML kernels, primarily targeting GPUs (CUDA), with an emphasis on throughput, latency, and memory efficiency.
  • Profile, benchmark, and analyze performance across different hardware configurations, identifying bottlenecks and narrow artifacts.
  • Debug and reason about low-level performance issues involving memory hierarchy, scheduling, synchronization, and numerical formats.
  • Build and maintain benchmarking and evaluation tools to compare performance across GPUs and other accelerators.
  • Advise internal teams on GPU and accelerator performance characteristics, tradeoffs, and best practices.
  • Explore and prototype support for alternative accelerator platforms (e.g., TPU, Amazon Trainium) as partnerships and needs evolve.
  • Collaborate closely with ML researchers and systems engineers to translate algorithmic needs into efficient kernel implementations.
Qualifications
  • Strong experience writing and optimizing CUDA kernels or equivalent low-level accelerator code.
  • Deep understanding of GPU architecture, including memory systems, parallel execution, and performance tradeoffs.
  • Experience with performance profiling and benchmarking tools (e.g., Nsight Systems / Compute, nvprof, framework-level profilers).
  • Proficiency in C++ and low-level performance-oriented programming.
  • Ability to independently investigate ambiguous or poorly understood performance issues and drive them to resolution.
  • Comfortable switching between different hardware ecosystems and learning new accelerator stacks as needed.
Preferred Qualifications
  • Experience with ML framework internals (e.g., PyTorch, TensorFlow, XLA) and custom operator development.
  • Prior work with non-GPU accelerators such as TPU, Trainium, IPU, or similar.
  • Familiarity with mixed-precision and low-precision compute (e.g., FP16, BF16, FP8).
  • Contributions to open-source performance, systems, or ML infrastructure projects.
Compensation & Benefits
  • Competitive salary commensurate with experience including base salary, performance-based bonus, and early stage equity grant
  • Comprehensive benefits including health, dental, vision, and life insurance
  • Well-equipped, sunny offices in Santa Clara, CA and Boston, MA
  • Relocation assistance and visa sponsorship
  • Perks include a daily lunch stipend, 401k match, and more
  • A collaborative, continuous-learning work environment with smart, dedicated colleagues engaged in developing the next generation of architecture for high-performance computing
The Opportunity
  • Impact: We are tackling a fundamental challenge at the infrastructure layer: unlocking greater AI capability while dramatically improving efficiency. The work we do here compounds across state-of-the-art AI models, systems, and real-world applications.
  • Timing: Joining now means real ownership of the company and meaningful influence over product direction and execution. You'll work from first principles, move quickly from insight to execution, and see your contributions directly reflected in what we build.
  • Culture: You'll work alongside a group of people who care deeply about rigor, clarity, and impact. We value thoughtful disagreement, fast learning, and intellectual fearlessness. This is a place where strong ideas shine, curiosity is encouraged, and growth is a daily practice.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, ML Kernels in Santa Clara, CA vacancy
  •  ...the Role RadixArk is seeking a Member of Technical Staff - Training to build and scale the systems...  ...This role sits at the intersection of ML, systems, and performance engineering....  ...training. Our team has optimized kernels serving billions of tokens daily,... 
    Suggested
    Flexible hours

    RadixArk

    Palo Alto, CA
    1 day ago
  •  ...About the Role RadixArk is hiring a Member of Technical Staff - Supercomputing to help build,...  ...infrastructure, LLM inference serving, ML systems, or large-scale training workloads...  .... Our team has optimized kernels serving billions of tokens daily, designed... 
    Suggested
    Flexible hours

    RadixArk

    Palo Alto, CA
    2 days ago
  •  ...About the Role RadixArk is hiring a Member of Technical Staff - CI Engineer to own the...  ...experience in CI contexts ~ Familiarity with ML inference workloads (model loading, KV...  ...and training. Our team has optimized kernels serving billions of tokens daily, designed... 
    Suggested
    Flexible hours
    Night shift

    RadixArk

    Palo Alto, CA
    5 hours ago
  • $169.6k

     ...empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of...  ...computation. Cerebras Systems Inc. has multiple openings for Member of Technical Staff (Software Engineer) Title : Member of Technical Staff (... 
    Suggested
    Full time
    Part time
    Internship
    Remote work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago
  • $119.8k - $234.7k

     ...Overview Microsoft AI is looking for a Member of Technical Staff - Capacity & Efficiency Infrastructure , to help us improve manage, and improve...  ...to help us improve. This will require you to partner with ML researchers to scale up the latest research recipes,... 
    Suggested
    Ongoing contract
    Work at office
    Local area

    Microsoft Corporation

    Mountain View, CA
    1 day ago
  • $109.6k - $218.4k

     ...AI/ML Engineer Opportunity Hungry, Humble, Honest, with Heart. The Opportunity...  ...who champions ML innovation and provides technical mentorship to help you grow as an ML engineer...  ..., eagerness to learn from senior team members, and enthusiasm for contributing ideas, staying... 
    Work at office
    Remote work
    Relocation package
    3 days per week

    Nutanix

    San Jose, CA
    2 days ago
  • $180k

     ...Member Of Technical Staff - Mid-training Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and...  ...checkpoints. Basic Qualifications: Expertise in ML and large model scaling, with familiarity across all kinds of... 
    Temporary work

    Xai

    Palo Alto, CA
    1 day ago
  • $180k

     ...intelligent systems. About the Role We are looking for a Member of Technical Staff - Mid-Training to lead the development of training...  ...programming systems, compilers, formal methods, or large-scale ML - rather than post-training specifically. The field is new enough... 
    Full time

    Hark

    San Jose, CA
    2 days ago
  •  ...Member Of Technical Staff Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning...  ...equivalent experience). At least 2 years of experience working on ML projects in PyTorch (or equivalent), preferably in a... 
    Immediate start
    Flexible hours

    Inception LLC

    Palo Alto, CA
    5 hours ago
  • $180k

     ...Member of Technical Staff - Multimodal Understanding Palo Alto, CA About xAI xAI's mission is to create AI systems that can accurately understand...  ...track record building or optimizing large-scale distributed ML systems (training/inference optimization, GPU utilization,... 
    Temporary work

    Xai

    Palo Alto, CA
    2 days ago
  • $180k

     ...Member Of Technical Staff - Data Platform Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and...  ...Kafka, HDFS, Spark, Flink, and Trino, enabling real-time ML pipelines, feed ranking, experimentation, analytics, and observability... 
    Temporary work

    Xai

    Palo Alto, CA
    1 day ago
  •  ...consumption on large GPU clusters. Partner with our elite team of ML researchers and engineers to ensure model architectures are...  ...and inference infrastructure. Have significant autonomy in technical decisions. Use the latest-generation GPUs. Who You Are... 

    Odyssey

    Santa Clara, CA
    2 hours ago
  • $119.8k - $234.7k

     ...Overview As a Member of Technical Staff - Machine Learning (AI Team), you will work to create LLM models for general purpose capabilities and...  ...data meets the model needs. Generalize machine learning (ML) solutions into repeatable frameworks. Lead evaluation efforts... 
    Ongoing contract
    Work at office
    Local area

    Microsoft Corporation

    Mountain View, CA
    4 days ago
  • $180k

     ...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the...  ...skills with passion on model-hardware co-design. Expert in ML and large model scaling, familiar with all kinds of scaling... 
    Temporary work

    Xai

    Palo Alto, CA
    4 days ago
  •  ...limits of AI4EDA and building the intelligence layer for the hardware revolution. What You'll Do As a Founding Member of the Technical Staff (ML infra) at Architect, you'll be responsible for the critical algorithms and infrastructure that our researchers depend... 

    Architect Labs

    Palo Alto, CA
    4 days ago
  • $150k - $400k

     ...protocols. This requires deep collaboration with Application and ML Platform teams to meet tight latency, jitter, and quality of...  ...and integrating with centralized policy/safety infrastructure. Technical Leadership: Strong technical communication skills and the ability... 

    Boson AI

    Santa Clara, CA
    22 hours ago
  •  ...points/cells). You will collaborate with our ML and physics simulation teams, contribute...  .... Experience working in a relevant technical domain such as computational geometry, graphics...  .../export (STEP, IGES, B-Rep) and geometry kernels (Parasolid, ACIS, OpenCASCADE).... 

    Vinci AI

    Palo Alto, CA
    3 days ago
  • $119.8k - $234.7k

     ...scale experiments, and safely advance model capabilities. Each team member owns meaningful parts of the post-training pipeline and has...  ...quantities of data Can design, run, and interpret large-scale ML experiments with careful statistical and empirical reasoning.... 
    Ongoing contract
    Local area
    Worldwide

    Microsoft Corporation

    Mountain View, CA
    4 days ago
  •  ...what's possible next. Who You Are A staff-level or senior engineer with experience in large...  ...optimization, or model deployment. Deeply technical and hands-on: you find energy in writing code, optimizing kernels, and making systems faster, more reliable, and... 

    Odyssey

    Santa Clara, CA
    22 hours ago
  •  ...Member Of Technical Staff, Software Engineer Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's...  ...management, and incident response. Collaborate across research, ML infrastructure, and product teams to understand pain points... 
    Immediate start
    Flexible hours

    Inception LLC

    Palo Alto, CA
    1 day ago
  • $148.5k - $223.9k

     ...iterate agentic AI systems with customers. With your strong technical competence, strategic thinking and customer engagement, you will...  ...candidates who Has exceptional engineering skills. Has deep ML knowledge with meaningful implementation track records.... 

    Salesforce.Com Inc

    Palo Alto, CA
    1 day ago
  •  ...About the Role We are seeking a Member of Technical Staff, Senior ASIC Verification Engineers. This role demands proven technical leadership in ASIC verification and simulation methodologies. You will propose and setup the optimum verification methodology. You... 
    Visa sponsorship
    Relocation package

    Netpreme

    Santa Clara, CA
    3 days ago
  • $119.8k - $234.7k

     ...Overview As a Member of Technical Staff - Software Engineer & Machine Learning, you will work building AI Insights, a Copilot analytics product...  ...ingestion, anomaly detection, and cohort segmentation. Implement ML-driven insights (prompted classifiers, anomaly detection)... 
    Ongoing contract
    Work at office
    Local area

    Microsoft Corporation

    Mountain View, CA
    7 days ago
  •  ...interfaces and backend services — creating intuitive experiences and robust APIs capable of serving at scale. You'll work closely with ML engineers and researchers to bridge the gap between complex AI models and user-friendly products. Key Responsibilities... 
    Immediate start
    Flexible hours

    Inception LLC

    Palo Alto, CA
    1 day ago
  • $142.8k - $274.8k

     ...Overview Microsoft AI is looking for a Member of Technical Staff - Reinforcement Learning Systems to help build the world's most advanced reinforcement learning systems. We are responsible for designing, developing, and operating the large-scale reinforcement learning... 
    Ongoing contract
    Work at office
    Local area
    Shift work

    Microsoft Corporation

    Mountain View, CA
    4 days ago
  •  ...Job Title : Principal Member of Technical Staff Reports to : Head, ASIC Engineering Job Location : Santa Clara, CA About Exo At Exo, we're united by one mission, to take medical imaging where you never thought it could go - everywhere. Through the... 
    Immediate start

    EXO

    Santa Clara, CA
    3 days ago
  •  ...implementation. You will also mentor junior engineers, new-grads, and interns to help them grow as engineers and become productive members of the team. You will primarily write code in Go and work with data pipeline using SQL or other types of interfaces. We leverage... 
    Immediate start

    Illumio

    Sunnyvale, CA
    2 days ago
  •  ...Member Of Technical Staff Location – Santa Clara, CA 95054 Duration: 6 Months Full life cycle software development from design through development, validation and release. Implementation of software for inventory, configuration, firmware and software management including... 

    CapLeo Global

    Santa Clara, CA
    2 hours ago
  • $180k

     ...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI systems that can accurately understand the...  ...balancing, auto-scaling) to deep low-level optimizations (GPU kernels, quantization, speculative decoding, tail latency). This... 
    Temporary work

    Xai

    Palo Alto, CA
    5 hours ago
  • $250k

     ...learning users to effortlessly run large-scale ML applications, without the hassle of...  ...Inc. has multiple openings for Sr. Technical Staff. Title: Sr. Technical Staff Job Duties...  ...model. Work with system team members to diagnose system related failures. Understand... 
    Remote work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, ML Kernels. Be the first to apply!