Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

$193.3k - $261.5k

Amazon Locker

Description

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium.

The Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance kernels for ML functions, ensuring every FLOP counts in delivering optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.

The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance.

As part of the broader Neuron Compiler organization, our team works across multiple technology layers - from frameworks and compilers to runtime and collectives. We not only optimize current performance but also contribute to future architecture designs, working closely with customers to enable their models and ensure optimal performance. This role offers a unique opportunity to work at the intersection of machine learning, high-performance computing, and distributed architectures, where you'll help shape the future of AI acceleration technology

This is an opportunity to work on cutting-edge products at the intersection of machine-learning, high-performance computing, and distributed architectures. You will architect and implement business-critical features, publish cutting-edge research, and mentor a brilliant team of experienced engineers. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. It is a very unique learning culture. The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achieve optimal performance on AWS ML accelerators.

Explore the product and our history!

Key job responsibilities

Our kernel engineers collaborate across compiler, runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the intersection of software, hardware, and machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will:

Design and implement high-performance compute kernels for ML operations, leveraging the Neuron architecture and programming models
Analyze and optimize kernel-level performance across multiple generations of Neuron hardware
Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks
Implement compiler optimizations such as fusion, sharding, tiling, and scheduling
Work directly with customers to enable and optimize their ML models on AWS accelerators
Collaborate across teams to develop innovative kernel optimization techniques

About the team

#1. Diverse Experiences

AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

#2. Why AWS

Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

#3. Inclusive Team Culture

Here at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.

#4. Work/Life Balance

Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.

#5. Mentorship & Career Growth

Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.

Basic Qualifications

5+ years of non-internship professional software development experience
5+ years of programming with at least one software programming language experience
5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
Experience as a mentor, tech lead or leading an engineering team

Preferred Qualifications

Bachelor's degree in computer science or equivalent
6+ years of full software development experience
Expertise in accelerator architectures for ML or HPC such as GPUs, CPUs, FPGAs, or custom architectures
Experience with GPU kernel optimization and GPGPU computing such as CUDA, NKI, Triton, OpenCL, SYCL, or ROCm
Demonstrated experience with NVIDIA PTX and/or AMD GPU ISA
Experience developing high performance libraries for HPC applications
Proficiency in low-level performance optimization for GPUs
Experience with LLVM/MLIR backend development for GPUs
Knowledge of ML frameworks (PyTorch, TensorFlow) and their GPU backends
Experience with parallel programming and optimization techniques
Understanding of GPU memory hierarchies and optimization strategies

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at .

USA, CA, Cupertino - 193,300.00 - 261,500.00 USD annually

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs in Cupertino, CA vacancy

Sr. Machine Learning Compiler Engineer, AWS Neuron, Annapurna Labs
$193.3k - $261.5k
...The Product: AWS Machine Learning... ...delivers best-in-class ML inference performance at the lowest... ...software stack, the AWS Neuron Software... ...whole, the Amazon Annapurna Labs team is responsible... ...including silicon engineering, hardware design... ...performance. You: As a Sr. Machine Learning...
Amazon Web Service
Senior
Performance
Internship
Work from home
Relocation
Flexible hours
Amazon
Cupertino, CA
2 days ago
Sr. Product Manager - Runtime Infra, AI/ML, Annapurna Labs (Cupertino) at Amazon Web Services ([...]
Sr. Product Manager - Runtime Infra, AI/ML, Annapurna Labs (Cupertino) An Amazon Web Services (AWS) job located in Cupertino,... ...Description AWS Neuron is looking... ...-in‑class ML performance in the cloud.... ...and influence engineering discussions... ...systems and kernel development...
Amazon Web Service
Senior
Performance
Downtown Boulder Partnership
Cupertino, CA
4 days ago
Sr. Software Development Engineer, Annapurna Labs
$193.3k - $261.5k
...optimization tools for the Neuron ML accelerators fleet... ...are available for performance profiling of large... ...for performance engineers to develop and improve custom kernels that would impact... ...Experiences AWS values diverse... .... About Amazon Annapurna Labs: Amazon Annapurna...
Amazon Web Service
Senior
Performance
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
10 hours ago
Sr. Machine Learning - Compiler Engineer III, AWS Neuron, Annapurna Labs
$193.3k - $261.5k
...to be part of AI revolution? At AWS our vision is to make deep learning... ...that make it possible. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia... ...This role is for a senior software engineer in the Compiler team for AWS...
Amazon Web Service
Senior
Performance
Local area
Flexible hours
Amazon
Cupertino, CA
5 hours ago
Principal Product Manager - Kernels, AI/ML, Annapurna Labs
$206.9k - $279.9k
...AWS Neuron is looking for an experienced Technical Product... ...for the Neuron Kernel Interface (NKI), a compiler... ...best-in-class ML performance in the cloud. You will... ...contribute to and influence engineering discussions around... .... About Amazon Annapurna Labs: Amazon Annapurna...
Amazon Web Service
Performance
Flexible hours
Amazon
Cupertino, CA
2 days ago
Sr Software Development Engineer, Neuron Collectives, Annapurna Labs
...Software Engineer Opportunity Annapurna Labs is an integral part of AWS and develops hardware and software components that... ...customer experience. The AWS Neuron Collectives team is seeking a... ...team, you'll push for maximum performance using C/C++, interfacing with...
Amazon Web Service
Senior
Performance
Amazon
Cupertino, CA
17 hours ago
Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference
$193.3k - $261.5k
...Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software... ...Inferentia and Trainium ML accelerators.... ...and training performance. The Inference... ...software boundary, our engineers build systematic... ...high-performance kernels for ML functions,...
Amazon Web Service
Senior
Performance
Work experience placement
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
5 hours ago
Sr. Embedded Firmware Engineer, Annapurna Labs ML Acceleration Systems Software
$193.3k - $261.5k
...Description Annapurna Labs designs silicon and software... ...learning accelerators at AWS scale. Working with... ...scalable firmware for ML acceleration servers.... ...members develop your engineering expertise so you feel... ...continuously raising our performance bar as we strive to...
Amazon Web Service
Senior
Performance
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
4 days ago
Software Development Engineer III Neuron Containers, Neuron Containers, Annapurna Labs
$193.3k - $261.5k
...operator development for ML accelerator resource... ...challenges - diagnose performance issues across thousand-... ...discordant About the team Neuron Containers connects the... ...lead or leading an engineering team - 7+ years of... ...pipelines - Experience with AWS compute services (EC2,...
Amazon Web Service
Performance
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
4 days ago
Software Development Engineer, Neuron Collectives, Annapurna Labs
$165.2k - $223.6k
...Annapurna Labs is an integral part of AWS and develops hardware and software components that... ...customer experience. The AWS Neuron Collectives team is seeking a Software Engineer to optimize collective... ..., you'll push for maximum performance using C/C++, interfacing with...
Amazon Web Service
Performance
Local area
Work from home
Flexible hours
Amazon
Cupertino, CA
3 days ago
Senior Software Engineer - SoC DevOps, MLA-MI - Annapurna Labs
$193.3k - $261.5k
...Senior SoC Software DevOps Engineer role centers on... ...scale AI training at AWS. The engineer will serve... ...ability to deliver advanced ML infrastructure to its... ...coverage and pipeline performance along with alerting... ...organization within Annapurna Labs (AWS). Our three software...
Amazon Web Service
Senior
Performance
Local area
Immediate start
Flexible hours
Amazon
Cupertino, CA
6 days ago
Lead Engineer, ML Network Stack - Annapurna Labs
$193.3k - $261.5k
...seeking an experienced engineer and technical... ...EC2 distributed AI/ML systems. The team develops... ..., and Perplexity kernels. Solid knowledge of... ..., networking, and performant coding is important... ...be joining is Annapurna Labs, an integral part of AWS that develops hardware...
Amazon Web Service
Performance
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
1 day ago
ML Compiler Engineer, Annapurna Labs
$165.2k - $223.6k
...The AWS Neuron Compiler team is actively seeking skilled compiler engineers to join our efforts in developing a state... ...for advanced ML capabilities, powering... ...to ensure system-wide performance optimization. As part... ...software co-design. Annapurna Labs (our organization...
Amazon Web Service
Performance
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
3 days ago
Software Development Engineer II, AI/ML Elastic Collectives - Annapurna Labs
$165.2k - $223.6k
...are seeking an experienced engineer to work on distributed AI/ML systems. This role involves... ...so solid knowledge of Linux, kernels, and performant code is important. Experience... ...org you would be joining is Annapurna Labs, an integral part of AWS and develops hardware and software...
Amazon Web Service
Performance
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
3 days ago
Sr. Software Engineer, Information Systems & Technology
...career senior-level Java Software Engineers, who will work closely with... ...-wide data processing & AI/ML systems. When you join IS&T... ...and shipping high-scale, high-performance, highly available, fault-tolerant... ...cloud-based Kubernetes (e.g. AWS EKS, GCP GKE, Azure AKS, etc.)...
Amazon Web Service
Senior
Performance
Work experience placement
Apple
Sunnyvale, CA
1 day ago
Embedded Software Engineer, Annapurna ML Software
$165.2k - $223.6k
...Annapurna Labs designs silicon and software that accelerates... ...Amazon Web Services (AWS) but across the industry... ...- Craft high-performance solutions using C/C++... ...device drivers, build ML infrastructure, implement... ...meets systems. While ML engineers optimize models and algorithms...
Amazon Web Service
Performance
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
3 days ago
Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...skilled and motivated software engineers to join us and build AI... ...and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry... ...for the field of ML Systems; survey recent publications... ...with cloud platforms (AWS/GCP/Azure),...
Amazon Web Service
Senior
Performance
NVIDIA
Santa Clara, CA
3 days ago
Senior Staff Customer Engineer - Automotive AI
$182k - $250k
...Senior Staff Customer Engineer – Automotive AI At Sonatus, we're... ...Design, integrate, and deploy AI/ML-based solutions across cloud... ...solutions, continuously improving performance based on real-world data and... ...in cloud environments (AWS, Azure, GCP, or customer-managed...
Amazon Web Service
Senior
Performance
Work at office
Worldwide
Flexible hours
Shift work
Sonatus
Sunnyvale, CA
3 days ago
Sr. Computer Vision / Machine Learning Engineer
...Vision / Machine Learning Software Engineers to build compute-constrained... ...Object detection Optimize performance, accuracy, and speed of compute... ...across teams to deploy CV/ML models into production Improve... ...Proficiency with Linux, Git, AWS/GCP, and CI/CD workflows Experience...
Amazon Web Service
Senior
Performance
Visa sponsorship
Corvus Robotics, Inc.
Mountain View, CA
10 hours ago
Virtual Platform Software Engineer, Annapurna Labs Machine Learning Accelerators, AWS
$165.2k - $223.6k
...Description AWS's Trainium and Inferentia chips power the world... ...We're looking for a software engineer to build and own the models... ...Design models for usability and performance - your customers are software... ...significant pieces of the stack No ML background needed. You'll...
Amazon Web Service
Performance
Local area
Flexible hours
Amazon
Cupertino, CA
2 days ago
Software Development Engineer AI/ML, Inference Serving, AWS Neuron
$193.3k - $261.5k
...AWS Neuron is the software stack powering AWS Inferentia... ...to deliver high-performance, low-cost inference at... ...Software Development Engineer to lead and architect... ...design of distributed ML serving systems optimized... ...PyTorch operators and NKI kernels * Mentor team members...
Amazon Web Service
Performance
Internship
Local area
Flexible hours
Amazon
Cupertino, CA
10 hours ago
Sr. Software & Machine Learning Engineer, Energy Optimization
$140k - $300k
...energy markets. As a Software Engineer, you will be instrumental... ...training, evaluating, and deploying ML models in production, ensuring... ...identify and resolve performance bottlenecks related to infrastructure... ...big data technologies such as AWS, Spark, Airflow, and Kubernetes...
Amazon Web Service
Senior
Performance
Hourly pay
Full time
Temporary work
Flexible hours
Tesla
Palo Alto, CA
1 day ago
Software Development Engineer, Nitro Card, Annapurna Labs
$165.2k - $223.6k
...Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates... ...to deliver security and performance that is... ...team members develop your engineering expertise so you feel empowered... ...- Experience with Linux kernel Amazon is an equal opportunity...
Amazon Web Service
Performance
Local area
Flexible hours
Amazon
Cupertino, CA
17 hours ago
Senior Cloud Engineer - Special Projects
$181.1k - $318.4k
...Senior Cloud Engineer – Special Projects Apple's intelligent systems... ...with hardware, robotics, ML, design, and platform teams to... ...end-to-end solutions that are performant, intuitive, and deeply integrated... ...scalable cloud infrastructure on AWS, GCP, or other modern cloud...
Amazon Web Service
Senior
Performance
Relocation
Apple
Cupertino, CA
1 day ago
Software Development Engineer, Neuron Foundation Tools
$165.2k - $223.6k
...Description AWS Neuron is the complete software stack for the AWS... ...As the Software Development Engineer for the Neuron Foundation... ...to develop and maintain high-performance monitoring and profiling tools... .... Improving performance of ML Kernels and ML Frameworks. In this...
Amazon Web Service
Performance
Internship
Local area
Work from home
Flexible hours
Amazon
Cupertino, CA
1 day ago
Senior Cloud Hardware Development Engineer
$183k - $247.6k
...network and Unix systems engineering to deliver simple,... ...generation networks? AWS Infrastructure Services... ...We are looking for a Sr. Optical Network Engineer... ...to balance efficiency, performance and reliability to allow... ...PCIe protocols and AI/ML scale up approaches -...
Amazon Web Service
Senior
Performance
Local area
Flexible hours
Amazon
Cupertino, CA
3 days ago
Distinguished Technologist & Director, AI Strategy and Enablement (OCTO & HPE Labs)
...Enablement (OCTO & HPE Labs)Applylocations: Milpitas... ...capabilities for engineering and field teams. Your charter... ...best practices for AI/ML systems; contribute to... ...mentor, and grow a high performing team of applied AI engineers... ...models (OpenStack, AWS, Azure, VMware,...
Amazon Web Service
Performance
Work experience placement
Work at office
2 days per week
Hewlett Packard Enterprise Development LP
Milpitas, CA
3 days ago
Kernel Engineer
...to effortlessly run large-scale ML applications, without the hassle... ...current customers include top model labs, global enterprises, and cutting-... .... About The Role As a Kernel Engineer on our team, you will develop high-performance software solutions at the intersection...
Performance
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
3 days ago
Senior Computer Vision/Machine Learning Engineer
...Vision / Machine Learning Software Engineers to build efficient models for... ...detection Optimize performance, accuracy, and speed of... ...functional teams to deploy CV/ML models into production Manage... ...Proficiency with Linux, Git, AWS/GCP, and CI/CD workflows ~...
Amazon Web Service
Senior
Performance
Work at office
Visa sponsorship
Corvus Robotics
Mountain View, CA
3 days ago
Principal Developer Advocate - AWS Trainium, Trainium
$188.9k - $330.5k
...hands-on with ML hardware acceleration... ...Web Services (AWS) is looking for... ...to join the Annapurna Labs team. AWS... ...class ML inference performance at the lowest... ...stack, the AWS Neuron Software Development... ...a software engineering, ML research,... ...tutorials to deep-dive kernel optimization...
Amazon Web Service
Performance
Local area
Immediate start
Worldwide
Flexible hours
Amazon
Cupertino, CA
10 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs. Be the first to apply!