Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

$193.3k - $261.5k

Amazon Locker

Description

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium.

The Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance kernels for ML functions, ensuring every FLOP counts in delivering optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.

The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance.

As part of the broader Neuron Compiler organization, our team works across multiple technology layers - from frameworks and compilers to runtime and collectives. We not only optimize current performance but also contribute to future architecture designs, working closely with customers to enable their models and ensure optimal performance. This role offers a unique opportunity to work at the intersection of machine learning, high-performance computing, and distributed architectures, where you'll help shape the future of AI acceleration technology

This is an opportunity to work on cutting-edge products at the intersection of machine-learning, high-performance computing, and distributed architectures. You will architect and implement business-critical features, publish cutting-edge research, and mentor a brilliant team of experienced engineers. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. It is a very unique learning culture. The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achieve optimal performance on AWS ML accelerators.

Explore the product and our history!

Key job responsibilities

Our kernel engineers collaborate across compiler, runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the intersection of software, hardware, and machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will:

  • Design and implement high-performance compute kernels for ML operations, leveraging the Neuron architecture and programming models

  • Analyze and optimize kernel-level performance across multiple generations of Neuron hardware

  • Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks

  • Implement compiler optimizations such as fusion, sharding, tiling, and scheduling

  • Work directly with customers to enable and optimize their ML models on AWS accelerators

  • Collaborate across teams to develop innovative kernel optimization techniques

About the team

#1. Diverse Experiences

AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

#2. Why AWS

Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

#3. Inclusive Team Culture

Here at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.

#4. Work/Life Balance

Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.

#5. Mentorship & Career Growth

Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.

Basic Qualifications

  • 5+ years of non-internship professional software development experience

  • 5+ years of programming with at least one software programming language experience

  • 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience

  • 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience

  • Experience as a mentor, tech lead or leading an engineering team

Preferred Qualifications

  • Bachelor's degree in computer science or equivalent

  • 6+ years of full software development experience

  • Expertise in accelerator architectures for ML or HPC such as GPUs, CPUs, FPGAs, or custom architectures

  • Experience with GPU kernel optimization and GPGPU computing such as CUDA, NKI, Triton, OpenCL, SYCL, or ROCm

  • Demonstrated experience with NVIDIA PTX and/or AMD GPU ISA

  • Experience developing high performance libraries for HPC applications

  • Proficiency in low-level performance optimization for GPUs

  • Experience with LLVM/MLIR backend development for GPUs

  • Knowledge of ML frameworks (PyTorch, TensorFlow) and their GPU backends

  • Experience with parallel programming and optimization techniques

  • Understanding of GPU memory hierarchies and optimization strategies

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at .

USA, CA, Cupertino - 193,300.00 - 261,500.00 USD annually

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs in Cupertino, CA vacancy
  • $193.3k - $261.5k

     ...The Product: AWS Machine Learning...  ...delivers best-in-class ML inference performance at the lowest...  ...software stack, the AWS Neuron Software...  ...whole, the Amazon Annapurna Labs team is responsible...  ...including silicon engineering, hardware design...  ...performance. You: As a Sr. Machine Learning... 
    Amazon Web Service
    Senior
    Performance
    Internship
    Work from home
    Relocation
    Flexible hours

    Amazon

    Cupertino, CA
    2 days ago
  • Sr. Product Manager - Runtime Infra, AI/ML, Annapurna Labs (Cupertino) An Amazon Web Services (AWS) job located in Cupertino,...  ...Description AWS Neuron is looking...  ...-in‑class ML performance in the cloud....  ...and influence engineering discussions...  ...systems and kernel development... 
    Amazon Web Service
    Senior
    Performance

    Downtown Boulder Partnership

    Cupertino, CA
    4 days ago
  • $193.3k - $261.5k

     ...optimization tools for the Neuron ML accelerators fleet...  ...are available for performance profiling of large...  ...for performance engineers to develop and improve custom kernels that would impact...  ...Experiences AWS values diverse...  .... About Amazon Annapurna Labs: Amazon Annapurna... 
    Amazon Web Service
    Senior
    Performance
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    10 hours ago
  • $193.3k - $261.5k

     ...to be part of AI revolution? At AWS our vision is to make deep learning...  ...that make it possible. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia...  ...This role is for a senior software engineer in the Compiler team for AWS... 
    Amazon Web Service
    Senior
    Performance
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    5 hours ago
  • $206.9k - $279.9k

     ...AWS Neuron is looking for an experienced Technical Product...  ...for the Neuron Kernel Interface (NKI), a compiler...  ...best-in-class ML performance in the cloud. You will...  ...contribute to and influence engineering discussions around...  .... About Amazon Annapurna Labs: Amazon Annapurna... 
    Amazon Web Service
    Performance
    Flexible hours

    Amazon

    Cupertino, CA
    2 days ago
  •  ...Software Engineer Opportunity Annapurna Labs is an integral part of AWS and develops hardware and software components that...  ...customer experience. The AWS Neuron Collectives team is seeking a...  ...team, you'll push for maximum performance using C/C++, interfacing with... 
    Amazon Web Service
    Senior
    Performance

    Amazon

    Cupertino, CA
    17 hours ago
  • $193.3k - $261.5k

     ...Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software...  ...Inferentia and Trainium ML accelerators....  ...and training performance. The Inference...  ...software boundary, our engineers build systematic...  ...high-performance kernels for ML functions,... 
    Amazon Web Service
    Senior
    Performance
    Work experience placement
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    5 hours ago
  • $193.3k - $261.5k

     ...Description Annapurna Labs designs silicon and software...  ...learning accelerators at AWS scale. Working with...  ...scalable firmware for ML acceleration servers....  ...members develop your engineering expertise so you feel...  ...continuously raising our performance bar as we strive to... 
    Amazon Web Service
    Senior
    Performance
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    4 days ago
  • $193.3k - $261.5k

     ...operator development for ML accelerator resource...  ...challenges - diagnose performance issues across thousand-...  ...discordant About the team Neuron Containers connects the...  ...lead or leading an engineering team - 7+ years of...  ...pipelines - Experience with AWS compute services (EC2,... 
    Amazon Web Service
    Performance
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    4 days ago
  • $165.2k - $223.6k

     ...Annapurna Labs is an integral part of AWS and develops hardware and software components that...  ...customer experience. The AWS Neuron Collectives team is seeking a Software Engineer to optimize collective...  ..., you'll push for maximum performance using C/C++, interfacing with... 
    Amazon Web Service
    Performance
    Local area
    Work from home
    Flexible hours

    Amazon

    Cupertino, CA
    3 days ago
  • $193.3k - $261.5k

     ...Senior SoC Software DevOps Engineer role centers on...  ...scale AI training at AWS. The engineer will serve...  ...ability to deliver advanced ML infrastructure to its...  ...coverage and pipeline performance along with alerting...  ...organization within Annapurna Labs (AWS). Our three software... 
    Amazon Web Service
    Senior
    Performance
    Local area
    Immediate start
    Flexible hours

    Amazon

    Cupertino, CA
    6 days ago
  • $193.3k - $261.5k

     ...seeking an experienced engineer and technical...  ...EC2 distributed AI/ML systems. The team develops...  ..., and Perplexity kernels. Solid knowledge of...  ..., networking, and performant coding is important...  ...be joining is Annapurna Labs, an integral part of AWS that develops hardware... 
    Amazon Web Service
    Performance
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    1 day ago
  • $165.2k - $223.6k

     ...The AWS Neuron Compiler team is actively seeking skilled compiler engineers to join our efforts in developing a state...  ...for advanced ML capabilities, powering...  ...to ensure system-wide performance optimization. As part...  ...software co-design. Annapurna Labs (our organization... 
    Amazon Web Service
    Performance
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    3 days ago
  • $165.2k - $223.6k

     ...are seeking an experienced engineer to work on distributed AI/ML systems. This role involves...  ...so solid knowledge of Linux, kernels, and performant code is important. Experience...  ...org you would be joining is Annapurna Labs, an integral part of AWS and develops hardware and software... 
    Amazon Web Service
    Performance
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    3 days ago
  •  ...career senior-level Java Software Engineers, who will work closely with...  ...-wide data processing & AI/ML systems. When you join IS&T...  ...and shipping high-scale, high-performance, highly available, fault-tolerant...  ...cloud-based Kubernetes (e.g. AWS EKS, GCP GKE, Azure AKS, etc.)... 
    Amazon Web Service
    Senior
    Performance
    Work experience placement

    Apple

    Sunnyvale, CA
    1 day ago
  • $165.2k - $223.6k

     ...Annapurna Labs designs silicon and software that accelerates...  ...Amazon Web Services (AWS) but across the industry...  ...- Craft high-performance solutions using C/C++...  ...device drivers, build ML infrastructure, implement...  ...meets systems. While ML engineers optimize models and algorithms... 
    Amazon Web Service
    Performance
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    3 days ago
  • $184k - $287.5k

     ...skilled and motivated software engineers to join us and build AI...  ...and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry...  ...for the field of ML Systems; survey recent publications...  ...with cloud platforms (AWS/GCP/Azure),... 
    Amazon Web Service
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $182k - $250k

     ...Senior Staff Customer Engineer – Automotive AI At Sonatus, we're...  ...Design, integrate, and deploy AI/ML-based solutions across cloud...  ...solutions, continuously improving performance based on real-world data and...  ...in cloud environments (AWS, Azure, GCP, or customer-managed... 
    Amazon Web Service
    Senior
    Performance
    Work at office
    Worldwide
    Flexible hours
    Shift work

    Sonatus

    Sunnyvale, CA
    3 days ago
  •  ...Vision / Machine Learning Software Engineers to build compute-constrained...  ...Object detection Optimize performance, accuracy, and speed of compute...  ...across teams to deploy CV/ML models into production Improve...  ...Proficiency with Linux, Git, AWS/GCP, and CI/CD workflows Experience... 
    Amazon Web Service
    Senior
    Performance
    Visa sponsorship

    Corvus Robotics, Inc.

    Mountain View, CA
    10 hours ago
  • $165.2k - $223.6k

     ...Description AWS's Trainium and Inferentia chips power the world...  ...We're looking for a software engineer to build and own the models...  ...Design models for usability and performance - your customers are software...  ...significant pieces of the stack No ML background needed. You'll... 
    Amazon Web Service
    Performance
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    2 days ago
  • $193.3k - $261.5k

     ...AWS Neuron is the software stack powering AWS Inferentia...  ...to deliver high-performance, low-cost inference at...  ...Software Development Engineer to lead and architect...  ...design of distributed ML serving systems optimized...  ...PyTorch operators and NKI kernels * Mentor team members... 
    Amazon Web Service
    Performance
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    10 hours ago
  • $140k - $300k

     ...energy markets. As a Software Engineer, you will be instrumental...  ...training, evaluating, and deploying ML models in production, ensuring...  ...identify and resolve performance bottlenecks related to infrastructure...  ...big data technologies such as AWS, Spark, Airflow, and Kubernetes... 
    Amazon Web Service
    Senior
    Performance
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    1 day ago
  • $165.2k - $223.6k

     ...Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates...  ...to deliver security and performance that is...  ...team members develop your engineering expertise so you feel empowered...  ...- Experience with Linux kernel Amazon is an equal opportunity... 
    Amazon Web Service
    Performance
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    17 hours ago
  • $181.1k - $318.4k

     ...Senior Cloud Engineer – Special Projects Apple's intelligent systems...  ...with hardware, robotics, ML, design, and platform teams to...  ...end-to-end solutions that are performant, intuitive, and deeply integrated...  ...scalable cloud infrastructure on AWS, GCP, or other modern cloud... 
    Amazon Web Service
    Senior
    Performance
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  • $165.2k - $223.6k

     ...Description AWS Neuron is the complete software stack for the AWS...  ...As the Software Development Engineer for the Neuron Foundation...  ...to develop and maintain high-performance monitoring and profiling tools...  .... Improving performance of ML Kernels and ML Frameworks. In this... 
    Amazon Web Service
    Performance
    Internship
    Local area
    Work from home
    Flexible hours

    Amazon

    Cupertino, CA
    1 day ago
  • $183k - $247.6k

     ...network and Unix systems engineering to deliver simple,...  ...generation networks? AWS Infrastructure Services...  ...We are looking for a Sr. Optical Network Engineer...  ...to balance efficiency, performance and reliability to allow...  ...PCIe protocols and AI/ML scale up approaches -... 
    Amazon Web Service
    Senior
    Performance
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    3 days ago
  •  ...Enablement (OCTO & HPE Labs)Applylocations: Milpitas...  ...capabilities for engineering and field teams. Your charter...  ...best practices for AI/ML systems; contribute to...  ...mentor, and grow a high performing team of applied AI engineers...  ...models (OpenStack, AWS, Azure, VMware,... 
    Amazon Web Service
    Performance
    Work experience placement
    Work at office
    2 days per week

    Hewlett Packard Enterprise Development LP

    Milpitas, CA
    3 days ago
  •  ...to effortlessly run large-scale ML applications, without the hassle...  ...current customers include top model labs, global enterprises, and cutting-...  .... About The Role As a Kernel Engineer on our team, you will develop high-performance software solutions at the intersection... 
    Performance

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  •  ...Vision / Machine Learning Software Engineers to build efficient models for...  ...detection Optimize performance, accuracy, and speed of...  ...functional teams to deploy CV/ML models into production Manage...  ...Proficiency with Linux, Git, AWS/GCP, and CI/CD workflows ~... 
    Amazon Web Service
    Senior
    Performance
    Work at office
    Visa sponsorship

    Corvus Robotics

    Mountain View, CA
    3 days ago
  • $188.9k - $330.5k

     ...hands-on with ML hardware acceleration...  ...Web Services (AWS) is looking for...  ...to join the Annapurna Labs team. AWS...  ...class ML inference performance at the lowest...  ...stack, the AWS Neuron Software Development...  ...a software engineering, ML research,...  ...tutorials to deep-dive kernel optimization... 
    Amazon Web Service
    Performance
    Local area
    Immediate start
    Worldwide
    Flexible hours

    Amazon

    Cupertino, CA
    10 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs. Be the first to apply!