Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. Fellow Machine Learning Engineer

AMD

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.

Together, we advance your career.
The Role

We are looking for a Fellow/Sr. Fellow Machine Learning Engineer to join our Training At Scale team. If you are excited by the challenge of distributed training of large models on a large number of GPUs, and if you are passionate about improving training efficiency while innovating and generating new ideas, then this role is for you. You will be part of a world class team focused on addressing the challenge of training generative AI.

The Person

The ideal candidate should have experience with distributed training pipelines, be knowledgeable in distributed training algorithms (Data Parallel, Tensor Parallel, Pipeline Parallel, Expert Parallel), and be familiar with training large models.

Key Responsibilities
  • Train large models to convergence on AMD GPUs at scale.
  • Improve the end-to-end training pipeline performance on large scale GPU cluster.
  • Improve the end-to-end debuggability on large scale GPU cluster.
  • Design and optimize the distributed training pipeline and software stack to scale out.
  • Contribute your changes to open source.
  • Stay up-to-date with the latest training algorithms/frameworks.
  • Influence the direction of AMD AI platform.
  • Collaborate across teams with various groups and stakeholders.
Preferred Experience
  • Strong background in machine learning, distributed systems, or AI infrastructure.
  • Proven experience building and optimizing distributed training systems for large models.
  • Prefer experience in both model and application-level development and optimization.
  • Strong familiarity with ML frameworks (PyTorch, JAX, TensorFlow) and distributed frameworks (TorchTitan, Megatron-LM).
  • Hands-on expertise with LLMs, recommendation systems, or ranking models.
  • Proficiency in Python and C++, including performance profiling, debugging, and large-scale optimization.
  • Experience collaborating across hardware, compiler, and system software layers.
  • Excellent communication, and problem-solving skills.
ACADEMIC CREDENTIALS

Master’s or Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a related field.

LOCATION

San Jose, CA or Bellevue, WA preferred. Other U.S. locations near AMD offices may be considered.

#HYBRID

Benefits offered are described: AMD benefits at a glance.

Legal & Equal Opportunity Statement

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's 'Responsible AI Policy' is available here.

This posting is for an existing vacancy.

#J-18808-Ljbffr
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Sr. Fellow Machine Learning Engineer in San Jose, CA vacancy
  •  ...A leading technology company is seeking a Fellow/Sr. Fellow Machine Learning Engineer to join the Training At Scale team in San Jose, CA. The candidate will work on distributed training of large models and improve training efficiency. Responsibilities include enhancing... 
    Senior

    Advanced Micro Devices , Inc.

    San Jose, CA
    4 days ago
  •  ...A leading semiconductor company is seeking a Fellow/Sr. Fellow Machine Learning Engineer in San Jose, CA. This role involves training large models, optimizing distributed training systems, and contributing to an advanced AI platform. The ideal candidate will have a strong... 
    Senior

    AMD

    San Jose, CA
    4 days ago
  •  ...strengthening cyber resilience for the infrastructure, systems, and organizations that keep the world running. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity. We thrive on visionary leadership, autonomy, and ownership, fostering a culture of... 
    Senior
    Immediate start

    Illumio

    Sunnyvale, CA
    4 days ago
  • $151.8k - $265.35k

     ...Senior Machine Learning Engineer We are looking for a Senior Machine Learning Engineer to join our team of driven machine learning and software engineers. This role covers system design, prompt engineering, ML model evaluation, building data pipelines, prototype creation... 
    Senior
    Local area

    Adobe

    San Jose, CA
    4 days ago
  •  ...delivered for millions of patients worldwide. We're a team of engineers, clinicians, and innovators united by one purpose: to make...  ..., prototype, and implement advanced computer vision and machine learning algorithms tailored for real-time processing of diverse... 
    Senior
    Local area
    Worldwide
    Flexible hours

    Intuitive

    Sunnyvale, CA
    3 days ago
  •  ...take care of ourselves, each other, and our communities. Job Summary: Job Description: PayPal, Inc. seeks Sr Machine Learning Engineer in San Jose, CA Job Duties: Conduct cutting-edge research in machine learning to develop solutions that address complex... 
    Senior
    Work at office
    Local area
    Immediate start
    Remote work
    Flexible hours

    PayPal

    San Jose, CA
    5 days ago
  • $184.5k - $258k

     ...parental leave, a flexible work model, and career development resources to fuel our employees’ passion for travel. Senior Machine Learning Engineer Expedia Technology teams partner with Product to create innovative products, services, and tools that deliver high‑quality... 
    Senior
    Flexible hours

    Traveltechessentialist

    San Jose, CA
    4 days ago
  •  ...to bring your talents to Zscaler to help shape the future of cybersecurity. Role We are looking for an experienced Sr. Machine Learning Engineer to join our Engineering team. This role is based in Bangalore, reporting to the Manager, Machine Learning Engineering.... 
    Senior
    Local area

    Zscaler

    San Jose, CA
    4 days ago
  • $181.1k - $318.4k

    Santa Clara, California, United States Machine Learning and AI Siri helps hundreds of millions of people find the information they are looking...  ...to users’ questions.We are looking for an experienced ML engineer with hands-on experience in search and recommendation and deploying... 
    Senior
    Local area
    Relocation

    Apple

    Santa Clara, CA
    2 days ago
  • $178.9k - $351.23k

     ...customers and all Adobe Products. We are looking for a senior ML engineering manager to spearhead our growing efforts in building world-...  ...industry experience. ~8+ years of experience in machine learning, including production-scale deployments. ~5+ years of engineering... 
    Senior
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    4 days ago
  • $181.1k - $318.4k

    Santa Clara, California, United States We are looking for a hardworking and experienced Machine Learning Engineer to build intelligent search experiences. In this role, you will build intelligent search systems that deeply understand user intent and context to return highly... 
    Senior
    Relocation package

    Apple Inc.

    Santa Clara, CA
    1 day ago
  • $181.1k - $318.4k

    Sr. Machine Learning Engineer, Siri Speech Cupertino, California, United States Machine Learning and AI We are a group of engineers/researchers responsible for advancing Siri Conversational AI at Apple. Our mission is to build cutting‑edge infrastructure, datasets, and... 
    Senior
    Relocation

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $212k - $318.4k

    Santa Clara, California, United States Machine Learning and AI Are you interested in enhancing the capabilities of Siri and Apple products...  ...range of backgrounds, including applied machine learning engineers with a focus on ML and LLM, and experienced distributed systems... 
    Senior
    Work experience placement
    Relocation

    Apple Inc.

    Santa Clara, CA
    2 days ago
  • $141k - $228.08k

     ...stronger relationships, and the kind of precision that drives great outcomes. Job Summary Job Summary We are seeking a Machine Learning Engineer to join our pioneering security team. This role is for a technical expert passionate about deconstructing complex threats... 
    Senior
    Full time
    Work at office

    Palo Alto Networks

    San Jose, CA
    2 days ago
  • $190.2k - $345.65k

     ...Adobe Firefly's Generative AI Services team is seeking Senior Machine Learning Engineers for our GenAI Services area. In this high-impact role, you will work with a team of talented engineers in building scalable, high-performance generative AI systems-powering... 
    Senior
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    5 days ago
  • $181.1k - $318.4k

    AIML - Sr Machine Learning Engineer, Responsible AI Cupertino, California, United States Machine Learning and AI Would you like to play a part in building the next generation of generative AI applications at Apple? We’re looking for Machine Learning Engineers to work on... 
    Senior
    Relocation

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $181.1k - $318.4k

    Sr. Machine Learning Engineer, Siri Global Cupertino, California, United States Machine Learning and AI Join the Siri team at Apple! Build and contribute to a product and company that is building products, personal devices, and software designed to enrich people’s lives... 
    Senior
    Relocation

    Apple

    Cupertino, CA
    5 days ago
  • $147k - $225.5k

     ...cybersecurity. We work fast, value ongoing learning, and we respect each employee as a...  ...Your Career Your Career Bring your machine learning and applied research expertise...  ...security solutions. As a Machine Learning Engineer, you will work on large-scale ML systems... 
    Senior
    Full time
    Casual work
    Work at office
    3 days per week

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $162.5k - $286.4k

    Sr. Machine Learning Engineer, ASR Infrastructure and Tools Cambridge, Massachusetts, United States Machine Learning and AI Want to join the team pushing the boundaries of AI and building an intelligent assistant that helps millions of people get things done? Join the Siri... 
    Senior
    Worldwide
    Relocation

    Apple Inc.

    Cupertino, CA
    5 days ago
  • $172.5k - $306.63k

     ...Senior Machine Learning Engineer At Adobe's Experience Platform, we are looking for a Senior Machine Learning Engineer to compose, build, and operate scalable intelligent AI systems that power end-user AI products. You will work closely with Adobe Research, product... 
    Senior
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    5 days ago
  • $154k - $220k

     ...future of cybersecurity. Role We are looking for a Sr. Staff Software Engineer to join our Zscaler Digital Experience (Core...  ...-efficient production features, utilizing LLMs, various machine learning models, data processing, fine-tuning, and inference optimization... 
    Senior
    Full time
    Work at office
    Local area
    Worldwide
    3 days per week

    Zscaler

    San Jose, CA
    4 days ago
  • $157.5k - $225k

     ...future of cybersecurity. Role We are looking for a Sr. Staff Software Engineer to join our team. This is a hybrid, based in San Jose,...  ...-efficient production features, utilizing LLMs, various machine learning models, data processing, fine-tuning, and inference optimization... 
    Senior
    Full time
    Work at office
    Local area
    Worldwide
    3 days per week

    Zscaler

    San Jose, CA
    4 days ago
  • $147.4k - $272.1k

    AIML - Sr Machine Learning Engineer - Data and ML Innovation Cupertino, California, United States Machine Learning and AI Would you like to be a part of Apple’s AI and Machine Learning org, where we encourage and create groundbreaking technology for multi-modal models... 
    Senior
    Worldwide
    Relocation

    Apple Inc.

    Cupertino, CA
    4 days ago
  • $147.4k - $272.1k

    Cupertino, California, United States Machine Learning and AI Apple’s products combine the best hardware and incredible software to deliver...  ...on users’ devices. We are seeking exceptional software engineers with a strong machine learning background to join our team where... 
    Senior
    Relocation

    Apple Inc.

    Cupertino, CA
    3 days ago
  • $270k - $334k

     ...Senior Machine Learning R&D Engineer CoStar Group is a leading global provider of commercial and residential real estate information, analytics...  ...within a dynamic R&D environment, collaborating closely with fellow engineers, researchers, and product teams to translate... 
    Senior
    Work at office
    Remote work

    CoStar Group

    Sunnyvale, CA
    2 days ago
  • $110k - $145k

     ...Position Overview We are looking for a talented and experienced Deep Learning Engineer specializing in Large Language Models (LLMs) to join our dynamic team. In this role, you will play a pivotal part in enhancing the reliability, safety, and performance of AI models and... 
    Senior

    A10 Networks

    San Jose, CA
    4 days ago
  •  ...Job Description Job Description We are seeking a highly skilled Machine Learning Engineer with deep expertise in developing Bird’s Eye View (BEV) fusion models using multimodal sensor inputs, particularly LiDAR. You will play a central role in designing scalable perception... 
    Senior

    PlusAI

    Santa Clara, CA
    a month ago
  • $225k - $325k

     ...Senior Machine Learning Engineer ABOUT THE ROLE This is a hands-on, high-ownership role for ML engineers who want to build production models that actually ship, and perform under real-world constraints. As a Founding Senior Machine Learning Engineer, you’ll work... 
    Senior
    H1b

    kadence

    San Jose, CA
    1 day ago
  •  ...the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams. We are seeking a Senior Machine Learning Engineer with expertise in deep learning and data analysis. In this role, you will apply data-driven techniques to develop high-... 
    Senior

    PlusAI

    Santa Clara, CA
    a month ago
  •  ...Design and deploy production-grade systems that integrate machine learning models into scalable pipelines. Develop features that leverage...  ...modern infrastructure. Approach problems with a software engineer’s mindset—prioritizing robustness, maintainability, and performance... 
    Senior

    Jecona

    Sunnyvale, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Fellow Machine Learning Engineer. Be the first to apply!