Sr. Fellow Machine Learning Engineer
AMD
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.
Together, we advance your career.
The Role
We are looking for a Fellow/Sr. Fellow Machine Learning Engineer to join our Training At Scale team. If you are excited by the challenge of distributed training of large models on a large number of GPUs, and if you are passionate about improving training efficiency while innovating and generating new ideas, then this role is for you. You will be part of a world class team focused on addressing the challenge of training generative AI.
The Person
The ideal candidate should have experience with distributed training pipelines, be knowledgeable in distributed training algorithms (Data Parallel, Tensor Parallel, Pipeline Parallel, Expert Parallel), and be familiar with training large models.
Key Responsibilities
- Train large models to convergence on AMD GPUs at scale.
- Improve the end-to-end training pipeline performance on large scale GPU cluster.
- Improve the end-to-end debuggability on large scale GPU cluster.
- Design and optimize the distributed training pipeline and software stack to scale out.
- Contribute your changes to open source.
- Stay up-to-date with the latest training algorithms/frameworks.
- Influence the direction of AMD AI platform.
- Collaborate across teams with various groups and stakeholders.
Preferred Experience
- Strong background in machine learning, distributed systems, or AI infrastructure.
- Proven experience building and optimizing distributed training systems for large models.
- Prefer experience in both model and application-level development and optimization.
- Strong familiarity with ML frameworks (PyTorch, JAX, TensorFlow) and distributed frameworks (TorchTitan, Megatron-LM).
- Hands-on expertise with LLMs, recommendation systems, or ranking models.
- Proficiency in Python and C++, including performance profiling, debugging, and large-scale optimization.
- Experience collaborating across hardware, compiler, and system software layers.
- Excellent communication, and problem-solving skills.
ACADEMIC CREDENTIALS
Master’s or Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
LOCATION
San Jose, CA or Bellevue, WA preferred. Other U.S. locations near AMD offices may be considered.
#HYBRID
Benefits offered are described: AMD benefits at a glance.
Legal & Equal Opportunity Statement
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's 'Responsible AI Policy' is available here.
This posting is for an existing vacancy.
#J-18808-Ljbffr- ...A leading technology company is seeking a Fellow/Sr. Fellow Machine Learning Engineer to join the Training At Scale team in San Jose, CA. The candidate will work on distributed training of large models and improve training efficiency. Responsibilities include enhancing...Senior
- ...A leading semiconductor company is seeking a Fellow/Sr. Fellow Machine Learning Engineer in San Jose, CA. This role involves training large models, optimizing distributed training systems, and contributing to an advanced AI platform. The ideal candidate will have a strong...Senior
- ...strengthening cyber resilience for the infrastructure, systems, and organizations that keep the world running. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity. We thrive on visionary leadership, autonomy, and ownership, fostering a culture of...SeniorImmediate start
$151.8k - $265.35k
...Senior Machine Learning Engineer We are looking for a Senior Machine Learning Engineer to join our team of driven machine learning and software engineers. This role covers system design, prompt engineering, ML model evaluation, building data pipelines, prototype creation...SeniorLocal area- ...delivered for millions of patients worldwide. We're a team of engineers, clinicians, and innovators united by one purpose: to make... ..., prototype, and implement advanced computer vision and machine learning algorithms tailored for real-time processing of diverse...SeniorLocal areaWorldwideFlexible hours
- ...take care of ourselves, each other, and our communities. Job Summary: Job Description: PayPal, Inc. seeks Sr Machine Learning Engineer in San Jose, CA Job Duties: Conduct cutting-edge research in machine learning to develop solutions that address complex...SeniorWork at officeLocal areaImmediate startRemote workFlexible hours
$184.5k - $258k
...parental leave, a flexible work model, and career development resources to fuel our employees’ passion for travel. Senior Machine Learning Engineer Expedia Technology teams partner with Product to create innovative products, services, and tools that deliver high‑quality...SeniorFlexible hours- ...to bring your talents to Zscaler to help shape the future of cybersecurity. Role We are looking for an experienced Sr. Machine Learning Engineer to join our Engineering team. This role is based in Bangalore, reporting to the Manager, Machine Learning Engineering....SeniorLocal area
$181.1k - $318.4k
Santa Clara, California, United States Machine Learning and AI Siri helps hundreds of millions of people find the information they are looking... ...to users’ questions.We are looking for an experienced ML engineer with hands-on experience in search and recommendation and deploying...SeniorLocal areaRelocation$178.9k - $351.23k
...customers and all Adobe Products. We are looking for a senior ML engineering manager to spearhead our growing efforts in building world-... ...industry experience. ~8+ years of experience in machine learning, including production-scale deployments. ~5+ years of engineering...SeniorTemporary workLocal areaWorldwide$181.1k - $318.4k
Santa Clara, California, United States We are looking for a hardworking and experienced Machine Learning Engineer to build intelligent search experiences. In this role, you will build intelligent search systems that deeply understand user intent and context to return highly...SeniorRelocation package$181.1k - $318.4k
Sr. Machine Learning Engineer, Siri Speech Cupertino, California, United States Machine Learning and AI We are a group of engineers/researchers responsible for advancing Siri Conversational AI at Apple. Our mission is to build cutting‑edge infrastructure, datasets, and...SeniorRelocation$212k - $318.4k
Santa Clara, California, United States Machine Learning and AI Are you interested in enhancing the capabilities of Siri and Apple products... ...range of backgrounds, including applied machine learning engineers with a focus on ML and LLM, and experienced distributed systems...SeniorWork experience placementRelocation$141k - $228.08k
...stronger relationships, and the kind of precision that drives great outcomes. Job Summary Job Summary We are seeking a Machine Learning Engineer to join our pioneering security team. This role is for a technical expert passionate about deconstructing complex threats...SeniorFull timeWork at office$190.2k - $345.65k
...Adobe Firefly's Generative AI Services team is seeking Senior Machine Learning Engineers for our GenAI Services area. In this high-impact role, you will work with a team of talented engineers in building scalable, high-performance generative AI systems-powering...SeniorTemporary workLocal areaWorldwide$181.1k - $318.4k
AIML - Sr Machine Learning Engineer, Responsible AI Cupertino, California, United States Machine Learning and AI Would you like to play a part in building the next generation of generative AI applications at Apple? We’re looking for Machine Learning Engineers to work on...SeniorRelocation$181.1k - $318.4k
Sr. Machine Learning Engineer, Siri Global Cupertino, California, United States Machine Learning and AI Join the Siri team at Apple! Build and contribute to a product and company that is building products, personal devices, and software designed to enrich people’s lives...SeniorRelocation$147k - $225.5k
...cybersecurity. We work fast, value ongoing learning, and we respect each employee as a... ...Your Career Your Career Bring your machine learning and applied research expertise... ...security solutions. As a Machine Learning Engineer, you will work on large-scale ML systems...SeniorFull timeCasual workWork at office3 days per week$162.5k - $286.4k
Sr. Machine Learning Engineer, ASR Infrastructure and Tools Cambridge, Massachusetts, United States Machine Learning and AI Want to join the team pushing the boundaries of AI and building an intelligent assistant that helps millions of people get things done? Join the Siri...SeniorWorldwideRelocation$172.5k - $306.63k
...Senior Machine Learning Engineer At Adobe's Experience Platform, we are looking for a Senior Machine Learning Engineer to compose, build, and operate scalable intelligent AI systems that power end-user AI products. You will work closely with Adobe Research, product...SeniorTemporary workLocal areaWorldwide$154k - $220k
...future of cybersecurity. Role We are looking for a Sr. Staff Software Engineer to join our Zscaler Digital Experience (Core... ...-efficient production features, utilizing LLMs, various machine learning models, data processing, fine-tuning, and inference optimization...SeniorFull timeWork at officeLocal areaWorldwide3 days per week$157.5k - $225k
...future of cybersecurity. Role We are looking for a Sr. Staff Software Engineer to join our team. This is a hybrid, based in San Jose,... ...-efficient production features, utilizing LLMs, various machine learning models, data processing, fine-tuning, and inference optimization...SeniorFull timeWork at officeLocal areaWorldwide3 days per week$147.4k - $272.1k
AIML - Sr Machine Learning Engineer - Data and ML Innovation Cupertino, California, United States Machine Learning and AI Would you like to be a part of Apple’s AI and Machine Learning org, where we encourage and create groundbreaking technology for multi-modal models...SeniorWorldwideRelocation$147.4k - $272.1k
Cupertino, California, United States Machine Learning and AI Apple’s products combine the best hardware and incredible software to deliver... ...on users’ devices. We are seeking exceptional software engineers with a strong machine learning background to join our team where...SeniorRelocation$270k - $334k
...Senior Machine Learning R&D Engineer CoStar Group is a leading global provider of commercial and residential real estate information, analytics... ...within a dynamic R&D environment, collaborating closely with fellow engineers, researchers, and product teams to translate...SeniorWork at officeRemote work$110k - $145k
...Position Overview We are looking for a talented and experienced Deep Learning Engineer specializing in Large Language Models (LLMs) to join our dynamic team. In this role, you will play a pivotal part in enhancing the reliability, safety, and performance of AI models and...Senior- ...Job Description Job Description We are seeking a highly skilled Machine Learning Engineer with deep expertise in developing Bird’s Eye View (BEV) fusion models using multimodal sensor inputs, particularly LiDAR. You will play a central role in designing scalable perception...Senior
$225k - $325k
...Senior Machine Learning Engineer ABOUT THE ROLE This is a hands-on, high-ownership role for ML engineers who want to build production models that actually ship, and perform under real-world constraints. As a Founding Senior Machine Learning Engineer, you’ll work...SeniorH1b- ...the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams. We are seeking a Senior Machine Learning Engineer with expertise in deep learning and data analysis. In this role, you will apply data-driven techniques to develop high-...Senior
- ...Design and deploy production-grade systems that integrate machine learning models into scalable pipelines. Develop features that leverage... ...modern infrastructure. Approach problems with a software engineer’s mindset—prioritizing robustness, maintainability, and performance...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr. Fellow Machine Learning Engineer. Be the first to apply!
- machine learning software engineer San Jose, CA
- ai ml engineer San Jose, CA
- computer vision machine learning engineer San Jose, CA
- machine learning engineer San Jose, CA
- senior ml engineer San Jose, CA
- machine learning ai engineer San Jose, CA
- senior cost analyst San Jose, CA
- senior manager quality engineering San Jose, CA
- senior software test automation engineer San Jose, CA
- senior design technologist San Jose, CA



