Distributed Machine Learning Engineer
$150kInstitute of Foundation Models
Distributed ML Engineer
We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy.
As part of our team, you'll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.
Key Responsibilities
- Understand, analyze, profile, optimize, and provide guidance to the team on deep learning workloads on state-of-the-art hardware and software platforms to improve their efficiency with different levels of optimization
- Design and implement performance benchmarks and testing methodologies to evaluate application performance
- Build tools to automate workload analysis, workload optimization, and other critical workflows
- Triage system issues and identify bottleneck and inefficiencies by analyzing the sources of issues and the impact on hardware, network and propose solutions to enhance GPU utilization
- Support the team to develop appropriate kernels and systems for new model architectures and algorithms
- Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
- Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
- Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
- Represent MBZUAI at industry conferences and events, showcasing the institution's cutting-edge HPC and deep learning capabilities and establishing MBZUAI as a global leader in AI research and innovation.
- Perform all other duties as reasonably directed by the line manager that are commensurate with these functional objectives.
Academic Qualifications
- Ph.D. in CS, EE or CSEE with 1+ years working experience, OR
- Masters in CS, EE or CSEE or equivalent experience with 2+ year working experience
$150,000 - $450,000 a year
This position is eligible for visa sponsorship.
Benefits Include
- Comprehensive medical, dental, and vision benefits
- Bonus
- 401K Plan
- Generous paid time off, sick leave and holidays
- Paid Parental Leave
- Employee Assistance Program
- Life insurance and disability
- An AI lab in Santa Clara is seeking a skilled software engineer with over 8 years of experience to optimize machine learning models for real-time applications. The role involves designing distributed training strategies, collaborating with ML researchers, and developing...Suggested
$150k
A leading research lab in Sunnyvale is seeking a distributed ML infrastructure engineer to extend and scale training systems. The ideal candidate must have over 5 years of experience in ML systems with strong expertise in distributed training frameworks like DeepSpeed...Suggested$181.1k - $272.1k
...Machine Learning Engineer - LLM Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services... ...tools, i.e. Claude Code, Roo Code, etc. Familiarity with distributed computing, cloud infrastructure, and orchestration tools,...SuggestedRelocation- ...About the job Machine Learning Engineer Glint Tech Solutions is Hiring an experienced Machine Learning Engineer to join our client's... ...Hands-on experience with Kubernetes and container orchestration Strong understanding of scalability and distributed systemsSuggested
$204k - $259k
...Machine Learning Engineer - Mapping Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver... ...and build the infrastructure to store, process, and distribute the map. Our team collaborates with several other Waymo teams...SuggestedFull timeRemote work- ...Job Title: Machine Learning Engineer Location: Cupertino, CA(Hybrid) Duration: 9 Months Pay Range on W2 Hourly: $92 - $97 with Mindsource... ...of relational databases, including SQL, and large-scale distributed systems such as Hadoop and Spark Ability to implement...Hourly pay
$181.1k - $318.4k
...AIML - Machine Learning Engineer for MLX, MLR As part of Apple's Machine Learning Research organization, we do world-class scientific research... ...the Machine Learning Research group to build scalable, distributed training and research pipelines. Description Work with...WorldwideRelocation$120k - $235k
...most innovative companies to build strong engineering teams ready for what’s next. Software... ...precision-recall tradeoff and failure distribution. That heterogeneity is not a design... ..., target bonus, and equity. Want to learn more about HackerRank? Check out HackerRank...Shift work$181.1k - $318.4k
...Machine Learning Engineer - MLR Play a part in building the next revolution of machine learning technology. We're looking for passionate mid... ...hands-on rapid prototyping of ideas and use of scalable distributed compute. You will work closely with both researchers but also...Relocation- ...Candidates only Position Summary Seeking an experienced Machine Learning Engineer to lead the development of prompt injection and prompt... ..., jailbreak, and agentic AI threat models, and with distributed training frameworks (DeepSpeed, FSDP, Accelerate). Preferred...
$147.4k - $272.1k
...Machine Learning Engineer- GenAI Imagine what you could do here. At Apple, we believe new insights have a way of becoming excellent products... ..., i.e. Claude Code, Roo Code, etc. Familiarity with distributed computing, cloud infrastructure, and orchestration tools,...Relocation$170k - $216k
...speed up developer velocity. We're looking for a software engineer to join the team to build and maintain the critical data and ML... ...field, and 2+ years equivalent experience Experience with distributed systems principles and experience building distributed systems...Full timeRemote work$140k - $220k
...feedback and needs. ABOUT THE JOB We are looking for a Machine Learning Engineer to help build and develop our ML capabilities at RADAR.... ...in big data processing including SQL optimization and distributed computing (Spark/Dask) ~ Production experience with workflow...Work at officeFlexible hours- ...Machine Learning Engineer Creatify is building the world's first end-to-end AI advertising agent—a platform that automates the entire video... ...operating-systems–level software, compilers, and network distribution software for massive social data and prediction problems....
- ...Senior ML Engineer Medical Imaging Evaluation & AI Reliability About the Role:... ...Qualifications: Strong experience in machine learning for medical imaging (radiology, pathology... ...of: model robustness, distribution shift, uncertainty, failure analysis...Shift work
$200k - $300k
...observability, and the tooling engineers use to understand what... ...and Python; comfortable with distributed data pipelines. ~ Experience... ...LLM evaluation, reinforcement learning from human feedback, natural... ...large systems involving machine learning. ~ Analytically rigorous...Home officeFlexible hours3 days per week$123.75k - $185k
...collaboration, and high standards. Our engineers, product leaders, and go-to-market... ...Diagnose and troubleshoot issues in complex distributed environments and optimize system... ...Qualifications: Knowledge and passion in machine learning algorithms, Gen AI, LLMs, and natural...Work experience placementWork at office3 days per week$100k
...Python/Java developers, Data analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients. Who Should... ...interested please email them or ask them to take you off their distribution list and make you unavailable as they share the same...Full timeH1b$181.1k - $318.4k
...Machine Learning Compiler Engineer At Apple, we're on the cutting edge of delivering transformative experiences through Artificial Intelligence... ...and functions Experience optimizing compilers for distributed, parallel, or heterogeneous execution environments, with...Relocation- ...674-0836 Summary We are seeking a highly experienced Machine Learning Engineer to build, deploy, and optimize Large Language Model (LLM)-... ...engineering, vector databases and RAG patterns. ~ Experience with distributed systems, databases (SQL/NoSQL), cloud platforms (AWS,...
- ...autonomous agents that reason, act, and continuously improve. As a Machine Learning Engineer , you won't just build models, you'll architect the entire... ...the ground up. You'll work at the intersection of LLMs, distributed systems, and real-world applications , owning everything...
$126.8k - $190.9k
Sunnyvale, California, United States Machine Learning and AI At Apple, we're on the cutting... ...team! As a Machine Learning Compiler Engineer on the Apple Neural Engine (ANE) team,... ...Experience optimizing compilers for distributed, parallel, or heterogeneous execution...Relocation package- Role Description We are seeking an experienced GenAI engineer to join our seasoned founding team to drive the... ...rapid development and iteration of scalable, robust distributed infrastructure to support machine learning training, inference, and evaluation. Hands‑on contributor...
$171k - $247k
...for all. We are seeking a ML Engineering TL to join the Behavior... ...models trained with Imitation Learning and Reinforcement Learning... ...~ MS or PhD in Robotics, Machine Learning, Computer Science,... ...models on massive datasets using distributed computing. ~ Fluency in Python...Work at officeLocal area3 days per week$148.7k - $258.72k
...Mountain View, CA, USA Senior Machine Learning Engineer, Vector Bidding Science Location Mountain View, CA, USA Department AI... ...might also have Experience working with large datasets and distributed computing frameworks (e.g., Spark, Ray, BigQuery, Flink)...Work at officeWorldwideRelocation package$181.1k - $318.4k
...Senior Machine Learning Engineer - Ads Bidding & Pacing Cupertino, California, United States Machine Learning and AI Posted: Mar 12,... ...Java or Python. Experience with Spark, Hadoop or other distributed frameworks. PhD in Machine Learning, Statistics, Control...Relocation$181.1k - $318.4k
...Sr. Machine Learning Engineer, Siri Speech We are a group of engineers/researchers responsible for advancing Siri Conversational AI at Apple... ..., training, evaluation, deployment Familiarity with distributed training and large-scale data pipelines Solid understanding...Relocation$170k - $240k
...and model development initiatives. As a Senior ML Engineer, you will collaborate closely with machine learning engineers, research scientists, and other... ...performance analysis and optimization solutions to scale distributed training workflows and maximize resource...Local areaRemote workWork from homeRelocationRelocation packageFlexible hours- ...discussions, stay proactive in learning, and apply new technologies... ...cutting-edge research in machine learning and incorporate advanced... ...Computer Science, Software Engineering, AI, Mathematics, Physics,... ...design, and sample distribution analysis; background in generative...Work experience placement
$181.1k - $318.4k
...Sr. Machine Learning Engineer - Answers, Knowledge & Information (AKI) Work Locations (2) Submit Resume Siri helps hundreds of millions... ...from applied scientists with a focus in NLP to experienced distributed systems. We are looking for candidates with both applied machine...Local areaRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Distributed Machine Learning Engineer. Be the first to apply!
- machine learning ai engineer Sunnyvale, CA
- machine learning engineer Sunnyvale, CA
- machine learning software engineer Sunnyvale, CA
- ai ml engineer Sunnyvale, CA
- senior ml engineer Sunnyvale, CA
- computer vision machine learning engineer Sunnyvale, CA
- machine learning research scientist Sunnyvale, CA
- machine learning part time Sunnyvale, CA
- artificial intelligence - machine learning intern Sunnyvale, CA
- machine learning Sunnyvale, CA

