Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Machine Learning Infrastructure Engineer

$160k - $200k

PlusAI, Inc.

PlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus was named by Fast Company as one of the World's Most Innovative Companies. Partners including TRATON GROUP's Scania, MAN, and International brands, Hyundai Motor Company, Iveco Group, Bosch, and DSV are working with Plus to accelerate the deployment of next-generation autonomous trucks. If you're ready to make a huge impact and drive the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams.

As a Senior ML Infrastructure Engineer at Plus, you will design scalable architectures capable of handling petabytes of data while ensuring optimal performance for both training and inference phases. You will build robust pipelines for managing model versioning systems and experiment tracking frameworks, which are essential for maintaining reproducibility across experiments. Additionally, you will be responsible for managing large-scale GPU clusters. This role offers unparalleled opportunities-both technically and professionally-for individuals passionate about solving challenging problems using modern cloud-native technologies. Ideal candidates thrive in environments that leverage tools such as Docker containers orchestrated via Kubernetes clusters, seamlessly integrated with state-of-the-art deep learning frameworks like PyTorch or TensorFlow. If you are eager to push the boundaries of what's possible in machine learning infrastructure and contribute to cutting-edge solutions, this position is an excellent fit!

Responsibilities:

  • Design and develop scalable, high-performance systems for training, inference, deploying, and monitoring ML models at scale.
  • Build and maintain efficient data pipelines, model versioning systems, and experiment tracking frameworks.
  • Collaborate with cross-functional teams, including ML researchers and engineers, to identify bottlenecks and improve platform usability.
  • Implement distributed systems and storage solutions optimized for machine learning workloadsDrive improvements in CI/CD workflows for ML models and infrastructure.
  • Ensure high availability and reliability of the ML platform by implementing robust monitoring, logging, and alerting systems.
  • Stay current with industry trends and integrate relevant tools and frameworks to enhance the platform.
  • Mentor junior engineers and contribute to a culture of technical excellence
  • Ensure that your work is performed in accordance with the company's Quality Management System (QMS) requirements and contribute to continuous improvement efforts.
  • Ensure team compliance with QMS, monitor quality, and drive process improvements.
Required Skills:
  • Phd or MS in Computer Science, Electrical Engineering, or related field
  • Good oral and written communication skills
  • Phd new grad or Masters with 3+ years of software engineering experience with a focus on ML infrastructure or distributed systems.
  • Proficiency in in Python, C++, SQL
  • Deep understanding of containerization, orchestration technologies, distributed ML workload, and experiment tracking tools (e.g., Docker, Kubernetes, multiprocessing, Kubeflow, and mlflow)
  • Deploy and manage resources across multiple cloud platforms (AWS, GCP, or on-prem environments)
  • Proficiency in at least one deep learning framework, such as PyTorch and data pipeline tools (e.g., Apache Airflow, Prefect).
  • Strong knowledge of distributed systems, databases, and storage solutions.
  • Extensive software design and development skills.
  • Ability to learn and adapt to new technologies and contribute in a productive environment.
Preferred Skills:
  • Familiarity with fundamental deep learning architectures, such as Convolutional Neural Networks (CNNs) and Transformer models
  • Experience in building large-scale ML datasets, MLOps pipelines, and distributed computing frameworks like Ray
  • Experience working with autonomous vehicles or robotics
Salary Range:
  • $160,000 - $200,000 a year

Our compensations (cash and equity) are determined based on the position, your location, qualifications, and experience.

Your opportunities joining PlusAI

Work, learn and grow in a highly future-oriented, innovative and dynamic field.

Wide range of opportunities for personal and professional development.

Catered free lunch, unlimited snacks and beverages.

Highly competitive salary and benefits package, including 401(k) plan.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Machine Learning Infrastructure Engineer in Santa Clara, CA vacancy
  •  ...vehicle (AV) efforts. We provide an infrastructure platform for teams developing...  ...validation of state‑of‑the‑art (SOTA) machine learning models with an emphasis on...  .... About the Role We are seeking a Senior ML Infrastructure Engineer to build and scale robust compute... 
    Senior
    Local area
    Work from home

    General Motors

    Sunnyvale, CA
    3 days ago
  •  ...efforts. We’re proud to serve as the infrastructure platform for teams developing...  ...validation of state-of-the-art (SOTA) machine learning models, with a focus on...  ...About the Role: We are seeking a Senior ML Infrastructure engineer to help build and scale robust Compute... 
    Senior
    Local area
    Work from home

    General Motors

    Sunnyvale, CA
    4 days ago
  • $153.2k - $234.1k

     ...team at General Motors, where we build the critical infrastructure that powers every machine learning engineer working on our cutting-edge Autonomous Driving...  ...world's most advanced driverless vehicles. As a Senior ML Infra Engineer, you will build critical infrastructure... 
    Senior
    Work at office
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  • $153.2k - $234.1k

     ...Our team is developing and deploying machine learning solutions that support safe and...  ...across real-world scenarios. As a Senior ML Infra Engineer, you will work on the core systems...  ...distributed systems, applications, or ML infrastructure. ~ Experience designing robust... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    4 days ago
  • $170k - $240k

     ...impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on...  ...research and model development initiatives. As a Senior ML Engineer, you will collaborate closely with machine learning engineers, research scientists, and other partners... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    2 days ago
  •  ...TITLE: ML Data Infrastructure Engineer LOCATION: Sunnyvale CA or Remote Duration: 12+ Months Rate: DOE Key skills - GCP ML...  ...preparation, feature engineering, and dataset management for machine learning. This role focuses on the data foundation that powers... 
    Senior
    Remote work

    Redolent

    Sunnyvale, CA
    4 days ago
  • $190k - $300k

     ...relevance, quality, and business impact We're looking for engineers who have: • Built and shipped recommendation, ranking,...  ...improving measurable product or business outcomes through machine learning. Particularly relevant experience includes: • Search... 
    Senior

    Acceler8 Talent

    Santa Clara, CA
    2 hours ago
  •  ...Job Description Job Description We are seeking a highly skilled Machine Learning Engineer with deep expertise in developing Bird’s Eye View (BEV) fusion models using multimodal sensor inputs, particularly LiDAR. You will play a central role in designing scalable perception... 
    Senior

    PlusAI

    Santa Clara, CA
    4 days ago
  •  ...Build robust, scalable, and reliable infrastructure to support the deployment and operation...  ...product managers, UX designers, and other engineers to define requirements and deliver...  ...Qualifications Knowledge and passion in machine learning algorithms, GenAI, LLMs, and Agentic... 
    Senior
    Work experience placement

    Nutanix

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...NVIDIA’s deep learning and HPC platforms have made a huge impact in various fields and are...  ...a premier AI company to develop leading machine learning frameworks, NVIDIA PhysicsNeMo...  ...science, mathematics, computational science/engineering, or related technical field or... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams. We are seeking a Senior Machine Learning Engineer with expertise in deep learning and data analysis. In this role, you will apply data-driven techniques to develop high... 
    Senior

    PlusAI

    Santa Clara, CA
    a month ago
  • $170.6k - $261.3k

     ...A leading automotive company in Sunnyvale seeks a Senior AI/ML Full-Stack Engineer to design and build software products for machine learning infrastructure. This hands-on role requires expertise in full-stack development, cloud technologies, and system design. The ideal... 
    Senior

    General Motors

    Sunnyvale, CA
    4 days ago
  •  ...General Motors is looking for a Senior ML Infrastructure Engineer to build robust compute platforms for AI validation. This role emphasizes driving efficiency and maximizing GPU utilization while improving platform reliability. You will collaborate with engineers to shape... 
    Senior

    General Motors

    Sunnyvale, CA
    3 days ago
  •  ...General Motors in Sunnyvale, California, is offering a Staff ML Infra Engineer position that focuses on enhancing autonomous driving through machine learning solutions. The role involves designing scalable systems for training and evaluating ML models, requiring a strong... 
    Senior
    Remote work

    General Motors

    Sunnyvale, CA
    3 days ago
  •  ...Fortinet, Inc. is hiring for a software engineering role based in Santa Clara, California. The position requires strong programming skills, with an emphasis on Python and extensive experience with AWS or Azure. You will contribute to developing and maintaining GenAI/ML... 
    Senior

    Fortinet

    Santa Clara, CA
    3 days ago
  • $160k - $200k

     ...PlusAI, based in Silicon Valley, is seeking a Senior ML Infrastructure Engineer to design scalable architectures for machine learning models. This role involves building robust data pipelines, managing GPU clusters, and collaborating with cross-functional teams. Candidates... 
    Senior

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  •  ...General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With... 
    Senior
    Remote work

    General Motors

    Sunnyvale, CA
    3 days ago
  • $130k - $220k

     ...drive the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams. We’re looking for a machine learning engineer to train and deploy the latest generation of ML-based planning algorithms on the extensive data we collect every day... 
    Senior

    PlusAI, Inc.

    Santa Clara, CA
    4 days ago
  • $166k - $244k

     ...Carlsbad Tech is actively seeking a Senior Software Engineer to work on the Gemini Live API in...  ...This role involves building scalable infrastructure and collaborating with research...  ...high-performance solutions in AI and machine learning. The ideal candidate will have a Bachelor... 
    Senior

    Carlsbad Tech

    Sunnyvale, CA
    3 days ago
  • $125k - $201.25k

     ...Senior Machine Learning Engineer – Robotics (Santa Clara, CA) Purpose: We are looking for a highly skilled and innovative Senior Machine Learning Engineer to apply the latest advances in learning‑based manipulation to improve the performance and safety of surgical robotics... 
    Senior
    Local area

    6267-Auris Health Inc. Legal Entity

    Santa Clara, CA
    4 days ago
  • $244.14k - $413.16k

     ...Senior Staff Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation...  ...researchers, perception and planning engineers, and infrastructure experts to design, train, and deploy large-scale multi... 
    Senior
    Full time

    XPENG

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...We are seeking exceptional Senior Machine Learning and Simulation Engineers to join NVIDIA's Autonomous Vehicles (AV) Simulation team! This role requires...  ...-scale ML training, AV systems, simulation, and AI infrastructure development. Deep proficiency in RL algorithms,... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $150k

     ...researchers, data scientists, and engineers, tackling the most...  ...performance computing in deep learning, driving impactful discoveries...  ...looking for a distributed ML infrastructure engineer to help extend and...  ...Experience with large‑scale machine learning workloads (strong ML... 
    Flexible hours

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $172.5k - $306.63k

     ...Adobe is seeking a Senior Machine Learning Engineer to join our team in San Jose, CA. Key Responsibilities Develop and program coordinated software...  ..., and reliability. Strong expertise in cloud infrastructure and distributed computing. Outstanding programming skills... 
    Senior
    Local area

    Adobe Systems Inc

    San Jose, CA
    3 days ago
  •  ...system. At 42dot, our AD ML Platform Engineers build the core data platform and ML training...  ...(TP) coverage. Bootstrap and maintain infrastructure for Data Platform components—Data...  ...well as integrating data pipelines with machine learning models Extensive experience with data... 
    Senior
    Full time
    Work experience placement

    42dot Inc.

    Sunnyvale, CA
    3 days ago
  •  ...Optical Coherence Tomography (OCT), embedded computing, machine learning, and AR/VR technologies. VETi is being developed for...  ...AI-enabled vision technologies. We are looking for a Senior Machine Learning Engineer to build the AI foundation for Kodiak's VETi platform,... 
    Senior

    Kodiak Sciences Inc

    Palo Alto, CA
    3 days ago
  • $128.7k - $261.3k

     ...Inference Solutions team in GM AV deploys machine learning models from training frameworks (e.g....  ...or operating production platform or infrastructure systems where reliability,...  ...Copilot, or equivalent) as part of your engineering workflow. Experience designing clean,... 
    Senior
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  • $125k - $201.25k

     ...of tomorrow, and profoundly impact health for humanity. Learn more at jnj.com As guided by Our Credo, Johnson & Johnson...  ...Job Description: We are searching for the best talent for Senior Machine Learning Engineer - Robotics to be in Santa Clara, CA. About MedTech Fueled... 
    Senior
    Work experience placement
    Local area
    Immediate start

    Johnson & Johnson

    Santa Clara, CA
    4 days ago
  • $229.5k - $367.1k

     ...platform by leveraging state-of-the-art machine learning. Our mission is to deliver meaningful...  ...design and build the underlying ML infrastructure, ensuring our systems remain fast,...  ...technology. Our work blends innovation, engineering excellence, and a deep commitment to... 
    Senior
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    4 days ago
  •  ...Senior Machine Learning Engineer, Agentic App Platform - Moveworks Job Description The Role Are you up for an exciting challenge? Picture yourself...  ...and use cases, and enhance our end‑to‑end product infrastructure with the utmost engineering quality and robustness. What... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Moveworks.ai

    Mountain View, CA
    16 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Machine Learning Infrastructure Engineer. Be the first to apply!