Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Machine Learning Infrastructure Engineer

$160k - $200k

PlusAI

Job Description

Job Description

PlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus was named by Fast Company as one of the World’s Most Innovative Companies. Partners including TRATON GROUP’s Scania, MAN, and International brands, Hyundai Motor Company, Iveco Group, Bosch, and DSV are working with Plus to accelerate the deployment of next-generation autonomous trucks. If you’re ready to make a huge impact and drive the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams.

As a Senior ML Infrastructure Engineer at Plus, you will design scalable architectures capable of handling petabytes of data while ensuring optimal performance for both training and inference phases. You will build robust pipelines for managing model versioning systems and experiment tracking frameworks, which are essential for maintaining reproducibility across experiments. Additionally, you will be responsible for managing large-scale GPU clusters. This role offers unparalleled opportunities—both technically and professionally—for individuals passionate about solving challenging problems using modern cloud-native technologies. Ideal candidates thrive in environments that leverage tools such as Docker containers orchestrated via Kubernetes clusters, seamlessly integrated with state-of-the-art deep learning frameworks like PyTorch or TensorFlow. If you are eager to push the boundaries of what's possible in machine learning infrastructure and contribute to cutting-edge solutions, this position is an excellent fit!

Responsibilities:
  • Design and develop scalable, high-performance systems for training, inference, deploying, and monitoring ML models at scale.
  • Build and maintain efficient data pipelines, model versioning systems, and experiment tracking frameworks.
  • Collaborate with cross-functional teams, including ML researchers and engineers, to identify bottlenecks and improve platform usability.
  • Implement distributed systems and storage solutions optimized for machine learning workloadsDrive improvements in CI/CD workflows for ML models and infrastructure.
  • Ensure high availability and reliability of the ML platform by implementing robust monitoring, logging, and alerting systems.
  • Stay current with industry trends and integrate relevant tools and frameworks to enhance the platform.
  • Mentor junior engineers and contribute to a culture of technical excellence
  • Ensure that your work is performed in accordance with the company’s Quality Management System (QMS) requirements and contribute to continuous improvement efforts.
  • Ensure team compliance with QMS, monitor quality, and drive process improvements.
Required Skills:
  • Phd or MS in Computer Science, Electrical Engineering, or related field
  • Good oral and written communication skills
  • Phd new grad or Masters with 3+ years of software engineering experience with a focus on ML infrastructure or distributed systems.
  • Proficiency in in Python, C++, SQL
  • Deep understanding of containerization, orchestration technologies, distributed ML workload, and experiment tracking tools (e.g., Docker, Kubernetes, multiprocessing, Kubeflow, and mlflow)
  • Deploy and manage resources across multiple cloud platforms (AWS, GCP, or on-prem environments)
  • Proficiency in at least one deep learning framework, such as PyTorch and data pipeline tools (e.g., Apache Airflow, Prefect).
  • Strong knowledge of distributed systems, databases, and storage solutions.
  • Extensive software design and development skills.
  • Ability to learn and adapt to new technologies and contribute in a productive environment.
Preferred Skills:
  • Familiarity with fundamental deep learning architectures, such as Convolutional Neural Networks (CNNs) and Transformer models
  • Experience in building large-scale ML datasets, MLOps pipelines, and distributed computing frameworks like Ray
  • Experience working with autonomous vehicles or robotics
Salary Range:
  • $160,000 - $200,000 a year

Our compensations (cash and equity) are determined based on the position, your location, qualifications, and experience.

Your opportunities joining PlusAI

Work, learn and grow in a highly future-oriented, innovative and dynamic field.

Wide range of opportunities for personal and professional development.

Catered free lunch, unlimited snacks and beverages.

Highly competitive salary and benefits package, including 401(k) plan.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 14 days ago
Similar jobs that could be interesting for youBased on the Senior Machine Learning Infrastructure Engineer in Santa Clara, CA vacancy
  • $212k - $386.3k

    Senior Staff Machine Learning Engineer, Apple Search & Knowledge Platforms Santa Clara, California, United States Machine Learning and AI Apple is...  ...iMessage, and operates the foundational platforms and infrastructure that keep these intelligent experiences running at hyperscale... 
    Senior
    Temporary work
    Worldwide
    Relocation

    Apple Inc.

    Santa Clara, CA
    4 days ago
  • $153k - $222k

    Decisive Point is hiring engineers in Sunnyvale, CA, to work on machine learning infrastructure. Responsibilities include designing GPU training approaches and building ML pipelines for product workflows. The ideal candidate should have a Bachelor's degree in Computer... 
    Senior

    Decisive Point

    Sunnyvale, CA
    4 days ago
  •  ...vehicle (AV) efforts. We provide an infrastructure platform for teams developing...  ...validation of state‑of‑the‑art (SOTA) machine learning models with an emphasis on...  .... About the Role We are seeking a Senior ML Infrastructure Engineer to build and scale robust compute... 
    Senior
    Local area
    Work from home

    General Motors

    Sunnyvale, CA
    2 days ago
  • $153.2k - $234.1k

     ...transportation on a global scale. Role As a Senior ML Infra Engineer, you will work on the core systems...  ...be to dramatically accelerate the machine learning development cycle from one modeling...  ...systems, applications, or ML infrastructure. Experience designing robust... 
    Senior
    Local area
    Remote work
    Relocation package
    Flexible hours

    Israelvcforum

    Sunnyvale, CA
    2 days ago
  • $170.6k - $261.3k

    A leading automotive company in Sunnyvale seeks a Senior AI/ML Full-Stack Engineer to design and build software products for machine learning infrastructure. This hands-on role requires expertise in full-stack development, cloud technologies, and system design. The ideal... 
    Senior

    General Motors

    Sunnyvale, CA
    13 hours ago
  • $152k - $241.5k

    NVIDIA’s deep learning and HPC platforms have made a huge impact in various fields and are...  ...with a team to develop leading machine learning frameworks, NVIDIA PhysicsNeMo...  ...science, mathematics, computational science/engineering, or related technical field, or equivalent... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...Build robust, scalable, and reliable infrastructure to support the deployment and operation...  ...product managers, UX designers, and other engineers to define requirements and deliver...  ...Qualifications Knowledge and passion in machine learning algorithms, GenAI, LLMs, and Agentic... 
    Senior
    Work experience placement

    Nutanix

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

    Intelligent machines powered by Artificial Intelligence computers that can learn, reason and interact with people are no longer science fiction. GPU Deep Learning...  ...relevant experience in Computer Science, Computer Engineering, or a related technical field. 2+ years of... 
    Senior
    Odd job
    Work experience placement

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

    ## Senior Machine Learning Engineer, End‐to‐End Autonomous DrivingApplylocations: US, CA, Santa Claratime type: Full timeposted on: Posted Todayjob requisition id: JR2016052We are seeking a Senior Machine Learning Engineer to join our end‐to‐end autonomous driving team... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    13 hours ago
  •  ..., combining automation algorithms, deep‑learning models, and agentic workflows to accelerate...  ...AI systems that integrate with existing machine learning, design automation, and...  ...need to see: MS/PhD in Electrical/Computer Engineering, Computer Science, Applied Mathematics,... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    13 hours ago
  • General Motors is looking for a Senior ML Infrastructure Engineer to build robust compute platforms for AI validation. This role emphasizes driving efficiency and maximizing GPU utilization while improving platform reliability. You will collaborate with engineers to shape... 
    Senior

    General Motors

    Sunnyvale, CA
    2 days ago
  • $152k - $241.5k

    NVIDIA Corporation is seeking a Senior ML Platform Engineer to design and scale high-performance ML infrastructure. You'll utilize IaC techniques with Ansible and Terraform, collaborating closely with ML researchers and ensuring system reliability and performance. This... 
    Senior
    Remote job

    NVIDIA

    Santa Clara, CA
    2 days ago
  • NVIDIA Gruppe is seeking a ML Platform Engineer to architect and scale high-performance ML infrastructure using modern Infrastructure-as-Code practices. You will collaborate with ML researchers to build robust platforms for advanced ML model development. The ideal candidate... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $130k - $220k

     ...drive the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams. We’re looking for a machine learning engineer to train and deploy the latest generation of ML-based planning algorithms on the extensive data we collect every day... 
    Senior

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  • $174k - $253k

    Google Inc. is seeking a Senior Software Engineer for AI/ML GenAI at Google Cloud in Sunnyvale, CA. This role involves writing and testing development code, collaborating on best practices, and designing GenAI solutions. Ideal candidates possess a Bachelor’s degree or... 
    Senior

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $166k - $244k

    Carlsbad Tech is actively seeking a Senior Software Engineer to work on the Gemini Live API in...  ...This role involves building scalable infrastructure and collaborating with research...  ...high-performance solutions in AI and machine learning. The ideal candidate will have a Bachelor... 
    Senior

    Carlsbad Tech

    Sunnyvale, CA
    2 days ago
  • $160k - $200k

    PlusAI, based in Silicon Valley, is seeking a Senior ML Infrastructure Engineer to design scalable architectures for machine learning models. This role involves building robust data pipelines, managing GPU clusters, and collaborating with cross-functional teams. Candidates... 
    Senior

    PlusAI

    Santa Clara, CA
    4 days ago
  • General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With over... 
    Senior
    Remote job

    General Motors

    Sunnyvale, CA
    4 days ago
  •  ...world. Role Overview As our Senior Staff Software Engineer, ML infra Engineer for Search &...  ...* Develop and scale data infrastructure that powers batch and real-time data...  ...professional experience in applied machine learning * Experience in machine learning,... 
    Senior
    Full time
    Temporary work
    Flexible hours

    Coupang

    Mountain View, CA
    4 days ago
  • $174k - $253k

    Google Inc. is seeking a Senior Software Engineer to lead innovation within Cloud and ML Infrastructure in Sunnyvale, CA. This role demands strong programming skills...  ...C++, complemented by extensive experience in machine learning infrastructure. You'll be at the forefront... 
    Senior
    Worldwide

    Google Inc.

    Sunnyvale, CA
    13 hours ago
  • $174k - $253k

    Google Inc. is seeking a Senior Software Engineer specialized in AI/ML for its Sunnyvale, CA location. The role requires expertise in developing and optimizing machine learning infrastructure, along with deep experience in programming with Python or C++. Candidates should... 
    Senior

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $224k - $356.5k

    We are seeking exceptional Senior Machine Learning and Simulation Engineers to join NVIDIA's Autonomous Vehicles (AV) Simulation team! This role requires...  ...-scale ML training, AV systems, simulation, and AI infrastructure development. Deep proficiency in RL algorithms,... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...system. At 42dot, our AD ML Platform Engineers build the core data platform and ML training...  ...TP) coverage. Bootstrap and maintain infrastructure for Data Platform components—Data...  ...as integrating data pipelines with machine learning models Extensive experience with data... 
    Senior
    Full time
    Work experience placement

    42dot Inc.

    Sunnyvale, CA
    4 days ago
  • Google Inc. is seeking a Senior Software Engineer for AI/ML in Sunnyvale, CA. The candidate will develop technologies that enhance user interaction and handle massive scale information. Responsibilities include writing code, testing, design collaboration, and ML solutions... 
    Senior

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $181.1k - $318.4k

    Senior machine learning platform engineer, Evaluation & Privacy Cupertino, California, United States Software and Services Imagine what you could do...  ...devices. Scale distributed training and evaluation infrastructure to support a growing portfolio of teams and products... 
    Senior
    Relocation

    Apple

    Cupertino, CA
    4 days ago
  • $152k - $241.5k

    ## Senior ML Platform EngineerApplylocations: US, CA, Santa Clara: US...  ...now looking for a ML Platform Engineer to help accelerate the next era of machine learning innovation.In this role, you will...  ...scale our high-performance ML infrastructure using modern Infrastructure-as-... 
    Senior
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $181.1k - $318.4k

    Sunnyvale, California, United States Machine Learning and AI Apple is where individual imaginations gather together, committing to the values...  ...are a team of computer vision and machine learning (CVML) engineers building real-time 3D perception and input systems for... 
    Senior
    Relocation

    Apple

    Sunnyvale, CA
    1 day ago
  • $170.7k - $300.2k

    A leading technology firm in Cupertino is seeking engineers to develop scalable machine learning approaches for autonomous systems. Candidates should possess a strong background in ML modeling frameworks, GPU computing, and software engineering. Responsibilities include... 
    Senior

    Career-Mover

    Cupertino, CA
    2 days ago
  •  ...is located in our Sunnyvale, CA office and has a schedule of 4 days on-site and 1 day remote. About the Role As a Senior Machine Learning R&D Engineer at Matterport, a part of CoStar Group, you will be at the forefront of innovating and advancing our spatial computing... 
    Senior
    Work at office
    Remote work

    Visual Lease

    Sunnyvale, CA
    3 days ago
  •  ...benchmarks, and LLM-as-judge systems. This is a high-leverage engineering role where your work directly gates what goes to production....  ...Comfort building robust data analysis and evaluation infrastructure, not just running experiments Experience with UI/UX and front... 
    Senior
    Work at office

    Hippocratic AI

    Palo Alto, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Machine Learning Infrastructure Engineer. Be the first to apply!