Senior AI Infrastructure Engineer
$180k - $240kFull-time
Gatik AI
WHO WE ARE
Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent deliveries while streamlining freight movement by reducing congestion. The company focuses on short-haul, B2B logistics for Fortune 500 retailers and in 2021 launched the world’s first fully driverless commercial transportation service with Walmart. Gatik's Class 3-7 autonomous trucks are commercially deployed across major markets, including Texas, Arkansas, and Ontario, Canada, driving innovation in freight transportation. The company's proprietary Level 4 autonomous technology, Gatik Carrier™, is custom-built to transport freight safely and efficiently between pick-up and drop-off locations on the middle mile. With robust capabilities in both highway and urban environments, Gatik Carrier™ serves as an all-encompassing solution that integrates advanced software and hardware powering the fleet, facilitating effortless integration into customers' logistics operations.ABOUT THE ROLE
We are seeking a Senior AI Infrastructure Engineer to design, build, and scale the high-performance AI platform powering our autonomous driving models. While researchers focus on developing perception, planning, and world models, you will be responsible for the underlying infrastructure that enables distributed training, experiment tracking, and seamless model deployment. You will bridge the gap between research and production, ensuring our AI stack is scalable, resilient, and highly efficient This role is onsite 5 days a week at our Santa Clara, CA office!WHAT YOU'LL DO
- Distributed Training & ML Systems Support
- Scale Research Workloads: Enable researchers to scale complex models (VLA,
- Agentic Infrastructure & Automation
- Self-Healing AI Infrastructure: Architect and deploy Autonomous AI Agents
- Model Management & Lifecycle (MLOps)
- Automated Lifecycle Management: Design and maintain ML infrastructure
- Cloud-Native Foundations & Data Integration
- Infrastructure as Code: Drive the "Everything as Code" philosophy using
- Monitoring & Observability
- System Metrics: Define and track key ML system metrics, including training
WHAT WE'RE LOOKING FOR
* Experience: 5+ years in ML infrastructure, MLOps, or DevOps supporting high-scale compute environments. * ML Expertise: Deep understanding of multi-GPU training strategies (FSDP, DeepSpeed, Ray Train) and high-performance networking (NCCL, InfiniBand). * Infrastructure Automation: Mastery of Kubernetes, Terraform, and Helm, with a focus on GPU-native orchestration. * AI Agent Frameworks: Proven experience building or supporting Agentic Workflows for infrastructure or data automation (e.g., using LLMs to drive DevOps tasks).- Platform Mastery: Expertise in MLFlow, Argo Workflows, and Kubernetes.
- Containerization: Strong experience with Docker, Kubernetes, and Helm.
- Data & CI/CD: Proficiency in Apache Airflow, Kafka, Spark, and GitOps
BONUS QUALIFICATIONS
* Advanced AI Protocols: Familiarity with the Model Context Protocol (MCP) to standardize how AI agents interact with internal databases and orchestration APIs. * Hybrid & Physical AI: Experience in hybrid cloud and on-prem GPU cluster management for Physical AI workloads (e.g., 3DGS, World Models). * Agentic Observability: Experience utilizing LLMs for semantic monitoring and log analysis to detect complex distributed system failures that traditional threshold-based alerts miss. Salary Ranges - $180,000- $240,000MORE ABOUT GATIK
Founded in 2017 by experts in autonomous vehicle technology, Gatik has rapidly expanded its presence to Mountain View, Dallas-Fort Worth, Arkansas, and Toronto. As the first and only company to achieve fully driverless middle-mile commercial deliveries, Gatik holds a unique and defensible position in the AV industry, with a clear trajectory toward sustainable growth and profitability. We have delivered complete, proprietary AV technology - an integration of software and hardware - to enable earlier successes for our clients in constrained Level 4 autonomy. By choosing the middle mile – with defined point-to-point delivery, we have simplified some of the more complex AV challenges, enabling us to achieve full autonomy ahead of competitors. Given extensive knowledge of Gatik’s well-defined, fixed route ODDs and hybrid architecture, we are able to hyper-optimize our models with exponentially less data, establish gate-keeping mechanisms to maintain explainability, and ensure continued safety of the system for unmanned operations. Visit us at Gatik [ for more company information and Careers at Gatik [ for more open roles.NOTABLE NEWS
* Bloomberg: Autonomous Trucking Firm Gatik Inks Contracts Worth $600 Million [ * Forbes: Hundreds’ Of Gatik Robot Delivery Trucks Headed For U.S. Roads [ * Forbes:Gatik And Loblaw Announce Largest Commercial Deployment Of AV Trucks [ * Forbes: Forget robotaxis. Upstart Gatik sees middle-mile deliveries as the path to profitable AVs [ * Tech Brew: Gatik AI exec unpacks the regulations that could shape the AV industry [ * Business Wire: Gatik Paves the Way for Safe Driverless Operations (‘Freight-Only’) at Scale with Industry-First Third-Party Safety Assessment Framework [ * Auto Futures: Autonomous Trucking Group Gatik Secures Investment From NIPPONEXPRESS HOLDINGS
[ * Automotive News: Gatik foresees hundreds of self-driving trucks on road soon, and that's just the beginning [ * Forbes: Isuzu And Gatik Go All In To Scale Up Driverless Freight Services [ * Bloomberg: Autonomous Vehicle Startup Takes Off by Picking Off Easier Routes [ * Reuters: Driverless vehicles on limited routes bump along despite US robotaxi scrutiny [TAKING CARE OF OUR TEAM
At Gatik, we connect people of extraordinary talent and experience to an opportunity to create a more resilient supply chain and contribute to our environment’s sustainability. We are diverse in our backgrounds and perspectives yet united by a bold vision and shared commitment to our values. Our culture emphasizes the importance of collaboration, respect and agility. We at Gatik strive to create a diverse and inclusive environment where everyone feels they have opportunities to succeed and grow because we know that together we can do great things. We are committed to an inclusive and diverse team. We do not discriminate based on race, color, ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, veteran status, genetic information, marital status or any legally protected status.Vacancy posted 12 hours ago
Similar jobs that could be interesting for youBased on the Senior AI Infrastructure Engineer in Santa Clara, CA vacancy
$126k - $423k
Decisive Point is seeking a Research Engineer (AI/RL Infrastructure) in Sunnyvale, California to design and operate large-scale ML systems. You will collaborate with leading experts and contribute to next-generation physical AI, impacting self-driving technologies. This...Senior- NVIDIA Corporation in Santa Clara is seeking a Senior Software Engineer to lead the optimization of large-scale AI systems. This role will involve profiling and... ...will have over 8 years of experience in software infrastructure for AI systems, with expert-level programming...Senior
$182k - $242k
CoreWeave is seeking an experienced professional to contribute to building distributed systems and ML infrastructure. The successful candidate will play a pivotal role in designing an optimal research cluster experience, including a Python SDK, while collaborating closely...Senior$262k - $365k
Google Inc. seeks a Senior Staff Software Engineer for AI Infrastructure within Google Cloud. This role involves architecting high-performance, distributed infrastructure for agentic AI workflows, with responsibilities including system reliability and transitioning experimental...Senior$356.5k
NVIDIA Gruppe is seeking an experienced AI infrastructure software engineer to join its DGX Cloud AI Efficiency Team in Santa Clara, California. This role focuses on developing the infrastructure for optimizing AI workloads and ensuring high availability and efficiency...Senior$168k - $322k
NVIDIA Gruppe is seeking a Senior AI Platform Engineer to improve engineering efficiency and data security through AI-powered products. The... ...involves working with Cloud and AI/ML teams to build and scale infrastructure and shape the technological future of the organization....Senior- NVIDIA Corporation is seeking a Datacenter Product Engineer to join its Datacenter team in Santa Clara, California. This role focuses on launching AI supercomputing platforms and supporting GPU production. The ideal candidate will collaborate with NPI teams and implement...Senior
- NVIDIA Gruppe is looking for an experienced GPU Deployment Engineer to tackle end-to-end AI deployment challenges on the NVIDIA RTX AI platform. The role involves analyzing GPU-accelerated applications, improving user experiences, and collaborating with teams to influence...Senior
- NVIDIA Gruppe in Santa Clara is looking for an experienced engineer to support our new supercomputers and AI technologies. You will lead collaboration across various teams and work closely with customers to understand their needs and develop tailored features. The ideal...Senior
- ...searching for a high-level DevOps Platform Engineer to enhance its Multi-Cloud Platform. In this role, you will architect AI-driven workflows and lead production environments... ..., and Azure. You will build self-healing infrastructure and develop advanced CI/CD pipelines while...Senior
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior$210k - $295k
...EXPLORATION TECHNOLOGIES CORP in Sunnyvale, CA, is seeking a Principal Software Engineer for the Platform Team. This role focuses on building foundational AI tooling and security infrastructure to enhance engineering workflows at SpaceX. The ideal candidate will have...Senior$174k - $253k
Google is seeking an Applied AI Customer Engineer in Sunnyvale, CA, offering a competitive salary ranging from $174,000 to $253,000 plus a bonus and equity. In this role, you will leverage your technical expertise to assist customers in adopting Conversational AI solutions...Senior- A technology firm specializing in AI solutions is seeking a Senior AI Engineer in Santa Clara, California. You will design and implement AI-powered software, managing everything from backend to frontend interfaces. Responsibilities include developing production-grade AI...Senior
- Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research... ...an AI infrastructure software engineer to join our team. You'll be instrumental... ...of AI systems. As a senior DGX Cloud AI Infrastructure software...Senior
- Lendistry, LLC. is seeking a Senior AI Engineer to lead the delivery of AI solutions, including document intelligence and risk assessment tools. In this role, you will be responsible for mentoring junior engineers and shaping AI-driven workflows, improving the borrower...Senior
$174k - $253k
Google Inc. is seeking a Senior Software Engineer specialized in AI/ML for its Sunnyvale, CA location. The role requires expertise in developing and optimizing machine learning infrastructure, along with deep experience in programming with Python or C++. Candidates should...Senior- A leading technology company in Santa Clara is seeking a Senior/Staff Software Engineer specializing in AI and search technologies. Candidates must possess strong skills in Kotlin or Java, have a minimum of 5 years' experience, and be proficient in containerized environments...Senior
- Google Inc. is seeking a Senior Software Engineer for AI/ML in Sunnyvale, CA. The candidate will develop technologies that enhance user interaction and handle massive scale information. Responsibilities include writing code, testing, design collaboration, and ML solutions...Senior
$224k - $356.5k
NVIDIA Corporation is seeking a Senior Software Engineer to drive architecture, optimization, and execution for autonomous driving software. The role involves deep learning inference and TensorRT to optimize DNN models for automotive compute platforms. Applicants should...Senior- NVIDIA is looking to hire a deeply technical, creative, and Senior AI Platform Engineer to build, support, and maintain the next generation of... ...era. What you will be doing: Define and lead AI-native infrastructure roadmaps and cross‑organizational initiatives. Architect...Senior
$283.4k
KLA is seeking a Sr. AI Infrastructure Software Engineer in Milpitas, California. This role focuses on C++ programming and involves designing core infrastructure for AI workloads. Join a top-notch team solving complex problems at the intersection of software and hardware...Senior- Drive Capital is seeking a Senior Customer Support Engineer in Campbell, CA. This role involves responding to customer inquiries, managing technical operations, and building strong relationships with customers based on technical excellence. The ideal candidate will have...Senior
- NVIDIA Gruppe is looking for a Product Manager to lead AI platform initiatives in Santa Clara. You will define product vision and execution with an emphasis on AI observability and developer workflows. The role involves collaborating with cross-functional teams to enhance...Senior
$208k - $327.75k
...experienced Product Manager to lead strategic AI platform initiatives in Santa Clara,... ...through close collaboration with engineering and platform teams. The ideal candidate... ...a strong technical background in AI/ML infrastructure. This position offers a competitive salary...Senior- Eridu AI in Saratoga, California, is seeking a Hardware Engineer to lead the design and production of advanced hardware systems. The role demands a solid foundation... ...across various teams to deliver innovative AI infrastructure solutions. The ideal candidate will possess at...Senior
$181.1k - $318.4k
...for its Special Projects team in Cupertino, California. The role focuses on building innovative applications and robust infrastructure to support AI research. Candidates should excel in programming languages like Go or Swift and have experience with web services and containers...Senior$174k - $253k
Google Inc. is seeking a Senior Software Engineer to support AI/ML Training Infrastructure in Mountain View, CA. The role involves building data and training foundations for AI innovation, collaborating with teams on design and code reviews, and ensuring effective operations...Senior- Google Inc. is seeking a Software Engineer III to focus on generative AI and infrastructure development in Mountain View, CA. The ideal candidate will possess strong software development skills and experience with GenAI techniques. This role offers a unique opportunity...Senior
$220k - $350k
United States Digital Space LLC seeks a Principal AI Engineer to integrate AI capabilities for U.S. federal agencies. Responsibilities include designing and optimizing models and ensuring effective deployment. Candidates should have extensive software engineering experience...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Infrastructure Engineer. Be the first to apply!
Related searches
- ai engineer Santa Clara, CA
- machine learning ai engineer Santa Clara, CA
- ai ml engineer Santa Clara, CA
- senior ai engineer Santa Clara, CA
- ai prompt engineer Santa Clara, CA
- ai developer Santa Clara, CA
- ai engineer remote Santa Clara, CA
- infrastructure automation engineer Santa Clara, CA
- senior infrastructure engineer Santa Clara, CA
- security infrastructure engineer Santa Clara, CA
