Senior AI/ML Infrastructure Engineer - Reliability & Scale
Reducto, Inc.
Reducto, Inc. is seeking an Infrastructure Engineer to design, build, and maintain scalable infrastructure for AI and ML workloads. The role involves automating cloud infrastructure and implementing robust monitoring systems to ensure reliability. With a requirement of 5+ years of experience and a strong focus on quality solutions, this in-person position is based in San Francisco and offers several benefits. #J-18808-Ljbffr Reducto, Inc.
- ...A leading tech company in San Francisco seeks a Senior / Staff Network Reliability Engineer to enhance and maintain a high-performance networking stack.... ...retirement plan. Join a team focused on operational excellence in networking for AI and HPC workloads. #J-18808-Ljbffr...Senior
$190k - $270k
AI Chopping Block, Inc. in San Francisco is seeking an AI Infrastructure Engineer to maintain user-facing services and production systems. The role involves building and... ...tools like Ansible and Kubernetes, ensuring reliability and scalability. Candidates should have over...Senior- ...Public Benefit Corporation is looking for an AI Cloud Infra Engineer to join their team in San Francisco. You will ensure the reliability of backend systems and work closely with... .... The ideal candidate has strong cloud infrastructure expertise (AWS/GCP/Azure) and is excited...Senior
$300k - $430k
...leading conversational AI platform empowering... ...team. About the Team The ML Infrastructure team builds the... ...edge ML techniques into reliable, scalable systems that... ...Staff ML Infrastructure Engineer to own the platforms powering... ...and post-training at scale Implement and...SuggestedWork at office- OpenArt AI in San Francisco is seeking a Senior Platform & Reliability Engineer to design and improve the reliability of its infrastructure. The role emphasizes building and operating production systems while collaborating with product engineers to ensure platform scalability...Senior
$250k - $325k
...runs on the same infrastructure: agreements between... ...We're building the AI that finally... ...last 12 months. Engineering at Ivo Engineers... ...[2023] • Large-scale LLM-based legal fact... ...strategies to isolate ML vs API workloads... ...performance, and reliability Implement security...SeniorContract workWork at officeRemote work- A leading AI platform company in San Francisco is looking for a Senior Infrastructure Engineer to design and operate production infrastructure for high-scale, low-latency systems. Your focus will be on... ...critical services, improving reliability, and enhancing developer...Senior
- A pioneering tech startup in neurotechnology is seeking a Senior Machine Learning Infrastructure Engineer to design and scale critical infrastructure powering ML applications. This role involves creating robust data pipelines and optimizing modeling processes, essential...Senior
$261k - $326k
A technology company specializing in AI infrastructure is seeking a Principal Engineer to enhance reliability and scalability of cloud systems. This role demands over... ...expertise and systems fundamentals, especially in high-scale environments. Competitive compensation includes...Senior- BUILD in San Francisco is seeking a Senior Software Engineer to own the foundational systems for their agentic AI. This role requires expertise in infrastructure that orchestrates workflows and guarantees reliability at institutional scale. The ideal candidate will design...Senior
- ...technology firm in San Francisco is seeking a Machine Learning Engineer to enhance drug discovery processes using innovative machine learning techniques. You will design and maintain reliable ML infrastructure and collaborate with cross-functional teams to meet...Relocation package
- A leading AI infrastructure company is seeking a Staff Infrastructure Engineer in San Francisco. In this role, you will own the systems that power the company at scale, focusing on reliability, scalability, and developer velocity. You will be responsible for designing...SeniorWork at office
- ...leading market intelligence firm is seeking an experienced engineering leader to transform AI-powered systems. Responsibilities include designing... ...distributed systems, mentoring engineers, and collaborating with ML teams. Ideal candidates have 7+ years in distributed...Senior
- Zendesk, Inc. is looking for a Machine Learning Engineer in San Francisco, CA, who will drive the development of advanced ML and AI solutions. This role focuses on applying... ...hybrid working model and the opportunity to scale impactful solutions. #J-18808-Ljbffr Zendesk...Senior
- ...Design, deploy, and maintain large distributed ML training and inference clusters Develop... ...end-to-end pipelines to manage petabyte-scale datasets and model training throughout the... ...platforms (GCP, AWS, or Azure) and their ML/AI service offerings Familiarity with containerization...Senior
$190k - $270k
AI Chopping Block, Inc. seeks an AI Infrastructure Engineer in San Francisco to manage user-facing services and production systems. The role requires 5+ years of experience, a Bachelor's in Computer Science (or equivalent), and proficiency in Ansible, Terraform, and Kubernetes...Senior$190k - $270k
AI Chopping Block, Inc. is hiring an AI Infrastructure Engineer in San Francisco, California. This full-time role involves ensuring smooth operation of user-facing services and production systems, alongside building and running infrastructure with Ansible, Terraform, and...SeniorFull time$160k - $250k
...A leading AI firm in San Francisco seeks a skilled engineer to build large scale, fault-tolerant distributed systems. You will optimize for performance, work with Kubernetes, and contribute both software and Infrastructure as Code solutions. A strong background in programming...Senior- A dynamic tech company in San Francisco is seeking a seasoned ML Infrastructure Engineer to lead the development of innovative AI product systems. This role entails scaling ML product development infrastructure, collaborating with cross-functional teams, and mentoring...Senior
- ...Onos Health is looking for an AI/ML Engineer in San Francisco to develop robust AI-driven systems aimed at improving healthcare administration... ...a talented team, ensuring that AI solutions are precise, reliable, and integrated seamlessly into the healthcare platform. The...SeniorFlexible hours
- Wherobots, Inc. is seeking a Senior Machine Learning Engineer in San Francisco, California to lead the... ...of a scalable geospatial ML platform. The ideal candidate will... ...optimizing ML pipelines, and ensuring the reliability of large-scale data processing jobs. Wherobots offers...SeniorRemote job
- ...A modern technology firm in San Francisco is seeking an experienced Software Engineer with an infrastructure focus. The successful candidate will design, build, and scale backend services that support complex engineering workflows. Candidates should have 3+ years of experience...Senior
- ...professional to design and build production-ready AI agent systems for wealth management. This role emphasizes reliability and performance in live financial environments.... ...relevant experience with a strong background in ML systems, Python programming, and LLM frameworks....Senior
$166k - $225k
...of machine learning engineers and researchers, Mosaic AI enables companies... ...AI platform for the ML development lifecycle... ...the core platform infrastructure that supports our... ...features Ensure the reliability, security, and scalability... ...building large-scale systems Experience...SeniorLocal areaWorldwide$200k - $250k
...combines modern web tooling with AI-powered workflows. Our stack... ...and Kubernetes-based production infrastructure. We’re hiring a senior owner of stability and infrastructure... ...to ensure the platform is reliable, fast, and resilient as we scale. Role Mission Own service...SeniorPermanent employment$150k - $250k
...they had to do. Powerful AI will be the biggest... ...deploys frontier compute infrastructure fastest will decide... ...and software. Speed and scale are our key... ...is seeking a Network Engineer, Reliability & Observability to serve... ...Experience operating AI/ML or HPC fabrics with RDMA...SeniorLocal area- ...security, delivering an AI-powered platform that... .... As a Staff Platform Engineer, you will play a... ...leadership role. You will own reliability for major platform... ...maintaining the shared infrastructure services and platforms... ...Work on a large-scale, cloud-native SaaS platform...Senior
- Orb is looking for a senior member of the infrastructure team to maintain high reliability across its billing software. You will lead infrastructure resiliency efforts... ...critical systems, and collaborate with various engineering teams. Ideal candidates have 5+ years of...Senior
- A cutting-edge AI video platform is seeking a Senior Software Engineer (Infrastructure) to manage its GPU deployments and maintain a reliable AWS backbone. You will collaborate with specialized providers to ensure high availability and architect scalable systems, impacting...Senior
$171.6k - $302.2k
Senior Machine Learning Engineer, Machine Learning Platform Technologies Seattle... ...Machine Learning and AI Imagine what you... ...and build large-scale server-side functionality... ...users worldwide with the reliability and excellence Apple... ...system and ML Modeling (Search, Recommendation...SeniorWorldwideRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI/ML Infrastructure Engineer - Reliability & Scale. Be the first to apply!
- senior ai engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- ai engineer remote San Francisco, CA
- ai engineer San Francisco, CA
- ai prompt engineer San Francisco, CA
- ai developer San Francisco, CA
- ai research engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA


