AI DevOps Engineer Jobs
AI Chopping Block, Inc.
Senior Infrastructure Engineer – Bland As a Senior Infrastructure Engineer at Bland, responsibilities include contributing to the design of scalable architecture by building distributed systems using Kubernetes that handle high-volume, real-time voice processing with strict latency and reliability requirements; building and supporting machine learning infrastructure including training pipelines and real-time inference serving across multiple regions; maintaining robust integrations with enterprise telephony systems, SIP trunks, and VoIP infrastructure; identifying architectural flaws and solving them; ensuring platform reliability through monitoring, alerting, and incident response systems to maintain enterprise-grade uptime; anticipating and solving scaling challenges related to exponential call volume growth; and implementing security best practices and compliance requirements for enterprise customers in regulated industries. Lead – AI/ML Stack Infrastructure Lead the team responsible for the infrastructure supporting AI/ML Stack, focusing on scalability and efficiency of the Machine Learning Operations platform. Develop and execute the long-term vision and roadmap for the MLOps team to support ML development and deployment across business units, balancing short-term tactical deliveries with long-term architectural transformation. Manage and mentor a team of 6-7+ engineers, allocating resources strategically to support existing services and execute key strategic initiatives. Collaborate cross-functionally with leaders in machine learning, data science, product engineering, and infrastructure to identify pain points, remove bottlenecks, and facilitate new solution deployment. Architect compute and storage pipelines for ML Engineers to manage large datasets and artifacts efficiently. Modernize the AI product inference stack for significant growth in global deployments. Work with Site Reliability Engineering to establish comprehensive system observability metrics. Conduct assessments for technology refresh and benchmark proprietary tools against commercial and open-source alternatives to meet future needs. Infrastructure Engineer – AI/ML Workflows The Infrastructure Engineer is responsible for building robust, secure, and scalable cloud infrastructure to support AI and machine learning workflows. This includes designing, building, and deploying cloud infrastructure, partnering with technical and non-technical stakeholders from idea generation through implementation and shipping, enabling Machine Learning Engineers and Data Scientists by contributing to internal best practices, standards, and reusable code repositories, proactively identifying and recommending ways customers can leverage cloud infrastructure to solve key challenges, creating and maintaining reusable, company-wide libraries and infrastructure-as-code, and researching and integrating the best open-source technologies to enhance Faculty's infrastructure capabilities. Staff DevOps Engineer – AI Workloads The Staff DevOps Engineer will design and architect secure, scalable cloud and edge infrastructure for deploying AI workloads across multi-cloud and hybrid environments. They will build and maintain production-grade Infrastructure as Code using tools like Terraform, Ansible, or Pulumi, managing over 100 resources with GitOps workflows and automated validation. The role includes designing and operating production Kubernetes clusters optimized for AI/ML workloads with GPU support, implementing container security, multi-tenancy, and resource optimization. They will implement secure CI/CD pipelines with integrated security controls and automated deployment workflows for containerized AI models. The engineer will lead MLOps infrastructure initiatives including model deployment pipelines, versioning, feature stores, experiment tracking, and monitoring for model performance and drift. Responsibilities also include designing comprehensive observability and monitoring solutions using tools like Prometheus, Grafana, ELK, or Datadog with distributed tracing, application performance monitoring, and real-time alerting. They will implement security best practices such as least-privilege access, encryption at rest and in transit, network segmentation, and automated compliance validation. The engineer will lead incident response and reliability initiatives, participate in on-call rotation, conduct post-mortems, and drive continuous improvement for system reliability. Architecting disaster recovery and business continuity strategies with automated backup, failover, and recovery processes is required. They will develop reusable infrastructure modules and templates to accelerate environment provisioning and standardize deployment patterns. Mentoring mid-level and senior engineers on cloud architecture, DevOps best practices, and platform reliability through design reviews and technical guidance is part of the role. They will also drive technical documentation and knowledge sharing including runbooks, architecture decision records, and infrastructure standards. Site Reliability Engineer, Inference Infrastructure As a Site Reliability Engineer on the Model Serving team, you will build self-service systems that automate managing, deploying, and operating services, including custom Kubernetes operators supporting language model deployments. You will automate environment observability and resilience, enabling all developers to troubleshoot and resolve problems, and take steps to ensure defined SLOs are met, including participating in an on-call rotation. Additionally, you will build strong relationships with internal developers and influence the Infrastructure team’s roadmap based on their feedback, as well as develop the team through knowledge sharing and an active review process. Location San Francisco or New York, United States #J-18808-Ljbffr AI Chopping Block, Inc.
- A pioneering AI firm in San Francisco seeks its first full-time engineer. This role emphasizes ownership of product and infrastructure, requiring an individual who can make architectural decisions and ship features rapidly. Candidates should have a builder mindset, fluency...SuggestedFull time
- ...Job Description Job Description Aircall is a unicorn, AI-powered customer communications platform used by 22,000+ companies worldwide to drive revenue, resolve... ...at home here. We are hiring a Software AI Engineer to join the Engineering Productivity (EngProd) team...SuggestedWorldwide
$150k - $200k
...Get AI-powered advice on this job and more exclusive features. Understanding Recruitment provided pay range This range is provided by Understanding... ...with a leading AI company, who are looking for a DevOps Engineer to join their ever-expanding Engineering team who can...SuggestedFull timeWork at officeImmediate start2 days per week3 days per week- ...Software Engineer Develop high quality software solutions with comprehensive test coverage. Contribute to the technical design for system enhancements. Implement features following the guidance of senior members of the team. Provide technical assistance to non-technical...Suggested
- ...DevOps Engineer - Operational Support (Pulumi + ECS) Location: Onsite - SF Bay Area Start - April 2026 Job Summary We are seeking a DevOps Engineer to support and maintain cloud-based infrastructure and applications. This role requires strong experience...Suggested
$120k - $200k
...Jetsons, and internal systems used for testing and demos. Our engineering team works across robotics, embedded systems, and edge infrastructure... ...fast iteration and reliable releases. About the Role As a DevOps Engineer at Droyd, you will own the systems that make software...Full timeLocal area- ...Sr. DevOps Engineer - AWS & GitOps Focus Core Technical Skills 5+ years of experience, 10+ years desired, Required: • ArgoCD, including multi-cluster management • Kubernetes orchestration and management • Expertise in AWS services: EC2, EKS, RDS, IAM, ASM, ACM, PrivateLink...
- ...Job Description Join our infrastructure team as a DevOps Engineer responsible for designing, automating and maintaining CI/CD pipelines, cloud infrastructure (AWS/GCP), and monitoring systems. You will work to improve deployment reliability, scalability and security...Remote work
- ...tools consistently fail. We are a small, fast-growing team of engineers in San Francisco powering Fortune 100 enterprises, YC startups,... ...capacity planning and cost optimization Requirements 4+ years in DevOps or platform engineering Strong Kubernetes, Terraform, AWS, and...Work at officeVisa sponsorshipRelocation package
$180k - $275k
...yr – $275,000.00/yr We’re building an AI-native fintech platform that’s modernizing... ...workflows at speed. Position DevOps Engineer – Own and evolve the cloud infrastructure... ...level Employment Type Full‑time Job Function Finance and Accounting/Auditing...Full time- ...Senior Site Reliability/DevOps Engineer SAN FRANCISCO, CA ENGINEERING FULL-TIME What you'll be doing: Provisioning and maintaining cloud infrastructure that will support training machine learning models over billions of historical data points collected from tens...Full timeWork experience placement
$150k - $250k
...Max AI – Stripe for Healthcare Max AI is the World’s first human-free, fully-autonomous medical billing AI agent. Many startups... ...research at MIT and Caltech for over 10 years. And our Head of Engineering was one of the earliest engineers at Figma. AI Engineer...- ...answer their phone with voice AI. We are making sure that when... ...pressure environment to work in our engineering. This role is critical to the... ...looking for a cushy corporate job where nothing get done, I... ...Qualifications: 5+ years as a DevOps engineer Experience writing async...Full timeWork at office
$150k - $350k
...About Collate Collate is an AI document generation platform for life sciences. We automate paperwork with AI, helping our customers... ...at Y Combinator and founder of Lever. Our AI researchers, engineers, and designers have worked at Google, Nvidia, Meta, Netflix, Amazon...- ...in card issuance technology. Their team of 20 passionate professionals is seeking exceptional engineers to aid in scaling their operations. Job Description Senior DevOps Engineer (L1+): steering the evolution of infrastructure and serverless architecture within an...Full timeSummer workWork at officeRelocation
- ...Yutori is reimagining how people interact with the web by building AI agents that can reliably do everyday digital tasks. We are... ...a superhuman generalist web‑agent Work closely with product engineers to translate cutting‑edge AI capabilities into reliable product...Work at officeRelocationVisa sponsorship
- ...is to create the next generation of Gen AI-driven code reviewers: a symbiotic partnership... ...significantly outperforms individual engineers. We combine language models with human ingenuity... ...and quality. Role Overview As a DevOps Engineer at CodeRabbit, you’ll play a key...Remote work
- ...Job Description Skyfire empowers AI to present verified identities, access essential services and process payments without human intervention. From... ...reliability) and create actionable alerts so the on-call engineer knows exactly what to do - Know that a system has malfunctioned...Work at officeShift work
$225k
...Engineering at Ivo Engineers at Ivo are inventors. Ivo was first-to-market with: An AI agent that lives in MS word and edits the document for... ...What? We’re looking for an DevOps Engineer as part of Infrastructure... ...services (APIs, workers, jobs) and frontend apps (builds,...Contract workWork at officeRemote work- ...Job Title Execute and automate day-to-day project environment maintenance Help with our AWS to Azure cloud migration effort... ...Defining their own CI/CD- to keep what they have or move to Azure DevOps EC2 (currently) but moving to containers- managed container service...
- ...Job Title Required Skills: Experience with automated Windows OS image creation using tool such as Windows ADK. Experience... ...development, test, release, update, and support processes for DevOps operation Document and validate build environments for customer...
- ...Job Requirements Hands on experience working with containers (Kubernetes) and serverless applications (Lambda, SQS, DynamoDB, S3). Understanding of SRE (Site Reliability Engineering) principles with supporting skills and experience in creating scalable and reliable...
$140k - $200k
...DevOps Engineer San Francisco, CA About Pump.co Cloud spend is a whopping $500 billion/yr, the biggest growing expense category for... ...is building the fastest way to save ~60% on cloud spend. Our AI-powered platform not only fully automates savings, but we also...Work at office$150k - $240k
...DevOps Engineer Title of Role: DevOps Engineer Location: San Francisco, on-site or remote Company Stage of Funding: Series B Office... ...software development space, particularly within API SDKs and AI-driven solutions. With a strong commitment to innovation and...Work at officeRemote work- ...Description We are seeking an AWS Developer Operations Engineer to join our team! You will design and develop solutions to complex... ...problems and deployment solutions. Looking to evolve "Coder" to a DevOps Master. Responsibilities: • Work with customers...Contract work
- ...About the role We’re looking for a top‑tier DevOps Engineer to join Linkup and work hand‑in‑hand with our CTO, Denis Charrier, and the... ...Build and operate large-scale, distributed systems powering AI-ready web search Design, implement, and maintain infrastructure...Remote workFlexible hours
- ...DevOps Engineer Location: San Francisco, CA Duration: 6+ Months Responsibilities: Experience building APIs that power web or mobile applications. Experience working with Linux systems and comfortable using the command line. Fluent in Python or Node...
$230k - $320k
...DevOps Engineer At EliseAI, we're improving the industries that matter most: housing and healthcare... ...than they should be. By integrating AI agents deeply into existing workflows, we... ...make the move exciting, not painful! Job Compensation Range The salary range...Work at officeLocal areaRelocation- ...DevOps Engineer We are seeking a highly skilled individual to join our team as a DevOps Engineer. This role involves supporting Azure & AWS Databricks from a DevOps perspective, requiring a hybrid skill set that spans cloud infrastructure management, deployment, automation...Work experience placement
- ...Analytics Platform Engineer As an Analytics platform engineer supporting Azure & AWS Databricks from a DevOps perspective requires a hybrid skill set spanning primarily cloud infrastructure management and deployment, automation and CI/CD practices and it is helpful...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI DevOps Engineer Jobs. Be the first to apply!
- ai research engineer San Francisco, CA
- ai developer San Francisco, CA
- ai prompt engineer San Francisco, CA
- ai engineer San Francisco, CA
- senior ai engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- ai engineer remote San Francisco, CA
- machine learning ai engineer San Francisco, CA
- big data devops engineer San Francisco, CA
- devops engineer full time San Francisco, CA



