Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI-Native DevOps / Operations Engineer (AMER)

Valency Systems Inc

About Valency

Valency Systems is a small, dynamic team of engineers, scientists, and researchers building the global hub for the agentic research era.

We're based in Berkeley, California, and we're building something that matters. If you care about open science, advancing research at the speed of thought, and using AI to accelerate discovery, we'd love to talk.

Our team is hybrid. We come together in person 3 days a week, with the option for 2 days of remote work.

The Position

We're hiring an AI-native DevOps / Operations Engineer to help build and operate the platform behind Valency. This is not a narrow infrastructure maintenance role. We want builders who can design and harden production systems, improve CI/CD and release quality, raise reliability and response times, and create the observability, analytics, and guardrails needed to safely operate a rapidly evolving platform.

This role sits at the intersection of platform engineering, cloud infrastructure, production operations, and AI-era software delivery. You will help close the loop from agentically written software to reliable, performant systems in production. That means better tests, better release controls, stronger guardrails, richer production telemetry, clearer workflows for human approval, and tighter feedback into product and engineering.

This is an especially strong fit for someone who has helped scale high-growth SaaS systems, likes building from first principles, and wants to experience that kind of growth again in a new context.

What You'll Own
  • Design, build, and improve the production platform powering Valency
  • Tighten CI/CD processes so changes are tested, gated, observable, and safe to ship
  • Improve production reliability, latency, deployment safety, and incident response
  • Build the operational feedback loops that help engineering and product teams act on real production behavior
  • Establish the right logging, analytics, tracing, alerting, and workflow instrumentation as the platform scales
  • Define and implement guardrails for agent-involved software delivery and operations
  • Introduce human-in-the-loop approval flows where autonomy needs stronger controls
  • Improve cost efficiency across cloud infrastructure and platform operations
  • Help shape security, compliance, and auditability foundations for SOC 2, ISO 27001, and FedRAMP-oriented environments
  • Contribute to the long-term platform engineering direction as the team grows and specializes
As the senior engineer on-site, you will:
  • Own production operations and operational excellence for this function
  • Lead incident response expectations for the role
  • Establish the operating model the broader team will scale on
  • Work onsite in the SF Bay Area
What Success Looks Like

In the first 6-12 months, you will help Valency begin tracking and materially improve:
  • Deployment frequency and release confidence
  • Change failure rate and rollback quality
  • MTTR and incident handling
  • p95 / p99 latency and system responsiveness
  • Uptime and service reliability
  • Alert quality and signal-to-noise ratio
  • Infrastructure cost efficiency
  • Operational visibility into agent workflows and production behavior
  • Guardrail coverage for agent-authored or agent-assisted changes
What You'll Work With

Today the platform makes use of AWS and adjacent infrastructure including:
  • ECS / Fargate
  • EKS / container orchestration environments
  • RDS
  • S3
  • Cloudflare
  • CloudWatch
  • Queues, caches, schedulers, and batch / background processing systems
We currently use GitHub Actions and expect this person to help evolve that into a stronger long-term platform engineering and delivery foundation

Our observability and analytics stack is still open for innovation. We want someone who is comfortable evaluating the tradeoffs and building the right system as complexity grows.

What Makes This Role AI-Native

This is not "DevOps, but with AI in the title."

You will help build the operational system around software and workflows that increasingly involve agents. That includes:
  • Tracing workflows across agent-driven and human-driven systems
  • Developing production guardrails to keep systems from going off the rails
  • Designing approval paths for high-risk or high-impact actions
  • Turning production signals into actionable inputs for product and engineering
  • Helping close the loop between what the system is doing, how users experience it, and how the platform should evolve
We do not require prior experience operating AI-native systems at scale. We do require strong judgment, strong production systems experience, and a willingness to build the right AI-era operating model.

Responsibilities
  • Own and improve CI/CD pipelines, release controls, and deployment workflows
  • Build and maintain highly reliable AWS-based production systems
  • Improve observability across logs, metrics, traces, events, and workflow state
  • Instrument platform behavior so system issues, regressions, and slowdowns are quickly visible and actionable
  • Create operational analytics that help close the loop between engineering, product, and customer experience
  • Drive cost engineering and infrastructure efficiency as the system scales
  • Build safer operating patterns for agent-assisted code changes and operational actions
  • Implement testing, validation, approval, and rollback mechanisms that reduce operational risk
  • Improve batch, queue, cache, and job-processing reliability and monitoring
  • Support incident response, root cause analysis, postmortems, and follow-through
  • Partner with external vendors and partners when needed
  • Help define platform standards, reliability practices, and operational maturity across the company
What We're Looking For

Required
  • 8+ years of progressively increasing responsibility operating important production systems
  • Demonstrated success shipping and running high-reliability systems in production
  • Deep AWS experience in real production environments
  • Strong background in software engineering and testing, not just infrastructure administration
  • Experience designing or significantly improving CI/CD systems and release processes
  • Experience building or operating logging, monitoring, alerting, and observability systems
  • Experience improving production reliability, performance, and operational response
  • Comfort with container-based systems and orchestration platforms
  • Strong hands-on ability in at least some of: Python, Go, Elixir, CDK
  • Strong judgment around guardrails, operational safety, and change management
  • Ability to work in ambiguity and build systems that do not yet fully exist
Strongly Preferred
  • Startup experience, especially in fast-scaling environments
  • Experience at high-scale SaaS companies that have gone through periods of rapid growth
  • Experience owning or materially influencing platform engineering functions
  • Experience with cost engineering / FinOps in AWS-heavy environments
  • Experience designing systems for compliance-oriented environments
  • Experience with SOC 2, ISO 27001, or FedRAMP-related operational requirements
  • Experience evaluating or implementing modern observability and workflow tracing stacks
  • Experience creating human-in-the-loop approval systems for sensitive production workflows
Why This Role
  • You will help define how an AI-native research platform is actually operated in production
  • You will work on systems that connect agents, researchers, product behavior, and infrastructure reality
  • You will have broad scope across infrastructure, reliability, analytics, and operational guardrails
  • You will help build the production foundation for a category-defining company at an early stage
  • You will not inherit a frozen stack; you will help choose and build the right one

Compensation, Benefits & Equity
We offer a competitive salary, benefits, and meaningful equity in a company building something important from the ground up.

Work Authorization: Candidates must be legally authorized to work in the United States.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior AI-Native DevOps / Operations Engineer (AMER) in Berkeley, CA vacancy
  •  ..., by developing the first AI Hardware Engineer. Our goal is to democratize...  ...has to just work. As a DevOps Engineer, you'll work on the...  ..., availability, and operational health of production systems...  ...~ Experience with cloud-native and serverless platforms (GCP... 
    Senior
    Shift work

    Flux Protocol

    San Francisco, CA
    3 days ago
  • A technology startup in San Francisco is seeking a DevOps Engineer as part of their founding team. The role involves building...  ...scalable cloud infrastructure and ensuring the operational reliability of their AI-native enterprise platform. Candidates should have 4-10 years... 
    Suggested

    Fabrion

    San Francisco, CA
    4 days ago
  • $192k - $240k

     ...Security Operations Engineer Brex is the intelligent finance platform that enables companies...  ...and control spend effortlessly. Brex's AI-native automation and world-class service eliminate...  ...Familiarity with CI/CD systems and DevOps workflows (e.g. Buildkite, Flux, Git,... 
    Senior
    Work experience placement
    Work at office
    Remote work
    Work from home

    Brex

    San Francisco, CA
    5 days ago
  • $67k - $136.8k

     ...opportunity As an FSO DevOps Engineer Senior Analyst, you’ll be based in...  ...in driving the delivery and operations of the Web3 Platform that supports...  ..., IAM Policies, cloud-native security controls What...  ...markets. Enabled by data, AI and advanced technology, EY... 
    Senior
    Summer holiday
    Flexible hours

    EY

    San Francisco, CA
    3 days ago
  • Finalis is seeking a Senior Software Engineer to enhance our ai-native platform. This role involves collaborating with cross-functional teams to design and implement scalable services for compliance, data, and payments. Candidates should have over 5 years of experience,... 
    Senior
    Remote work

    Finalis

    San Francisco, CA
    5 days ago
  • OpenArt AI in San Francisco is seeking a Senior Platform & Reliability Engineer to design and improve the reliability of its infrastructure...  ...role emphasizes building and operating production systems while...  ...skills, and familiarity with cloud-native architectures like AWS or GCP.... 
    Senior

    OpenArt AI

    San Francisco, CA
    2 days ago
  •  ...We are looking for a Senior Automation / DevOps Engineer to support and evolve a modern integration and cloud...  ...role focuses on automating manual operational work, strengthening Infrastructure as...  ...this job, you agree to receive calls, AI-generated calls, text messages, or... 
    Senior

    Apex Systems

    San Francisco, CA
    1 day ago
  • A technology company in San Francisco is seeking a DevOps Engineer to enhance the reliability and operational health of their production systems. You will set observability standards, build internal tooling, and partner with engineers for system design. The ideal candidate... 
    Senior

    Flux Enterprise

    San Francisco, CA
    3 days ago
  • A healthcare AI company is seeking a Sr. Infrastructure Engineer to enhance and scale their systems. Responsibilities include managing infrastructure with Terraform and Kubernetes, creating monitoring solutions, and troubleshooting. Ideal candidates will have 5+ years in... 
    Senior
    Remote job

    AKASA

    San Francisco, CA
    1 day ago
  •  ...technology startup in the San Francisco Bay Area is seeking a DevOps Engineer to build and maintain scalable cloud infrastructure. The ideal...  ...team. If you're motivated by solving complex infrastructure challenges in AI, we want to hear from you. #J-18808-Ljbffr Fabrion

    Fabrion

    San Francisco, CA
    3 days ago
  • Icehouseventures is seeking a skilled DevOps Engineer in San Francisco to join their Infrastructure team. This role focuses on building and maintaining Docker-based environments, managing infrastructure using Pulumi, and optimizing CI/CD workflows. Ideal candidates will... 

    Icehouseventures

    San Francisco, CA
    5 days ago
  • Phonely in San Francisco is seeking an experienced DevOps Engineer to join our engineering team and help build reliable cloud infrastructure for voice AI systems. This role is fully on-site and essential to our fast-paced business environment. The ideal candidate will... 
    Senior

    Phonely

    San Francisco, CA
    5 days ago
  • A tech innovation firm in San Francisco seeks a DevOps Engineer to scale and secure their infrastructure for AI-driven developer tools. This hands-on role demands expertise in CI/CD systems, cloud providers, and observability tools, as well as a strong commitment to system... 
    Senior

    CodeRabbit

    San Francisco, CA
    1 day ago
  • $67 - $72 per hour

    Sr DevOps Automation Engineer Responsibilities Defines standards to guide solution decisions in relation to multiple integration technologies. Leads...  ...Grafana or equivalent is a must. Practical application of AI tooling in: Accelerating Terraform module development and... 
    Senior
    Hourly pay
    Work experience placement

    Crystal Equation Corporation

    San Francisco, CA
    4 days ago
  •  ...Senior DevOps Engineer Skyfire empowers AI to present verified identities, access essential services and process payments without human intervention...  ...to debug failures under time constraints for smooth operations. Quick learner : The ability to quickly experiment... 
    Senior
    Work at office
    Shift work

    Skyfire

    San Francisco, CA
    1 day ago
  •  ...is to create the next generation of Gen AI-driven code reviewers: a symbiotic partnership...  ...significantly outperforms individual engineers. We combine language models with human ingenuity...  ...and quality. Role Overview As a DevOps Engineer at CodeRabbit, you'll play a key... 
    Senior
    Remote work

    CodeRabbit

    San Francisco, CA
    2 days ago
  • $190k - $282k

     ...Senior Security Production Engineer Livingston, NJ / New York, NY / Sunnyvale...  ...Essential Cloud for AI™. Built for pioneers...  ...safe and efficient operations for enterprise and...  ...and cloud native technologies Build...  ...reliability engineering, DevOps, security engineering... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    San Francisco, CA
    2 days ago
  • $230k - $320k

     ...DevOps Engineer At EliseAI, we're improving the industries that matter most: housing and healthcare...  ...than they should be. By integrating AI agents deeply into existing workflows,...  ...deployment workflows, and ensuring operational consistency as our infrastructure scales... 
    Senior
    Work at office
    Local area
    Relocation

    EliseAI

    San Francisco, CA
    2 days ago
  •  ...cutting-edge technology firm based in San Francisco is seeking a DevOps Engineer to enhance the reliability of their production systems. You...  ...platforms. Join us in our mission to revolutionize hardware design through innovative AI solutions. #J-18808-Ljbffr Flux Enterprise
    Senior

    Flux Enterprise

    San Francisco, CA
    3 days ago
  • $160k - $220k

     ...Fidelity, and employs a team of 450 engineers and entrepreneurs. Astranis designs, builds, and operates its satellites out of its 153,...  ...Northern California, USA. Senior DevOps Engineer As a Senior...  ...development workflow including AI assistant tools, language... 
    Senior
    Permanent employment
    Remote work
    Flexible hours

    Astranis

    San Francisco, CA
    3 days ago
  •  ...through intelligent automation. We build AI-powered software that eliminates...  ...a passionate team of former healthcare operators and world-class AI technologists, Plenful...  ...About the role We're hiring a Senior DevOps Engineer to join our engineering team at Plenful... 
    Senior
    Work at office
    Local area
    Remote work
    Flexible hours
    2 days per week

    Plenful

    San Francisco, CA
    2 days ago
  • MixedBread AI in San Francisco is seeking a DevOps Engineer to join their core infrastructure team. You will be responsible for building and maintaining CI/CD pipelines, automating infrastructure management, and ensuring high availability of systems to support rapid, safe... 
    Senior

    MixedBread AI

    San Francisco, CA
    5 days ago
  • $150k - $220k

     ...Senior DevOps Engineer/ AWS/ AI Infrastructure San Francisco, California Onsite Full Time $150k - $220k Our client is a high-growth...  ...tooling: Datadog, Prometheus, or Grafana ~ Comfortable operating in fast-moving, high-ownership environments Desired... 
    Senior
    Full time
    Work at office
    Immediate start

    Motion Recruitment

    San Francisco, CA
    2 days ago
  • $150k - $220k

     ...Senior Cloud DevSecOps Infrastructure Engineer Title of Role: Senior Cloud DevSecOps Infrastructure Engineer Location...  ...: Venture-Backed — Healthcare, AI, Security, Enterprise Office...  ...Background ~6+ years of experience in DevOps or Infrastructure Engineering, with... 
    Senior
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    2 days ago
  • A technology solutions provider is seeking a Senior DevOps Engineer in San Francisco. You will build and maintain development platforms using data tools like Databricks and AWS. Responsibilities include defining CI/CD practices, managing data migration to the cloud, and... 
    Senior

    Compunnel, Inc.

    San Francisco, CA
    3 days ago
  •  ...Process Street in San Francisco is seeking a Senior Software Engineer to help architect innovative solutions using AI and modern technologies. This role demands expertise in backend systems and AI tools, along with a proactive approach to software development. Candidates... 
    Senior
    Full time
    Remote work

    Process Street

    San Francisco, CA
    13 days ago
  • $148.5k - $260.1k

     ...Job Category Software Engineering Job Details About Salesforce...  ...Salesforce Salesforce is the #1 AI CRM, where humans with agents...  .... Must be a U.S. Citizen operating on U.S. Soil with ability to...  ...looking for an experienced DevOps engineer to join a team of... 
    Senior
    Work experience placement
    Local area

    Salesforce.Com Inc

    San Francisco, CA
    4 days ago
  •  ...specialist to own end-to-end intelligence processes, create a best-in-class enablement toolkit, analyze sales patterns, and build an AI-native program. This role requires at least 5 years of experience in competitive intelligence within B2B SaaS, strong communication... 
    Senior

    Rippling

    San Francisco, CA
    2 days ago
  • A leading health technology company is seeking a Senior AI Native Product Designer for its Platform Team in San Francisco. The ideal candidate...  ...architect foundational systems that enable designers and engineers to create intelligent, context-aware experiences. The... 
    Senior

    ŌURA

    San Francisco, CA
    1 day ago
  • Everself is seeking a Senior Product Manager for their AI-native healthcare platform in San Francisco. This hybrid role focuses on the EHR and patient app, overseeing product strategy and execution. The ideal candidate has over 4 years of experience in product management... 
    Senior

    Everself

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI-Native DevOps / Operations Engineer (AMER). Be the first to apply!