Senior AI-Native DevOps / Operations Engineer (AMER)
Valency Systems Inc
About Valency Valency Systems is a small, dynamic team of engineers, scientists, and researchers building the global hub for the agentic research era. We're based in Berkeley, California, and we're building something that matters. If you care about open science, advancing research at the speed of thought, and using AI to accelerate discovery, we'd love to talk. Our team is hybrid. We come together in person 3 days a week, with the option for 2 days of remote work. The Position We're hiring an AI-native DevOps / Operations Engineer to help build and operate the platform behind Valency. This is not a narrow infrastructure maintenance role. We want builders who can design and harden production systems, improve CI/CD and release quality, raise reliability and response times, and create the observability, analytics, and guardrails needed to safely operate a rapidly evolving platform. This role sits at the intersection of platform engineering, cloud infrastructure, production operations, and AI-era software delivery. You will help close the loop from agentically written software to reliable, performant systems in production. That means better tests, better release controls, stronger guardrails, richer production telemetry, clearer workflows for human approval, and tighter feedback into product and engineering. This is an especially strong fit for someone who has helped scale high-growth SaaS systems, likes building from first principles, and wants to experience that kind of growth again in a new context. What You'll Own
We offer a competitive salary, benefits, and meaningful equity in a company building something important from the ground up. Work Authorization: Candidates must be legally authorized to work in the United States.
- Design, build, and improve the production platform powering Valency
- Tighten CI/CD processes so changes are tested, gated, observable, and safe to ship
- Improve production reliability, latency, deployment safety, and incident response
- Build the operational feedback loops that help engineering and product teams act on real production behavior
- Establish the right logging, analytics, tracing, alerting, and workflow instrumentation as the platform scales
- Define and implement guardrails for agent-involved software delivery and operations
- Introduce human-in-the-loop approval flows where autonomy needs stronger controls
- Improve cost efficiency across cloud infrastructure and platform operations
- Help shape security, compliance, and auditability foundations for SOC 2, ISO 27001, and FedRAMP-oriented environments
- Contribute to the long-term platform engineering direction as the team grows and specializes
- Own production operations and operational excellence for this function
- Lead incident response expectations for the role
- Establish the operating model the broader team will scale on
- Work onsite in the SF Bay Area
- Deployment frequency and release confidence
- Change failure rate and rollback quality
- MTTR and incident handling
- p95 / p99 latency and system responsiveness
- Uptime and service reliability
- Alert quality and signal-to-noise ratio
- Infrastructure cost efficiency
- Operational visibility into agent workflows and production behavior
- Guardrail coverage for agent-authored or agent-assisted changes
- ECS / Fargate
- EKS / container orchestration environments
- RDS
- S3
- Cloudflare
- CloudWatch
- Queues, caches, schedulers, and batch / background processing systems
- Tracing workflows across agent-driven and human-driven systems
- Developing production guardrails to keep systems from going off the rails
- Designing approval paths for high-risk or high-impact actions
- Turning production signals into actionable inputs for product and engineering
- Helping close the loop between what the system is doing, how users experience it, and how the platform should evolve
- Own and improve CI/CD pipelines, release controls, and deployment workflows
- Build and maintain highly reliable AWS-based production systems
- Improve observability across logs, metrics, traces, events, and workflow state
- Instrument platform behavior so system issues, regressions, and slowdowns are quickly visible and actionable
- Create operational analytics that help close the loop between engineering, product, and customer experience
- Drive cost engineering and infrastructure efficiency as the system scales
- Build safer operating patterns for agent-assisted code changes and operational actions
- Implement testing, validation, approval, and rollback mechanisms that reduce operational risk
- Improve batch, queue, cache, and job-processing reliability and monitoring
- Support incident response, root cause analysis, postmortems, and follow-through
- Partner with external vendors and partners when needed
- Help define platform standards, reliability practices, and operational maturity across the company
- 8+ years of progressively increasing responsibility operating important production systems
- Demonstrated success shipping and running high-reliability systems in production
- Deep AWS experience in real production environments
- Strong background in software engineering and testing, not just infrastructure administration
- Experience designing or significantly improving CI/CD systems and release processes
- Experience building or operating logging, monitoring, alerting, and observability systems
- Experience improving production reliability, performance, and operational response
- Comfort with container-based systems and orchestration platforms
- Strong hands-on ability in at least some of: Python, Go, Elixir, CDK
- Strong judgment around guardrails, operational safety, and change management
- Ability to work in ambiguity and build systems that do not yet fully exist
- Startup experience, especially in fast-scaling environments
- Experience at high-scale SaaS companies that have gone through periods of rapid growth
- Experience owning or materially influencing platform engineering functions
- Experience with cost engineering / FinOps in AWS-heavy environments
- Experience designing systems for compliance-oriented environments
- Experience with SOC 2, ISO 27001, or FedRAMP-related operational requirements
- Experience evaluating or implementing modern observability and workflow tracing stacks
- Experience creating human-in-the-loop approval systems for sensitive production workflows
- You will help define how an AI-native research platform is actually operated in production
- You will work on systems that connect agents, researchers, product behavior, and infrastructure reality
- You will have broad scope across infrastructure, reliability, analytics, and operational guardrails
- You will help build the production foundation for a category-defining company at an early stage
- You will not inherit a frozen stack; you will help choose and build the right one
We offer a competitive salary, benefits, and meaningful equity in a company building something important from the ground up. Work Authorization: Candidates must be legally authorized to work in the United States.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior AI-Native DevOps / Operations Engineer (AMER) in Berkeley, CA vacancy
- ..., by developing the first AI Hardware Engineer. Our goal is to democratize... ...has to just work. As a DevOps Engineer, you'll work on the... ..., availability, and operational health of production systems... ...~ Experience with cloud-native and serverless platforms (GCP...SeniorShift work
- A technology startup in San Francisco is seeking a DevOps Engineer as part of their founding team. The role involves building... ...scalable cloud infrastructure and ensuring the operational reliability of their AI-native enterprise platform. Candidates should have 4-10 years...Suggested
$192k - $240k
...Security Operations Engineer Brex is the intelligent finance platform that enables companies... ...and control spend effortlessly. Brex's AI-native automation and world-class service eliminate... ...Familiarity with CI/CD systems and DevOps workflows (e.g. Buildkite, Flux, Git,...SeniorWork experience placementWork at officeRemote workWork from home$67k - $136.8k
...opportunity As an FSO DevOps Engineer Senior Analyst, you’ll be based in... ...in driving the delivery and operations of the Web3 Platform that supports... ..., IAM Policies, cloud-native security controls What... ...markets. Enabled by data, AI and advanced technology, EY...SeniorSummer holidayFlexible hours- Finalis is seeking a Senior Software Engineer to enhance our ai-native platform. This role involves collaborating with cross-functional teams to design and implement scalable services for compliance, data, and payments. Candidates should have over 5 years of experience,...SeniorRemote work
- OpenArt AI in San Francisco is seeking a Senior Platform & Reliability Engineer to design and improve the reliability of its infrastructure... ...role emphasizes building and operating production systems while... ...skills, and familiarity with cloud-native architectures like AWS or GCP....Senior
- ...We are looking for a Senior Automation / DevOps Engineer to support and evolve a modern integration and cloud... ...role focuses on automating manual operational work, strengthening Infrastructure as... ...this job, you agree to receive calls, AI-generated calls, text messages, or...Senior
- A technology company in San Francisco is seeking a DevOps Engineer to enhance the reliability and operational health of their production systems. You will set observability standards, build internal tooling, and partner with engineers for system design. The ideal candidate...Senior
- A healthcare AI company is seeking a Sr. Infrastructure Engineer to enhance and scale their systems. Responsibilities include managing infrastructure with Terraform and Kubernetes, creating monitoring solutions, and troubleshooting. Ideal candidates will have 5+ years in...SeniorRemote job
- ...technology startup in the San Francisco Bay Area is seeking a DevOps Engineer to build and maintain scalable cloud infrastructure. The ideal... ...team. If you're motivated by solving complex infrastructure challenges in AI, we want to hear from you. #J-18808-Ljbffr Fabrion
- Icehouseventures is seeking a skilled DevOps Engineer in San Francisco to join their Infrastructure team. This role focuses on building and maintaining Docker-based environments, managing infrastructure using Pulumi, and optimizing CI/CD workflows. Ideal candidates will...
- Phonely in San Francisco is seeking an experienced DevOps Engineer to join our engineering team and help build reliable cloud infrastructure for voice AI systems. This role is fully on-site and essential to our fast-paced business environment. The ideal candidate will...Senior
- A tech innovation firm in San Francisco seeks a DevOps Engineer to scale and secure their infrastructure for AI-driven developer tools. This hands-on role demands expertise in CI/CD systems, cloud providers, and observability tools, as well as a strong commitment to system...Senior
$67 - $72 per hour
Sr DevOps Automation Engineer Responsibilities Defines standards to guide solution decisions in relation to multiple integration technologies. Leads... ...Grafana or equivalent is a must. Practical application of AI tooling in: Accelerating Terraform module development and...SeniorHourly payWork experience placement- ...Senior DevOps Engineer Skyfire empowers AI to present verified identities, access essential services and process payments without human intervention... ...to debug failures under time constraints for smooth operations. Quick learner : The ability to quickly experiment...SeniorWork at officeShift work
- ...is to create the next generation of Gen AI-driven code reviewers: a symbiotic partnership... ...significantly outperforms individual engineers. We combine language models with human ingenuity... ...and quality. Role Overview As a DevOps Engineer at CodeRabbit, you'll play a key...SeniorRemote work
$190k - $282k
...Senior Security Production Engineer Livingston, NJ / New York, NY / Sunnyvale... ...Essential Cloud for AI™. Built for pioneers... ...safe and efficient operations for enterprise and... ...and cloud native technologies Build... ...reliability engineering, DevOps, security engineering...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$230k - $320k
...DevOps Engineer At EliseAI, we're improving the industries that matter most: housing and healthcare... ...than they should be. By integrating AI agents deeply into existing workflows,... ...deployment workflows, and ensuring operational consistency as our infrastructure scales...SeniorWork at officeLocal areaRelocation- ...cutting-edge technology firm based in San Francisco is seeking a DevOps Engineer to enhance the reliability of their production systems. You... ...platforms. Join us in our mission to revolutionize hardware design through innovative AI solutions. #J-18808-Ljbffr Flux EnterpriseSenior
$160k - $220k
...Fidelity, and employs a team of 450 engineers and entrepreneurs. Astranis designs, builds, and operates its satellites out of its 153,... ...Northern California, USA. Senior DevOps Engineer As a Senior... ...development workflow including AI assistant tools, language...SeniorPermanent employmentRemote workFlexible hours- ...through intelligent automation. We build AI-powered software that eliminates... ...a passionate team of former healthcare operators and world-class AI technologists, Plenful... ...About the role We're hiring a Senior DevOps Engineer to join our engineering team at Plenful...SeniorWork at officeLocal areaRemote workFlexible hours2 days per week
- MixedBread AI in San Francisco is seeking a DevOps Engineer to join their core infrastructure team. You will be responsible for building and maintaining CI/CD pipelines, automating infrastructure management, and ensuring high availability of systems to support rapid, safe...Senior
$150k - $220k
...Senior DevOps Engineer/ AWS/ AI Infrastructure San Francisco, California Onsite Full Time $150k - $220k Our client is a high-growth... ...tooling: Datadog, Prometheus, or Grafana ~ Comfortable operating in fast-moving, high-ownership environments Desired...SeniorFull timeWork at officeImmediate start$150k - $220k
...Senior Cloud DevSecOps Infrastructure Engineer Title of Role: Senior Cloud DevSecOps Infrastructure Engineer Location... ...: Venture-Backed — Healthcare, AI, Security, Enterprise Office... ...Background ~6+ years of experience in DevOps or Infrastructure Engineering, with...SeniorWork at office- A technology solutions provider is seeking a Senior DevOps Engineer in San Francisco. You will build and maintain development platforms using data tools like Databricks and AWS. Responsibilities include defining CI/CD practices, managing data migration to the cloud, and...Senior
- ...Process Street in San Francisco is seeking a Senior Software Engineer to help architect innovative solutions using AI and modern technologies. This role demands expertise in backend systems and AI tools, along with a proactive approach to software development. Candidates...SeniorFull timeRemote work
$148.5k - $260.1k
...Job Category Software Engineering Job Details About Salesforce... ...Salesforce Salesforce is the #1 AI CRM, where humans with agents... .... Must be a U.S. Citizen operating on U.S. Soil with ability to... ...looking for an experienced DevOps engineer to join a team of...SeniorWork experience placementLocal area- ...specialist to own end-to-end intelligence processes, create a best-in-class enablement toolkit, analyze sales patterns, and build an AI-native program. This role requires at least 5 years of experience in competitive intelligence within B2B SaaS, strong communication...Senior
- A leading health technology company is seeking a Senior AI Native Product Designer for its Platform Team in San Francisco. The ideal candidate... ...architect foundational systems that enable designers and engineers to create intelligent, context-aware experiences. The...Senior
- Everself is seeking a Senior Product Manager for their AI-native healthcare platform in San Francisco. This hybrid role focuses on the EHR and patient app, overseeing product strategy and execution. The ideal candidate has over 4 years of experience in product management...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI-Native DevOps / Operations Engineer (AMER). Be the first to apply!
Related searches
- ai developer Berkeley, CA
- ai engineer Berkeley, CA
- devops aws developer (remote) Berkeley, CA
- senior devops engineer remote Berkeley, CA
- production operations engineer Berkeley, CA
- post production engineer Berkeley, CA
- security operations center engineer Berkeley, CA
- operations engineer Berkeley, CA
- data center operations engineer Berkeley, CA
- senior lead project manager Berkeley, CA

