Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Site Reliability Engineer

$175k - $200k

Order.co

Order.co is the System of Action for the Office of the CFO, transforming the way businesses purchase and pay into an intuitive, B2C-like shopping experience. Order.co leverages embedded AI agents and embedded financial products to reinvent the way businesses connect with their vendors. End users enjoy a seamless, zero-training buying experience, while finance and procurement leaders gain a single platform to orchestrate how the business “should operate”. The result is an all-in-one solution that serves as a gravitational pull for spend and data, automating and eliminating procurement and finance workflows from requisition to reconciliation along the way. Order.co is on the cutting edge of B2B Agentic Commerce, poised to be the market leader in creating a more predictive, prescriptive, and personalized experience for users. Founded in 2016 and headquartered in New York City, Order.co oversees nearly half a billion in annualized spend across hundreds of customers like WeWork, SoulCycle, Lume, and solidcore. Order.co has raised $75M in funding from industry-leading investors like MIT, Stage 2 Capital, Rally Ventures, 645 Ventures, and more. Order.co has been proudly named a 50 to Watch by Spend Matters and a Best Place to Work by BuiltIn and Inc. Magazine. The Role As a Senior Site Reliability Engineer on the Platform team, you will ensure that software systems are reliable, scalable, performant, and operationally efficient. You blend software engineering skills with infrastructure and operations expertise to keep critical systems running smoothly while enabling rapid product development. Responsibilities Reliability Engineering & Infrastructure Ownership Design, build, and operate highly available, scalable, and fault-tolerant infrastructure and platform services Own reliability, availability, latency, and operational excellence for critical production systems and services Define and maintain service level objectives (SLOs), service level indicators (SLIs), and error budgets across platform systems Lead incident response efforts for complex production outages; drive root-cause analysis and long-term remediation actions Build resilient systems that gracefully handle failures, traffic spikes, dependency degradation, and regional outages Continuously improve system reliability through automation, observability, performance tuning, and capacity planning Develop infrastructure automation and self-service tooling to reduce operational toil and improve engineering velocity Build and maintain CI/CD pipelines, deployment automation, and release engineering workflows Implement infrastructure as code (IaC) practices using tools such as Terraform, CloudFormation, and container orchestration Improve developer experience by building reliable internal platforms, operational tooling, and standardized deployment patterns Drive adoption of GitOps, immutable infrastructure, and automated remediation patterns Observability & Operational Excellence Design and maintain comprehensive monitoring, logging, tracing, and alerting systems for distributed services Establish actionable alerting standards that reduce noise while improving incident detection and response times Analyze production trends, system bottlenecks, and failure patterns to proactively prevent incidents Lead operational readiness reviews, disaster recovery planning, and game‑day exercises Improve mean time to detect (MTTD) and mean time to recovery (MTTR) through tooling, automation, and process refinement Participate actively in architecture and infrastructure design reviews Propose scalable and reliable platform designs that account for multi‑region deployment, redundancy, failover, and security considerations Evaluate trade-offs between reliability, scalability, operational complexity, and engineering velocity Identify systemic risks and operational gaps before they become production incidents Partner with engineering teams to ensure services are designed with operability, observability, and resilience in mind from day one Security & Compliance Approach infrastructure and operational practices with a strong security mindset Implement and maintain secure cloud networking, secrets management, IAM policies, and infrastructure hardening standards Partner with Security and Compliance teams to ensure systems meet organizational and regulatory requirements Drive operational best practices around vulnerability management, patching, and production access controls End-to-End Ownership & Collaboration Scope and estimate infrastructure and reliability initiatives accurately Coordinate production rollouts, maintenance events, and reliability improvements across teams Communicate operational risks, dependencies, and incident impacts clearly to technical and non-technical stakeholders Collaborate closely with Software Engineering, Security, Product, and Operations teams to improve platform reliability and scalability Serve as a trusted escalation point during critical production incidents Mentorship & Technical Leadership Mentor junior and mid-level engineers on reliability engineering principles, operational excellence, and infrastructure best practices Raise the operational maturity of the engineering organization through documentation, reviews, and technical guidance Drive improvements in team standards around observability, incident management, automation, and infrastructure design Influence technical decisions through credibility, operational expertise, and strong engineering judgment Qualifications You are motivated by accountability — you own outcomes, not just tasks You are results‑oriented and measure success by shipped, working software You are motivated by correctness in code that touches money — the consequences of a bug land on real customer balances, and you take that seriously You love helping people on your team grow and improve Writing tests is an integral part of your development process, not an afterthought You know how to design and build software incrementally — you don't need a complete spec to make progress Collaborating with the people around you to achieve a goal motivates you You are collaborative, open‑minded, and actively developing your craft You are curious and pragmatic about AI‑driven solutions — you apply them where they add real value and stay skeptical where they don't Familiarity with AI‑assisted development tools — you understand how they work, where they help, and where they fail. Prior hands‑on use is a plus; intellectual curiosity and the instinct to evaluate AI output critically are what matter Technical Skills Strong foundation in computer science fundamentals: data structures, algorithms, and system design Familiarity with building production‑grade applications and services using Ruby and Ruby on Rails Deep expertise with Linux systems administration and production troubleshooting Strong experience operating cloud infrastructure at scale, particularly within AWS environments Experience with Kubernetes, container orchestration, and cloud‑native infrastructure patterns Proficiency with infrastructure as code tools such as Terraform or CloudFormation Expertise designing and operating CI/CD pipelines and deployment automation systems Deep understanding of observability tooling including Datadog, OpenTelemetry, or similar platforms Strong knowledge of distributed systems reliability patterns including redundancy, failover, autoscaling, rate limiting, and graceful degradation Experience building automation and operational tooling using languages such as Python, Go, Bash, or Ruby Strong understanding of networking fundamentals including DNS, load balancing, TLS, VPNs, firewalls, and service discovery Hands‑on experience with incident response, root‑cause analysis, and production operations in high‑availability environments Familiarity with SRE methodologies including SLOs, SLIs, error budgets, capacity planning, and operational maturity modeling Experience implementing secure infrastructure and cloud security best practices including IAM, secrets management, and vulnerability remediation Proven ability to design scalable, resilient, and maintainable platform systems and APIs Experience supporting distributed microservices architectures and event‑driven systems Strong understanding of operational excellence principles including automation‑first engineering and toil reduction Experience using AI‑assisted engineering tools (e.g., Claude, GitHub Copilot) as force multipliers while applying sound operational and engineering judgment Excellent debugging and systems thinking skills across infrastructure, networking, application, and platform layers What Great Looks Like A Senior Software Engineer on the Platform team who is thriving at this level demonstrates: Reliable delivery of complex work — consistently ships multi‑part solutions on time with low defect rates Low defects in owned areas — proactively monitors and improves the quality of the systems they own; that means incident‑free quarters in code paths that move funds and clean reconciliation against vendor reports Measurable mentorship impact — engineers around you write better code because of your reviews and guidance Someone we can depend on for the work that matters — especially the work that touches money. Failure Modes We Screen Against We actively evaluate candidates for the following anti‑patterns during the interview process: Failure Mode What It Looks Like Strong coder, weak owner Ships code but doesn't manage to the task — owns the merge, not the outcome; hands off and moves on without monitoring or fixing post‑release issues Hoards knowledge instead of sharing — becomes a single point of failure and blocks team growth Proposes solutions without considering trade‑offs — jumps to conclusions, resists alternative approaches Produces AI‑generated output without verifying it against the codebase, tests, or business context Interview Process Our 5‑round process is designed to evaluate you across all competency areas. AI tools are permitted in technical rounds. Round Format What We Evaluate 60 min, conversational Career trajectory, mentorship philosophy, technical influence examples, communication style 2 — Take‑Home + PR Discussion 72h take‑home + 60 min live Navigating unfamiliar code, ownership and decomposition discipline visible in your PR, root‑cause judgment, AI tool usage Requirements gathering, schema/API design, trade‑off articulation, calibrated code‑review judgment on a teammate's PR 4 — Team Interview (conditional) 30 min, behavioral Collaboration patterns, mentorship behavior, negotiation behavior with cross‑functional partners 5 — Culture Add 30 min, People Team Organizational values alignment Round 4 is conditional: it runs when the team needs additional behavioral signal after Rounds 2 and 3, and is otherwise skipped. Your recruiter will tell you whether it's scheduled before your loop is finalized. The Round 2 (Take‑Home + PR Discussion) and Round 3 (System Design) exercises are drawn from real problems so the technical evaluation is grounded in the work you'd actually be doing. What You’ll Receive Competitive compensation including base salary, bonus, and equity Employer‑sponsored 401(k) with match Comprehensive medical, dental, and vision coverage Flexible time off and hybrid work environment The anticipated annual salary range for this role is $175,000 - $200,000 . Actual compensation and title will be commensurate with experience, qualifications, knowledge, and skills. #J-18808-Ljbffr Order.co

Vacancy posted 15 hours ago
Similar jobs that could be interesting for youBased on the Senior Site Reliability Engineer in New York, NY vacancy
  •  ...A leading technology firm is seeking a Sr. Site Reliability Engineer in the United States. The ideal candidate will enhance system reliability and stability and should possess over 8 years of relevant experience in site reliability engineering. The position covers cloud... 
    Senior

    Jobgether

    New York, NY
    15 hours ago
  •  ...Upstart is seeking a Senior Staff Engineer to lead technical initiatives in shaping the applicant experience across the loan process. This role offers remote flexibility while ensuring collaboration through in-person sessions across the U.S. The ideal candidate has over... 
    Senior
    Remote work

    Israelvcforum

    New York, NY
    10 hours ago
  •  ...A leading company in crypto and Web3 is seeking a Senior Site Reliability Engineer to join their Onchain infrastructure team. The role involves automating and managing the infrastructure for digital asset access, focusing on cloud-based solutions. Ideal candidates have... 
    Senior
    Remote work

    WorksHub

    New York, NY
    2 days ago
  •  ...New York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming the lives of borrowers by bringing humanity to consumer finance. Our data-driven products empower financial institutions... 
    Senior
    Remote work

    Govserviceshub

    New York, NY
    2 days ago
  •  ...critical services in a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure...  ...of a clear career path in our SRE team: SRE I → SRE II → Senior → Senior II → Principal → Senior Principal. Each step... 
    Senior
    Work at office
    Remote work

    Akamai

    New York, NY
    15 hours ago
  • $150k - $170k

     ...Senior Site Reliability Engineer – Zip CoJoin to apply for the Senior Site Reliability Engineer role at Zip CoAt Zip, we build cloud-native software applications that serve millions of customers and process billions of dollars in payments. We're looking for a seasoned... 
    Senior
    Casual work
    Work at office
    Remote work

    ZIP

    New York, NY
    2 days ago
  • $65 - $75 per hour

     ...Confluence, and IT Service Management tools. Description: As an Engineer 2, you will collaborate with management, departments, and...  ...event management, and automation across the IT organization. Seniority level Mid-Senior level Employment type Contract Job function Information... 
    Senior
    Contract work
    Remote work

    SBS Creatix

    New York, NY
    15 hours ago
  • $175k - $190k

     ...This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer - AWS in United States. This role sits at the core of a fast-growing, AI-driven engineering environment focused on building highly reliable... 
    Senior
    Full time
    Temporary work

    Jobgether

    New York, NY
    15 hours ago
  • $153k - $190k

     ...first interconnected health network and we want you to join us to change healthcare for the better! Job Description As a Senior Site Reliability Engineer you will be tasked with making sure we build a reliable, secure and efficient platform for the b. Well network. You... 
    Senior
    Full time
    Contract work
    Live in
    Remote work

    b.well Connected Health

    New York, NY
    15 hours ago
  •  ...Senior Site Reliability Engineer (Tax free & based in GCC) The ambition is to create a global leader in space – driving innovation globally for a better world, while transforming and inspiring Saudi society. Much attention has turned to the space sector in recent years... 
    Senior
    Local area

    Firstaff Personnel Consultants Ltd

    New York, NY
    15 hours ago
  •  ...our mission is to unlock the next era of financial, creative, and personal freedom. The Department: Onchain The Role: Senior Site Reliability Engineer The Onchain infrastructure team at Gemini creates and manages software tools and platforms, automates the creation and... 
    Senior
    Remote work
    Flexible hours

    WorksHub

    New York, NY
    2 days ago
  •  ...Senior Site Reliability Engineer – Azure Cloud Join to apply for the Senior Site Reliability Engineer role at Concord Technologies Concord Technologies is growing! Currently seeking a full‑time Senior Site Reliability Engineer (Sr. SRE) , with experience engineering solutions... 
    Senior
    Full time
    Local area
    Immediate start
    Remote work
    Flexible hours

    Concord Technologies

    New York, NY
    15 hours ago
  • $182.3k - $220k

     ...first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team,...  ...building tools that empower our engineers to ship safely and confidently....  ...throughout the year (i.e., during team on-sites).   At Ro, we believe that... 
    Senior
    Local area
    Flexible hours

    Ro

    New York, NY
    4 days ago
  • $127k - $249k

     ...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    New York, NY
    15 hours ago
  • $165k - $235k

     ...it, and the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You’ll ensure the reliability and scalability... 
    Senior
    Temporary work
    Work at office
    Worldwide
    Flexible hours

    Crypto Pro Network

    New York, NY
    15 hours ago
  • $185k - $227k

     ...by this common purpose and we are hiring the world’s best engineers, scientists, designers, product managers, operations experts...  ...compelling, read on for more details. ROLE AND RESPONSIBILITIES A Senior Site Reliability Engineer (SRE) is expected to own the operational stability... 
    Senior
    Remote work

    JUUL Labs

    New York, NY
    15 hours ago
  • $140k - $205k

     ...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operationsteam. Position summary: The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability... 
    Senior
    Full time
    Temporary work
    Work at office
    Flexible hours
    Weekend work

    Cooley

    New York, NY
    3 days ago
  •  ...contribute to meaningful impact and be part of a team dedicated to enhancing security and fighting fraud. We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements across our production SaaS environment. You’ll play a critical role in... 
    Senior
    Remote work
    Flexible hours
    Night shift

    CertifID LLC

    New York, NY
    3 days ago
  • $150k - $200k

     ...parts of eye care and continue shaping the future of practice management. About the Role We are looking for a seasoned Senior Site Reliability Engineer to join our dynamic team in a foundational role, owning reliability and infrastructure as our first SRE. This role... 
    Senior
    Work experience placement
    Remote work

    Barti

    New York, NY
    15 hours ago
  • $150k - $200k

     ...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve... 
    Senior
    Full time
    Local area
    Remote work
    Work from home

    Gradle Inc.

    New York, NY
    15 hours ago
  • jobr.pro is seeking a Senior Site Reliability Engineer in New York, NY, to enhance platform reliability and engineering excellence. You will be instrumental in implementing observability, security, and CI/CD practices. This role involves coaching teams and optimizing workflows... 
    Senior

    jobr.pro

    New York, NY
    15 hours ago
  •  ...Job Description A major financial services company in NYC is growing its team rapidly, and they are looking for a Senior DevOps Engineer / Site Reliability Engineer who can join. If you’re passionate about high-availability, reliability, automation, we’d be excited... 
    Senior

    The Greene Group

    New York, NY
    4 days ago
  • $206.7k - $330.3k

     ...in partnership with ZG security teams About The Role As a Senior Engineering Manager (M4) for FUB Infrastructure (SRE), you will lead a...  ...and security engineers responsible for the infrastructure, reliability, and developer experience that underpin Follow Up Boss. You... 
    Senior
    Permanent employment
    Live in
    Work at office
    Local area
    Immediate start
    Remote work
    Shift work

    Zillow Inc

    New York, NY
    15 hours ago
  •  ...requirements unforgiving, and the impact immediate. This isn’t a reactive firefighting role. It’s proactive, engineering-focused SRE where you’ll automate reliability, engineer for performance, and shape infrastructure strategy at the firm level. What they’re looking for:... 
    Senior
    Immediate start

    Campbell North Ltd.

    New York, NY
    3 days ago
  • $180k - $200k

    Parabola is looking for a Senior Site Reliability Engineer to improve performance and reliability of its software systems in New York. This role requires 5+ years of SRE or DevOps experience and expertise in AWS and containerization tools. Offering a salary of $180,000... 
    Senior
    Work at office
    3 days per week

    Parabola

    New York, NY
    15 hours ago
  • $160k - $195k

     ...federal, state and local agencies fuels the RapidSOS HARMONY AI engine that delivers this intelligence to those who need it most....  ...What this role is about Are you excited to work on systems where reliability directly impacts real‑world outcomes? At RapidSOS, we build... 
    Senior
    Local area
    Flexible hours

    RapidSOS

    New York, NY
    4 days ago
  •  ...the future of legal tech — we’re defining it. Ready to join us in building the intelligent future of law? The role As a Senior Site Reliability Engineer you'll join the founding SRE team at our new NYC engineering hub, sitting within Foundations. You'll own critical... 
    Senior
    Work at office

    Legora AB

    New York, NY
    4 days ago
  •  ...acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from...  ...redefining go-to-market with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data... 
    Senior

    Unify

    New York, NY
    15 hours ago
  • We are hiring a Senior Site Reliability Engineer to help build and operate the infrastructure foundation that supports engineering teams. The role centers on reliability, scalability, cloud infrastructure, Kubernetes operations, and automation that allows developers to... 
    Senior

    Rad-Hires

    New York, NY
    8 days ago
  • $170k - $230k

     ...estate industry. Responsibilities As a Senior SRE, you will help own and improve the technical...  ...of Perchwell while exemplifying engineering rigor and excellence across our engineering...  ...to innovate faster in a safe and reliable way. Reliability, resiliency and adaptability... 
    Senior
    Work experience placement
    Work at office
    Flexible hours
    3 days per week
    1 day per week

    Perchwell

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!