Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer - 2

$86.33k - $191.9k

Traveltechessentialist

What You'll Do Building a fast moving, high growth service. Navan is revolutionizing travel and expense services for the enterprise, and the product is evolving quickly. You are comfortable in a startup environment, enjoy seeing the product take shape, and have strong ownership of the success of your services. Designing, implementing and operating cloud infrastructure . You’re a fit for us if you think in terms of infrastructure as code, deployment pipelines, and building the guardrails to make going fast also going safely. Identifying reliability anti-patterns and solving them systemically . You dive deep into the data to evaluate the health of your systems, and you use it to improve visibility and reliability across the fleet of services. Finding and automating the toil out of our processes . You’d prefer to automate it entirely, or build a tool to empower your users rather than be the gatekeeper to the tool. Leveraging AI tools and platforms in your daily work to achieve autonomous operations, reduce toil, and improve system observability. Contributing to the definition and adoption of system reliability standards, including formalizing SLO/SLI frameworks, observability standards, and blameless post‑mortem practices. Assisting in the adoption of AI‑assisted developer tools and platforms to increase engineering productivity, enforce code quality standards, and enable real‑time architectural validation. What We’re Looking For 2+ years of progressive experience as an SRE or equivalent role. Passionate about solving problems and learning new tools and technologies. Excellent communication skills working with stakeholders and domain experts across the company to design solutions to user problems. Thrive in a fast‑paced environment. Demonstrated ability to contribute to and take ownership of technical infrastructure projects. Operate with a strong sense of ownership demonstrated through shipping production‑quality code and infrastructure equipped with testing, monitoring and documentation. Hands‑on operational experience with Java based applications and services including JVM profiling and performance tuning (python, Node.js and Go are a plus). Hands‑on experience building and operating distributed systems in a public cloud environment (preferably AWS), using CI/CD to deploy, manage and operate production systems, focusing on tooling and automation using tools such as maven and Jenkins. Hands‑on experience with microservice architecture and related reliability and resiliency patterns such as throttling, queueing, and retries. Hands‑on experience with writing Infrastructure as Code in Terraform or Cloudformation or similar tools. A passion for automating away everything, using scripting languages such as python, bash, groovy (we prefer lazy engineers). Built, using, and automating monitoring systems such as NewRelic, DataDog, SignalFX, Kibana. Hands‑on experience deploying, operating, and monitoring production‑grade AI/ML microservices (e.g., RAG pipelines, agentic systems) on cloud platforms like AWS Fargate/ECS. Experience leveraging AI/LLM platforms (e.g., Gemini, Braintrust) and managing their secrets and infrastructure using Infrastructure as Code (Terraform) and AWS SSM. Demonstrated ability to integrate AI‑specific telemetry and advanced observability practices to enable predictive insights and systemic root‑cause analysis. Pay Range $86,325 – $191,900 USD Benefits Navan offers a comprehensive benefits program designed to support your well‑being, financial security, and life outside of work. Our benefits, thoughtfully tailored by country to meet local needs, include healthcare coverage, insurance offerings, and wellness resources for you and your family. We support long‑term financial growth through retirement savings programs and opportunities to participate in our equity plans, so you can share in Navan’s success. To promote balance, we offer flexible time off, country‑specific holidays, and paid parental leave for all new parents. Additional benefits include connectivity and commuting support, mental health resources, and exclusive travel‑related perks. Wherever you’re based, our benefits evolve with you. Equal Opportunity Navan is an equal opportunity employer. We make all employment decisions based solely on merit. We provide equal employment opportunity to all applicants and employees without discrimination on the bases of race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We prohibit any such discrimination or harassment. This policy applies to all terms and conditions of employment, including hiring. Accommodations Navan complies with the Americans with Disabilities Act (ADA), as amended by the ADA Amendments Act, and all applicable state or local law. Navan will reasonably accommodate qualified individuals with a disability in connection with applications for employment as required by law. #J-18808-Ljbffr Traveltechessentialist

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer - 2 in Palo Alto, CA vacancy
  • $210k - $270k

    Your Impact on our Mission: Zocdoc is looking for a Senior Site Reliability Engineer to help develop, monitor, and maintain our distributed production...  ...Reliability Engineering or Production Engineering role 2+ years of on-call experience in a 24/7 cloud-based production... 
    Suggested
    Flexible hours

    GoTo Meeting

    Palo Alto, CA
    4 days ago
  • $169k - $224k

     ...disciplinary organization of scientists, engineers, and physicians and we are using the...  ...visit grail.com GRAIL is seeking a Staff Site Reliability / DevOps Engineer to lead the...  ...frameworks such as ISO 27001, NIST, SOC 2, or HIPAA Preferred Qualifications... 
    Suggested
    Full time
    Work at office
    Local area
    Flexible hours
    Shift work

    GRAIL

    Menlo Park, CA
    7 days ago
  • $168.93k - $192.5k

     ...identity. To learn more, visit Role Overview We are seeking a Site Reliability Engineer to join our Core Platform Engineering organization. The SRE...  ...Engineering, DevOps, or Infrastructure Engineering. ~2+ years of hands-on experience managing and scaling services... 
    Suggested
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours

    ID.me

    Mountain View, CA
    23 days ago
  •  ...React Developer Location: Mountain View, CA (Hybrid, 2 days a week Onsite) Duration: Contract Rate: DOE US Citizens, GC, EAD (H4, L2), E3 TN visa holders preferred, NO third party corp to corp accepted for this job Required Skills: ~ BS/MS in Computer... 
    Suggested
    Contract work
    2 days per week

    Georgia IT Inc

    Mountain View, CA
    6 days ago
  • $207k - $300k

    Site Reliability Manager, Site Reliability Engineering Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep...  ...infrastructure, distributed systems/networks. 2 years of experience with distributed systems and system... 
    Suggested
    Full time

    Google Inc.

    Mountain View, CA
    4 days ago
  • $252k - $308k

     ...journey. POSITION SUMMARY Lead EarnIn's shift to AI-first reliability engineering. Define how AI transforms on-call, incident response, alert triage...  ...Mountain View (Headquarters) and will require in-office work 2 days a week. WHAT YOU'LL DO Set a reliability strategy... 
    Full time
    Work at office
    Local area
    Shift work
    2 days per week

    EarnIn

    Mountain View, CA
    16 hours ago
  •  ...Job Description Job Description Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day-to-Day) Own and manage our cloud infrastructure (GCP or AWS, on-prem). Build, maintain, and optimize Kubernetes... 

    Amiri Recruiting

    Mountain View, CA
    9 days ago
  •  ...spanning infrastructure. •       Work with engineering teams to make sure new features and...  ...Constantly improve our system performance and reliability through better tools, process and...  ...over 130 countries around the world. 2.\tHuobi has set 10 internal records of... 
    Worldwide

    Cryptoware Technologies Inc

    Santa Clara, CA
    5 days ago
  •  ...technologies. Our mission is to double America’s compute capacity without building new data centers. We are seeking a skilled Site Reliability Engineer to join our growing team. The ideal candidate will help ensure the reliability, scalability, and performance of our hybrid... 
    Work at office
    Weekend work

    FLUIX

    Palo Alto, CA
    4 days ago
  • $210k - $270k

    Zocdoc is seeking a Senior Site Reliability Engineer to develop and maintain distributed production systems. The ideal candidate will have over 5 years of experience in site reliability or production engineering, particularly in cloud environments like AWS. Responsibilities... 

    GoTo Meeting

    Palo Alto, CA
    4 days ago
  • $140k - $220k

    About the Job You’ll own reliability and operational excellence for Pylon’s production systems. This means designing and implementing...  ...scale as we grow. You’ll build tooling that makes the entire engineering team more effective, establish on‑call rotations and runbooks... 

    Pylon

    Palo Alto, CA
    1 day ago
  • $180k

     ...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who...  ...Cybersecurity / SRE team is focused on ensuring the security and reliability of X Money. This role will primarily focus on the X Money platform... 
    Permanent employment
    Temporary work
    Relocation

    xAI

    Palo Alto, CA
    a month ago
  • $217.57k - $260k

     ...Identity Left Behind" to enable all people to have a secure digital identity. To learn more, visit Role Overview The Staff Site Reliability Engineer, Infrastructure role is building a high-scale infrastructure team responsible for owning environments with thousands of... 
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours
    Shift work

    ID.me

    Mountain View, CA
    23 days ago
  • $232k - $263k

     ...scaling quickly toward long-term growth and IPO readiness. Join us as we define the future of SaaS security! Sr. Staff Site Reliability Engineer As a Sr. Staff SRE at Obsidian , you will define and drive the company-wide reliability vision for a complex, multi-tenant... 
    Work from home
    Flexible hours

    Obsidian Security

    Palo Alto, CA
    22 days ago
  •  ..., and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey. Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of ASAPP's infrastructure and products. The team owns... 
    Remote work

    ASAPP

    Mountain View, CA
    14 days ago
  • $174k - $252k

    Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records will...  ...analyzing, and troubleshooting large-scale distributed systems. 2 years of experience leading projects and providing... 
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $180k - $360k

     ...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who...  ...Cybersecurity / SRE team is focused on ensuring the security and reliability of X Money. This role will primarily focus on the X Money platform... 
    Temporary work
    Relocation

    Pantera Capital

    Palo Alto, CA
    2 days ago
  •  ...Software Engineer 2 Location: Newark, CA Duration: 9 months (Contract to Hire) Role Overview We are seeking a skilled ADAS Sensor...  .... Optimize sensor data pipelines for performance and reliability on Linux-based systems. Participate in ADAS feature development... 
    Contract work
    Immediate start

    Kasmo Global

    Newark, CA
    5 days ago
  •  ...Engineer Software 2 A major Aerospace company is looking for an Engineer Software 2 with extensive LabVIEW programming experience related...  ...the LabVIEW programming language. This position will serve on-site at Sunnyvale, CA. Responsibilities: Designs and develops data... 

    Cenergy Corporation

    Sunnyvale, CA
    4 days ago
  • $180k - $260k

     ...facilitating effortless integration into customers’ logistics operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. In this role, you will work... 
    Odd job
    Work at office
    Remote work

    Booster

    Mountain View, CA
    4 days ago
  • $250k

     ...the single source of truth—explainable, reliable, and maintainable—that serves as the...  ...scale. Position Overview As Director of Site Reliability Engineering, you will ensure that eGain’s AI...  ...section - this is a take-home test Step 2 Panel interview (in-person at eGain Sunnyvale... 
    Work at office

    eGain Corporation

    Sunnyvale, CA
    4 days ago
  • $175k - $215k

     ...Software Reliability Engineer Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since...  ...and practicing blameless retrospectives You have: ~2+ years of experience writing clean, efficient code in C++, Java... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  •  ...environment. Required Qualifications: Bachelor's degree in System Engineering, Electrical Engineering, or Computer Science. 5+ years of...  .... Several years of working experience at automotive Tier 1/2 suppliers or OEMs, with a deep understanding of automotive processes... 
    Work experience placement

    Tranzeal

    Newark, CA
    3 days ago
  •  ...GraphQL, Kotlin) - Hybrid Location: Sunnyvale, CA (Hybrid - 2 days on Site) Duration: 3 months Rate: DOE U.S. Citizens and those...  ...design. Lead the work of other small groups of three to five engineers. Troubleshoot business and production issues. Ensure... 

    Staffing the Universe

    Sunnyvale, CA
    11 days ago
  •  ...following opportunity, please contact our Talent Specialist, Amit at (***) ***-**** or Vijay at (***) ***-**** Title: Endpoint Engineer - Hybrid (2 Openings) Duration: 6 Months Location: Onsite, Palo Alto, CA Only W2 candidates are eligible for this position.... 
    Contract work

    divihn.com

    Palo Alto, CA
    4 days ago
  •  ...We are seeking a hands-on Integration Test Engineer to lead system integration and white-box testing for in-vehicle infotainment (IBI) systems. This role focuses on hardware/software integration, troubleshooting, and validation of complex systems. If you love solving... 

    Insight Global

    Palo Alto, CA
    16 hours ago
  • A leading tech recruiting firm is seeking a Site Reliability Engineer to manage and optimize cloud infrastructure primarily using GCP or AWS. The role involves maintaining high availability through Kubernetes clusters and improving CI/CD pipelines with Terraform. Ideal... 

    Amiri Recruiting

    Mountain View, CA
    4 days ago
  • $141.3k - $226k

     ...account before you apply for a job. (Click Sign In Create Account) 2. If you already have a Candidate Account, please Sign-In before...  ...Job Description: OS kernel and system software development engineer ESX CPU and Server Platform At VMware by Broadcom, we are building... 
    Local area

    Broadcom Corporation

    Palo Alto, CA
    6 days ago
  • $84.65k - $111.55k

     ...Web Developer 2 – Hybrid or Remote The School of Humanities and Sciences (H&S) is...  ...technology that meets a critical need for research sites in diverse communities and can be used...  .../or modify clean, well-structured, search engine optimization-friendly documented code.... 
    Fixed term contract
    Remote work

    Stanford

    Stanford, CA
    1 day ago
  •  ...high-performance applications using Spring Boot and Kafka, as well as engaging in Agile development practices. The candidate will work 2 days a week in the office and will be considered for extension or conversion to full-time employment based on performance. #J-18808-... 
    Full time
    For contractors
    Work at office
    2 days per week

    Nerdleveltech

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer - 2. Be the first to apply!