Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Systems Engineer - Site Reliability Engineering

$96k - $151.8k

Marriott

Additional Information Bethesda, MD Pay Range: $96,000-$151,800 annuallyRemote Pay Range: $87,300-$138,000 annually

Job Number 26063090

Job Category Information Technology

Location 7750 Wisconsin Ave, Bethesda, Maryland, United States, 20814 VIEW ON MAP (

Schedule Full Time

Located Remotely? Y

Position Type Management

Bonus Eligible: Y

Expiration Date: 06/22/2026

JOB SUMMARY:

The Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission-critical cloud and on-prem services that support millions of Marriot customers globally. This role involves overseeing incident management, driving automation efforts, and working closely with cross-functional teams to ensure alignment between SRE strategy and business objectives. Partners closely with Product Teams, Applications teams, Infrastructure, and the broader Applications and Infrastructure Delivery teams to develop key metrics and KPIs to improve applications stability, availability and performance. The ideal candidate will bring strong communication skills, collaborating with key stakeholders across the company to optimize cloud infrastructure and uphold the highest standards of operational excellence in a dynamic, fast-paced environment.

CANDIDATE PROFILE:

Required :

  • Undergraduate degree in an engineering or computer science discipline and/or equivalent experience/certification

  • 5+ years of hands-on experience in designing, building and operating production grade systems including:

  • 2+ years of experience as a Site Reliability Engineer (SRE), building and managing highly available and mission critical systems

  • Deep understanding of SRE practices, such as Service Level Objectives, Error Budgets, Toil Management, Observability & Monitoring, Blameless Postmortems, Incident Response Process, Capacity Planning

  • Expertise in AWS services including designing highly available , multi-AZ and multi-region architectures, for example:

  • Compute: EC2, Auto Scaling, Lambda

  • Containers: EKS (Mandatory), ECS (good to have)

  • Networking: VPC, subnets, route tables, NAT gateways, Transit Gateway

  • Security: IAM roles/Policies, KMS, Secret manager

  • Storage and Databases: S3, EBS, EFS, RDS, DocumentDB.

  • Proven automation and programming experience in one or more of the following languages: Python, PowerShell

  • Experience using modern, continuous development techniques and pipelines (e.g. Agile, Kanban, Jira, CI/CD, Helm, Harness, Jenkins, Git, Artifactory, Vault)

  • Experience designing and implementing end-to-end observability solutions across metrics, logs, and traces using tools like Prometheus, Grafana, ELK Stack, and OpenTelemetry.

  • Hands-on experience with Linux administration(RHEL, Ubuntu, CentOS, AWS Linux)

  • Experience troubleshooting API-related issues in distributed systems, including latency, authentication/authorization failures, rate limiting, and upstream/downstream dependency failures.

  • Experience with containerization orchestration engines such as Kubernetes (EKS, AKS, ACK)

  • Familiarity with service mesh technologies to enable secure and resilient service communication, including mTLS, traffic shaping, and policy enforcement.

  • Familiarity with Infrastructure as Code (Iac) tools like Terraform and CloudFormation.

  • Familiarity with configuration management and automation tools such as Ansible.

  • Familiarity with vulnerability management, OS hardening, patching, security compliance of infrastructure, applications and databases

  • Understanding of basic networking fundamentals

Preferred :

  • Experience driving cloud cost optimization initiatives (rightsizing, reserved instances, autoscaling strategies, cost observability)

  • Networking expertise including Load Balancing, Firewalls, Security Groups, NACLs, TCP/IP, DNS, SSL/TLS etc

CORE WORK ACTIVITIES :

  • Ensure the reliability, availability, and performance of mission-critical cloud services, implementing best practices for monitoring, alerting, and incident management.

  • Oversee the management of high-severity incidents, driving quick resolution and post-incident analysis to identify root causes and prevent recurrence.

  • Drive the automation of operational processes and ensure systems can scale effectively to support growing user demand, optimizing cloud and on-prem infrastructure and resource usage.

  • Develop and execute the SRE strategy aligned with business goals, and communicate service health, reliability, and performance metrics to senior leadership and stakeholders

Drive Applications Performance Management and Monitoring :

  • Assess application architectures to identify key monitoring points

  • Identify Key Performance Indicators, apply monitoring, and report out on compliance.

  • Gather information to develop reporting metrics and KPIs

  • Ensure that all applications adhere to appropriate monitoring standards based on their technology/business process

  • Determine forums and cadence to provide regular monitoring updates

Building Successful Relationships :

  • Collaborates with Enterprise Application and Architecture and Infrastructure teams to continuously improve processes and procedures.

  • Liaises with vendors and Service Providers to select services and tools that best meet company goals

Managing Projects and Priorities :

  • Develops specific goals and plans to prioritize, organize, and accomplish work.

  • Champions leaders' vision for product and service delivery.

  • Executes the necessary decisions to keep moving forward toward achievement of goals.

  • Determines priorities, schedules, plans and necessary resources to promote completion of any projects on schedule.

Delivering on the Needs of Key Stakeholders :

  • Understands and meets the needs of key stakeholders.

  • Communicates concepts in a clear and persuasive manner that is easy to understand.

  • Demonstrates an understanding of business priorities.

  • Supports achievement of performance goals, budget goals, team goals, etc.

Providing Technical Support and Consultation :

  • Provides technical expertise within own and other teams.

  • Provides recommendations to improve the effectiveness of processes and programs.

  • Demonstrates advanced knowledge of job-relevant issues, products, systems, and processes.

  • Keeps up-to-date technically and applies new knowledge to job.

  • Performs other reasonable duties as required for this position.

At Marriott International, we are dedicated to being an equal opportunity employer, welcoming all and providing access to opportunity. We actively foster an environment where the unique backgrounds of our associates are valued and celebrated.?Our greatest strength lies in the rich blend of culture, talent, and experiences of our associates. ?We are committed to non-discrimination on any protected basis, including disability, veteran status, or other basis protected by applicable law.

All positions offer a 401(k) plan, stock purchase plan, discounts at Marriott properties, commuter benefits, employee assistance plan, and childcare discounts. Benefits are subject to terms and conditions, which may include rules regarding eligibility, enrollment, waiting period, contribution, benefit limits, election changes, benefit exclusions, and others. Click here ( to learn more.

Full-time positions also offer coverage for medical, dental, vision, health care flexible spending account, dependent care flexible spending account, life insurance, disability insurance, accident insurance, adoption expense reimbursements, paid parental leave and educational assistance.

Washington Applicants Only : Employees will accrue paid sick leave, 0.077 PTO balance for every hour worked and be eligible to receive a minimum of 9 holidays annually.

Marriott HQ is committed to a hybrid work environment that enables associates to Be connected. Headquarters-based positions are considered hybrid, for candidates within a commuting distance to Bethesda, MD; candidates outside of commuting distance to Bethesda, MD will be considered for Remote positions.

Marriott International is the world's largest hotel company, with more brands, more hotels and more opportunities for associates to grow and succeed. Be where you can do your best work,? begin your purpose, belong to an amazing global? team, and become the best version of you.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Systems Engineer - Site Reliability Engineering in Bethesda, MD vacancy
  • Geico is seeking a Staff Engineer to innovate and enhance systems while mentoring engineers and collaborating across teams. This position involves utilizing programming languages like Go and Python, working with Azure services, Docker, and Kubernetes, and requires 6+ years... 
    Suggested

    Geico

    Bethesda, MD
    5 days ago
  • $110k - $230k

     ...and Great Careers. GEICO's Cyber Security Engineering & Analytics, Automation (SEA) team is seeking a Staff Cyber Site Reliability Engineer (SRE) — a hands-on, engineering-minded...  ...reliable, observable, and scalable systems at the intersection of security and infrastructure... 
    Suggested
    Hourly pay
    Full time
    Work experience placement
    Local area
    Flexible hours

    GEICO

    Bethesda, MD
    2 days ago
  • $51.9 per hour

     ...This job is responsible for the reliability, availability, and performance of critical healthcare IT systems, principally in the...  ...efficiency. This role blends software engineering, clinical engineering, and...  ...cross-functionally with AHN site leaders and teams to navigate... 
    Suggested
    For contractors
    Local area

    Highmark Health

    Washington DC
    4 days ago
  • $160k - $200k

     ...Engineering Leader Filevine is a Legal AI company delivering Legal Operating Intelligence...  ...of legal work. Grounded in a singular system of truth, Filevine brings together data...  ...engineering leader to spearhead system reliability, drive platform project execution, and... 
    Suggested
    Full time
    Temporary work
    Work experience placement

    Filevine

    Washington DC
    3 days ago
  • $55.2k - $126k

     ...to expect during your journey as a candidate with us. Engineering to make a system more resilient and efficient frees up time and money to...  ...a passion for making systems better, we need you! As a site reliability engineer on our team, you’ll help our Platform Engineering... 
    Suggested
    Full time
    Contract work
    Part time
    Local area
    Remote work

    Phase2 Technology

    Mc Lean, VA
    2 days ago
  • A leading technology company is seeking a Senior Site Reliability Engineer in Virginia. The role involves maintaining a Kubernetes-based platform, ensuring high availability, and automating infrastructure processes with tools like Terraform. The ideal candidate will have... 
    Remote job
    Flexible hours

    Workday, Inc.

    Mc Lean, VA
    1 day ago
  • Senior Site Reliability Engineer Job Description Overview CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential...  ...-time data, millions of active users, and mission-critical systems across a globally distributed platform. As we scale, we're... 
    Full time
    Work at office
    Work from home
    Monday to Thursday

    Visual Lease

    Arlington, VA
    1 day ago
  • $147.4k - $221.2k

    Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerremote type: Flexlocations: USA, VA, McLean: USA.VA.Restontime type: Full Timeposted on: Posted Yesterdayjob requisition id: JR-0104084**Your work days are brighter here.**We’re obsessed with... 
    Work experience placement
    Work at office
    Remote work
    Home office
    Flexible hours

    Workday, Inc.

    Mc Lean, VA
    1 day ago
  • $166k - $220k

    ABOUT THE JOB As a site reliability engineer in Platform Discovery, you will solve a wide variety of problems involving networking, autonomy, systems integration, robotics, and more, while making pragmatic engineering tradeoffs along the way. Your efforts will ensure that... 
    Full time
    Work experience placement
    Relocation package

    Slope

    Washington DC
    1 day ago
  • Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of our...  .... This role is a hybrid of software engineering and systems architecture, with a specialized focus on MLOps —bridging the... 
    Local area

    Tiger Analytics Inc.

    Washington DC
    1 day ago
  • $125k - $200k

    Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure, and observable cloud‑native systems that support mission‑critical applications and services. You will blend software engineering, DevOps practices, and infrastructure... 
    Local area
    2 days per week

    Steampunk

    Mc Lean, VA
    4 days ago
  • Description Onsite in Washington, DC Our client seeks a Sr. Site Reliability Engineer III to design, automate, and operate mission-critical systems for federal environments. The role focuses on Kubernetes or VMWare platforms, CI/CD enablement, observability, and developer... 
    Permanent employment
    Full time
    Immediate start

    Eliassen Group

    Washington DC
    5 days ago
  •  ...customers rely on us in the moments that matter. Engineering delivers on that promise. The Senior Site Reliability Engineer is responsible for ensuring our SaaS...  ...experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our... 
    Work experience placement
    Remote work
    Flexible hours

    Donnelley Financial, LLC

    Rockville, MD
    9 days ago
  • $60 per hour

     ...contribute to developing cutting-edge AI systems, while enjoying the flexibility of remote...  ...full-stack, machine learning, and other engineers — who are driving real-world impact in AI...  ...countries will not see work or assessments available on our site at this time.J-18808-Ljbffr... 
    Hourly pay
    Full time
    Remote work
    Flexible hours

    DataAnnotation

    Washington DC
    3 days ago
  • $55.2k - $126k

    A leading consulting firm in McLean, Virginia, is seeking a Site Reliability Engineer to enhance system resilience and efficiency. Key duties include developing robust infrastructure, implementing automation, and reducing manual tasks. The role requires experience with... 
    Remote job

    Booz Allen Hamilton

    Mc Lean, VA
    1 day ago
  • A technology solutions provider seeks a System Developer based in Washington, DC, to support operations for the Small Business Administration...  ...Entra services. A minimum of 5 years of experience in systems engineering is required along with a Bachelor's degree in Computer Science... 
    Local area

    Highlighttech

    Washington DC
    3 days ago
  • $126k - $248k

     ..., you will partner with SRE leaders and engineers to scale the platform that underpins all...  ...program execution, strengthen production reliability practices, and coordinate cross-...  ...complex, multi-team efforts Build Scalable Systems & Processes - Design lightweight frameworks... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    Washington DC
    1 day ago
  • Salesforce is seeking a Site Reliability Engineer in Washington, DC to ensure cloud services availability. This role involves monitoring services...  ...incident management, and driving automation for resilient systems. Candidates should have a Bachelor's in Computer Science or... 

    Salesforce

    Washington DC
    2 days ago
  • Overview of the Role: Join our Site Reliability Engineering (SRE) team, where you'll work alongside Infrastructure and Research & Development (R&...  ...incidents fast, drive automation, and help build the resilient systems that millions of customers depend on every day. This role... 
    Work experience placement

    Relha LLC

    Washington DC
    5 days ago
  • Job Category: Software Engineering About Salesforce Salesforce is the #1 AI CRM, where...  ...it all. Overview Of The Role Join our Site Reliability Engineering (SRE) team, where you’ll work...  ..., and help build the resilient systems that millions of customers depend on every... 
    Work experience placement

    Salesforce

    Washington DC
    4 days ago
  •  ...to protect our country from threats. Job Description SITE RELIABILITY ENGINEER (SRE) Own your opportunity. Make your impact As a Site...  ...more than 250 global sites. You will engineer and optimize systems, automate operational workflows, strengthen monitoring... 

    General Dynamics Information Technology

    Washington DC
    23 days ago
  • $150k - $195k

     ...Senior SRE. Responsibilities Own the reliability of one product domain end-to-end...  ...reliability reviews with that domain's product engineering team Participate in a 1-week-in-6...  ...engineering at companies running production systems for paying customers ~ Hands-on... 
    Full time
    Seasonal work
    Immediate start
    Remote work
    Home office
    Shift work

    Careerscape

    Washington DC
    a month ago
  • $84.24k - $142.48k

     ...collaboratively with our talented team of dynamic and passionate engineers to deliver capabilities that enable our customers to make a...  ...platform Design, implement, configure, and utilize monitoring systems to monitor the health of SaaS products Manage infrastructure... 
    Worldwide
    Flexible hours

    Esri

    Vienna, VA
    1 day ago
  • $100k - $160k

     ...innovative and trusted results. We are looking for a dynamic Site Reliability Engineer (SRE) with a Top Secret clearance to join our team! The...  ..., and infrastructure Security & Compliance: Ensure all systems follow best practices in terms of security and compliance... 
    Full time
    Temporary work

    CATHEXIS

    Tysons, VA
    a month ago
  • $120k - $214k

     ...industry, Anduril is changing how military systems are designed, built and sold. Anduril’s...  ...to our customers. System Deployment Engineers work in complex environments with...  ...demonstrations and exercises Work with site reliability engineers to provide and refine requirements... 
    Full time
    Temporary work
    Work experience placement
    Local area
    Relocation package

    Anduril Industries

    Washington DC
    more than 2 months ago
  •  ...Vexterra Group is currently searching for a Senior Systems Integrator Engineer to provide the following systems support in Reston VA or Bethesda, MD office: You will work closely with other infrastructure and network engineers, system engineers and O&M team members... 
    Work at office
    Remote work
    Flexible hours

    Vexterra Group

    Bethesda, MD
    2 days ago
  • $188k - $235k

     ...TechOps) team, we live this mission by building the most reliable and performant systems on the planet. We empower organizations to do their...  ...Role We are looking for an experienced Senior Site Reliability Engineer (SRE) who thrives on the challenge of managing large-... 
    Permanent employment
    Full time
    Work at office
    Local area
    Flexible hours

    Okta

    Washington DC
    more than 2 months ago
  • $194k - $267k

     ...to you. We are seeking a highly technical Observability Site Reliability Engineer with a specialty in Google Cloud, to own and expand our Observability...  ...of agents and collectors across complex distributed systems. Key Responsibilities Automated Infrastructure:... 
    Permanent employment
    Full time
    Work at office
    Local area
    Flexible hours

    Okta

    Washington DC
    more than 2 months ago
  • $194k - $267k

     ...educate on new concepts and tools. Position Overview: The Site Reliability Engineer (SRE) will play a key role in building and managing...  ...Troubleshooting: Respond to incidents, troubleshoot, and resolve system issues related to performance, availability, and security... 
    Permanent employment
    Work at office
    Local area
    Worldwide
    Flexible hours

    Okta

    Washington DC
    more than 2 months ago
  • $207k - $284.9k

     ...is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. Senior Manager, Site Reliability Engineering District of Columbia Area Secure Every Identity, from AI to Human Identity is the key to unlocking the potential... 
    Permanent employment
    Local area
    Worldwide
    Flexible hours

    Okta

    Washington DC
    8 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Systems Engineer - Site Reliability Engineering. Be the first to apply!