Systems Engineer - Site Reliability Engineering

$96k - $151.8k

Marriott

Bethesda, MD

Additional Information Bethesda, MD Pay Range: $96,000-$151,800 annuallyRemote Pay Range: $87,300-$138,000 annually

Job Number 26063090

Job Category Information Technology

Location 7750 Wisconsin Ave, Bethesda, Maryland, United States, 20814 VIEW ON MAP (

Schedule Full Time

Located Remotely? Y

Position Type Management

Bonus Eligible: Y

Expiration Date: 06/22/2026

JOB SUMMARY:

The Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission-critical cloud and on-prem services that support millions of Marriot customers globally. This role involves overseeing incident management, driving automation efforts, and working closely with cross-functional teams to ensure alignment between SRE strategy and business objectives. Partners closely with Product Teams, Applications teams, Infrastructure, and the broader Applications and Infrastructure Delivery teams to develop key metrics and KPIs to improve applications stability, availability and performance. The ideal candidate will bring strong communication skills, collaborating with key stakeholders across the company to optimize cloud infrastructure and uphold the highest standards of operational excellence in a dynamic, fast-paced environment.

CANDIDATE PROFILE:

Required :

Undergraduate degree in an engineering or computer science discipline and/or equivalent experience/certification
5+ years of hands-on experience in designing, building and operating production grade systems including:
2+ years of experience as a Site Reliability Engineer (SRE), building and managing highly available and mission critical systems
Deep understanding of SRE practices, such as Service Level Objectives, Error Budgets, Toil Management, Observability & Monitoring, Blameless Postmortems, Incident Response Process, Capacity Planning
Expertise in AWS services including designing highly available , multi-AZ and multi-region architectures, for example:
Compute: EC2, Auto Scaling, Lambda
Containers: EKS (Mandatory), ECS (good to have)
Networking: VPC, subnets, route tables, NAT gateways, Transit Gateway
Security: IAM roles/Policies, KMS, Secret manager
Storage and Databases: S3, EBS, EFS, RDS, DocumentDB.
Proven automation and programming experience in one or more of the following languages: Python, PowerShell
Experience using modern, continuous development techniques and pipelines (e.g. Agile, Kanban, Jira, CI/CD, Helm, Harness, Jenkins, Git, Artifactory, Vault)
Experience designing and implementing end-to-end observability solutions across metrics, logs, and traces using tools like Prometheus, Grafana, ELK Stack, and OpenTelemetry.
Hands-on experience with Linux administration(RHEL, Ubuntu, CentOS, AWS Linux)
Experience troubleshooting API-related issues in distributed systems, including latency, authentication/authorization failures, rate limiting, and upstream/downstream dependency failures.
Experience with containerization orchestration engines such as Kubernetes (EKS, AKS, ACK)
Familiarity with service mesh technologies to enable secure and resilient service communication, including mTLS, traffic shaping, and policy enforcement.
Familiarity with Infrastructure as Code (Iac) tools like Terraform and CloudFormation.
Familiarity with configuration management and automation tools such as Ansible.
Familiarity with vulnerability management, OS hardening, patching, security compliance of infrastructure, applications and databases
Understanding of basic networking fundamentals

Preferred :

Experience driving cloud cost optimization initiatives (rightsizing, reserved instances, autoscaling strategies, cost observability)
Networking expertise including Load Balancing, Firewalls, Security Groups, NACLs, TCP/IP, DNS, SSL/TLS etc

CORE WORK ACTIVITIES :

Ensure the reliability, availability, and performance of mission-critical cloud services, implementing best practices for monitoring, alerting, and incident management.
Oversee the management of high-severity incidents, driving quick resolution and post-incident analysis to identify root causes and prevent recurrence.
Drive the automation of operational processes and ensure systems can scale effectively to support growing user demand, optimizing cloud and on-prem infrastructure and resource usage.
Develop and execute the SRE strategy aligned with business goals, and communicate service health, reliability, and performance metrics to senior leadership and stakeholders

Drive Applications Performance Management and Monitoring :

Assess application architectures to identify key monitoring points
Identify Key Performance Indicators, apply monitoring, and report out on compliance.
Gather information to develop reporting metrics and KPIs
Ensure that all applications adhere to appropriate monitoring standards based on their technology/business process
Determine forums and cadence to provide regular monitoring updates

Building Successful Relationships :

Collaborates with Enterprise Application and Architecture and Infrastructure teams to continuously improve processes and procedures.
Liaises with vendors and Service Providers to select services and tools that best meet company goals

Managing Projects and Priorities :

Develops specific goals and plans to prioritize, organize, and accomplish work.
Champions leaders' vision for product and service delivery.
Executes the necessary decisions to keep moving forward toward achievement of goals.
Determines priorities, schedules, plans and necessary resources to promote completion of any projects on schedule.

Delivering on the Needs of Key Stakeholders :

Understands and meets the needs of key stakeholders.
Communicates concepts in a clear and persuasive manner that is easy to understand.
Demonstrates an understanding of business priorities.
Supports achievement of performance goals, budget goals, team goals, etc.

Providing Technical Support and Consultation :

Provides technical expertise within own and other teams.
Provides recommendations to improve the effectiveness of processes and programs.
Demonstrates advanced knowledge of job-relevant issues, products, systems, and processes.
Keeps up-to-date technically and applies new knowledge to job.
Performs other reasonable duties as required for this position.

At Marriott International, we are dedicated to being an equal opportunity employer, welcoming all and providing access to opportunity. We actively foster an environment where the unique backgrounds of our associates are valued and celebrated.?Our greatest strength lies in the rich blend of culture, talent, and experiences of our associates. ?We are committed to non-discrimination on any protected basis, including disability, veteran status, or other basis protected by applicable law.

All positions offer a 401(k) plan, stock purchase plan, discounts at Marriott properties, commuter benefits, employee assistance plan, and childcare discounts. Benefits are subject to terms and conditions, which may include rules regarding eligibility, enrollment, waiting period, contribution, benefit limits, election changes, benefit exclusions, and others. Click here ( to learn more.

Full-time positions also offer coverage for medical, dental, vision, health care flexible spending account, dependent care flexible spending account, life insurance, disability insurance, accident insurance, adoption expense reimbursements, paid parental leave and educational assistance.

Washington Applicants Only : Employees will accrue paid sick leave, 0.077 PTO balance for every hour worked and be eligible to receive a minimum of 9 holidays annually.

Marriott HQ is committed to a hybrid work environment that enables associates to Be connected. Headquarters-based positions are considered hybrid, for candidates within a commuting distance to Bethesda, MD; candidates outside of commuting distance to Bethesda, MD will be considered for Remote positions.

Marriott International is the world's largest hotel company, with more brands, more hotels and more opportunities for associates to grow and succeed. Be where you can do your best work,? begin your purpose, belong to an amazing global? team, and become the best version of you.

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Systems Engineer - Site Reliability Engineering in Bethesda, MD vacancy

Staff Site Reliability Engineer (SRE) - Cloud & Systems
Geico is seeking a Staff Engineer to innovate and enhance systems while mentoring engineers and collaborating across teams. This position involves utilizing programming languages like Go and Python, working with Azure services, Docker, and Kubernetes, and requires 6+ years...
Suggested
Geico
Bethesda, MD
5 days ago
Staff Cyber Site Reliability Engineer (SRE)
$110k - $230k
...and Great Careers. GEICO's Cyber Security Engineering & Analytics, Automation (SEA) team is seeking a Staff Cyber Site Reliability Engineer (SRE) — a hands-on, engineering-minded... ...reliable, observable, and scalable systems at the intersection of security and infrastructure...
Suggested
Hourly pay
Full time
Work experience placement
Local area
Flexible hours
GEICO
Bethesda, MD
2 days ago
Manager Site Reliability Engineering
$51.9 per hour
...This job is responsible for the reliability, availability, and performance of critical healthcare IT systems, principally in the... ...efficiency. This role blends software engineering, clinical engineering, and... ...cross-functionally with AHN site leaders and teams to navigate...
Suggested
For contractors
Local area
Highmark Health
Washington DC
4 days ago
Manager, Site Reliability Engineering I
$160k - $200k
...Engineering Leader Filevine is a Legal AI company delivering Legal Operating Intelligence... ...of legal work. Grounded in a singular system of truth, Filevine brings together data... ...engineering leader to spearhead system reliability, drive platform project execution, and...
Suggested
Full time
Temporary work
Work experience placement
Filevine
Washington DC
3 days ago
University - Site Reliability Engineer
$55.2k - $126k
...to expect during your journey as a candidate with us. Engineering to make a system more resilient and efficient frees up time and money to... ...a passion for making systems better, we need you! As a site reliability engineer on our team, you’ll help our Platform Engineering...
Suggested
Full time
Contract work
Part time
Local area
Remote work
Phase2 Technology
Mc Lean, VA
2 days ago
Senior Site Reliability Engineer - Flexible/Remote (K8s)
A leading technology company is seeking a Senior Site Reliability Engineer in Virginia. The role involves maintaining a Kubernetes-based platform, ensuring high availability, and automating infrastructure processes with tools like Terraform. The ideal candidate will have...
Remote job
Flexible hours
Workday, Inc.
Mc Lean, VA
1 day ago
Senior Site Reliability Engineer
Senior Site Reliability Engineer Job Description Overview CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential... ...-time data, millions of active users, and mission-critical systems across a globally distributed platform. As we scale, we're...
Full time
Work at office
Work from home
Monday to Thursday
Visual Lease
Arlington, VA
1 day ago
Senior Site Reliability Engineer
$147.4k - $221.2k
Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerremote type: Flexlocations: USA, VA, McLean: USA.VA.Restontime type: Full Timeposted on: Posted Yesterdayjob requisition id: JR-0104084**Your work days are brighter here.**We’re obsessed with...
Work experience placement
Work at office
Remote work
Home office
Flexible hours
Workday, Inc.
Mc Lean, VA
1 day ago
Site Reliability Engineer, Discovery
$166k - $220k
ABOUT THE JOB As a site reliability engineer in Platform Discovery, you will solve a wide variety of problems involving networking, autonomy, systems integration, robotics, and more, while making pragmatic engineering tradeoffs along the way. Your efforts will ensure that...
Full time
Work experience placement
Relocation package
Slope
Washington DC
1 day ago
Sr. Site Reliability Engineer
Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of our... .... This role is a hybrid of software engineering and systems architecture, with a specialized focus on MLOps —bridging the...
Local area
Tiger Analytics Inc.
Washington DC
1 day ago
Site Reliability Engineer
$125k - $200k
Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure, and observable cloud‑native systems that support mission‑critical applications and services. You will blend software engineering, DevOps practices, and infrastructure...
Local area
2 days per week
Steampunk
Mc Lean, VA
4 days ago
Sr. Site Reliability Engineer III
Description Onsite in Washington, DC Our client seeks a Sr. Site Reliability Engineer III to design, automate, and operate mission-critical systems for federal environments. The role focuses on Kubernetes or VMWare platforms, CI/CD enablement, observability, and developer...
Permanent employment
Full time
Immediate start
Eliassen Group
Washington DC
5 days ago
Senior Site Reliability Engineer - Cloud
...customers rely on us in the moments that matter. Engineering delivers on that promise. The Senior Site Reliability Engineer is responsible for ensuring our SaaS... ...experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our...
Work experience placement
Remote work
Flexible hours
Donnelley Financial, LLC
Rockville, MD
9 days ago
Site Reliability Engineer - AI Trainer
$60 per hour
...contribute to developing cutting-edge AI systems, while enjoying the flexibility of remote... ...full-stack, machine learning, and other engineers — who are driving real-world impact in AI... ...countries will not see work or assessments available on our site at this time.J-18808-Ljbffr...
Hourly pay
Full time
Remote work
Flexible hours
DataAnnotation
Washington DC
3 days ago
Remote Site Reliability Engineer - Cloud, Kubernetes & CI/CD
$55.2k - $126k
A leading consulting firm in McLean, Virginia, is seeking a Site Reliability Engineer to enhance system resilience and efficiency. Key duties include developing robust infrastructure, implementing automation, and reducing manual tasks. The role requires experience with...
Remote job
Booz Allen Hamilton
Mc Lean, VA
1 day ago
Site Reliability & Automation Engineer
A technology solutions provider seeks a System Developer based in Washington, DC, to support operations for the Small Business Administration... ...Entra services. A minimum of 5 years of experience in systems engineering is required along with a Bachelor's degree in Computer Science...
Local area
Highlighttech
Washington DC
3 days ago
Staff Technical Program Manager, Site Reliability Engineering
$126k - $248k
..., you will partner with SRE leaders and engineers to scale the platform that underpins all... ...program execution, strengthen production reliability practices, and coordinate cross-... ...complex, multi-team efforts Build Scalable Systems & Processes - Design lightweight frameworks...
Local area
Remote work
Worldwide
Flexible hours
MongoDB
Washington DC
1 day ago
Site Reliability Engineer — Cloud Resilience & Automation
Salesforce is seeking a Site Reliability Engineer in Washington, DC to ensure cloud services availability. This role involves monitoring services... ...incident management, and driving automation for resilient systems. Candidates should have a Bachelor's in Computer Science or...
Salesforce
Washington DC
2 days ago
SRE Systems Engineer
Overview of the Role: Join our Site Reliability Engineering (SRE) team, where you'll work alongside Infrastructure and Research & Development (R&... ...incidents fast, drive automation, and help build the resilient systems that millions of customers depend on every day. This role...
Work experience placement
Relha LLC
Washington DC
5 days ago
SRE Systems Engineer - (TS/SCI Clearance)
Job Category: Software Engineering About Salesforce Salesforce is the #1 AI CRM, where... ...it all. Overview Of The Role Join our Site Reliability Engineering (SRE) team, where you’ll work... ..., and help build the resilient systems that millions of customers depend on every...
Work experience placement
Salesforce
Washington DC
4 days ago
Site Reliability Engineer - TS/SCI with Poly
...to protect our country from threats. Job Description SITE RELIABILITY ENGINEER (SRE) Own your opportunity. Make your impact As a Site... ...more than 250 global sites. You will engineer and optimize systems, automate operational workflows, strengthen monitoring...
General Dynamics Information Technology
Washington DC
23 days ago
Site Reliability Engineer
$150k - $195k
...Senior SRE. Responsibilities Own the reliability of one product domain end-to-end... ...reliability reviews with that domain's product engineering team Participate in a 1-week-in-6... ...engineering at companies running production systems for paying customers ~ Hands-on...
Full time
Seasonal work
Immediate start
Remote work
Home office
Shift work
Careerscape
Washington DC
a month ago
Sr. Site Reliability Engineer - AWS Geospatial Technology
$84.24k - $142.48k
...collaboratively with our talented team of dynamic and passionate engineers to deliver capabilities that enable our customers to make a... ...platform Design, implement, configure, and utilize monitoring systems to monitor the health of SaaS products Manage infrastructure...
Worldwide
Flexible hours
Esri
Vienna, VA
1 day ago
Site Reliability Engineer - Top Secret (req-236)
$100k - $160k
...innovative and trusted results. We are looking for a dynamic Site Reliability Engineer (SRE) with a Top Secret clearance to join our team! The... ..., and infrastructure Security & Compliance: Ensure all systems follow best practices in terms of security and compliance...
Full time
Temporary work
CATHEXIS
Tysons, VA
a month ago
Deployment Site Reliability Engineer - Connected Warfare
$120k - $214k
...industry, Anduril is changing how military systems are designed, built and sold. Anduril’s... ...to our customers. System Deployment Engineers work in complex environments with... ...demonstrations and exercises Work with site reliability engineers to provide and refine requirements...
Full time
Temporary work
Work experience placement
Local area
Relocation package
Anduril Industries
Washington DC
more than 2 months ago
Senior System Integrator Engineer (TS/SCI)
...Vexterra Group is currently searching for a Senior Systems Integrator Engineer to provide the following systems support in Reston VA or Bethesda, MD office: You will work closely with other infrastructure and network engineers, system engineers and O&M team members...
Work at office
Remote work
Flexible hours
Vexterra Group
Bethesda, MD
2 days ago
Staff Site Reliability Engineer, Kubernetes w/ active TS/SCI
$188k - $235k
...TechOps) team, we live this mission by building the most reliable and performant systems on the planet. We empower organizations to do their... ...Role We are looking for an experienced Senior Site Reliability Engineer (SRE) who thrives on the challenge of managing large-...
Permanent employment
Full time
Work at office
Local area
Flexible hours
Okta
Washington DC
more than 2 months ago
Staff Site Reliability Engineer - Observability
$194k - $267k
...to you. We are seeking a highly technical Observability Site Reliability Engineer with a specialty in Google Cloud, to own and expand our Observability... ...of agents and collectors across complex distributed systems. Key Responsibilities Automated Infrastructure:...
Permanent employment
Full time
Work at office
Local area
Flexible hours
Okta
Washington DC
more than 2 months ago
Staff Site Reliability Engineer - Kubernetes
$194k - $267k
...educate on new concepts and tools. Position Overview: The Site Reliability Engineer (SRE) will play a key role in building and managing... ...Troubleshooting: Respond to incidents, troubleshoot, and resolve system issues related to performance, availability, and security...
Permanent employment
Work at office
Local area
Worldwide
Flexible hours
Okta
Washington DC
more than 2 months ago
Senior Manager, Site Reliability Engineering (Federal)
$207k - $284.9k
...is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. Senior Manager, Site Reliability Engineering District of Columbia Area Secure Every Identity, from AI to Human Identity is the key to unlocking the potential...
Permanent employment
Local area
Worldwide
Flexible hours
Okta
Washington DC
8 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Systems Engineer - Site Reliability Engineering. Be the first to apply!