Lead Site Reliability Engineer
GMAC Financial Services
Job Description
Innovation isn't just a talking point at GM Financial, it's how we operate. From generative AI and cloud-native technologies to peer-led learning and hackathons, our tech teams are building real solutions that make a difference. We're committed to AI-powered transformation, using advanced machine learning and automation to help us reimagine customer interactions and modernize operations, positioning GM Financial as a leader in digital innovation within a dynamic industry. Join us and discover a workplace where your ideas matter, your development is prioritized, and you can truly make a global impact.
Responsibilities
About the Role:
We are expanding our efforts into complementary data technologies for decision support in areas of ingesting and processing large data sets. Our interests are in enabling data science and search based applications on large and low latent data sets in both a batch and streaming context for processing. To that end, this role will incorporates aspects of software engineering and operations, combining SRE and DevOps skills to come up with efficient ways of managing and operating applications. The role will require a high level of responsibility and accountability to deliver technical solutions. The data sets we deal with support both off-line and in-line machine learning training and model execution. Other data sets support search engine based analytics. Exploration and deployment of technologies activities include identifying opportunities that impact business strategy, selecting data solutions software, and defining hardware requirements based on business requirements. Responsibility also includes documentation of procedures for deployment, monitoring, managing and switching the environments in production and disaster recovery sites. This role participates along with team counterparts to architect an end-to-end framework developed on a group of core data technologies.
- Manage/Administer/Deploy Kubernetes and Spark cluster environments, on bare-metal and container infrastructure, including service allocation and configuration for the cluster, capacity planning, performance tuning, and ongoing monitoring.
- Define and refine processes and procedures for the site reliability engineering practice.
- Setup, manage and maintain Kubernetes based scalable environments for high-availability and work with vendors for smooth and continuous operations.
- Work closely with data scientists, data architects, data engineers, ETL developers, cybersecurity, network, Linux, other IT counterparts, and business partners to design and setup the environments to manage the ingested and processed datasets from the external sources, internal systems, and the data warehouse to extract features of interest.
- Evaluate, research, experiment with data processing, management and scalability technologies in a lab to keep pace with industry innovation while assessing business impact and viability for use cases associated with efforts in hand.
- Design, setup, test, deploy, monitor, document, and troubleshoot data processing and associated automation issues from the operations perspective.
- Work with IT Operations and Information Security Operations with monitoring and troubleshooting of incidents to maintain service levels.
- Work with Information Security Vulnerability Management and vendors to remediate known impacting vulnerabilities.
- Contribute to the evolving distributed systems architecture to meet changing requirements for scaling, reliability, performance, manageability, and cost.
- Report utilization and performance metrics to user communities.
- Contribute to planning and implementation of new/upgraded hardware and software releases.
- Responsible for monitoring the Linux, Kubernetes, Object Storage(MinIO), Feature Store, and Spark.
- Research and recommend innovative, and where possible, automated approaches for administration tasks.
- Identify approaches to efficiencies in resource utilization, provide economies of scale, and simplify support issues.
- Responsible for administration of Machine Learning platforms & Operations (MLOps) Such as Kubeflow/Jupyterhub/Python.
- This role will support GMF international operations and will closely align with our GMF IT NorthStar architecture and operating Principles.
Qualifications
What Makes You an Ideal Candidate?
- Excellent knowledge of Kubernetes Administration, Deployments & Upgrades.
- Excellent Knowledge on Apache Spark administration on various platforms.
- Strong working knowledge of Object Store(MinIO) and Spark cluster security, networking connectivity and IO throughput along with other factors that affect distributed system performance.
- Strong working knowledge of disaster recovery, incident management, and security best practices.
- Working knowledge of containers (e.g., docker) and major orchestrators (e.g., Mesos, Kubernetes, Docker Datacenter).
- Working knowledge of software defined networking.
- Working knowledge of hardening Data at Rest with key based encryption technologies.
- Working knowledge of setting up and customize interactive data analytics tools (e.g., Apache Zeppelin, Jupyter notebooks).
- Excellent knowledge on building the docker images to provide Containers-as-a-service.
- Working knowledge on Azure Administration, Azure DevOps & Azure Kubernetes Service (AKS).
- Working knowledge of Pipeline Automation: Azure DevOps (YAML, ARM), Terraform, Jenkins, Chef/Puppet, Ansible.
- Working knowledge of CICD methodologies like Artifactory/Git/Gitops/Jenkins.
- Working knowledge of Code Scanning tools: SonarQube, Checkmarx/Blackduck/Twistlock.
- Working knowledge of Object Storage like S3/MinIO, Bucket policies and administration.
- Working knowledge of Kubernetes Storage protocols.
- Experienced with networking infrastructure including VLAN and firewalls.
- Working knowledge of hardening Kubernetes clusters with network policies like Calico/Tigera, service meshes like Istio, Internal & external load balancers.
- Proven track record with Red Hat Enterprise Linux & Kubernetes administration.
- Proficiency in a high-level language like Python, Go, Ruby and/or Java.
- Solid experience in High Availability and distributed systems, Linux, Data and SAN Storage Networks, NAS and Networking, leveraging tools to instrument and automate proactively and eventually predictive availability solutions.
- Proven track record leading complex enterprise production support efforts adhering to a mix of DevOps & SRE frameworks.
- Experience transitioning platforms to the cloud, with knowledge of cloud frameworks & design patterns, micro-service architectures
- Extensive Knowledge of networking, including DNS, DHCP, firewalls, load balancers and IP routing.
- Experience in Monitoring tools - Splunk, Zenoss, Elastic, Appdynamics, Dynatrace, Grafana, Promotheus, Kiali etc.
- Ability to grasp difficult concepts, large architectures, and sophisticated designs quickly and troubleshoot with debugging skills across a variety of integrated platforms.
- Proven capability to provide operational visibility on environment health to Senior Leadership, Technology and Business partners.
- Receptive, approachable teammate, with the ability to positively interact with business partners, technology teams, offshore, and professional services.
- Strong customer advocate with excellent written and verbal communication skills.
Additional Knowledge and Skills
Working effectively within an AI enabled environment:
- Ability to use AI tools (e.g., Microsoft Copilot) to support daily work.
- Skills in evaluating AI outputs for accuracy, compliance, and bias.
- Experience integrating AI into workflows to improve efficiency or insights.
- Familiarity with AI assisted research, summarization, and content generation.
- Understanding of responsible AI use, including ethics and data protection.
Education and Experience:
- 5-7 years of hands-on experience with supporting Linux production environments required.
- 5-7 years of hands-on administration experience on Spark required.
- 3-5 years hands-on experience with scripting with bash, perl, ruby, or python required.
- 3-5 years experience with Docker Datacenter required.
- 2-4 years of hands-on administration experience on Machine learning platforms required.
- Minimum of 1 year of experience in Mesos, Kubernetes, OpenShift and/or Deis or other such container/platform-as-a-service orchestrator required.
- Minimum of 1 year of hands-on experience on CICD tools & Technologies required.
- Minimum of 1 year of lead experience of site reliability engineering team required.
- Hands-on experience in cloud technologies with Microsoft Azure required.
- High School Diploma or equivalent required.
- Bachelor's Degree in related field or equivalent experience required.
- Master's Degree preferred.
What We
- ...cloud-native platforms to advanced release engineering practices, our teams are redefining how... ...accelerate development and improve reliability. Your work will directly influence how... ...Scrum teams with demonstrated success leading improvements (getting better/faster/happier...SuggestedFull timeWork at officeRemote workFlexible hours2 days per week
- ...incorporates aspects of software engineering and operations, combining SRE... ...and disaster recovery sites. This role participates along... ...and procedures for the site reliability engineering practice. Setup... ...solutions. Proven track record leading complex enterprise production...SuggestedWork at officeFlexible hours2 days per week
- LH Arlington Operating Company, LLC in Arlington, TX is looking for an experienced Housekeeping Manager. This role involves overseeing the housekeeping team to maintain high cleanliness standards and guest satisfaction. Candidates should have a strong background in hotel...Suggested
- ...Senior / Lead .NET Software Engineer / SRE DETAILS Location : Arlington, TX 76014 (hybrid onsite 2-days per week) Openings... ...financial services platforms, making them more scalable, reliable, automated, and cloud-native. The .NET Engineer / SRE will...SuggestedHourly payWork at officeLocal area2 days per week
- ...Storeroom Lead Poly-America, L.P. is currently hiring Storeroom lead to join our team. Poly-America produces several lines of polyethylene products including high quality trash bags, construction films, and geomembrane liners. Poly-America is also the most technologically...SuggestedCurrently hiringNight shiftDay shift
- Lockheed Martin is seeking a highly skilled AI Engineer to lead the development of innovative AI solutions that drive business value. You’ll collaborate with cross-functional teams, deploy AI technologies, and apply advanced machine learning to real-world problems. This...Remote jobFlexible hours
- ...Overview: As a Security Lead you will have the opportunity to oversee your daily team, while assisting with daily operations on a new level! Responsibilities: Guests choose Six Flags Over Texas to have a fun filled day with their families and friends, with safety...Flexible hours
- ...responsible, light-hearted individuals with strong customer service skills. A strong team environment is crucial for our business and we need site managers who will embrace and promote that type of workplace. Our Management Team is trained to learn every nuance of the business,...Night shiftWeekend workWeekday work
- A technology solutions provider is seeking a Senior Developer to join the Enterprise Risk Management Technologies Team. In this role, the successful candidate will be responsible for configuring and integrating ServiceNow solutions to meet specific business needs. Key ...
$18 - $18.5 per hour
...Custodial Lead Position SBM Management is currently looking to hire a Custodial Lead to join their team! The Custodial Lead has responsibilities for overseeing activities within the assigned program. This includes the company employees and other temporary employees...Hourly payTemporary workCurrently hiringImmediate startMonday to FridayShift work$18.5 - $26.45 per hour
...Lead, Pharmacy Technician We're unique. You should be, too. We're changing lives every day. For both our patients and our team members. Are you innovative and entrepreneurial minded? Is your work ethic and ambition off the charts? Do you inspire others with your...Hourly payFull timeWork experience placementWork at officeFlexible hoursWeekend workAfternoon shift$21 - $29 per hour
...explanation, demonstration, action plans, and ethical issue resolution. Lead and participate in teams, share resources, determine customer... ...customers and members, deliver results, make decisions based on reliable information, balance short and long‑term priorities, and...Hourly payMinimum wageFull timeTemporary workPart time$19.5 - $32.5 per hour
...Available shifts: Location Walmart Supercenter #1801 4801 S COOPER ST, ARLINGTON, TX, 76017, US Job Overview AP Team Lead Benefits & perks At Walmart, we offer competitive pay as well as performance-based incentive awards and other great benefits for...Hourly payMinimum wageFull timeTemporary workPart timeShift work$17 per hour
...The opportunity Delaware North Sportservice is hiring seasonal Concessions Leads to join our team at Globe Life Field in Arlington, Texas. As Concessions Lead, you will supervise assigned concessions stand(s) and lead team members in delivering excellent guest service....Weekly payFull timePart timeSeasonal workFlexible hoursShift workWeekend workAfternoon shift$28 - $30 per hour
...JOB DESCRIPTION: A growing property management firm is seeking a Maintenance Lead to oversee an apartment complex Village at Johnson Creek located in Arlington , TX . This position will be fully accountable for the day-to-day operations and upkeep. WHAT...Hourly payWeekend workAfternoon shift- ...Catering Lead At Panera, our people come first. If you’re looking for a place where you can grow, feel supported, be yourself, enjoy... ...tips Free on-shift meals & unlimited fountain beverages Flexible & reliable scheduling Paid vacation, sick time, and holidays for full-time...Full timeLocal areaFlexible hoursShift workNight shift
- ...Store Shift Lead | Murphy Oil USA As one of the largest national gasoline and convenience retailers with more than 1,700 stores in... ...months of Cashier experience Must have valid driver's license and reliable transportation Must be able to perform repeated bending,...Daily paidPart timeLocal areaImmediate startShift work
$25 - $50 per hour
...Role Overview TSA is accepting applications for Lead and Supervisory Transportation Security Officers at airports in Grand Prairie. These roles are ideal for individuals looking to step into leadership positions within airport security operations. TSA provides training...Shift workNight shiftWeekend work$12 - $24.33 per hour
...Position Overview The Team Lead works closely with the Department Manager(s) or Store Manager to receive, price, and stock merchandise to meet the needs of the store's customers and drive sales and profits. Assist Team Members with completing the work within the...Part timeFlexible hoursAfternoon shift- ...management is not present. Reports disciplinary issues and customer complaints to management. Job ID: 1783181BR Title: Shift Lead Company Indicator: Walgreens Employment Type: Full-time Job Function: Retail Full Store Address: 2410 BALLPARK WAY,...Hourly payFull timeWork experience placementSeasonal workWork at officeLocal areaFlexible hoursShift workAfternoon shift
- ...Shift Lead You support the Restaurant General Manager (RGM) by running great work shifts and meeting Taco Bell standards. You take ownership and responsibility to solve problems, seek help when needed and are willing to help and guide others. Key responsibilities include...Shift work
- ...Lead Pharmacist Show All Jobs Apply Show Map Location 1525 Greenview drive, Grand Prairie, TX, 75050, United States Job Category #pharmacist, #lead Industry #longtermcare Employee Type Full Time Contact information Name Georgina Edmund Email...Full timeLocal areaShift work
$13 - $17 per hour
...of a team that treats every customer like family. As a Shift Lead, you'll play a key role in ensuring the store operates smoothly,... ...fast-paced situations with ease. A food handler's permit and reliable transportation. Regular, predictable attendance and the ability...Flexible hoursShift work- ...Summary: The Shift Lead supports the Restaurant General Manager and Store Assistant General Manager in their efforts to oversee all the restaurant operations. The Shift Lead assists in management activities including ensuring excellence in both product quality and customer...Work at officeAll shiftsFlexible hoursShift work
$14.25 - $16.2 per hour
...~ Medical, dental, and vision benefits And much, much more! This role is vital to the guest experience because you'll: Lead the experience: Check in with guests and make sure they are enjoying themselves Be the solution: Handle guest concerns and provide...Hourly payLocal areaFlexible hoursShift work- ...pets, becoming their trusted partner in a pet's lifelong health. Practice Your Best Medicine: From diagnosis to treatment, you'll lead patient care with the freedom to uphold the highest standards. Educate and Empower: Clearly communicate findings and treatment...Temporary workLocal area
- ...associates- Encourage, monitor and assist new techs through the technician training program- Ensure execution of department standards by leading by example and delegating as necessary- Serve as the primary representative for store-wide meetings/huddles- Help create and manage...Hourly payLocal area
- ...Location: 1620 East Copeland Road, Arlington, TX, 76011, United States Required Degree: None The Shift Lead is responsible for overseeing our team members on the floor throughout shifts. In addition, they support employees and the Management team to ensure that guests...Local areaShift work
$11 per hour
Description:This job posting is for a position in a restaurant that is owned and operated by Grissett Enterprises. At Grissett Enterprises we care about our team. We provide everyone with an opportunity to learn, grow and succeed everyday. If you're looking for a full-...Full timePart time- ...timeposted on: Posted Todayjob requisition id: JR103348## As a Shift Lead, you’ll help ensure that every shift runs smoothly by supporting... ...’ll engage with customers, assist with daily operations, uphold site standards, and step in to lead the team when management is not on...Shift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead Site Reliability Engineer. Be the first to apply!
- on-site clinical research associate (traveling/remote) Arlington, TX
- junior website developer Arlington, TX
- IT site lead Arlington, TX
- site leader Arlington, TX
- site safety Arlington, TX
- on site coordinator Arlington, TX
- site services specialist Arlington, TX
- website coordinator Arlington, TX
- lead support engineer
- lead ios engineer

