Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff

xAI

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands‑on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. About the Role We are seeking a highly skilled Member of Technical Staff to join our team in managing and enhancing reliability across a multi‑data‑center environment. This role focuses on automating processes, building and implementing robust observability solutions, and ensuring seamless operations for mission‑critical AI infrastructure. The ideal candidate will combine strong coding abilities with hands‑on data‑center experience to build scalable reliability services, optimize system performance, and minimize downtime—including close partnership with facility operations to address physical infrastructure impacts. If you thrive in lightning‑fast, distributed environments and are passionate about leveraging automation to drive efficiency, this is an opportunity to make a significant impact on our infrastructure’s resilience and scalability. In an era where AI workloads demand near‑zero downtime, this position plays a pivotal role in bridging software engineering principles with physical data‑center realities. By prioritizing automation and observability, team members in this role can reduce mean time to recovery (MTTR) by up to 50% through proactive monitoring and automated remediation, based on industry benchmarks from high‑scale environments like those at hyperscale cloud providers. The primary objective of this team is to mitigate downtime and minimize impact to end‑users from both scheduled and unscheduled maintenance, as well as events affecting onsite data centers. This is achieved through proactive automation, robust observability, and integrated software‑physical reliability strategies, ensuring our AI infrastructure remains resilient, scalable, and at the cutting edge of innovation. Responsibilities Design, develop, and deploy scalable code and services (primarily in Python and Rust, with flexibility for emerging languages) to automate reliability workflows, including monitoring, alerting, incident response, and infrastructure provisioning. We value adaptability to new tools and paradigms in the fast‑evolving AI space. Implement and maintain observability tools and practices, such as metrics collection, logging, tracing, and dashboards, to provide real‑time insights into system health across multiple data centers—open to innovative stacks beyond traditional ones like ELK. Collaborate with cross‑functional teams—including software development, network engineering, site operations, and facility operations (critical facilities, mechanical/electrical teams, and data center infrastructure management)—to identify reliability bottlenecks, automate solutions for fault tolerance, disaster recovery, capacity planning, and physical/environmental risk mitigation (e.g., power redundancy, cooling efficiency, and environmental monitoring integration). This role encourages broad skill sets from diverse technical backgrounds to foster innovation. Troubleshoot and resolve complex issues in data center environments, including hardware failures, environmental anomalies, software bugs, and network‑related problems, while adhering to reliability principles like error budgets and SLAs. Key Insight: By applying SWE rigor to troubleshooting, team members can create reusable diagnostic tools that accelerate resolution, turning unscheduled events (e.g., hardware faults) into opportunities for system hardening and reducing overall end‑user impact through targeted SLAs that prioritize critical AI services. We seek versatile problem‑solvers who adapt to bleeding‑edge challenges. Optimize Linux‑based systems for performance, security, and reliability, including kernel tuning, container orchestration (e.g., Kubernetes or emerging alternatives), and scripting for automation. Understand network topologies and concepts in large‑scale, multi‑data‑center environments to effectively troubleshoot connectivity, routing, redundancy, and performance issues; integrate observability into data center interconnects and facility‑level controls for rapid diagnosis and automation. Key Insight: In multi‑site setups, network insights allow for automated failover mechanisms that handle both digital and physical disruptions, ensuring seamless continuity for end‑users during events like fiber cuts or power outages. This attracts candidates from varied networking and systems backgrounds to drive forward‑thinking solutions. Participate in on‑call rotations, post‑incident reviews (blameless postmortems), and continuous improvement initiatives to enhance overall site reliability, including joint exercises with facility teams for physical failover and recovery scenarios. We prioritize growth‑minded individuals who embrace evolving practices. Mentor junior team members and document processes to foster a culture of automation, knowledge sharing, and adaptability to new technologies. Required Qualifications Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or a closely related technical field (or equivalent professional experience). 5+ years of hands‑on experience in site reliability engineering (SRE), infrastructure engineering, DevOps, or systems engineering, preferably supporting large‑scale, distributed, or production environments. Strong programming skills with proven production experience in Python (required for automation and tooling); experience with Rust or willingness to work in Rust is a plus, but strong coding fundamentals in at least one systems‑level language (e.g., Python, Go, C++) are essential. Solid experience with Linux systems administration, performance tuning, kernel‑level understanding, and scripting/automation in production environments. Practical knowledge of containerization and orchestration technologies, such as Docker and Kubernetes (or similar systems). Experience implementing observability solutions, including metrics, logging, tracing, monitoring tools (e.g., Prometheus, Grafana, or alternatives), alerting, and dashboards. Familiarity with troubleshooting complex issues in distributed systems, including software bugs, hardware failures, network problems, and environmental factors. Understanding of networking fundamentals (TCP/IP, routing, redundancy, DNS) in large‑scale or multi‑site environments. Experience participating in on‑call rotations, incident response, post‑incident reviews (blameless postmortems), and reliability practices such as error budgets or SLAs. Ability to collaborate effectively with cross‑functional teams (software engineers, network teams, site/facility operations, mechanical/electrical teams). Preferred Qualifications 7+ years of experience in SRE or infrastructure roles, ideally in hyperscale, cloud, or AI/ML training infrastructure environments with multi‑data‑center setups. Hands‑on experience operating or scaling Kubernetes clusters (or equivalent orchestration) at large scale, including automation for provisioning, lifecycle management, and high‑availability. Proficiency in Rust for systems programming and performance‑critical components. Direct experience integrating software reliability tools with physical data center infrastructure (e.g., power, cooling, environmental monitoring, facility controls) and automating responses to physical events. Exposure to advanced or innovative observability stacks beyond traditional tools (e.g., exploring cutting‑edge alternatives for metrics, logs, and tracing). Experience building automated remediation, fault tolerance, disaster recovery, capacity planning, or predictive failure detection systems. Background in optimizing Linux‑based systems for AI workloads, GPU clusters, or high‑throughput compute environments. Demonstrated success reducing downtime, MTTR, or improving resource efficiency (e.g., through automation or observability) in high‑stakes production settings. Prior work with bare‑metal provisioning, data center interconnects, or hybrid/multi‑site failover mechanisms. Mentoring experience, strong documentation skills, and a track record of fostering knowledge sharing and automation culture. Comfort with rapid technology adaptation in fast‑evolving domains like AI infrastructure. EEO Statement xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice. #J-18808-Ljbffr xAI

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff in Memphis, TN vacancy
  •  ...the Support Technician III is to Provides technical support to end-users for PC, server,...  ...change that could mean downtime for Executive Staff. • Work closely with our video...  ...current state of the system. • Assist other members of the team when appropriate to complete... 
    Suggested
    Local area
    Worldwide
    Flexible hours

    Jabil

    Memphis, TN
    3 days ago
  • $22 - $28 per hour

     ...Technical Support Specialist I The Technical Support Specialist I is responsible for providing technical assistance and support to end...  ...response deadlines Ability to work independently and as a member of a team Demonstrated experience and strong knowledge of computer... 
    Suggested
    Full time
    Work experience placement
    Work at office
    Monday to Friday

    Knipper Health

    Memphis, TN
    3 days ago
  • First South Financial Federal Credit Union in Memphis, TN is looking for a full-time Member Relationship Specialist. The role involves managing customer accounts, handling transactions, and supporting sales of products and services. Ideal candidates will possess a high... 
    Suggested
    Full time
    Work at office

    First South Financial Federal Credit Union

    Memphis, TN
    3 days ago
  •  ...Senior Production Support WMoS Technical Analyst Location: Memphis, TN, Byhalia, MS, Bethlehem, PA, Foothill Ranch, CA, Los Angeles,...  ...improvement Demonstrated ability to pull together diverse team members with different goals and impact outcomes across loosely coupled... 
    Suggested
    Work experience placement
    Worldwide

    Software Technology Inc

    Memphis, TN
    3 days ago
  •  ...between IT and users and have both business and technical expertise. Determines cause of application...  ...of other departmental business systems analyst staff; supports professional and technical capabilities of team members; guides business systems analyst staff in analyzing... 
    Suggested
    Full time
    Work experience placement
    Monday to Friday
    Flexible hours
    Afternoon shift

    Trustmark

    Memphis, TN
    1 day ago
  •  ...daily. Description Under supervision, provides support to front‑line customer service staff; including check‑in, club café, and customer service. Acts as a Club advocate for members, guests and support staff using proactive problem solving, multi‑tasking and excellent... 
    Part time
    Work at office
    Monday to Friday
    Shift work
    Weekend work

    Germantown Athletic Club

    Germantown, TN
    1 day ago
  • $17 - $25 per hour

    Sam's Club #8292 2150 Covington Pike Memphis, TN 38128-6907 Position Summary As a Member Frontline Services Associate, you will serve customers by operating the checkout system, resolving member concerns, and promoting Sam’s Club products and services. You will maintain... 
    Hourly pay
    Temporary work

    Walmart

    Memphis, TN
    3 days ago
  •  ...divh2Telemarketer - State Farm Agent Team Member/h2pAs a Telemarketer - State Farm Agent Team Member for Zach Jaworski - State Farm Agent, your creativity and strategy promote the continued growth of our agency. Your diversified marketing shapes our brands public image... 
    Hourly pay
    For contractors
    Work at office

    Zach Jaworski - State Farm Agent

    Memphis, TN
    1 day ago
  •  ...transactions. Handle balancing of ATM(s). Actively participate in needs-based sales program by offering products and services to members. Process loan applications from start to finish. This includes signing and funding approved loans as well as communicating... 
    Permanent employment
    Work experience placement
    Work at office
    Local area
    Night shift

    First South Financial

    Memphis, TN
    1 day ago
  •  ...Training & development ~401(k) ~ Bonus based on performance ROLE DESCRIPTION: As aTelemarketer - State Farm Agent Team Member for Zach Jaworski - State Farm Agent, your creativity and strategy promote the continued growth of our agency. Your diversified marketing... 

    Zach Jaworski - State Farm Agent

    Memphis, TN
    11 days ago
  •  ...) project that will focus on Dropship. This individual will be technical in nature but also responsible for functional responsibilities....  ...disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by... 

    Insight Global

    Memphis, TN
    6 days ago
  • $22 - $28 per hour

     ...Become one of our Contributors Join the Caretria Team! The Technical Support Specialist I is responsible for providing technical assistance...  ...response deadlines Ability to work independently and as a member of a team Demonstrated experience and strong knowledge of... 
    Full time
    Work experience placement
    Work at office
    Monday to Friday

    KnippeRx

    Memphis, TN
    2 days ago
  •  ...applications, and environments to ensure seamless operations. The Technical Support Specialist is expected to exercise sound judgment and...  ...IT standards and user requirements. Collaborate with IT team members to test and refine system images, ensuring consistent and... 
    Remote work

    ViziRecruiter,LLC.

    Memphis, TN
    2 days ago
  •  ...backup/DR functions as necessary for the site and work with other members of IT when needed to assist in system updates or new...  ...customer service; time management; identifying end user issues; technical troubleshooting; problem solving techniques; and report preparation... 
    Temporary work
    Worldwide
    Shift work

    Terumo Cardiovascular Group

    Southaven, MS
    1 day ago
  • $23.5 per hour

    Vaco is actively seeking a Technical Support Specialist for a Contract-to-Hire opportunity supporting a rapidly expanding distribution...  ...people of color, LGBTQ+ individuals, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply... 
    Hourly pay
    Contract work
    Work at office
    Local area
    Immediate start

    Vaco

    Memphis, TN
    3 days ago
  • $22 per hour

     ...Maintain accurate records of inventory, maintenance, and incidents Work closely with remote technical teams for hands-on support Communicate effectively with team members and stakeholders Compliance & Safety : Follow data center security protocols and safety guidelines... 
    Full time
    Remote work
    Monday to Friday
    Day shift

    InnoSource

    Memphis, TN
    3 days ago
  •  ...Job Overview: Search Solution Group is seeking an IT Technical Support Specialist on behalf of our client, a logistics and supply chain management organization that offers reverse logistics, finished goods operations, and repair services. This role is responsible for... 
    Work experience placement
    Weekend work

    Search Solution Group

    Olive Branch, MS
    1 day ago
  • $40 - $43 per hour

     ...comfortable partnering with researchers, operational leaders, and technical teams to improve service delivery and request outcomes. Key...  ...our 'Welcome Packet' as well, which an Everforth Apex team member can provide. Employee Type: Contract Location: Memphis,... 
    Hourly pay
    Contract work
    Remote work

    Stratacuity

    Memphis, TN
    3 days ago
  •  ...professional leadership. We are known for the care we take with our staff and clients. We are passionate about delivering the best...  .../enhancements and document business requirements. Serve as a member of a project team and/or work independently on projects. Support... 
    Full time
    Part time
    Work experience placement
    Work at office
    Remote work
    Work from home

    Work At Home Vintage Experts LLC

    Memphis, TN
    3 days ago
  •  ...Memphis and the Mid-South. The IAM Analyst II is an experienced member of the Identity and Access Management team that reports...  ...regulatory compliance requirements. The ideal candidate has a technical background and possesses 4-6 years of experience in technology... 
    Work experience placement
    Remote work

    Methodist Le Bonheur Healthcare

    Memphis, TN
    3 days ago
  •  ...providing quality support to users, employing a high degree of technical expertise, and timeliness. Troubleshoot problems and provide technical...  ...including system backups and recovery. Assist other IT Team members to monitor and test network performance and provide network... 
    For contractors
    Work experience placement
    Work at office

    PMC Group International, Inc.

    Memphis, TN
    4 days ago
  •  ...will partner with an existing senior team member to augment and enhance the current...  ...policies and regulatory requirements. Technical Analysis and Problem Resolution: Provides...  ...effectively in team environments, mentor junior staff, and present technical concepts to non-... 

    Mississippi Baptist Health Systems

    Memphis, TN
    3 days ago
  •  ...Real-time Payment) gateway implementation. Ability to provide technical roadmap to new enterprise-wide initiatives. Understand...  ...solution to different stake holders (Presentation & articulation skills) Mentoring skills – ability to coach members of the team.... 

    United IT

    Memphis, TN
    3 days ago
  •  ...access for integration users, including SFTP, SSH keys, and service accounts. ~ Configure and troubleshoot Oracle PaaS/SaaS technical components related to security, integrations, and environment performance. ~ Collaborate with IT, functional leads, and... 
    Work experience placement

    Baptist Memorial Healthcare Corporation

    Memphis, TN
    16 hours ago
  • $96.4k - $120.5k

     ...and dedication. Our goal is to cultivate a culture of belonging that encourages innovation, collaboration, and respect for all team members, ensuring that WWT remains a great place to work for All! If you have any questions or concerns about this posting, please email... 
    Full time
    Shift work

    World Wide Technology

    Southaven, MS
    2 days ago
  • Jobot Consulting is seeking an IT Support Specialist in Olive Branch, Mississippi. This role involves providing technical assistance for end users and troubleshooting hardware, software, and network-related issues. The ideal candidate will have a relevant degree or equivalent... 

    Jobot Consulting

    Olive Branch, MS
    2 days ago
  •  ...resources with over 60 branch locations across the U.S. working together to serve our customers. This growing network offers our team members constant opportunity for career growth and professional development. CULTURE – Barnhart has a strong team culture -- the “One... 
    Work experience placement

    Barnhart

    Memphis, TN
    5 days ago
  •  ...business processes are adequately documented. - Lead cooperative efforts among members of a project team. - Manage efficient execution of business meetings with internal project staff, client staff, and/or project vendors. - Act as advisor to project team members... 
    Minimum wage
    Contract work
    Temporary work
    Work experience placement

    MAXIMUS

    Memphis, TN
    2 days ago
  • $70k - $75k

    Vaco Recruiter Services is seeking an IT Systems Support Analyst in Memphis, TN, to provide day-to-day support while getting exposure to systems and infrastructure. The role involves troubleshooting user issues, managing accounts in Active Directory, and supporting system...
    Full time

    Vaco Recruiter Services

    Memphis, TN
    1 day ago
  • $120k - $140k

     ...configuration documentation, and test scripts. Act as a bridge between technical and non-technical teams, translating business requirements into...  ...communication skills to communicate with customers, team members, external data providers, and management. ~ Demonstrated... 
    Full time
    Local area
    Visa sponsorship
    Work visa

    Alphatec Spine

    Memphis, TN
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff. Be the first to apply!