Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior ML Infrastructure Engineer: Scale GPU‑Driven AI

$172.5k - $313.7k

Centaur Labs

Software Engineering About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn’t a buzzword — it’s a way of life. The world of work as we know it is changing and we’re looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce’s core values at the heart of it all. Ready to level‑up your career at the company leading workforce transformation in the agentic era? You’re in the right place! Agentforce is the future of AI, and you are the future of Salesforce. About Slack AI Slack AI’s mission is to transform how people work by making Slack an AI‑powered operating system. We’re tackling significant challenges like unlocking collective knowledge and reducing noise, all while building a seamless, consumer‑grade AI experience within users’ existing workflows. Join us in shaping the future of work through AI. About the Team The AI and ML Infrastructure team is part of Slack’s Core Infrastructure organization and is responsible for the foundational systems that enable machine learning and AI across the company. The team designs, builds, and operates reliable, scalable, and high‑performance platforms that allow product and ML teams to develop, deploy, and operate AI‑driven capabilities with confidence. The team owns shared infrastructure, services, and tooling that support the full ML lifecycle, including model training, deployment, inference, and monitoring. As Slack AI continues to grow, the team is evolving from traditional ML deployments toward large scale, highly distributed systems. This work involves deep architectural decisions around scalable model deployment strategies, real‑time feature serving at very high throughput, GPU‑accelerated inference at message scale, and responsible training of models on sensitive data with strong privacy and safety requirements. Core Focus Areas ML Infrastructure – The ML Infrastructure focus area is responsible for the low level systems that power training and inference at scale. This includes architecting and maintaining distributed systems for model training, serving, and deployment using Kubernetes‑based platforms, GPU infrastructure, and open source ML stacks such as KubeRay and vLLM. The team delivers platform capabilities that improve the speed, reliability, and quality of ML development, including training pipelines, feature generation systems, and compute orchestration. AI Platform – The AI Platform focus area builds the tooling and platform layers that enable AI development across Slack. This includes creating developer‑facing tools, SDKs, and workflows that allow product teams to integrate AI into Slack features efficiently and safely. The platform supports LLM efficiency and model transition initiatives through integrations with managed services across multiple cloud providers acting as the connective layer between core infrastructure and product engineering teams. About the Role We are looking for a Senior or Staff Software Engineer to join the ML Infrastructure focus area and help architect and operate the core systems that power AI at Slack. In this role, you will own foundational infrastructure for large‑scale model training and inference, and evolve it into a reliable, secure, and self‑service platform used across the company. You will work at the intersection of distributed systems, GPU infrastructure, and modern ML stacks, solving complex scalability and reliability challenges. This role blends deep systems engineering with a strong understanding of the ML lifecycle, and plays a critical part in shaping the long‑term technical foundations of Slack’s AI capabilities. What You Will Be Doing Design, build, and operate systems to train, serve, and deploy machine learning models at scale, with a focus on reliability, performance, and operational simplicity. Evolve GPU‑backed inference infrastructure to support high throughput, latency‑sensitive workloads, including large‑scale model serving. Architect and optimize distributed training and data processing systems using platforms such as Ray, Airflow, Spark, or similar technologies. Build and maintain Kubernetes‑based platforms and orchestration layers using tools such as KubeRay, vLLM, and internally developed services. Architect solutions that bridge legacy systems with modern technologies while maintaining monolithic application stability. Develop robust monitoring, observability, and alerting for production ML workloads to ensure operational excellence. Partner closely with AI Platform, ML modeling, security, and product engineering teams to design infrastructure that supports evolving AI use cases. Provide technical leadership through design reviews, mentorship, and by setting engineering standards and long‑term architectural direction for ML infrastructure. Author technical design and architecture documentation, and contribute thought leadership through engineering blog posts. What You Should Have Significant professional experience in software engineering with a strong focus on infrastructure, backend systems, platform engineering, or MLOps. Deep experience building and operating distributed systems, including expert‑level knowledge of Kubernetes and container‑based platforms. Hands‑on experience with modern ML infrastructure and serving stacks such as Ray or KubeRay, vLLM, or similar training and inference orchestration frameworks. Experience working with GPU infrastructure, including performance optimization and operational management at scale. Strong experience with data infrastructure and orchestration technologies such as Airflow, Spark, or similar systems. Experience building and operating cloud‑native systems on public cloud platforms such as AWS, GCP, or Azure, including infrastructure as code. A demonstrated ability to drive technical direction for complex systems and balance short‑term delivery with long‑term architectural goals. Excellent written communication, as well as the ability to thrive in an asynchronous and globally distributed infrastructure team. A related technical degree required. Compensation and Benefits In the United States, compensation offered will be determined by factors such as location, job level, job‑related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time‑off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock‑purchasing program. The typical base salary range for this position is $172,500 - $313,700 annually. The range represents base salary only, and does not include company bonus, incentive for sales roles, equity or benefits, as applicable. Accommodations If you require assistance due to a disability applying for open positions please submit a request via the Accommodations Request Form. Posting Statement Salesforce is an equal‑opportunity employer and maintains a policy of non‑discrimination with all employees and applicants for employment. We believe in equality for all and create a workplace that’s inclusive and free from discrimination. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications – without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education. #J-18808-Ljbffr Centaur Labs

Vacancy posted 10 hours ago
Similar jobs that could be interesting for youBased on the Senior ML Infrastructure Engineer: Scale GPU‑Driven AI in Austin, TX vacancy
  • $155.42k - $395.9k

     ...the Team: The ML Inference Platform...  ...of the AV ML Infrastructure organization. Our...  ...powers GM’s AI efforts. We’re...  ...groups building AI-driven products for GM...  ...to maximizing GPU utilization...  ...are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms... 
    Senior
    Remote work
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    7 days ago
  • ESP Engineered is seeking a Senior Customer Success Engineer to bridge technical expertise and customer outcomes. This role involves architecting self-service documentation, optimizing AI agent architectures, and managing technical escalations during customer onboarding... 
    Senior

    ESP Engineered

    Austin, TX
    19 hours ago
  • A leading automotive company seeks a Senior ML Infrastructure Engineer in Austin, Texas, to design and implement backend software for ML inference workflows. The engineer will collaborate with ML engineers to ensure efficient model serving and lead technical decisions on... 
    Senior
    Remote work

    General Motors

    Austin, TX
    2 days ago
  •  ...Consulting Member of Technical Staff for its AI Infrastructure team in Austin, Texas. This role focuses on building high-performance GPU platforms and overseeing the software...  .../MS in Computer Science, 6+ years in large-scale systems, and proficiency in programming languages... 
    Senior
    Flexible hours

    Oracle

    Austin, TX
    19 hours ago
  •  ...provider in Austin, Texas, is seeking a Senior Software Engineer to join the AI Agentic Platform Team. You will be...  ...scalable and secure agent infrastructures that enhance merchant interactions...  ...position requires a passion for AI-driven solutions and a commitment to continuous... 
    Senior

    BigCommerce

    Austin, TX
    2 days ago
  •  ...About Autonomize AI Autonomize AI is revolutionizing...  ...looking for bold, driven teammates to join us....  ...As a Senior Machine Learning Engineer at Autonomize, you will...  ...including data scientists,ml engineers, healthcare...  ...Experience in efficiently scaling ML model training and... 
    Senior
    Remote work

    Autonomize Inc

    Austin, TX
    19 hours ago
  • $184k - $287.5k

     ...unlimited potential of AI to define the next era of...  ...computing. An era in which our GPU acts as the brains of...  ...an outstanding Software Engineer to join our US-based...  ...a decision. ~ Result driven and comfortable multitasking...  ...performance tuning at scale. Experience building... 
    Senior
    Shift work

    NVIDIA

    Austin, TX
    3 days ago
  • $144.6k - $198.8k

    Document Crunch, Inc. in Austin, TX is seeking a senior engineer to lead the architectural design of an AI-driven platform. You will collaborate with cross-functional...  ...candidate has expertise in cloud architecture, AI/ML systems, and a proven track record in system... 
    Senior

    Document Crunch, Inc.

    Austin, TX
    2 days ago
  • bfiverecruiting is seeking a Senior Backend Engineer in Austin, Texas, to design, build, and support scalable backend systems for data-driven software. You will improve backend services, work...  ...data flows, and integrate with modern AI functionalities. Ideal candidates will... 
    Senior

    bfiverecruiting

    Austin, TX
    19 hours ago
  • $128.7k - $261.3k

     ...mobility. For the AI Kernels & Compilers...  ...on real vehicles at scale. We pioneer new approaches...  ..., and performance engineering so that every cycle...  ..., systems, and GPU engineers who enjoy...  .... The Role As a Senior Compiler Engineer on...  ...and effortless for ML engineers across the... 
    Senior
    Local area
    Flexible hours

    General Motors

    Austin, TX
    19 hours ago
  •  ...is looking for an experienced Systems Engineer to support large-scale technology transformations. The role focuses...  ...systems and involves working with AI-assisted workflows and cloud-native...  ...DevOps practices, microservices, and AI/ML implementation. You will design and... 
    Senior

    Compunnel, Inc.

    Austin, TX
    19 hours ago
  • $218.8k - $335.3k

     ...in Austin is seeking an experienced Staff AI/ML Engineer to join the AV ML Infra team. The role involves...  ...designing and implementing scalable ML infrastructure solutions and requires over 8 years of experience in large-scale distributed systems. Candidates should be proficient... 

    General Motors

    Austin, TX
    3 days ago
  • $90 - $100 per hour

     ...Eliassen Group is seeking a Senior AI/ML Engineer to design cloud-native machine learning solutions on AWS. The role involves LLM orchestration, building predictive models, and working with multi-agent systems. Applicants must have experience in AI engineering and a strong... 
    Senior
    Hourly pay
    Contract work
    Remote work

    Eliassen Group

    Austin, TX
    14 hours ago
  •  ...are seeking a Senior Principal...  ...Development Engineer (IC5) to lead...  ...platforms supporting GPU‑ and...  ...enabling OCI’s AI superclusters...  ...performance, cluster scale, and workload...  ...for AI/ML workloads (e....  ..., or offload‑driven architectures...  ...NIC NPI for AI infrastructure, from early... 
    Senior
    Temporary work
    Flexible hours

    Ll Oefentherapie

    Austin, TX
    19 hours ago
  •  ...highest performance scale-out networking solutions for AI and HPC datacenters....  ...maximize the efficiency of GPU, CPU and accelerator...  ...team of architects, engineers, and business...  ...talented and experienced Senior Software Engineer,...  ....Apply AI-driven techniques for automated... 
    Senior
    Full time
    Remote work
    Flexible hours

    Cornelis Networks

    Austin, TX
    2 days ago
  • $184k - $287.5k

    NVIDIA Corporation seeks a Machine Learning Compiler Engineer in Austin, Texas. This role requires deep expertise in compilers and machine...  .... Join a diverse environment and impact cutting-edge fields like AI and autonomous systems. #J-18808-Ljbffr NVIDIA Corporation
    Senior

    NVIDIA Corporation

    Austin, TX
    3 days ago
  •  ...Senior Principal Network Engineer Austin, Texas, United States Graphcore...  ...software and systems infrastructure that will unlock the next generation of AI breakthroughs and...  ...Architecture Lead to design and scale high‑performance...  ...fabrics supporting GPU clusters. You will... 
    Senior

    Graphcore

    Austin, TX
    2 days ago
  • $128.7k - $261.3k

     ...mission is two-fold: build the ML deployment platform that...  ...currently performed manually by engineers. Build the developer experience...  ...production platform or infrastructure systems where reliability, observability...  ...Familiarity with the NVIDIA GPU stack at the integration... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours
    Shift work

    General Motors

    Austin, TX
    4 days ago
  •  ...EDB provides a data and AI platform that enables organizations...  ...risk, manage costs and scale efficiently for a data and...  ...technology companies. EDB’s data-driven solutions enable customers...  ...looking for a confident Senior IT Infrastructure Engineer who has demonstrated... 
    Senior
    Remote work

    EDB

    Austin, TX
    19 hours ago
  • $150k - $300k

    A leading insurance provider is seeking a Senior Staff Machine Learning Engineer in Austin, TX, to drive the strategy and architecture of ML systems. This hands-on technical role involves building and integrating AI capabilities that enhance customer experiences. Candidates... 
    Senior

    GEICO

    Austin, TX
    19 hours ago
  •  ...seeking a Cloud MLOps Engineer to build and operate the cloud infrastructure that powers...  ...intersection of ML research,...  ...deploy models at scale, while ensuring telemetry-driven insights flow from...  ...Experience supporting GPU workloads in...  ...autonomy, or embodied AI environments... 

    Insight Global

    Austin, TX
    19 hours ago
  • $152k - $241.5k

     ...experienced Compiler Infrastructure Engineer to join our...  ...developer productivity at scale. This role sits at...  ...its mark on every GPU NVIDIA produces....  ..., and community‑driven compiler technology...  ...(including AI-assisted workflows...  ...experience applying AI or ML-based tools to... 
    Senior

    NVIDIA Corporation

    Austin, TX
    2 days ago
  • $124k - $210k

     ...Senior AI/ML Engineer, epocrates Join us as we work to create a thriving...  ...production-ready systems that can scale reliably and operate with...  ...and production-grade infrastructure that support high-volume workloads...  .... We unite as mission-driven problem-solvers with a deep... 
    Senior
    Full time
    Temporary work
    Work at office

    athenahealth

    Austin, TX
    1 day ago
  • $188k - $250k

     ...build, and productionize large-scale NLP and LLM systems that...  ...and LLM systems that analyze AI Answering engine outputs and public web content...  ...customer problems into measurable ML deliverables and ship...  ...compensation approach is data-driven, market-informed, and built to... 
    Senior
    Local area

    Meltwater

    Austin, TX
    1 day ago
  • Job Description As a Senior Core Infrastructure Engineer, you will own the software design...  ...be a rock-solid developer, driven problem solver and have experience supporting large scale data planes. You should be...  ...forward in the use of agentic AI development it is expected... 
    Senior
    Temporary work
    Flexible hours

    Ll Oefentherapie

    Austin, TX
    1 day ago
  •  ...Senior Machine Learning Engineer We are seeking a Senior Machine...  ...production ready AI systems for secure...  ...government cloud infrastructure to edge devices...  ...Engineer adaptive ML systems using LoRA...  ...into large scale distributed systems...  ...understanding of GPU computing, CUDA,... 
    Senior
    Live out
    Work at office
    Flexible hours

    webAI

    Austin, TX
    3 days ago
  •  ...global technology services company is seeking a Senior Backend / Product Engineer to help build and scale its AI-powered cost intelligence platform. This position...  ...development, focusing on core capabilities and AI-driven insights rather than client work. Ideal candidates... 
    Senior
    Remote work

    Virtasant

    Austin, TX
    2 days ago
  • $75 - $120 per hour

     ...Senior ML Engineer - Deployment and Databricks MLOps Location: Austin,...  ...hardening Databricks-based MLOps infrastructure and promoting models into...  ...model promotion using Git-driven development practices. Establish...  ...you agree to receive calls, AI-generated calls, text... 
    Senior
    Contract work

    Apex Systems

    Austin, TX
    4 days ago
  • $152k - $228k

     ...Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered revenue execution platform...  ...'ll be a primary driver of the infrastructure powering our Context Engine and...  ..., Baseten, and Kubernetes-based GPU infrastructure. Profile and tune... 
    Senior
    Currently hiring
    Remote work
    Flexible hours

    Invoca

    Austin, TX
    4 days ago
  • MontyCloud, located in Austin, Texas, is seeking a Senior Customer Success Engineer to bridge technical expertise and customer outcomes. You will be...  ...responsible for architecting technical self-service and optimizing AI agent deployments, enhancing customer support efficiency.... 
    Senior

    MontyCloud

    Austin, TX
    19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Infrastructure Engineer: Scale GPU‑Driven AI. Be the first to apply!