Senior ML Infrastructure Engineer: Scale GPU‑Driven AI

$172.5k - $313.7k

Centaur Labs

Software Engineering About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn’t a buzzword — it’s a way of life. The world of work as we know it is changing and we’re looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce’s core values at the heart of it all. Ready to level‑up your career at the company leading workforce transformation in the agentic era? You’re in the right place! Agentforce is the future of AI, and you are the future of Salesforce. About Slack AI Slack AI’s mission is to transform how people work by making Slack an AI‑powered operating system. We’re tackling significant challenges like unlocking collective knowledge and reducing noise, all while building a seamless, consumer‑grade AI experience within users’ existing workflows. Join us in shaping the future of work through AI. About the Team The AI and ML Infrastructure team is part of Slack’s Core Infrastructure organization and is responsible for the foundational systems that enable machine learning and AI across the company. The team designs, builds, and operates reliable, scalable, and high‑performance platforms that allow product and ML teams to develop, deploy, and operate AI‑driven capabilities with confidence. The team owns shared infrastructure, services, and tooling that support the full ML lifecycle, including model training, deployment, inference, and monitoring. As Slack AI continues to grow, the team is evolving from traditional ML deployments toward large scale, highly distributed systems. This work involves deep architectural decisions around scalable model deployment strategies, real‑time feature serving at very high throughput, GPU‑accelerated inference at message scale, and responsible training of models on sensitive data with strong privacy and safety requirements. Core Focus Areas ML Infrastructure – The ML Infrastructure focus area is responsible for the low level systems that power training and inference at scale. This includes architecting and maintaining distributed systems for model training, serving, and deployment using Kubernetes‑based platforms, GPU infrastructure, and open source ML stacks such as KubeRay and vLLM. The team delivers platform capabilities that improve the speed, reliability, and quality of ML development, including training pipelines, feature generation systems, and compute orchestration. AI Platform – The AI Platform focus area builds the tooling and platform layers that enable AI development across Slack. This includes creating developer‑facing tools, SDKs, and workflows that allow product teams to integrate AI into Slack features efficiently and safely. The platform supports LLM efficiency and model transition initiatives through integrations with managed services across multiple cloud providers acting as the connective layer between core infrastructure and product engineering teams. About the Role We are looking for a Senior or Staff Software Engineer to join the ML Infrastructure focus area and help architect and operate the core systems that power AI at Slack. In this role, you will own foundational infrastructure for large‑scale model training and inference, and evolve it into a reliable, secure, and self‑service platform used across the company. You will work at the intersection of distributed systems, GPU infrastructure, and modern ML stacks, solving complex scalability and reliability challenges. This role blends deep systems engineering with a strong understanding of the ML lifecycle, and plays a critical part in shaping the long‑term technical foundations of Slack’s AI capabilities. What You Will Be Doing Design, build, and operate systems to train, serve, and deploy machine learning models at scale, with a focus on reliability, performance, and operational simplicity. Evolve GPU‑backed inference infrastructure to support high throughput, latency‑sensitive workloads, including large‑scale model serving. Architect and optimize distributed training and data processing systems using platforms such as Ray, Airflow, Spark, or similar technologies. Build and maintain Kubernetes‑based platforms and orchestration layers using tools such as KubeRay, vLLM, and internally developed services. Architect solutions that bridge legacy systems with modern technologies while maintaining monolithic application stability. Develop robust monitoring, observability, and alerting for production ML workloads to ensure operational excellence. Partner closely with AI Platform, ML modeling, security, and product engineering teams to design infrastructure that supports evolving AI use cases. Provide technical leadership through design reviews, mentorship, and by setting engineering standards and long‑term architectural direction for ML infrastructure. Author technical design and architecture documentation, and contribute thought leadership through engineering blog posts. What You Should Have Significant professional experience in software engineering with a strong focus on infrastructure, backend systems, platform engineering, or MLOps. Deep experience building and operating distributed systems, including expert‑level knowledge of Kubernetes and container‑based platforms. Hands‑on experience with modern ML infrastructure and serving stacks such as Ray or KubeRay, vLLM, or similar training and inference orchestration frameworks. Experience working with GPU infrastructure, including performance optimization and operational management at scale. Strong experience with data infrastructure and orchestration technologies such as Airflow, Spark, or similar systems. Experience building and operating cloud‑native systems on public cloud platforms such as AWS, GCP, or Azure, including infrastructure as code. A demonstrated ability to drive technical direction for complex systems and balance short‑term delivery with long‑term architectural goals. Excellent written communication, as well as the ability to thrive in an asynchronous and globally distributed infrastructure team. A related technical degree required. Compensation and Benefits In the United States, compensation offered will be determined by factors such as location, job level, job‑related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time‑off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock‑purchasing program. The typical base salary range for this position is $172,500 - $313,700 annually. The range represents base salary only, and does not include company bonus, incentive for sales roles, equity or benefits, as applicable. Accommodations If you require assistance due to a disability applying for open positions please submit a request via the Accommodations Request Form. Posting Statement Salesforce is an equal‑opportunity employer and maintains a policy of non‑discrimination with all employees and applicants for employment. We believe in equality for all and create a workplace that’s inclusive and free from discrimination. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications – without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education. #J-18808-Ljbffr Centaur Labs

Apply

Vacancy posted 10 hours ago

Similar jobs that could be interesting for youBased on the Senior ML Infrastructure Engineer: Scale GPU‑Driven AI in Austin, TX vacancy

Senior ML Infrastructure Engineer, Inference Platform
$155.42k - $395.9k
...the Team: The ML Inference Platform... ...of the AV ML Infrastructure organization. Our... ...powers GM’s AI efforts. We’re... ...groups building AI-driven products for GM... ...to maximizing GPU utilization... ...are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms...
Senior
Remote work
Relocation
Relocation package
Flexible hours
General Motors
Austin, TX
7 days ago
Senior Cloud Success Engineer: AI-Driven Automation & Scale
ESP Engineered is seeking a Senior Customer Success Engineer to bridge technical expertise and customer outcomes. This role involves architecting self-service documentation, optimizing AI agent architectures, and managing technical escalations during customer onboarding...
Senior
ESP Engineered
Austin, TX
19 hours ago
Senior ML Inference Platform Engineer Scale & Serve
A leading automotive company seeks a Senior ML Infrastructure Engineer in Austin, Texas, to design and implement backend software for ML inference workflows. The engineer will collaborate with ML engineers to ensure efficient model serving and lead technical decisions on...
Senior
Remote work
General Motors
Austin, TX
2 days ago
Senior Cloud GPU AI Infrastructure Engineer
...Consulting Member of Technical Staff for its AI Infrastructure team in Austin, Texas. This role focuses on building high-performance GPU platforms and overseeing the software... .../MS in Computer Science, 6+ years in large-scale systems, and proficiency in programming languages...
Senior
Flexible hours
Oracle
Austin, TX
19 hours ago
Senior AI Agentic Engineer - Platform & Scale
...provider in Austin, Texas, is seeking a Senior Software Engineer to join the AI Agentic Platform Team. You will be... ...scalable and secure agent infrastructures that enhance merchant interactions... ...position requires a passion for AI-driven solutions and a commitment to continuous...
Senior
BigCommerce
Austin, TX
2 days ago
Senior ML Engineer - US( in Austin only )
...About Autonomize AI Autonomize AI is revolutionizing... ...looking for bold, driven teammates to join us.... ...As a Senior Machine Learning Engineer at Autonomize, you will... ...including data scientists,ml engineers, healthcare... ...Experience in efficiently scaling ML model training and...
Senior
Remote work
Autonomize Inc
Austin, TX
19 hours ago
Senior Software Engineer, AI Networking
$184k - $287.5k
...unlimited potential of AI to define the next era of... ...computing. An era in which our GPU acts as the brains of... ...an outstanding Software Engineer to join our US-based... ...a decision. ~ Result driven and comfortable multitasking... ...performance tuning at scale. Experience building...
Senior
Shift work
NVIDIA
Austin, TX
3 days ago
Senior Staff Engineer - AI-Driven Cloud Platform
$144.6k - $198.8k
Document Crunch, Inc. in Austin, TX is seeking a senior engineer to lead the architectural design of an AI-driven platform. You will collaborate with cross-functional... ...candidate has expertise in cloud architecture, AI/ML systems, and a proven track record in system...
Senior
Document Crunch, Inc.
Austin, TX
2 days ago
Senior Backend Engineer - AI-Driven Data Platform
bfiverecruiting is seeking a Senior Backend Engineer in Austin, Texas, to design, build, and support scalable backend systems for data-driven software. You will improve backend services, work... ...data flows, and integrate with modern AI functionalities. Ideal candidates will...
Senior
bfiverecruiting
Austin, TX
19 hours ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...mobility. For the AI Kernels & Compilers... ...on real vehicles at scale. We pioneer new approaches... ..., and performance engineering so that every cycle... ..., systems, and GPU engineers who enjoy... .... The Role As a Senior Compiler Engineer on... ...and effortless for ML engineers across the...
Senior
Local area
Flexible hours
General Motors
Austin, TX
19 hours ago
Senior Systems Engineer - AI-Driven Cloud & Microservices
...is looking for an experienced Systems Engineer to support large-scale technology transformations. The role focuses... ...systems and involves working with AI-assisted workflows and cloud-native... ...DevOps practices, microservices, and AI/ML implementation. You will design and...
Senior
Compunnel, Inc.
Austin, TX
19 hours ago
Staff AI/ML Infra Engineer - AV Cloud-Scale Systems
$218.8k - $335.3k
...in Austin is seeking an experienced Staff AI/ML Engineer to join the AV ML Infra team. The role involves... ...designing and implementing scalable ML infrastructure solutions and requires over 8 years of experience in large-scale distributed systems. Candidates should be proficient...
General Motors
Austin, TX
3 days ago
Senior AI/ML Engineer - Remote, AWS Cloud-Native & LLMs
$90 - $100 per hour
...Eliassen Group is seeking a Senior AI/ML Engineer to design cloud-native machine learning solutions on AWS. The role involves LLM orchestration, building predictive models, and working with multi-agent systems. Applicants must have experience in AI engineering and a strong...
Senior
Hourly pay
Contract work
Remote work
Eliassen Group
Austin, TX
14 hours ago
Senior Principal Network Development Engineer (IC5) - Backend NIC Qualification & NPI (OCI AI2 [...]
...are seeking a Senior Principal... ...Development Engineer (IC5) to lead... ...platforms supporting GPU‑ and... ...enabling OCI’s AI superclusters... ...performance, cluster scale, and workload... ...for AI/ML workloads (e.... ..., or offload‑driven architectures... ...NIC NPI for AI infrastructure, from early...
Senior
Temporary work
Flexible hours
Ll Oefentherapie
Austin, TX
19 hours ago
Senior Kubernetes Platform Engineer
...highest performance scale-out networking solutions for AI and HPC datacenters.... ...maximize the efficiency of GPU, CPU and accelerator... ...team of architects, engineers, and business... ...talented and experienced Senior Software Engineer,... ....Apply AI-driven techniques for automated...
Senior
Full time
Remote work
Flexible hours
Cornelis Networks
Austin, TX
2 days ago
Senior ML Compiler Engineer — AI & GPU Systems
$184k - $287.5k
NVIDIA Corporation seeks a Machine Learning Compiler Engineer in Austin, Texas. This role requires deep expertise in compilers and machine... .... Join a diverse environment and impact cutting-edge fields like AI and autonomous systems. #J-18808-Ljbffr NVIDIA Corporation
Senior
NVIDIA Corporation
Austin, TX
3 days ago
Senior Principal Network Engineer
...Senior Principal Network Engineer Austin, Texas, United States Graphcore... ...software and systems infrastructure that will unlock the next generation of AI breakthroughs and... ...Architecture Lead to design and scale high‑performance... ...fabrics supporting GPU clusters. You will...
Senior
Graphcore
Austin, TX
2 days ago
Senior ML Inference Engineer - Platform
$128.7k - $261.3k
...mission is two-fold: build the ML deployment platform that... ...currently performed manually by engineers. Build the developer experience... ...production platform or infrastructure systems where reliability, observability... ...Familiarity with the NVIDIA GPU stack at the integration...
Senior
Local area
Remote work
Work from home
Relocation package
Flexible hours
Shift work
General Motors
Austin, TX
4 days ago
Senior IT Infrastructure Engineer
...EDB provides a data and AI platform that enables organizations... ...risk, manage costs and scale efficiently for a data and... ...technology companies. EDB’s data-driven solutions enable customers... ...looking for a confident Senior IT Infrastructure Engineer who has demonstrated...
Senior
Remote work
EDB
Austin, TX
19 hours ago
Senior Staff ML Engineer, GenAI Platform Lead
$150k - $300k
A leading insurance provider is seeking a Senior Staff Machine Learning Engineer in Austin, TX, to drive the strategy and architecture of ML systems. This hands-on technical role involves building and integrating AI capabilities that enhance customer experiences. Candidates...
Senior
GEICO
Austin, TX
19 hours ago
Cloud MLOps Engineer
...seeking a Cloud MLOps Engineer to build and operate the cloud infrastructure that powers... ...intersection of ML research,... ...deploy models at scale, while ensuring telemetry-driven insights flow from... ...Experience supporting GPU workloads in... ...autonomy, or embodied AI environments...
Insight Global
Austin, TX
19 hours ago
Senior Compiler Engineer Infrastructure
$152k - $241.5k
...experienced Compiler Infrastructure Engineer to join our... ...developer productivity at scale. This role sits at... ...its mark on every GPU NVIDIA produces.... ..., and community‑driven compiler technology... ...(including AI-assisted workflows... ...experience applying AI or ML-based tools to...
Senior
NVIDIA Corporation
Austin, TX
2 days ago
Senior AI/ML Engineer, epocrates
$124k - $210k
...Senior AI/ML Engineer, epocrates Join us as we work to create a thriving... ...production-ready systems that can scale reliably and operate with... ...and production-grade infrastructure that support high-volume workloads... .... We unite as mission-driven problem-solvers with a deep...
Senior
Full time
Temporary work
Work at office
athenahealth
Austin, TX
1 day ago
Senior AI/ML Engineer
$188k - $250k
...build, and productionize large-scale NLP and LLM systems that... ...and LLM systems that analyze AI Answering engine outputs and public web content... ...customer problems into measurable ML deliverables and ship... ...compensation approach is data-driven, market-informed, and built to...
Senior
Local area
Meltwater
Austin, TX
1 day ago
Senior Core Infrastructure Engineer
Job Description As a Senior Core Infrastructure Engineer, you will own the software design... ...be a rock-solid developer, driven problem solver and have experience supporting large scale data planes. You should be... ...forward in the use of agentic AI development it is expected...
Senior
Temporary work
Flexible hours
Ll Oefentherapie
Austin, TX
1 day ago
Senior Machine Learning Engineer
...Senior Machine Learning Engineer We are seeking a Senior Machine... ...production ready AI systems for secure... ...government cloud infrastructure to edge devices... ...Engineer adaptive ML systems using LoRA... ...into large scale distributed systems... ...understanding of GPU computing, CUDA,...
Senior
Live out
Work at office
Flexible hours
webAI
Austin, TX
3 days ago
Senior Backend Engineer, AI Platform (Remote, )
...global technology services company is seeking a Senior Backend / Product Engineer to help build and scale its AI-powered cost intelligence platform. This position... ...development, focusing on core capabilities and AI-driven insights rather than client work. Ideal candidates...
Senior
Remote work
Virtasant
Austin, TX
2 days ago
Senior ML Engineer - Deployment and Databricks MLOps
$75 - $120 per hour
...Senior ML Engineer - Deployment and Databricks MLOps Location: Austin,... ...hardening Databricks-based MLOps infrastructure and promoting models into... ...model promotion using Git-driven development practices. Establish... ...you agree to receive calls, AI-generated calls, text...
Senior
Contract work
Apex Systems
Austin, TX
4 days ago
Senior ML Engineer
$152k - $228k
...Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered revenue execution platform... ...'ll be a primary driver of the infrastructure powering our Context Engine and... ..., Baseten, and Kubernetes-based GPU infrastructure. Profile and tune...
Senior
Currently hiring
Remote work
Flexible hours
Invoca
Austin, TX
4 days ago
Senior AI-Driven Cloud Customer Success Engineer
MontyCloud, located in Austin, Texas, is seeking a Senior Customer Success Engineer to bridge technical expertise and customer outcomes. You will be... ...responsible for architecting technical self-service and optimizing AI agent deployments, enhancing customer support efficiency....
Senior
MontyCloud
Austin, TX
19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Infrastructure Engineer: Scale GPU‑Driven AI. Be the first to apply!