Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior ML Infrastructure Engineer: Scale AI

$172.5k - $313.7k

Centaur Labs

Software Engineering About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn’t a buzzword — it’s a way of life. The world of work as we know it is changing and we’re looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce’s core values at the heart of it all. Ready to level‑up your career at the company leading workforce transformation in the agentic era? You’re in the right place! Agentforce is the future of AI, and you are the future of Salesforce. About Slack AI Slack AI’s mission is to transform how people work by making Slack an AI‑powered operating system. We’re tackling significant challenges like unlocking collective knowledge and reducing noise, all while building a seamless, consumer‑grade AI experience within users’ existing workflows. Join us in shaping the future of work through AI. About the Team The AI and ML Infrastructure team is part of Slack’s Core Infrastructure organization and is responsible for the foundational systems that enable machine learning and AI across the company. The team designs, builds, and operates reliable, scalable, and high‑performance platforms that allow product and ML teams to develop, deploy, and operate AI‑driven capabilities with confidence. The team owns shared infrastructure, services, and tooling that support the full ML lifecycle, including model training, deployment, inference, and monitoring. As Slack AI continues to grow, the team is evolving from traditional ML deployments toward large scale, highly distributed systems. This work involves deep architectural decisions around scalable model deployment strategies, real‑time feature serving at very high throughput, GPU‑accelerated inference at message scale, and responsible training of models on sensitive data with strong privacy and safety requirements. Core Focus Areas ML Infrastructure – The ML Infrastructure focus area is responsible for the low level systems that power training and inference at scale. This includes architecting and maintaining distributed systems for model training, serving, and deployment using Kubernetes‑based platforms, GPU infrastructure, and open source ML stacks such as KubeRay and vLLM. The team delivers platform capabilities that improve the speed, reliability, and quality of ML development, including training pipelines, feature generation systems, and compute orchestration. AI Platform – The AI Platform focus area builds the tooling and platform layers that enable AI development across Slack. This includes creating developer‑facing tools, SDKs, and workflows that allow product teams to integrate AI into Slack features efficiently and safely. The platform supports LLM efficiency and model transition initiatives through integrations with managed services across multiple cloud providers acting as the connective layer between core infrastructure and product engineering teams. About the Role We are looking for a Senior or Staff Software Engineer to join the ML Infrastructure focus area and help architect and operate the core systems that power AI at Slack. In this role, you will own foundational infrastructure for large‑scale model training and inference, and evolve it into a reliable, secure, and self‑service platform used across the company. You will work at the intersection of distributed systems, GPU infrastructure, and modern ML stacks, solving complex scalability and reliability challenges. This role blends deep systems engineering with a strong understanding of the ML lifecycle, and plays a critical part in shaping the long‑term technical foundations of Slack’s AI capabilities. What You Will Be Doing Design, build, and operate systems to train, serve, and deploy machine learning models at scale, with a focus on reliability, performance, and operational simplicity. Evolve GPU‑backed inference infrastructure to support high throughput, latency‑sensitive workloads, including large‑scale model serving. Architect and optimize distributed training and data processing systems using platforms such as Ray, Airflow, Spark, or similar technologies. Build and maintain Kubernetes‑based platforms and orchestration layers using tools such as KubeRay, vLLM, and internally developed services. Architect solutions that bridge legacy systems with modern technologies while maintaining monolithic application stability. Develop robust monitoring, observability, and alerting for production ML workloads to ensure operational excellence. Partner closely with AI Platform, ML modeling, security, and product engineering teams to design infrastructure that supports evolving AI use cases. Provide technical leadership through design reviews, mentorship, and by setting engineering standards and long‑term architectural direction for ML infrastructure. Author technical design and architecture documentation, and contribute thought leadership through engineering blog posts. What You Should Have Significant professional experience in software engineering with a strong focus on infrastructure, backend systems, platform engineering, or MLOps. Deep experience building and operating distributed systems, including expert‑level knowledge of Kubernetes and container‑based platforms. Hands‑on experience with modern ML infrastructure and serving stacks such as Ray or KubeRay, vLLM, or similar training and inference orchestration frameworks. Experience working with GPU infrastructure, including performance optimization and operational management at scale. Strong experience with data infrastructure and orchestration technologies such as Airflow, Spark, or similar systems. Experience building and operating cloud‑native systems on public cloud platforms such as AWS, GCP, or Azure, including infrastructure as code. A demonstrated ability to drive technical direction for complex systems and balance short‑term delivery with long‑term architectural goals. Excellent written communication, as well as the ability to thrive in an asynchronous and globally distributed infrastructure team. A related technical degree required. Compensation and Benefits In the United States, compensation offered will be determined by factors such as location, job level, job‑related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time‑off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock‑purchasing program. The typical base salary range for this position is $172,500 - $313,700 annually. The range represents base salary only, and does not include company bonus, incentive for sales roles, equity or benefits, as applicable. Accommodations If you require assistance due to a disability applying for open positions please submit a request via the Accommodations Request Form. Posting Statement Salesforce is an equal‑opportunity employer and maintains a policy of non‑discrimination with all employees and applicants for employment. We believe in equality for all and create a workplace that’s inclusive and free from discrimination. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications – without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior ML Infrastructure Engineer: Scale AI in Austin, TX vacancy
  • A leading automotive company seeks a Senior ML Infrastructure Engineer in Austin, Texas, to design and implement backend software for ML inference workflows. The engineer will collaborate with ML engineers to ensure efficient model serving and lead technical decisions on... 
    Senior
    Remote work

    General Motors

    Austin, TX
    2 days ago
  • $155.42k - $395.9k

     ...About the Team: The ML Inference Platform is part of the AV ML Infrastructure organization. Our team owns the...  ...that powers GM’s AI efforts. We’re proud to serve...  ...the Role: We are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms for ML Inference... 
    Senior
    Remote work
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    12 days ago
  • $218.8k - $335.3k

     ...in Austin is seeking an experienced Staff AI/ML Engineer to join the AV ML Infra team. The role involves...  ...designing and implementing scalable ML infrastructure solutions and requires over 8 years of experience in large-scale distributed systems. Candidates should be proficient... 
    Suggested

    General Motors

    Austin, TX
    1 day ago
  • $150k - $300k

     ...A leading insurance provider is seeking a Senior Staff Machine Learning Engineer in Austin, TX, to drive the strategy and architecture of ML systems. This hands-on technical role involves building and integrating AI capabilities that enhance customer experiences. Candidates... 
    Senior

    GEICO

    Austin, TX
    1 day ago
  • $90 - $100 per hour

     ...Eliassen Group is seeking a Senior AI/ML Engineer to design cloud-native machine learning solutions on AWS. The role involves LLM orchestration, building predictive models, and working with multi-agent systems. Applicants must have experience in AI engineering and a strong... 
    Senior
    Hourly pay
    Contract work
    Remote work

    Eliassen Group

    Austin, TX
    4 days ago
  • $170k - $240k

     ...delivering-driven expert in ML Training Infrastructure with a strong ability to...  ...reliable, and high-performance AI/ML platform infrastructure...  ...initiatives. As a Senior ML Engineer, you will collaborate closely...  ...support model training at scale. Model training performance... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    4 days ago
  •  ...Be a Machine Learning Infrastructure Hero at Quora! (Remote-friendly...  ...you love solving large-scale problems using machine learning (ML)? Help Quora, a massive...  ...users with information and AI language models like GPT-...  .... Collaborating with ML engineers to help them be more effective... 
    Remote work

    Stackruit Ltd.

    Austin, TX
    1 day ago
  •  ...ESP Engineered is seeking a Senior Customer Success Engineer to bridge technical expertise and customer outcomes. This role involves architecting self-service documentation, optimizing AI agent architectures, and managing technical escalations during customer onboarding... 
    Senior

    ESP Engineered

    Austin, TX
    1 day ago
  • $128.7k - $261.3k

     ...accessible mobility. For the AI Kernels & Compilers...  ...on real vehicles at scale. We pioneer new approaches...  ..., and performance engineering so that every cycle on...  ...driving. The Role As a Senior Compiler Engineer on the...  ...reliable, and effortless for ML engineers across the AV... 
    Senior
    Local area
    Flexible hours

    General Motors

    Austin, TX
    1 day ago
  •  ...About Autonomize AI Autonomize AI is revolutionizing...  ...The Opportunity As a Senior Machine Learning Engineer at Autonomize, you will lead...  ...including data scientists,ml engineers, healthcare clients...  ...Experience in efficiently scaling ML model training and inferencing... 
    Senior
    Remote work

    Autonomize Inc

    Austin, TX
    5 days ago
  •  ...Roku, Inc. is looking for a Machine Learning Engineer to tackle challenging problems in advertising through model optimization and creative generation. Ideal candidates will have over 5 years of experience in developing Machine Learning platforms, particularly in deep... 
    Senior
    Flexible hours

    Roku

    Austin, TX
    1 day ago
  •  ...Description About the Team: The AI Validation Platform team...  ...’re proud to serve as the infrastructure platform for teams...  ...prioritizing high-impact, ML-centric use cases. About the...  ...a Staff ML Infrastructure engineer to help build and scale robust Compute platforms for... 
    Local area
    Work from home

    Israelvcforum

    Austin, TX
    2 days ago
  •  ...Senior Principal Network Engineer Austin, Texas, United States Graphcore is one of...  ...hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the...  ...Architecture Lead to design and scale high‑performance computing (... 
    Senior

    Graphcore

    Austin, TX
    2 days ago
  • $128.7k - $261.3k

     ...hardware. Our mission is two-fold: build the ML deployment platform that makes model...  ...currently performed manually by engineers. Build the developer experience that...  ...building or operating production platform or infrastructure systems where reliability, observability... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours
    Shift work

    General Motors

    Austin, TX
    4 days ago
  •  ...About Us EDB provides a data and AI platform that enables organizations to...  ...enterprises to control risk, manage costs and scale efficiently for a data and AI led...  ...** We are looking for a confident Senior IT Infrastructure Engineer who has demonstrated experience... 
    Senior
    Remote work

    EDB

    Austin, TX
    4 days ago
  •  ...Job Description As a Senior Core Infrastructure Engineer, you will own the software design and development for...  ...and have experience supporting large scale data planes. You should be well versed...  ...move forward in the use of agentic AI development it is expected that you will... 
    Senior
    Temporary work
    Flexible hours

    Ll Oefentherapie

    Austin, TX
    1 day ago
  • Upstart is looking for a Principal Software Engineer focused on Machine Learning Simulations. This role involves building an MLOps platform to support machine learning model inference and automating processes. The ideal candidate has a strong background in Python, Kotlin... 
    Senior
    Remote job

    Upstart

    Austin, TX
    5 days ago
  • $128.7k - $261.3k

     ...accessible mobility. For the AI Kernels & Compilers...  ...on real vehicles at scale. We pioneer new approaches...  ..., and performance engineering so that every cycle on...  ...heart of our on‑vehicle ML inference for ADAS and...  ...and improve tooling and infrastructure that make it easier to... 
    Senior
    Full time
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    18 hours ago
  •  ...A leading technology company in Austin is seeking a hands-on Machine Learning Engineer to enhance advertising systems. The role involves designing and building machine learning systems and data pipelines, defining innovation roadmaps, and optimizing model performance.... 

    Apple

    Austin, TX
    1 day ago
  • $156k - $231k

    Afresh is seeking a Senior Data Engineer to play a key role in scaling customer data integrations. In this role, you will design and implement ETLs using PySpark...  .... Join us in tackling food waste with innovative AI solutions while enjoying a collaborative work environment... 
    Senior

    Afresh

    Austin, TX
    1 day ago
  •  ...secure mobility, analytics / AI, digital marketing and...  ...looking for an experienced Senior Network Engineer to join our dynamic Network...  ...seamless operation of our network infrastructure, which is vital to our...  ...hands-on experience in large-scale, enterprise-class network... 
    Senior
    Casual work
    Work at office
    Local area
    Flexible hours

    Samsung SDS America

    Austin, TX
    9 days ago
  • $124k - $210k

     ...healthcare for all. Role summary: The Senior AI/ML Engineer, epocrates, will help design and...  ...production-ready systems that can scale reliably and operate with clear...  ...Implement ML pipelines and production-grade infrastructure that support high-volume workloads... 
    Senior
    Full time
    Temporary work
    Work at office
    Remote work

    athenahealth

    Austin, TX
    6 days ago
  • $188k - $250k

     ...Design, build, and productionize large-scale NLP and LLM systems that power information...  ...deploy NLP and LLM systems that analyze AI Answering engine outputs and public web content to...  ...turn customer problems into measurable ML deliverables and ship production features... 
    Senior
    Local area

    Meltwater

    Austin, TX
    6 days ago
  • $128.7k - $261.3k

     ...General Motors is hiring a Senior Compiler Engineer to join their AI Kernels & Compilers team in Austin, Texas. The role focuses on optimizing inference...  ...candidates have a background in compilers and experience with ML frameworks. This hybrid position requires in-office... 
    Senior
    Work at office

    General Motors

    Austin, TX
    1 day ago
  •  ...CloudFlare in Austin, Texas, seeks a talented engineer to join the Egress team. This role focuses...  ...grasp of networking protocols, and a commitment to integrating AI tools into their workflow. Join us to work on cloud networking challenges at scale. #J-18808-Ljbffr... 
    Senior

    Cloudflare Inc

    Austin, TX
    1 day ago
  • $125k - $156.3k

     ...Job Description Role Overview The Senior AI/ML Engineer is responsible for designing, building...  ...and machine learning engineering at scale, with a passion for building robust,...  ...evaluation pipelines Build the underlying infrastructure for autonomous and semi-autonomous AI... 
    Senior
    Work at office
    Immediate start
    Remote work
    Worldwide

    Natera

    Austin, TX
    7 days ago
  •  ...CesiumAstro is seeking a Senior Machine Learning Engineer II to develop and deploy machine learning solutions for distributed processing platforms. The ideal candidate has 6+ years of experience in ML systems, strong proficiency in Python, and expertise in managing workflows... 
    Senior

    Roman Health Pharmacy LLC

    Austin, TX
    1 day ago
  •  ...Rival is looking for a Senior Machine Learning Operations Engineer based in Austin, TX. This role will focus on designing and maintaining scalable ML systems, mentoring junior engineers, and collaborating across teams to drive key initiatives. The ideal candidate has... 
    Senior
    Casual work

    Rival Inc

    Austin, TX
    2 days ago
  •  ...leading fintech company is seeking a Senior Staff Machine Learning Engineer to revolutionize financial services through...  ...strategies, leading the design of ML systems, and mentoring engineers in a...  ...over 10 years of experience in large-scale ML systems, leadership in end-to-end... 
    Senior
    Remote work

    Affirm

    Austin, TX
    2 days ago
  • $96.8k - $223.4k

     ...Product Development Engineer AI2NE strives to be a global leader in the RDMA...  ...clusters tailored specifically for AI, ML, HPC workloads. We strive to be...  ...deployment, and operations of large-scale global Oracle Cloud Infrastructure (OCI). Primarily focused on the development... 
    Senior
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Infrastructure Engineer: Scale AI. Be the first to apply!