Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

On-prem Platform Engineer (LLM, Gen AI)

Ampstek

Hi ,

Hope you are doing great!

We have the below urgent position with my client. Please reply if you are interested.

Role : On-prem Platform Engineer

Location : Charlotte, NC

Long Term Contract

Must-Have Skills (Mandatory Keywords)

LLM Inference & Optimization

• vLLM, TensorRT-LLM, Triton Inference Server, SGLang

• Inference optimization techniques:

o Continuous batching

o Speculative decoding

o KV cache / Prefix caching

• Model optimization:

o FP8, AWQ, GPTQ

Distributed & GPU Systems

• Tensor parallelism and large model scaling

• CUDA, NCCL, GPU architecture

• GPU partitioning & optimization (MIG)

Kubernetes & ML Serving

• Kubernetes-based ML serving platforms

• KServe, OpenShift AI

• Helm charts, Operators, platform automation

GPU Orchestration

• Run:AI or similar GPU scheduling/orchestration platforms

• Multi-tenant GPU workload management

Platform Engineering

• Experience building internal AI/ML platforms (on-prem or hybrid)

• Strong automation and system design mindset

Observability & Performance

• Prometheus, Grafana

• ML observability (model latency, throughput, drift, resource utilization)

• Performance benchmarking and tuning

Good to Have / Preferred Skills

• Experience with LLMOps / GenAI pipelines

• Exposure to hybrid cloud (on-prem + GCP/Azure integration)

• Familiarity with Inferentia / alternative accelerators

• Knowledge of service mesh / networking in GPU clusters

· Build, configure, and operate on prem Kubernetes/OpenShift AI platforms for deploying and serving GenAI models and LLM inference workloads.

· Design and optimize high performance inference stacks using vLLM, TensorRT LLM, Triton Inference Server, SGLang, and advanced techniques (continuous batching, speculative decoding, KV caching).

· Manage GPU orchestration and capacity using Run:AI, MIG, CUDA/NCCL, and tensor parallelism to maximize utilization and throughput.

· Deploy and operate Kubernetes ML serving frameworks (KServe, Helm, Operators) for scalable, reliable model serving.

· Drive inference optimization and benchmarking, leveraging FP8, AWQ, GPTQ, and performance tools such as GuideLLM and Locust.

· Implement observability and ML monitoring using Prometheus, Grafana, Arize AI, ensuring SLA/SLO compliance for GenAI services.

· Collaborate with ML and research teams to onboard new models, tune inference performance, and productionize GenAI use cases.

Thanks and Regards

Rohit Pathak| Technical Recruiter

View email address on click.appcast.io|

Direct No: View phone number on click.appcast.io

Vacancy posted 3 hours ago
Similar jobs that could be interesting for youBased on the On-prem Platform Engineer (LLM, Gen AI) in Charlotte, NC vacancy
  •  ...Sr AI Platform Engineer- AI Platform Engineer Position: Contract Location: Charlotte, NC (Hybrid...  ...on with Arize AI, LangSmith, or similar LLM observability platforms. • Experience...  ...adopted across multiple teams. • On prem or hybrid AI deployment experience.
    Suggested
    Contract work

    Lorven Technologies

    Charlotte, NC
    1 day ago
  • A leading organization in the technology sector is hiring an AI Platform Engineer in Charlotte, North Carolina. This role involves building and maintaining the MLOps platform for cutting-edge AI applications and requires 5+ years of experience in platform engineering with... 
    Suggested

    ManpowerGroup Global, Inc.

    Charlotte, NC
    2 days ago
  •  ...Cloud Platform Engineer Location: Charlotte, NC Must-Have Skills (Mandatory): GCP, Azure...  ..., observability Support GenAI/LLM workloads and platform enablement GCP...  ...lifecycle management, integrating Arize AI and GenAI platforms. Implement platform... 
    Suggested

    Apolis

    Charlotte, NC
    1 day ago
  • $153.84k - $246.15k

     ...learn new ones "I can succeed as a Platform Engineering Manager, Private Hosting Services at Capital...  .... You have driven the adoption of AI-enabled and automation-first capabilities...  .... I am comfortable operating in on-prem and cloud-adjacent environments and driving... 
    Suggested
    Temporary work
    Local area
    Flexible hours

    Capital Group

    Charlotte, NC
    3 days ago
  •  ...Infrastructure Financial Management Platform Engineering Lead Plano, Texas;Chandler, Arizona; Charlotte...  ...(external cloud, private cloud and on-prem). The role requires close partnership...  ...storage, network, database/middleware, AI platforms, container, and PaaS teams to... 
    Suggested
    Work at office
    Shift work
    Day shift

    Bank of America

    Charlotte, NC
    1 day ago
  •  ...leading organization in the technology sector, is seeking an AI Platform Engineer to join their team. As an AI Platform Engineer, you will be part...  ..., monitoring, evaluation, and continuous improvement of AI/LLM applications. Implement evaluation frameworks to monitor... 
    Weekly pay
    Temporary work
    Flexible hours

    Experis/Manpower Group

    Charlotte, NC
    18 hours ago
  • $117.2k - $175.8k

     ...Sr Data Engineer - GE07BE We're determined to make a difference and are proud to be an...  ...Hartford seeks energetic and passionate AI Platform Engineers to build AI Operations (AIOps,...  ...performance tuning is a plus. ~ Experience with LLM orchestration frameworks like Langchain,... 
    Temporary work
    Work at office
    Remote work
    3 days per week

    The Hartford

    Charlotte, NC
    1 day ago
  •  ...Senior UiPath Platform Engineer - Automation & Cloud Reliability Location: Charlotte NC (onsite...  ...of UiPath Automation Cloud and hybrid/on prem environments. • Own Orchestrator,...  ...Suite, Automation Hub, Insights, Agentic AI integrations, and related components.... 
    Permanent employment

    Metasys Technologies

    Charlotte, NC
    1 day ago
  •  ...Senior UiPath Platform Engineer - Application & Automation Enablement Location: Charlotte NC (...  ...owner of In-house utility development, AI/ML integration, and developer productivity...  ...engineering standards. 2. GenAI, LLM & AI Platform Engineering (Primary) •... 

    Metasys Technologies

    Charlotte, NC
    1 day ago
  • A leading company is seeking a Senior AI professional with expertise in Large Language Models and a strong Java background. The ideal...  ...optimize NLP systems, build scalable applications, and integrate LLM capabilities into various projects. Proficiency in Java and cloud... 

    TechDigital Group

    Charlotte, NC
    2 days ago
  •  ...experiencedSaaSplatformengineerto play a senior, hands-on role in the engineering, administration, and evolution of our...  ...expertfor critical SaaS and cloud platforms, driving automation, security,...  ...models Lead automation initiatives (AI Agents,PowerShell, APIs, workflows) to... 
    Visa sponsorship
    Work visa

    CapTech Consulting

    Charlotte, NC
    2 days ago
  •  ...AI Engineer At Allstate, great things happen when our people work together to protect families...  ...on top of our Unstructured Data Platform. In this role, you'll help transform massive...  ...AI workflows Prompt Engineering: LLM orchestration and optimization Semantic... 

    Allstate

    Charlotte, NC
    3 days ago
  •  ...Senior Engineer - SIEM Platform Engineering & Operations Denver, Colorado;Washington, District of Columbia; Addison, Texas; Charlotte, North...  ...Support Model Risk Management (MRM) efforts to describe AI or ML Models in use by any of our SIEM Technologies. Required... 
    Work at office
    Shift work
    Day shift

    Bank of America

    Charlotte, NC
    3 days ago
  • $55 - $60 per hour

    *Platform Engineer* *12 Month Contract* *Charlotte, NC* *Hybrid 3 days on site* ***MUST WORK ON A W2 WITHOUT SPONSORSHIP*** *Description...  ...and civil liability. *Use of Artificial Intelligence (AI):* We may use Artificial Intelligence (AI) to support parts of... 
    Contract work
    Temporary work

    TEKsystems

    Charlotte, NC
    3 days ago
  • $136.75k - $218.8k

     ...existing skills and learn new ones "I can succeed as a Senior Platform Engineer at Capital Group." As a Senior Engineeron our enterprise-...  ...as Code can achieve across the enterprise. Infuse AI into Infrastructure - Leverage AI and machine learning capabilities... 
    Temporary work
    Local area
    Flexible hours

    Capital Group

    Charlotte, NC
    4 days ago
  • $125k - $142k

     ...achieve wealth, independence, and purpose. We are seeking a Platform Engineer to join our platform engineering team, focusing on building...  ...enterprise data patterns Knowledge of working with various AI Tools such as Copilot, Cursor AI, Claude Code, Blitzy AI etc.,... 
    Work at office
    Flexible hours

    AssetMark

    Charlotte, NC
    3 days ago
  • $136.75k - $218.8k

     ...existing skills and learn new ones "I can succeed as a Senior Platform Engineer at Capital Group." As a Senior Engineer specializing in...  ...development tools, automation agents, or platformlevel AI capabilities to improve engineering productivity, reliability,... 
    Temporary work
    Local area
    Flexible hours

    Capital Group

    Charlotte, NC
    4 days ago
  • $187.02k - $317.93k

     ...premiseplatforms, while influencing the broader technology organization to adopt similar standards. The leader will driveadoptionof modern, AI-enabledand automation-first capabilities to enhance operational effectiveness, simplify complexity, and scale sustainably. A key... 
    Temporary work
    Local area
    Immediate start
    Flexible hours

    Capital Group

    Charlotte, NC
    3 days ago
  • $218.03k - $256.5k

     ...every day, as we build the emerging onchain platform — and with it, the future global...  ...supported. We're hiring a Staff Software Engineer to lead the Identity Accounts team — the...  ...production ~ Proven experience leading AI/LLM product development — building AI agents... 
    Local area

    Coinbase

    Charlotte, NC
    2 days ago
  • $153.84k - $246.15k

     ...existing skills and learn new ones "I can succeed as a Lead Platform Engineer at Capital Group." As a senior individual contributor...  ...boundaries of tools such asClaude Code, GitHub Copilot, and similar AI development environments-integrating them into your own daily... 
    Temporary work
    Local area
    Flexible hours

    Capital Group

    Charlotte, NC
    3 days ago
  • $80 - $85 per hour

     ...organization, apply now. We are currently seeking a Cloud GenAI Platform Engineer to join our team in Charlotte, North Carolina (US-NC), United...  ...enterprise GenAI platform enabling predictive and generative AI inferencing across both GCP and Azure. This role is focused on... 
    Hourly pay
    Temporary work
    Flexible hours

    NTT DATA Americas, Inc.

    Charlotte, NC
    18 hours ago
  •  ...Description: Lead Software Engineer - AI Application Platform The Opportunity We are seeking a Lead Software Engineer to guide the architectural...  ...'ll Lead AppGen is an enterprise-grade, multi-tenant LLM-driven application generator deployed across: •... 
    Immediate start

    3B Staffing LLC

    Charlotte, NC
    2 days ago
  •  ...Cloud GenAI Platform Engineer Location: Charlotte, NC (Hybrid) Duration: Long Term Contract Job Description...  ...GenAI Platform Engineer to support an enterprise Generative AI platform enabling predictive and generative AI inferencing across... 
    Long term contract

    United Software Group

    Charlotte, NC
    1 day ago
  •  ...Job Title: ( AI Platform Engineer (AWS - Financial Services)) About Kyyba: Founded in 1998 and headquartered in Farmington Hills, MI, Kyyba has a global presence delivering high-quality resources and top-notch recruiting services, enabling businesses to effectively... 
    Work experience placement
    Work at office
    Visa sponsorship
    Work visa
    3 days per week

    Kyyba

    Charlotte, NC
    2 days ago
  • $136.75k - $218.8k

     ...demand professional development resources that allow you to hone existing skills and learn new ones "I can succeed as an AI Platform Engineer at Capital Group." As an AI Platform Engineer, you will design, build, and operate the foundational components of... 
    Temporary work
    Local area
    Flexible hours

    Capital Group

    Charlotte, NC
    1 day ago
  • $186.07k - $225k

     ...every day, as we build the emerging onchain platform — and with it, the future global...  ...for a Senior Machine Learning Platform Engineer to join our Machine Learning Platform team...  ...the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini... 
    Local area

    Coinbase

    Charlotte, NC
    2 days ago
  •  ...JOB SUMMARY The AWS Data & AI Platform Engineer will be responsible for designing, building, and operating secure and scalable AI platforms on AWS within enterprise and regulated environments. This role involves developing and managing generative AI solutions, containerized... 
    Work experience placement

    Compunnel

    Charlotte, NC
    4 hours ago
  • $119k - $224k

     ...About this role: Wells Fargo is seeking a Lead Infrastructure Engineer to join our AI Platforms and model Support Group as part of Digital Technology and Innovations. Learn more about the career areas and business divisions at wellsfargojobs.com. The Lead Infrastructure... 
    Work experience placement

    Wells Fargo

    Charlotte, NC
    4 days ago
  • $48k

     ...AI Platform Engineer We are looking for an AI Platform Engineer—a builder who can architect the "factory" where AI is made. Our goal is to build an internal, on-premises AI ecosystem that mimics the capabilities of AWS or Azure. You will be responsible for creating... 
    Full time
    For contractors

    Photon

    Charlotte, NC
    18 hours ago
  •  ...migrating Spring Boot microservices from an on-premises Rancher platform to Azure Cloud, including secret management (Azure Key Vault,...  ...documentation tools such as Confluence, and design tools like Visio. • Gen AI experience and/or knowledge in large language models (LLMs) and... 
    Contract work

    AceStack LLC

    Charlotte, NC
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to On-prem Platform Engineer (LLM, Gen AI). Be the first to apply!