On-prem Platform Engineer (LLM, Gen AI)
Ampstek
Hi ,
Hope you are doing great!
We have the below urgent position with my client. Please reply if you are interested.
Role : On-prem Platform Engineer
Location : Charlotte, NC
Long Term Contract
Must-Have Skills (Mandatory Keywords)
LLM Inference & Optimization
• vLLM, TensorRT-LLM, Triton Inference Server, SGLang
• Inference optimization techniques:
o Continuous batching
o Speculative decoding
o KV cache / Prefix caching
• Model optimization:
o FP8, AWQ, GPTQ
Distributed & GPU Systems
• Tensor parallelism and large model scaling
• CUDA, NCCL, GPU architecture
• GPU partitioning & optimization (MIG)
Kubernetes & ML Serving
• Kubernetes-based ML serving platforms
• KServe, OpenShift AI
• Helm charts, Operators, platform automation
GPU Orchestration
• Run:AI or similar GPU scheduling/orchestration platforms
• Multi-tenant GPU workload management
Platform Engineering
• Experience building internal AI/ML platforms (on-prem or hybrid)
• Strong automation and system design mindset
Observability & Performance
• Prometheus, Grafana
• ML observability (model latency, throughput, drift, resource utilization)
• Performance benchmarking and tuning
Good to Have / Preferred Skills
• Experience with LLMOps / GenAI pipelines
• Exposure to hybrid cloud (on-prem + GCP/Azure integration)
• Familiarity with Inferentia / alternative accelerators
• Knowledge of service mesh / networking in GPU clusters
· Build, configure, and operate on prem Kubernetes/OpenShift AI platforms for deploying and serving GenAI models and LLM inference workloads.
· Design and optimize high performance inference stacks using vLLM, TensorRT LLM, Triton Inference Server, SGLang, and advanced techniques (continuous batching, speculative decoding, KV caching).
· Manage GPU orchestration and capacity using Run:AI, MIG, CUDA/NCCL, and tensor parallelism to maximize utilization and throughput.
· Deploy and operate Kubernetes ML serving frameworks (KServe, Helm, Operators) for scalable, reliable model serving.
· Drive inference optimization and benchmarking, leveraging FP8, AWQ, GPTQ, and performance tools such as GuideLLM and Locust.
· Implement observability and ML monitoring using Prometheus, Grafana, Arize AI, ensuring SLA/SLO compliance for GenAI services.
· Collaborate with ML and research teams to onboard new models, tune inference performance, and productionize GenAI use cases.
Thanks and Regards
Rohit Pathak| Technical Recruiter
View email address on click.appcast.io|
Direct No: View phone number on click.appcast.io
- ...Sr AI Platform Engineer- AI Platform Engineer Position: Contract Location: Charlotte, NC (Hybrid... ...on with Arize AI, LangSmith, or similar LLM observability platforms. • Experience... ...adopted across multiple teams. • On prem or hybrid AI deployment experience.SuggestedContract work
- A leading organization in the technology sector is hiring an AI Platform Engineer in Charlotte, North Carolina. This role involves building and maintaining the MLOps platform for cutting-edge AI applications and requires 5+ years of experience in platform engineering with...Suggested
- ...Cloud Platform Engineer Location: Charlotte, NC Must-Have Skills (Mandatory): GCP, Azure... ..., observability Support GenAI/LLM workloads and platform enablement GCP... ...lifecycle management, integrating Arize AI and GenAI platforms. Implement platform...Suggested
$153.84k - $246.15k
...learn new ones "I can succeed as a Platform Engineering Manager, Private Hosting Services at Capital... .... You have driven the adoption of AI-enabled and automation-first capabilities... .... I am comfortable operating in on-prem and cloud-adjacent environments and driving...SuggestedTemporary workLocal areaFlexible hours- ...Infrastructure Financial Management Platform Engineering Lead Plano, Texas;Chandler, Arizona; Charlotte... ...(external cloud, private cloud and on-prem). The role requires close partnership... ...storage, network, database/middleware, AI platforms, container, and PaaS teams to...SuggestedWork at officeShift workDay shift
- ...leading organization in the technology sector, is seeking an AI Platform Engineer to join their team. As an AI Platform Engineer, you will be part... ..., monitoring, evaluation, and continuous improvement of AI/LLM applications. Implement evaluation frameworks to monitor...Weekly payTemporary workFlexible hours
$117.2k - $175.8k
...Sr Data Engineer - GE07BE We're determined to make a difference and are proud to be an... ...Hartford seeks energetic and passionate AI Platform Engineers to build AI Operations (AIOps,... ...performance tuning is a plus. ~ Experience with LLM orchestration frameworks like Langchain,...Temporary workWork at officeRemote work3 days per week- ...Senior UiPath Platform Engineer - Automation & Cloud Reliability Location: Charlotte NC (onsite... ...of UiPath Automation Cloud and hybrid/on prem environments. • Own Orchestrator,... ...Suite, Automation Hub, Insights, Agentic AI integrations, and related components....Permanent employment
- ...Senior UiPath Platform Engineer - Application & Automation Enablement Location: Charlotte NC (... ...owner of In-house utility development, AI/ML integration, and developer productivity... ...engineering standards. 2. GenAI, LLM & AI Platform Engineering (Primary) •...
- A leading company is seeking a Senior AI professional with expertise in Large Language Models and a strong Java background. The ideal... ...optimize NLP systems, build scalable applications, and integrate LLM capabilities into various projects. Proficiency in Java and cloud...
- ...experiencedSaaSplatformengineerto play a senior, hands-on role in the engineering, administration, and evolution of our... ...expertfor critical SaaS and cloud platforms, driving automation, security,... ...models Lead automation initiatives (AI Agents,PowerShell, APIs, workflows) to...Visa sponsorshipWork visa
- ...AI Engineer At Allstate, great things happen when our people work together to protect families... ...on top of our Unstructured Data Platform. In this role, you'll help transform massive... ...AI workflows Prompt Engineering: LLM orchestration and optimization Semantic...
- ...Senior Engineer - SIEM Platform Engineering & Operations Denver, Colorado;Washington, District of Columbia; Addison, Texas; Charlotte, North... ...Support Model Risk Management (MRM) efforts to describe AI or ML Models in use by any of our SIEM Technologies. Required...Work at officeShift workDay shift
$55 - $60 per hour
*Platform Engineer* *12 Month Contract* *Charlotte, NC* *Hybrid 3 days on site* ***MUST WORK ON A W2 WITHOUT SPONSORSHIP*** *Description... ...and civil liability. *Use of Artificial Intelligence (AI):* We may use Artificial Intelligence (AI) to support parts of...Contract workTemporary work$136.75k - $218.8k
...existing skills and learn new ones "I can succeed as a Senior Platform Engineer at Capital Group." As a Senior Engineeron our enterprise-... ...as Code can achieve across the enterprise. Infuse AI into Infrastructure - Leverage AI and machine learning capabilities...Temporary workLocal areaFlexible hours$125k - $142k
...achieve wealth, independence, and purpose. We are seeking a Platform Engineer to join our platform engineering team, focusing on building... ...enterprise data patterns Knowledge of working with various AI Tools such as Copilot, Cursor AI, Claude Code, Blitzy AI etc.,...Work at officeFlexible hours$136.75k - $218.8k
...existing skills and learn new ones "I can succeed as a Senior Platform Engineer at Capital Group." As a Senior Engineer specializing in... ...development tools, automation agents, or platformlevel AI capabilities to improve engineering productivity, reliability,...Temporary workLocal areaFlexible hours$187.02k - $317.93k
...premiseplatforms, while influencing the broader technology organization to adopt similar standards. The leader will driveadoptionof modern, AI-enabledand automation-first capabilities to enhance operational effectiveness, simplify complexity, and scale sustainably. A key...Temporary workLocal areaImmediate startFlexible hours$218.03k - $256.5k
...every day, as we build the emerging onchain platform — and with it, the future global... ...supported. We're hiring a Staff Software Engineer to lead the Identity Accounts team — the... ...production ~ Proven experience leading AI/LLM product development — building AI agents...Local area$153.84k - $246.15k
...existing skills and learn new ones "I can succeed as a Lead Platform Engineer at Capital Group." As a senior individual contributor... ...boundaries of tools such asClaude Code, GitHub Copilot, and similar AI development environments-integrating them into your own daily...Temporary workLocal areaFlexible hours$80 - $85 per hour
...organization, apply now. We are currently seeking a Cloud GenAI Platform Engineer to join our team in Charlotte, North Carolina (US-NC), United... ...enterprise GenAI platform enabling predictive and generative AI inferencing across both GCP and Azure. This role is focused on...Hourly payTemporary workFlexible hours- ...Description: Lead Software Engineer - AI Application Platform The Opportunity We are seeking a Lead Software Engineer to guide the architectural... ...'ll Lead AppGen is an enterprise-grade, multi-tenant LLM-driven application generator deployed across: •...Immediate start
- ...Cloud GenAI Platform Engineer Location: Charlotte, NC (Hybrid) Duration: Long Term Contract Job Description... ...GenAI Platform Engineer to support an enterprise Generative AI platform enabling predictive and generative AI inferencing across...Long term contract
- ...Job Title: ( AI Platform Engineer (AWS - Financial Services)) About Kyyba: Founded in 1998 and headquartered in Farmington Hills, MI, Kyyba has a global presence delivering high-quality resources and top-notch recruiting services, enabling businesses to effectively...Work experience placementWork at officeVisa sponsorshipWork visa3 days per week
$136.75k - $218.8k
...demand professional development resources that allow you to hone existing skills and learn new ones "I can succeed as an AI Platform Engineer at Capital Group." As an AI Platform Engineer, you will design, build, and operate the foundational components of...Temporary workLocal areaFlexible hours$186.07k - $225k
...every day, as we build the emerging onchain platform — and with it, the future global... ...for a Senior Machine Learning Platform Engineer to join our Machine Learning Platform team... ...the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini...Local area- ...JOB SUMMARY The AWS Data & AI Platform Engineer will be responsible for designing, building, and operating secure and scalable AI platforms on AWS within enterprise and regulated environments. This role involves developing and managing generative AI solutions, containerized...Work experience placement
$119k - $224k
...About this role: Wells Fargo is seeking a Lead Infrastructure Engineer to join our AI Platforms and model Support Group as part of Digital Technology and Innovations. Learn more about the career areas and business divisions at wellsfargojobs.com. The Lead Infrastructure...Work experience placement$48k
...AI Platform Engineer We are looking for an AI Platform Engineer—a builder who can architect the "factory" where AI is made. Our goal is to build an internal, on-premises AI ecosystem that mimics the capabilities of AWS or Azure. You will be responsible for creating...Full timeFor contractors- ...migrating Spring Boot microservices from an on-premises Rancher platform to Azure Cloud, including secret management (Azure Key Vault,... ...documentation tools such as Confluence, and design tools like Visio. • Gen AI experience and/or knowledge in large language models (LLMs) and...Contract work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to On-prem Platform Engineer (LLM, Gen AI). Be the first to apply!
- platform engineer Charlotte, NC
- client platform engineer Charlotte, NC
- platform developer Charlotte, NC
- data platform engineer Charlotte, NC
- senior platform engineer Charlotte, NC
- director of digital platform Charlotte, NC
- digital platform specialist Charlotte, NC
- platform product manager Charlotte, NC
- platform manager Charlotte, NC
- platform engineering manager

