On-prem Platform Engineer (LLM, Gen AI)
Ampstek
Hi ,
Hope you are doing great!
We have the below urgent position with my client. Please reply if you are interested.
Role : On-prem Platform Engineer
Location : Charlotte, NC
Long Term Contract
Must-Have Skills (Mandatory Keywords)
LLM Inference & Optimization
• vLLM, TensorRT-LLM, Triton Inference Server, SGLang
• Inference optimization techniques:
o Continuous batching
o Speculative decoding
o KV cache / Prefix caching
• Model optimization:
o FP8, AWQ, GPTQ
Distributed & GPU Systems
• Tensor parallelism and large model scaling
• CUDA, NCCL, GPU architecture
• GPU partitioning & optimization (MIG)
Kubernetes & ML Serving
• Kubernetes-based ML serving platforms
• KServe, OpenShift AI
• Helm charts, Operators, platform automation
GPU Orchestration
• Run:AI or similar GPU scheduling/orchestration platforms
• Multi-tenant GPU workload management
Platform Engineering
• Experience building internal AI/ML platforms (on-prem or hybrid)
• Strong automation and system design mindset
Observability & Performance
• Prometheus, Grafana
• ML observability (model latency, throughput, drift, resource utilization)
• Performance benchmarking and tuning
Good to Have / Preferred Skills
• Experience with LLMOps / GenAI pipelines
• Exposure to hybrid cloud (on-prem + GCP/Azure integration)
• Familiarity with Inferentia / alternative accelerators
• Knowledge of service mesh / networking in GPU clusters
· Build, configure, and operate on prem Kubernetes/OpenShift AI platforms for deploying and serving GenAI models and LLM inference workloads.
· Design and optimize high performance inference stacks using vLLM, TensorRT LLM, Triton Inference Server, SGLang, and advanced techniques (continuous batching, speculative decoding, KV caching).
· Manage GPU orchestration and capacity using Run:AI, MIG, CUDA/NCCL, and tensor parallelism to maximize utilization and throughput.
· Deploy and operate Kubernetes ML serving frameworks (KServe, Helm, Operators) for scalable, reliable model serving.
· Drive inference optimization and benchmarking, leveraging FP8, AWQ, GPTQ, and performance tools such as GuideLLM and Locust.
· Implement observability and ML monitoring using Prometheus, Grafana, Arize AI, ensuring SLA/SLO compliance for GenAI services.
· Collaborate with ML and research teams to onboard new models, tune inference performance, and productionize GenAI use cases.
Thanks and Regards
Rohit Pathak| Technical Recruiter
View email address on click.appcast.io|
Direct No: View phone number on click.appcast.io
- ...Sr AI Platform Engineer- AI Platform Engineer Position: Contract Location: Charlotte, NC (Hybrid... ...on with Arize AI, LangSmith, or similar LLM observability platforms. • Experience... ...adopted across multiple teams. • On prem or hybrid AI deployment experience.SuggestedContract work
- A leading organization in the technology sector is hiring an AI Platform Engineer in Charlotte, North Carolina. This role involves building and maintaining the MLOps platform for cutting-edge AI applications and requires 5+ years of experience in platform engineering with...Suggested
- ...Role: Cloud Platform Engineer Location: Charlotte, NC Rate: $80 C2C... ...patterns, observability Support GenAI/LLM workloads and platform enablement... ...lifecycle management, integrating Arize AI and GenAI platforms. • Implement platform...Suggested
$153.84k - $246.15k
...learn new ones "I can succeed as a Platform Engineering Manager, Private Hosting Services at Capital... .... You have driven the adoption of AI-enabled and automation-first capabilities... .... I am comfortable operating in on-prem and cloud-adjacent environments and driving...SuggestedTemporary workLocal areaFlexible hours$117.2k - $175.8k
...Sr Data Engineer - GE07BE We're determined to make a difference and are proud to be an... ...Hartford seeks energetic and passionate AI Platform Engineers to build AI Operations (AIOps,... ...performance tuning is a plus. ~ Experience with LLM orchestration frameworks like Langchain,...SuggestedTemporary workWork at officeRemote work3 days per week$155k
...integrated risk and compliance management platform that help our customers protect their... ...experience life at work! As an AI Platform Runtime Engineer, you will build, test, deploy, and operate... ...-multi-step workflows that combine LLM reasoning with retrieval and tool execution...- ...leading organization in the technology sector, is seeking an AI Platform Engineer to join their team. As an AI Platform Engineer, you will be part... ..., monitoring, evaluation, and continuous improvement of AI/LLM applications. Implement evaluation frameworks to monitor...Weekly payTemporary workFlexible hours
- ...Senior UiPath Platform Engineer - Application & Automation Enablement Location: Charlotte NC (... ...owner of In-house utility development, AI/ML integration, and developer productivity... ...engineering standards. 2. GenAI, LLM & AI Platform Engineering (Primary) •...
- ...Senior UiPath Platform Engineer - Automation & Cloud Reliability Location: Charlotte NC (onsite... ...of UiPath Automation Cloud and hybrid/on prem environments. • Own Orchestrator,... ...Suite, Automation Hub, Insights, Agentic AI integrations, and related components....Permanent employment
- A leading company is seeking a Senior AI professional with expertise in Large Language Models and a strong Java background. The ideal... ...optimize NLP systems, build scalable applications, and integrate LLM capabilities into various projects. Proficiency in Java and cloud...
- ...experiencedSaaSplatformengineerto play a senior, hands-on role in the engineering, administration, and evolution of our... ...expertfor critical SaaS and cloud platforms, driving automation, security,... ...models Lead automation initiatives (AI Agents,PowerShell, APIs, workflows) to...Visa sponsorshipWork visa
- ...AI Engineer At Allstate, great things happen when our people work together to protect families... ...on top of our Unstructured Data Platform. In this role, you'll help transform massive... ...AI workflows Prompt Engineering: LLM orchestration and optimization Semantic...
$125k - $142k
...achieve wealth, independence, and purpose. We are seeking a Platform Engineer to join our platform engineering team, focusing on building... ...enterprise data patterns Knowledge of working with various AI Tools such as Copilot, Cursor AI, Claude Code, Blitzy AI etc.,...Work at officeFlexible hours$55 - $60 per hour
*Platform Engineer* *12 Month Contract* *Charlotte, NC* *Hybrid 3 days on site* ***MUST WORK ON A W2 WITHOUT SPONSORSHIP*** *Description... ...and civil liability. *Use of Artificial Intelligence (AI):* We may use Artificial Intelligence (AI) to support parts of...Contract workTemporary work- ...Senior Engineer - SIEM Platform Engineering & Operations Denver, Colorado;Washington, District of Columbia; Addison, Texas; Charlotte, North... ...Support Model Risk Management (MRM) efforts to describe AI or ML Models in use by any of our SIEM Technologies. Required...Work at officeShift workDay shift
$136.75k - $218.8k
...existing skills and learn new ones "I can succeed as a Senior Platform Engineer at Capital Group." As a Senior Engineer specializing in... ...development tools, automation agents, or platformlevel AI capabilities to improve engineering productivity, reliability,...Temporary workLocal areaFlexible hours$136.75k - $218.8k
...existing skills and learn new ones "I can succeed as a Senior Platform Engineer at Capital Group." As a Senior Engineeron our enterprise-... ...as Code can achieve across the enterprise. Infuse AI into Infrastructure - Leverage AI and machine learning capabilities...Temporary workLocal areaFlexible hours$187.02k - $317.93k
...premiseplatforms, while influencing the broader technology organization to adopt similar standards. The leader will driveadoptionof modern, AI-enabledand automation-first capabilities to enhance operational effectiveness, simplify complexity, and scale sustainably. A key...Temporary workLocal areaImmediate startFlexible hours- ...Cloud GenAI Platform Engineer Location: Charlotte, NC (Hybrid) Duration: Long Term Contract Job Description... ...GenAI Platform Engineer to support an enterprise Generative AI platform enabling predictive and generative AI inferencing across...Long term contract
$218.03k - $256.5k
...every day, as we build the emerging onchain platform — and with it, the future global... ...supported. We're hiring a Staff Software Engineer to lead the Identity Accounts team — the... ...production ~ Proven experience leading AI/LLM product development — building AI agents...Local area$80 - $85 per hour
...organization, apply now. We are currently seeking a Cloud GenAI Platform Engineer to join our team in Charlotte, North Carolina (US-NC), United... ...enterprise GenAI platform enabling predictive and generative AI inferencing across both GCP and Azure. This role is focused on...Hourly payTemporary workFlexible hours- ...Description: Lead Software Engineer - AI Application Platform The Opportunity We are seeking a Lead Software Engineer to guide the architectural... ...'ll Lead AppGen is an enterprise-grade, multi-tenant LLM-driven application generator deployed across: •...Immediate start
$153.84k - $246.15k
...existing skills and learn new ones "I can succeed as a Lead Platform Engineer at Capital Group." As a senior individual contributor... ...boundaries of tools such asClaude Code, GitHub Copilot, and similar AI development environments-integrating them into your own daily...Temporary workLocal areaFlexible hours- ...Job Title: ( AI Platform Engineer (AWS - Financial Services)) About Kyyba: Founded in 1998 and headquartered in Farmington Hills, MI, Kyyba has a global presence delivering high-quality resources and top-notch recruiting services, enabling businesses to effectively...Work experience placementWork at officeVisa sponsorshipWork visa3 days per week
$136.75k - $218.8k
...demand professional development resources that allow you to hone existing skills and learn new ones "I can succeed as an AI Platform Engineer at Capital Group." As an AI Platform Engineer, you will design, build, and operate the foundational components of...Temporary workLocal areaFlexible hours$48k
...AI Platform Engineer We are looking for an AI Platform Engineer—a builder who can architect the "factory" where AI is made. Our goal is to build an internal, on-premises AI ecosystem that mimics the capabilities of AWS or Azure. You will be responsible for creating...Full timeFor contractors$186.07k - $225k
...every day, as we build the emerging onchain platform — and with it, the future global... ...for a Senior Machine Learning Platform Engineer to join our Machine Learning Platform team... ...the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini...Local area- ...Job Description Job Description Senior Veritas eDiscovery Platform (eDP) Engineer Employment Type: Full-Time, Executive-Level Department... ...cgsfederal.com #CJ We may use artificial intelligence (AI) tools to support parts of the hiring process, such as...Full timeFor contractorsRemote workFlexible hours
- ...migrating Spring Boot microservices from an on-premises Rancher platform to Azure Cloud, including secret management (Azure Key Vault,... ...documentation tools such as Confluence, and design tools like Visio. • Gen AI experience and/or knowledge in large language models (LLMs) and...Contract work
- ...Data Platform Engineer Industry: Banking, Financial Services & Insurance Location: Charlotte, NC (Locals Only) Job Description We... ...build, and optimize scalable data infrastructure that powers AI and advanced analytics initiatives. This role is not about building...Local area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to On-prem Platform Engineer (LLM, Gen AI). Be the first to apply!
- platform engineer Charlotte, NC
- client platform engineer Charlotte, NC
- platform developer Charlotte, NC
- data platform engineer Charlotte, NC
- senior platform engineer Charlotte, NC
- director of digital platform Charlotte, NC
- digital platform specialist Charlotte, NC
- platform product manager Charlotte, NC
- platform manager Charlotte, NC
- platform engineering manager


