On-Premise LLM Inference & GPU Systems Engineer
NTT DATA
On-Premise LLM Inference & GPU Systems Engineer
NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking an On-Premise LLM Inference & GPU Systems Engineer to join our team in Charlotte, North Carolina (US-NC), United States (US).
Role Overview
We are seeking an AI Infrastructure Runtime Engineer to build and maintain large-scale on-prem LLM infrastructure. This is an enterprise private GenAI environment running on NVIDIA H200 GPU clusters and an OpenShift AI deployment ecosystem. You will manage production inference internally, including self-hosting open-source LLMs like Llama. We are focused exclusively on inferencing; this role involves no model training infrastructure or fine-tuning pipelines.
Key Responsibilities
- NVIDIA GPU Runtime Optimization: Drive extreme runtime efficiency and optimization for the token generation pipeline. Specifically manage prefill/decode optimization and KV cache management.
- Inference Serving: Deploy and manage inference engines including vLLM and TensorRT-LLM.
- Hardware Utilization: Optimize GPU throughput tuning, batching strategies, and latency optimization. Manage workload orchestration using RunAI and Kubernetes GPU orchestration.
- Model Lifecycle Management: Oversee the complete Hugging Face model lifecycle, including model onboarding, deployment, and retirement.
- Platform Operations: Operate and maintain the OpenShift AI ecosystem as the primary container platform for GenAI workloads.
Required Qualifications
- 5+ years expertise as an LLM Systems Engineer or AI Infrastructure Runtime Engineer.
- 5+ years hands-on experience with NVIDIA H200 clusters and runtime optimization techniques (KV Cache, prefill/decode).
- 3+ years experience in OpenShift AI and GPU orchestration tools like RunAI.
- Strong experience with modern inference frameworks, specifically vLLM and TensorRT-LLM.
- Proven track record managing the Hugging Face deployment lifecycle.
- ...part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a On-Premise LLM Inference & GPU Systems Engineer to join our team in Charlotte, North Carolina (US-NC), United States (US). Job Description: ~ Role Overview...SuggestedRemote work
- ...AI/ML Inference Engineer Major Financial Services Organization - Charlotte, NC 3 Open Roles... ...of large language model serving, GPU infrastructure, and enterprise MLOps - delivering... ...NVIDIA H200 GPU clusters using TensorRT-LLM, Triton Inference Server, and SGLang...SuggestedImmediate start
- ...end-users requirements. Coordinate the engineering and technical aspects of projects, including... ..., electrical and software aspects of system integration and conducting throughput analyses... ...such as probability and statistical inference, and fundamentals of plane and solid...SuggestedFlexible hours
$124k - $280k
...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies... ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and... ...data sources for use in AI and LLM-powered solutions Manage daily operations...SuggestedFull timeH1b$77k - $202k
...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies... ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and... ...- Experience in prompt engineering for LLM outputs - Designing thorough data...SuggestedFull timeH1b- ...Job Description Insight Global is seeking a Systems Engineer II to support a large-scale IT infrastructure environment within the logistics and technology industry. This engineer will be responsible for maintaining and optimizing a heavily on-prem environment while...
$68 - $73 per hour
...Senior Industrial Engineer – Supplier Production Systems Our client, a global energy technology and industrial manufacturing leader, is seeking a Senior Industrial Engineer – Supplier Production Systems to join their team. As a Senior Industrial Engineer, you will...Hourly pay$90k
...- as one team. We are bold in our ideas, accountable in our actions, and committed to doing the right things right. As our Systems Engineer, you will own the infrastructure and servers that host our SaaS solution that support thousands of companies worldwide. You will...WorldwideWeekend work- ...The Systems Engineer will be responsible for designing, implementing, and maintaining the organization's IT infrastructure and systems to ensure optimal performance, reliability, and security. This role will support servers, networks, and cloud environments while troubleshooting...
- ...Essential Functions & Responsibilities: Industrial/Commercial Control Systems Engineering service for ISI customers in the local area, relating to their needs in the following areas: SCADA Systems, PLC Programming, CAD Drawings, Industrial Networking, Systems Calibration...Contract workFor contractorsLocal areaNight shift
- ...System Engineer (with Linux & RHEL) Charlotte , NC / Jacksonville, FL (Hybrid role) """• Senior level experience Primarily with Linux • Experience with IBM AIX OS is a plus • Ability to Recommend, design, develop, and implement software and hardware solutions...Flexible hoursWeekend work
- ...for EV charging hardware, including coordination with internal engineering teams and external test laboratories. Regulatory Compliance... ...Systematic Approach: Use techniques like data mining, system log analysis, EMC/debug testing, and performance testing to troubleshoot...Hourly payFull timeTemporary workLocal area
- ...Webserver System Engineer A Few Words About Us Integrated Resources, Inc is a premier staffing firm recognized as one of the tri-states most well-respected professional specialty firms. IRI has built its reputation on excellent service and integrity since its inception...Weekend work
$64.5k - $129.5k
...Systems Engineer Carrier Global Corporation, global leader in intelligent climate and energy solutions, is committed to creating innovations that bring comfort, safety and sustainability to life. Through cutting-edge advancements in climate solutions such as temperature...Full timeTemporary workLocal areaRemote workMonday to Friday1 day per week$105.4k - $124k
...career. Try new things, learn new skills and discover what you excel at—all from Day One. Job Description The Nexthink Systems Engineer creates and integrates solutions and services. The Nexthink Systems Engineer directs or recommends enhancements for system performance...Temporary workWork experience placementLocal areaRemote work3 days per week$75k - $90k
...world leader in the field of professional mobile communications systems with an impressive heritage of technological innovations and a... ...the following: IP Networking, CAD, System Management, Network Engineering, Networking Equipment, Solution Architecture, ASTRO 25, WAVE...Contract workRelocation- ...team of high-performing business professionals and leaders in engineering, R&D, product management and business development areas at our... ...specific skillset and experience for the following role: The Systems Engineer, reporting to the Systems Engineering Manager, is...Permanent employmentWork at officeImmediate startWork visa
- This role requires a strong understanding of network loadbalancing solutions, along with hands on API development experience. Candidates must be proficient with Ansible and Python, specifically using the FastAPI and Django frameworks. Experience with Kubernetes or Red ...
$87 - $88 per hour
...Info Systems Engineer IV New York, New York, United States $ 87.00 - 88.00 (US Dollar) Info Systems Engineer IV needs 7+ years experience Info Systems Engineer IV requires: ~ Locations: NY, NY; Charlotte, NC; Iselin, NJ Info Systems Engineer IV...- ...Title: System Admin/Engineer Location: Charlotte, NC (Onsite) Duration: 12 Months Responsible for matching current technology with the current needs. As part of this task, engages in the evaluation and installation of software, hardware, and other types of support...
- ...Insight Global is seeking a Mac Systems Engineer for a large enterprise client undergoing modernization of their macOS environment. This engineer will play a critical role in validating and securing macOS endpoints by integrating Jamf Pro with Azure Active Directory,...
- ...Systems Engineer The Systems Engineer must be able to script, batch, do app support at the server or system layer. Looking for experience scripting to resolve server problems. The candidate needs to have worked with SQL, and Oracle would be a huge plus. This candidate...Work at office
$80k - $90k
...Systems Engineer Charlotte - Headquarters - Charlotte, NC 28217 Overview Salary Range $80,000.00 - $90,000.00 Salary/year Position Type Full Time Description Systems Engineer How This Role Makes An Impact The Imagine team is a growing company, and...Full timeWork experience placementWork at officeRemote workNight shift$76.1k - $104.6k
...design, configuration, and operation of complete building control systems including fire, security, and other low voltage control sub-... ...in release meeting with project field team. Performs value engineering to provide cost effective results while maintaining customer...Contract workFor contractorsWork experience placementFor subcontractor- ...strengthening industrial maturity of global supplier manufacturing systems by improving production stability, capacity, and operational... .... Requirement/Must Have Degree in Industrial Engineering, Manufacturing Engineering, Production Engineering, or Mechanical...
- ...Commvault Systems Engineer (Data Protection / Backup) Employment Type: Full-Time, Experienced CGS is seeking an experienced Commvault Data Protection Engineer with extensive knowledge and experience in designing, developing, configuring, implementing, testing, troubleshooting...Full timeFlexible hours
$80 - $81 per hour
...Senior Systems Engineer Charlotte, North Carolina, United States $ 80.00 - 81.00 (US Dollar) About the job Senior Systems Engineer Senior Systems Analyst needs 3 or more years working with Windows and Linux Operating Systems Senior Systems Analyst requires...$143.91k - $169.3k
...Senior Capital Markets Systems Engineer The Senior Capital Markets Systems Engineer is a senior front office leader accountable for building and operating critical quantitative trading and risk systems within Fixed Income. This role owns one or more production platforms...Temporary workWork at office$124k - $280k
...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies... ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and... ...- Experience in prompt engineering for LLM outputs - Developing scalable data...Full timeH1b$248k - $396.75k
...NVIDIA Software Engineer Position NVIDIA is hiring experienced software engineers with... ...node health monitoring and working with GPU resource scheduling. We welcome out-of-the... ...DGX Cloud team responsible for production systems that enable large scalable GPU clusters to...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to On-Premise LLM Inference & GPU Systems Engineer. Be the first to apply!
- operations support system engineer Charlotte, NC
- microsoft systems engineer Charlotte, NC
- mission system engineer Charlotte, NC
- unix linux systems engineer Charlotte, NC
- space systems engineer Charlotte, NC
- digital communications systems engineer Charlotte, NC
- application system engineer Charlotte, NC
- system performance engineer Charlotte, NC
- system engineer contract Charlotte, NC
- senior staff systems engineer Charlotte, NC

