Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

On-Premise LLM Inference & GPU Systems Engineer

NTT DATA

On-Premise LLM Inference & GPU Systems Engineer

NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking an On-Premise LLM Inference & GPU Systems Engineer to join our team in Charlotte, North Carolina (US-NC), United States (US).

Role Overview

We are seeking an AI Infrastructure Runtime Engineer to build and maintain large-scale on-prem LLM infrastructure. This is an enterprise private GenAI environment running on NVIDIA H200 GPU clusters and an OpenShift AI deployment ecosystem. You will manage production inference internally, including self-hosting open-source LLMs like Llama. We are focused exclusively on inferencing; this role involves no model training infrastructure or fine-tuning pipelines.

Key Responsibilities
  • NVIDIA GPU Runtime Optimization: Drive extreme runtime efficiency and optimization for the token generation pipeline. Specifically manage prefill/decode optimization and KV cache management.
  • Inference Serving: Deploy and manage inference engines including vLLM and TensorRT-LLM.
  • Hardware Utilization: Optimize GPU throughput tuning, batching strategies, and latency optimization. Manage workload orchestration using RunAI and Kubernetes GPU orchestration.
  • Model Lifecycle Management: Oversee the complete Hugging Face model lifecycle, including model onboarding, deployment, and retirement.
  • Platform Operations: Operate and maintain the OpenShift AI ecosystem as the primary container platform for GenAI workloads.
Required Qualifications
  • 5+ years expertise as an LLM Systems Engineer or AI Infrastructure Runtime Engineer.
  • 5+ years hands-on experience with NVIDIA H200 clusters and runtime optimization techniques (KV Cache, prefill/decode).
  • 3+ years experience in OpenShift AI and GPU orchestration tools like RunAI.
  • Strong experience with modern inference frameworks, specifically vLLM and TensorRT-LLM.
  • Proven track record managing the Hugging Face deployment lifecycle.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the On-Premise LLM Inference & GPU Systems Engineer in Charlotte, NC vacancy
  •  ...part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a On-Premise LLM Inference & GPU Systems Engineer to join our team in Charlotte, North Carolina (US-NC), United States (US). Job Description: ~ Role Overview... 
    Suggested
    Remote work

    The Nippon Telegraph and Telephone Corporation (NTT)

    Charlotte, NC
    5 days ago
  •  ...AI/ML Inference Engineer Major Financial Services Organization - Charlotte, NC 3 Open Roles...  ...of large language model serving, GPU infrastructure, and enterprise MLOps - delivering...  ...NVIDIA H200 GPU clusters using TensorRT-LLM, Triton Inference Server, and SGLang... 
    Suggested
    Immediate start

    Hallmark Global Solutions Ltd

    Charlotte, NC
    3 days ago
  •  ...end-users requirements. Coordinate the engineering and technical aspects of projects, including...  ..., electrical and software aspects of system integration and conducting throughput analyses...  ...such as probability and statistical inference, and fundamentals of plane and solid... 
    Suggested
    Flexible hours

    Murata Machinery USA Inc

    Charlotte, NC
    3 days ago
  • $124k - $280k

     ...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies...  ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and...  ...data sources for use in AI and LLM-powered solutions Manage daily operations... 
    Suggested
    Full time
    H1b

    PwC

    Charlotte, NC
    2 days ago
  • $77k - $202k

     ...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies...  ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and...  ...- Experience in prompt engineering for LLM outputs - Designing thorough data... 
    Suggested
    Full time
    H1b

    PwC

    Charlotte, NC
    3 days ago
  •  ...Job Description Insight Global is seeking a Systems Engineer II to support a large-scale IT infrastructure environment within the logistics and technology industry. This engineer will be responsible for maintaining and optimizing a heavily on-prem environment while... 

    Insight Global

    Charlotte, NC
    2 days ago
  • $68 - $73 per hour

     ...Senior Industrial Engineer – Supplier Production Systems Our client, a global energy technology and industrial manufacturing leader, is seeking a Senior Industrial Engineer – Supplier Production Systems to join their team. As a Senior Industrial Engineer, you will... 
    Hourly pay

    Manpower

    Charlotte, NC
    4 days ago
  • $90k

     ...- as one team. We are bold in our ideas, accountable in our actions, and committed to doing the right things right. As our Systems Engineer, you will own the infrastructure and servers that host our SaaS solution that support thousands of companies worldwide. You will... 
    Worldwide
    Weekend work

    Navex Inc

    Charlotte, NC
    3 days ago
  •  ...The Systems Engineer will be responsible for designing, implementing, and maintaining the organization's IT infrastructure and systems to ensure optimal performance, reliability, and security. This role will support servers, networks, and cloud environments while troubleshooting... 

    Ferretti Search

    Charlotte, NC
    3 days ago
  •  ...Essential Functions & Responsibilities: Industrial/Commercial Control Systems Engineering service for ISI customers in the local area, relating to their needs in the following areas: SCADA Systems, PLC Programming, CAD Drawings, Industrial Networking, Systems Calibration... 
    Contract work
    For contractors
    Local area
    Night shift

    Applied Industrial Technologies

    Charlotte, NC
    12 hours ago
  •  ...System Engineer (with Linux & RHEL) Charlotte , NC / Jacksonville, FL (Hybrid role) """• Senior level experience Primarily with Linux • Experience with IBM AIX OS is a plus • Ability to Recommend, design, develop, and implement software and hardware solutions... 
    Flexible hours
    Weekend work

    Apex Informatics

    Charlotte, NC
    5 days ago
  •  ...for EV charging hardware, including coordination with internal engineering teams and external test laboratories. Regulatory Compliance...  ...Systematic Approach: Use techniques like data mining, system log analysis, EMC/debug testing, and performance testing to troubleshoot... 
    Hourly pay
    Full time
    Temporary work
    Local area

    alpitronic

    Charlotte, NC
    5 days ago
  •  ...Webserver System Engineer A Few Words About Us Integrated Resources, Inc is a premier staffing firm recognized as one of the tri-states most well-respected professional specialty firms. IRI has built its reputation on excellent service and integrity since its inception... 
    Weekend work

    Careers Integrated Resources Inc

    Charlotte, NC
    12 hours ago
  • $64.5k - $129.5k

     ...Systems Engineer Carrier Global Corporation, global leader in intelligent climate and energy solutions, is committed to creating innovations that bring comfort, safety and sustainability to life. Through cutting-edge advancements in climate solutions such as temperature... 
    Full time
    Temporary work
    Local area
    Remote work
    Monday to Friday
    1 day per week

    Carrier Global Corp

    Charlotte, NC
    9 hours ago
  • $105.4k - $124k

     ...career. Try new things, learn new skills and discover what you excel at—all from Day One. Job Description The Nexthink Systems Engineer creates and integrates solutions and services. The Nexthink Systems Engineer directs or recommends enhancements for system performance... 
    Temporary work
    Work experience placement
    Local area
    Remote work
    3 days per week

    U.S. Bank

    Charlotte, NC
    5 days ago
  • $75k - $90k

     ...world leader in the field of professional mobile communications systems with an impressive heritage of technological innovations and a...  ...the following: IP Networking, CAD, System Management, Network Engineering, Networking Equipment, Solution Architecture, ASTRO 25, WAVE... 
    Contract work
    Relocation

    Motorola Solutions

    Charlotte, NC
    4 days ago
  •  ...team of high-performing business professionals and leaders in engineering, R&D, product management and business development areas at our...  ...specific skillset and experience for the following role: The Systems Engineer, reporting to the Systems Engineering Manager, is... 
    Permanent employment
    Work at office
    Immediate start
    Work visa

    Terrestrial Energy USA Inc

    Charlotte, NC
    3 days ago
  • This role requires a strong understanding of network loadbalancing solutions, along with hands on API development experience. Candidates must be proficient with Ansible and Python, specifically using the FastAPI and Django frameworks. Experience with Kubernetes or Red ...

    Experis/Manpower Group

    Charlotte, NC
    3 days ago
  • $87 - $88 per hour

     ...Info Systems Engineer IV New York, New York, United States $ 87.00 - 88.00 (US Dollar) Info Systems Engineer IV needs 7+ years experience Info Systems Engineer IV requires: ~ Locations: NY, NY; Charlotte, NC; Iselin, NJ Info Systems Engineer IV... 

    Global Channel Management

    Charlotte, NC
    3 days ago
  •  ...Title: System Admin/Engineer Location: Charlotte, NC (Onsite) Duration: 12 Months Responsible for matching current technology with the current needs. As part of this task, engages in the evaluation and installation of software, hardware, and other types of support... 

    Brahma Consulting Group

    Charlotte, NC
    11 hours ago
  •  ...Insight Global is seeking a Mac Systems Engineer for a large enterprise client undergoing modernization of their macOS environment. This engineer will play a critical role in validating and securing macOS endpoints by integrating Jamf Pro with Azure Active Directory,... 

    Insight Global

    Charlotte, NC
    4 days ago
  •  ...Systems Engineer The Systems Engineer must be able to script, batch, do app support at the server or system layer. Looking for experience scripting to resolve server problems. The candidate needs to have worked with SQL, and Oracle would be a huge plus. This candidate... 
    Work at office

    Software Technology Inc

    Charlotte, NC
    10 hours ago
  • $80k - $90k

     ...Systems Engineer Charlotte - Headquarters - Charlotte, NC 28217 Overview Salary Range $80,000.00 - $90,000.00 Salary/year Position Type Full Time Description Systems Engineer How This Role Makes An Impact The Imagine team is a growing company, and... 
    Full time
    Work experience placement
    Work at office
    Remote work
    Night shift

    ImagineSoftware

    Charlotte, NC
    2 days ago
  • $76.1k - $104.6k

     ...design, configuration, and operation of complete building control systems including fire, security, and other low voltage control sub-...  ...in release meeting with project field team. Performs value engineering to provide cost effective results while maintaining customer... 
    Contract work
    For contractors
    Work experience placement
    For subcontractor

    Johnson Controls

    Charlotte, NC
    5 days ago
  •  ...strengthening industrial maturity of global supplier manufacturing systems by improving production stability, capacity, and operational...  .... Requirement/Must Have Degree in Industrial Engineering, Manufacturing Engineering, Production Engineering, or Mechanical... 

    Cynet Systems

    Charlotte, NC
    4 days ago
  •  ...Commvault Systems Engineer (Data Protection / Backup) Employment Type: Full-Time, Experienced CGS is seeking an experienced Commvault Data Protection Engineer with extensive knowledge and experience in designing, developing, configuring, implementing, testing, troubleshooting... 
    Full time
    Flexible hours

    Contact Government Services LLC

    Charlotte, NC
    11 hours ago
  • $80 - $81 per hour

     ...Senior Systems Engineer Charlotte, North Carolina, United States $ 80.00 - 81.00 (US Dollar) About the job Senior Systems Engineer Senior Systems Analyst needs 3 or more years working with Windows and Linux Operating Systems Senior Systems Analyst requires... 

    Global Channel Management

    Charlotte, NC
    5 days ago
  • $143.91k - $169.3k

     ...Senior Capital Markets Systems Engineer The Senior Capital Markets Systems Engineer is a senior front office leader accountable for building and operating critical quantitative trading and risk systems within Fixed Income. This role owns one or more production platforms... 
    Temporary work
    Work at office

    U.S. Bancorp

    Charlotte, NC
    2 days ago
  • $124k - $280k

     ...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies...  ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and...  ...- Experience in prompt engineering for LLM outputs - Developing scalable data... 
    Full time
    H1b

    PwC

    Charlotte, NC
    4 days ago
  • $248k - $396.75k

     ...NVIDIA Software Engineer Position NVIDIA is hiring experienced software engineers with...  ...node health monitoring and working with GPU resource scheduling. We welcome out-of-the...  ...DGX Cloud team responsible for production systems that enable large scalable GPU clusters to... 

    NVIDIA

    Belmont, NC
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to On-Premise LLM Inference & GPU Systems Engineer. Be the first to apply!