Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

On-Premise LLM Inference & GPU Systems Engineer

NTT Data Americas, Inc.

Company Overview NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward‑thinking organization, apply now. We are currently seeking a On‑Premise LLM Inference & GPU Systems Engineer to join our team in Charlotte, North Carolina (US‑NC), United States (US). Job Description We are seeking an AI Infrastructure Runtime Engineer to build and maintain large‑scale on‑prem LLM infrastructure. This is an enterprise private GenAI environment running on NVIDIA H200 GPU clusters and an OpenShift AI deployment ecosystem. You will manage production inference internally, including self‑hosting open‑source LLMs like Llama. We are focused exclusively on inferencing; this role involves no model training infrastructure or fine‑tuning pipelines. Key Responsibilities NVIDIA GPU Runtime Optimization: Drive extreme runtime efficiency and optimization for the token generation pipeline. Specifically manage prefill/decode optimization and KV cache management. Inference Serving: Deploy and manage inference engines including vLLM and TensorRT‑LLM. Hardware Utilization: Optimize GPU throughput tuning, batching strategies, and latency optimization. Manage workload orchestration using RunAI and Kubernetes GPU orchestration. Model Lifecycle Management: Oversee the complete Hugging Face model lifecycle, including model onboarding, deployment, and retirement. Platform Operations: Operate and maintain the OpenShift AI ecosystem as the primary container platform for GenAI workloads. Required Qualifications 5+ years expertise as an LLM Systems Engineer or AI Infrastructure Runtime Engineer. 5+ years hands‑on experience with NVIDIA H200 clusters and runtime optimization techniques (KV Cache, prefill/decode). 3+ years experience in OpenShift AI and GPU orchestration tools like RunAI. Strong experience with modern inference frameworks, specifically vLLM and TensorRT‑LLM. Proven track record managing the Hugging Face deployment lifecycle. Equal Opportunity Employment NTT DATA is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status. For our EEO Policy Statement, please click here. If you’d like more information on your EEO rights under the law, please click here. #J-18808-Ljbffr NTT Data Americas, Inc.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the On-Premise LLM Inference & GPU Systems Engineer in Charlotte, NC vacancy
  •  ...On-Premise LLM Inference & GPU Systems Engineer NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking... 
    Suggested

    NTT DATA

    Charlotte, NC
    3 days ago
  •  ...Infrastructure Runtime Engineer to build and maintain large-scale on-prem LLM infrastructure. This is...  ...running on NVIDIA H200 GPU clusters and an OpenShift...  ...will manage production inference internally, including self...  ...years expertise as an LLM Systems Engineer or AI... 
    Suggested
    Local area

    TechnoGen

    Charlotte, NC
    5 days ago
  • NTT Data Americas, Inc. is looking for an On-Premise LLM Inference & GPU Systems Engineer to enhance our large-scale LLM infrastructure in Charlotte, North Carolina. The successful candidate will focus on managing NVIDIA H200 GPU clusters and optimizing runtime efficiency... 
    Suggested

    NTT Data Americas, Inc.

    Charlotte, NC
    4 days ago
  •  ...interested. Role : On-prem Platform Engineer Location : Charlotte, NC Long...  ...Must-Have Skills (Mandatory Keywords) LLM Inference & Optimization • vLLM, TensorRT-LLM...  ...o FP8, AWQ, GPTQ Distributed & GPU Systems • Tensor parallelism and large model... 
    Suggested
    Long term contract

    Ampstek

    Charlotte, NC
    2 days ago
  •  ...AI/ML Inference Engineer Major Financial Services Organization - Charlotte, NC 3 Open Roles...  ...of large language model serving, GPU infrastructure, and enterprise MLOps - delivering...  ...NVIDIA H200 GPU clusters using TensorRT-LLM, Triton Inference Server, and SGLang... 
    Suggested
    Immediate start

    Hallmark Global Solutions Ltd

    Charlotte, NC
    3 days ago
  •  ...end-users requirements. Coordinate the engineering and technical aspects of projects, including...  ..., electrical and software aspects of system integration and conducting throughput analyses...  ...such as probability and statistical inference, and fundamentals of plane and solid... 
    Flexible hours

    Murata Machinery USA Inc

    Charlotte, NC
    3 days ago
  • $119.04k - $144.4k

     ...Description U.S. Bank is seeking the position of Senior Systems Engineer in Charlotte, NC. Essential Responsibilities: The Senior...  ..., network devices, and enterprise applications across on-premises and cloud environments. The Senior Systems Engineer manages and... 
    Temporary work
    Bank staff
    Local area
    Work from home

    U.S. Bank

    Charlotte, NC
    3 days ago
  • $124k - $280k

     ...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies...  ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and...  ...data sources for use in AI and LLM-powered solutions Manage daily operations... 
    Full time
    H1b

    PwC

    Charlotte, NC
    2 days ago
  • $55 - $65 per hour

    AI/LLM Engineer - 18 month W2 contract, hybrid 3 days onsite / 2 days remote. Charlotte, NC...  ...Description - AI/LLM Engineer (RAG & Agentic Systems) We are seeking a highly motivated AI/...  ...techniques, prompt engineering, and inference optimization. Implement model safety... 
    Contract work
    Internship
    Local area
    Remote work

    The Matlen Silver Group, Inc.

    Charlotte, NC
    5 days ago
  • $77k - $202k

     ...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies...  ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and...  ...- Experience in prompt engineering for LLM outputs - Designing thorough data... 
    Full time
    H1b

    PwC

    Charlotte, NC
    3 days ago
  •  ...offer visa transfer or sponsorship now or in the future. Role Systems Operations Engineer Location Irving Texas, Charlotte, NC Mandatory skills...  ...support environments. Experience utilizing generative AI or LLM based tools to assist with diagnostics, system pattern... 
    Temporary work

    Cognizant

    Charlotte, NC
    2 days ago
  •  ...Essential Functions & Responsibilities: Industrial/Commercial Control Systems Engineering service for ISI customers in the local area, relating to their needs in the following areas: SCADA Systems, PLC Programming, CAD Drawings, Industrial Networking, Systems Calibration... 
    Contract work
    For contractors
    Local area
    Night shift

    Applied Industrial Technologies

    Charlotte, NC
    1 day ago
  •  ...The Systems Engineer will be responsible for designing, implementing, and maintaining the organization's IT infrastructure and systems to ensure optimal performance, reliability, and security. This role will support servers, networks, and cloud environments while troubleshooting... 

    Ferretti Search

    Charlotte, NC
    3 days ago
  • $80k - $90k

     ...Systems Engineer Charlotte - Headquarters - Charlotte, NC 28217 Overview Salary Range $80,000.00 - $90,000.00 Salary/year Position Type Full Time Description Systems Engineer How This Role Makes An Impact The Imagine team is a growing company, and... 
    Full time
    Work experience placement
    Work at office
    Remote work
    Night shift

    ImagineSoftware

    Charlotte, NC
    2 days ago
  • $90k

     ...Systems Engineer At NAVEX, we're transforming the world—making it safer, more ethical, and ensuring every voice is heard. That's real impact. Our high-performance culture is driven by our values. We move with speed, passion and purpose — as one team. We are bold... 
    Worldwide
    Weekend work

    NAVEX Global

    Charlotte, NC
    4 days ago
  •  ...Insight Global is seeking a Mac Systems Engineer for a large enterprise client undergoing modernization of their macOS environment. This engineer will play a critical role in validating and securing macOS endpoints by integrating Jamf Pro with Azure Active Directory,... 

    Insight Global

    Charlotte, NC
    4 days ago
  •  ...Title: System Admin/Engineer Location: Charlotte, NC (Onsite) Duration: 12 Months Responsible for matching current technology with the current needs. As part of this task, engages in the evaluation and installation of software, hardware, and other types of support... 

    Brahma Consulting Group

    Charlotte, NC
    1 day ago
  •  ...Systems Engineer The Systems Engineer must be able to script, batch, do app support at the server or system layer. Looking for experience scripting to resolve server problems. The candidate needs to have worked with SQL, and Oracle would be a huge plus. This candidate... 
    Work at office

    Software Technology Inc

    Charlotte, NC
    1 day ago
  • $87 - $88 per hour

     ...Info Systems Engineer IV New York, New York, United States $ 87.00 - 88.00 (US Dollar) Info Systems Engineer IV needs 7+ years experience Info Systems Engineer IV requires: ~ Locations: NY, NY; Charlotte, NC; Iselin, NJ Info Systems Engineer IV... 

    Global Channel Management

    Charlotte, NC
    1 day ago
  • $76.1k - $104.6k

     ...design, configuration, and operation of complete building control systems including fire, security, and other low voltage control sub-...  ...in release meeting with project field team. Performs value engineering to provide cost effective results while maintaining customer... 
    Contract work
    For contractors
    Work experience placement
    For subcontractor

    Johnson Controls

    Charlotte, NC
    5 days ago
  •  ...Job Description Insight Global is seeking a Systems Engineer II to support a large-scale IT infrastructure environment within the logistics and technology industry. This engineer will be responsible for maintaining and optimizing a heavily on-prem environment while... 

    Insight Global

    Charlotte, NC
    2 days ago
  • $68 - $73 per hour

     ...Senior Industrial Engineer – Supplier Production Systems Our client, a global energy technology and industrial manufacturing leader, is seeking a Senior Industrial Engineer – Supplier Production Systems to join their team. As a Senior Industrial Engineer, you will... 
    Hourly pay

    Manpower

    Charlotte, NC
    4 days ago
  •  ...for EV charging hardware, including coordination with internal engineering teams and external test laboratories. Regulatory Compliance...  ...Systematic Approach: Use techniques like data mining, system log analysis, EMC/debug testing, and performance testing to troubleshoot... 
    Hourly pay
    Full time
    Temporary work
    Local area

    alpitronic

    Charlotte, NC
    5 days ago
  •  ...System Engineer (with Linux & RHEL) Charlotte , NC / Jacksonville, FL (Hybrid role) """• Senior level experience Primarily with Linux • Experience with IBM AIX OS is a plus • Ability to Recommend, design, develop, and implement software and hardware solutions... 
    Flexible hours
    Weekend work

    Apex Informatics

    Charlotte, NC
    5 days ago
  •  ...Webserver System Engineer A Few Words About Us Integrated Resources, Inc is a premier staffing firm recognized as one of the tri-states most well-respected professional specialty firms. IRI has built its reputation on excellent service and integrity since its inception... 
    Weekend work

    Careers Integrated Resources Inc

    Charlotte, NC
    1 day ago
  •  ...team of high-performing business professionals and leaders in engineering, R&D, product management and business development areas at our...  ...specific skillset and experience for the following role: The Systems Engineer, reporting to the Systems Engineering Manager, is... 
    Permanent employment
    Work at office
    Immediate start
    Work visa

    Terrestrial Energy USA Inc

    Charlotte, NC
    3 days ago
  • This role requires a strong understanding of network loadbalancing solutions, along with hands on API development experience. Candidates must be proficient with Ansible and Python, specifically using the FastAPI and Django frameworks. Experience with Kubernetes or Red ...

    Experis/Manpower Group

    Charlotte, NC
    3 days ago
  • $64.5k - $129.5k

     ...follow on Carrier social media at @Carrier. Summary Conduct system calculation, product development activities and prepare test...  ...completion of the initial onboarding and training period, the Systems Engineer will be eligible for a hybrid work schedule, with the option to... 
    Full time
    Temporary work
    Local area
    Remote work
    Monday to Friday
    1 day per week

    Carrier

    Charlotte, NC
    4 days ago
  •  ...strengthening industrial maturity of global supplier manufacturing systems by improving production stability, capacity, and operational...  .... Requirement/Must Have Degree in Industrial Engineering, Manufacturing Engineering, Production Engineering, or Mechanical... 

    Cynet Systems

    Charlotte, NC
    4 days ago
  •  ...Commvault Systems Engineer (Data Protection / Backup) Employment Type: Full-Time, Experienced CGS is seeking an experienced Commvault Data Protection Engineer with extensive knowledge and experience in designing, developing, configuring, implementing, testing, troubleshooting... 
    Full time
    Flexible hours

    Contact Government Services LLC

    Charlotte, NC
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to On-Premise LLM Inference & GPU Systems Engineer. Be the first to apply!