On-Premise LLM Inference & GPU Systems Engineer
The Nippon Telegraph and Telephone Corporation (NTT)
Company Overview:
Req ID: 372211
NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.
We are currently seeking a On-Premise LLM Inference & GPU Systems Engineer to join our team in Charlotte, North Carolina (US-NC), United States (US).
- Role Overview We are seeking an AI Infrastructure Runtime Engineer to build and maintain large-scale on-prem LLM infrastructure. This is an enterprise private GenAI environment running on NVIDIA H200 GPU clusters and an OpenShift AI deployment ecosystem. You will manage production inference internally, including self-hosting open-source LLMs like Llama. We are focused exclusively on inferencing; this role involves no model training infrastructure or fine-tuning pipelines.
- NVIDIA GPU Runtime Optimization: Drive extreme runtime efficiency and optimization for the token generation pipeline. Specifically manage prefill/decode optimization and KV cache management.
- Inference Serving: Deploy and manage inference engines including vLLM and TensorRT-LLM.
- Hardware Utilization: Optimize GPU throughput tuning, batching strategies, and latency optimization. Manage workload orchestration using RunAI and Kubernetes GPU orchestration.
- Model Lifecycle Management: Oversee the complete Hugging Face model lifecycle, including model onboarding, deployment, and retirement.
- Platform Operations: Operate and maintain the OpenShift AI ecosystem as the primary container platform for GenAI workloads.
- 5+ years expertise as an LLM Systems Engineer or AI Infrastructure Runtime Engineer.
- 5+ years hands-on experience with NVIDIA H200 clusters and runtime optimization techniques (KV Cache, prefill/decode).
- 3+ years experience in OpenShift AI and GPU orchestration tools like RunAI.
- Strong experience with modern inference frameworks, specifically vLLM and TensorRT-LLM.
- Proven track record managing the Hugging Face deployment lifecycle.
About NTT DATA: NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at us.nttdata.com NTT DATA endeavors to make accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact us at This contact information is for accommodation requests only and cannot be used to inquire about the status of applications. NTT DATA is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status. For our EEO Policy Statement, please click here. If you'd like more information on your EEO rights under the law, please click here. For Pay Transparency information, please click here.
- ...Infrastructure Runtime Engineer Local candidates only... ...large-scale on-prem LLM infrastructure. This is... ...running on NVIDIA H200 GPU clusters and an OpenShift... ...will manage production inference internally, including self... ...expertise as an LLM Systems Engineer or AI Infrastructure...SuggestedLocal area
- ...interested. Role : On-prem Platform Engineer Location : Charlotte, NC Long... ...Must-Have Skills (Mandatory Keywords) LLM Inference & Optimization • vLLM, TensorRT-LLM... ...o FP8, AWQ, GPTQ Distributed & GPU Systems • Tensor parallelism and large model...SuggestedLong term contract
- ...end-users requirements. Coordinate the engineering and technical aspects of projects, including... ..., electrical and software aspects of system integration and conducting throughput analyses... ...such as probability and statistical inference, and fundamentals of plane and solid...SuggestedFlexible hours
- ...AI/ML Inference Engineer Major Financial Services Organization - Charlotte, NC 3 Open Roles... ...of large language model serving, GPU infrastructure, and enterprise MLOps - delivering... ...NVIDIA H200 GPU clusters using TensorRT-LLM, Triton Inference Server, and SGLang...SuggestedImmediate start
$119.04k - $144.4k
...Senior Systems Engineer U.S. Bank is seeking the position of Senior Systems Engineer in Charlotte, NC. Essential Responsibilities:... ...servers, network devices, and enterprise applications across on-premises and cloud environments. The Senior Systems Engineer manages and...SuggestedTemporary workBank staffWork from home$55 - $65 per hour
AI/LLM Engineer - 18 month W2 contract, hybrid 3 days onsite / 2 days remote. Charlotte, NC... ...Description - AI/LLM Engineer (RAG & Agentic Systems) We are seeking a highly motivated AI/... ...techniques, prompt engineering, and inference optimization. Implement model safety...Contract workInternshipLocal areaRemote work$124k - $280k
...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies... ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and... ...data sources for use in AI and LLM-powered solutions Manage daily operations...Full timeH1b$77k - $202k
...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies... ...designing and optimising algorithms, models, and systems to enable intelligent decision-making and... ...- Experience in prompt engineering for LLM outputs - Designing thorough data...Full timeH1b- ...offer visa transfer or sponsorship now or in the future. Role Systems Operations Engineer Location Irving Texas, Charlotte, NC Mandatory skills... ...support environments. Experience utilizing generative AI or LLM based tools to assist with diagnostics, system pattern...Temporary work
- ...System Engineer (with Linux & RHEL) Charlotte , NC / Jacksonville, FL (Hybrid role) """• Senior level experience Primarily with Linux • Experience with IBM AIX OS is a plus • Ability to Recommend, design, develop, and implement software and hardware solutions...Flexible hoursWeekend work
- ...for EV charging hardware, including coordination with internal engineering teams and external test laboratories. Regulatory Compliance... ...Systematic Approach: Use techniques like data mining, system log analysis, EMC/debug testing, and performance testing to troubleshoot...Hourly payFull timeTemporary workLocal area
- ...Webserver System Engineer A Few Words About Us Integrated Resources, Inc is a premier staffing firm recognized as one of the tri-states most well-respected professional specialty firms. IRI has built its reputation on excellent service and integrity since its inception...Weekend work
- ...Essential Functions & Responsibilities: Industrial/Commercial Control Systems Engineering service for ISI customers in the local area, relating to their needs in the following areas: SCADA Systems, PLC Programming, CAD Drawings, Industrial Networking, Systems Calibration...Contract workFor contractorsLocal areaNight shift
- ...The Systems Engineer will be responsible for designing, implementing, and maintaining the organization's IT infrastructure and systems to ensure optimal performance, reliability, and security. This role will support servers, networks, and cloud environments while troubleshooting...
$64.5k - $129.5k
...follow on Carrier social media at @Carrier. Summary Conduct system calculation, product development activities and prepare test... ...completion of the initial onboarding and training period, the Systems Engineer will be eligible for a hybrid work schedule, with the option to...Full timeTemporary workLocal areaRemote workMonday to Friday1 day per week- ...team of high-performing business professionals and leaders in engineering, R&D, product management and business development areas at our... ...specific skillset and experience for the following role: The Systems Engineer, reporting to the Systems Engineering Manager, is...Permanent employmentWork at officeImmediate startWork visa
- This role requires a strong understanding of network loadbalancing solutions, along with hands on API development experience. Candidates must be proficient with Ansible and Python, specifically using the FastAPI and Django frameworks. Experience with Kubernetes or Red ...
- ...Title: System Admin/Engineer Location: Charlotte, NC (Onsite) Duration: 12 Months Responsible for matching current technology with the current needs. As part of this task, engages in the evaluation and installation of software, hardware, and other types of support...
- ...Systems Engineer The Systems Engineer must be able to script, batch, do app support at the server or system layer. Looking for experience scripting to resolve server problems. The candidate needs to have worked with SQL, and Oracle would be a huge plus. This candidate...Work at office
$76.1k - $104.6k
...design, configuration, and operation of complete building control systems including fire, security, and other low voltage control sub-... ...in release meeting with project field team. Performs value engineering to provide cost effective results while maintaining customer...Contract workFor contractorsWork experience placementFor subcontractor$87 - $88 per hour
...Info Systems Engineer IV New York, New York, United States $ 87.00 - 88.00 (US Dollar) Info Systems Engineer IV needs 7+ years experience Info Systems Engineer IV requires: ~ Locations: NY, NY; Charlotte, NC; Iselin, NJ Info Systems Engineer IV...$90k
...Systems Engineer At NAVEX, we're transforming the world—making it safer, more ethical, and ensuring every voice is heard. That's real impact. Our high-performance culture is driven by our values. We move with speed, passion and purpose — as one team. We are bold...WorldwideWeekend work$80k - $90k
...Systems Engineer Charlotte - Headquarters - Charlotte, NC 28217 Overview Salary Range $80,000.00 - $90,000.00 Salary/year Position Type Full Time Description Systems Engineer How This Role Makes An Impact The Imagine team is a growing company, and...Full timeWork experience placementWork at officeRemote workNight shift- ...Insight Global is seeking a Mac Systems Engineer for a large enterprise client undergoing modernization of their macOS environment. This engineer will play a critical role in validating and securing macOS endpoints by integrating Jamf Pro with Azure Active Directory,...
$105.4k - $124k
...career. Try new things, learn new skills and discover what you excel at—all from Day One. Job Description The Nexthink Systems Engineer creates and integrates solutions and services. The Nexthink Systems Engineer directs or recommends enhancements for system performance...Temporary workWork experience placementLocal areaRemote work3 days per week$68 - $73 per hour
...Our client, a global energy technology and industrial manufacturing leader, is seeking a Senior Industrial Engineer - Supplier Production Systems to join their team. As a Senior Industrial Engineer, you will be part of the Strategic Procurement / Supplier Development...Hourly pay$75k - $90k
...world leader in the field of professional mobile communications systems with an impressive heritage of technological innovations and a... ...the following: IP Networking, CAD, System Management, Network Engineering, Networking Equipment, Solution Architecture, ASTRO 25, WAVE...Contract workRelocation- ...Job Description Insight Global is seeking a Systems Engineer II to support a large-scale IT infrastructure environment within the logistics and technology industry. This engineer will be responsible for maintaining and optimizing a heavily on-prem environment while...
- ...Commvault Systems Engineer (Data Protection / Backup) Employment Type: Full-Time, Experienced CGS is seeking an experienced Commvault Data Protection Engineer with extensive knowledge and experience in designing, developing, configuring, implementing, testing, troubleshooting...Full timeFlexible hours
- ...strengthening industrial maturity of global supplier manufacturing systems by improving production stability, capacity, and operational... .... Requirement/Must Have Degree in Industrial Engineering, Manufacturing Engineering, Production Engineering, or Mechanical...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to On-Premise LLM Inference & GPU Systems Engineer. Be the first to apply!
- healthcare systems engineer Charlotte, NC
- wireless systems engineer Charlotte, NC
- unix linux systems engineer Charlotte, NC
- systems engineer Charlotte, NC
- ground systems engineer Charlotte, NC
- operations support system engineer Charlotte, NC
- digital communications systems engineer Charlotte, NC
- data systems engineer Charlotte, NC
- sr systems engineer Charlotte, NC
- entry level systems engineer Charlotte, NC

