Senior AI Infrastructure Engineer: GPU Clusters & LLM Ops
$136.5k - $253.5kCadence
Cadence is seeking a highly skilled AI Systems Engineer to join their team in San Jose, CA. This hands-on, senior role will lead the AI infrastructure development, including architecting high-performance GPU clusters and deploying advanced AI models. Ideal candidates will have over 10 years of experience in technical roles, specifically in AI infrastructure management and cloud services integration. Competitive compensation includes an annual salary range of $136,500 to $253,500, plus bonuses and benefits such as paid vacation and 401(k) plan with employer match. #J-18808-Ljbffr Cadence
$152k - $287.5k
A leading technology company is seeking a Senior Software Engineer to develop solutions for GPU clusters aimed at enhancing machine learning innovation. The ideal... ...engineering with significant involvement in ML infrastructure, strong coding skills in Python, C++, or Rust,...Senior- Cadence in San Jose, California is seeking a skilled AI Systems Engineer to lead the development and support of AI infrastructure. The role requires managing GPU clusters, deploying advanced AI models, and utilizing public cloud services. Ideal candidates have over 10 years...Senior
- ...Senior AI/ML DevOps Engineer Join Cisco's CX AI Incubation Team as a Senior AI/... ...Engineer and help productionize LLM/SLM capabilities for... ...intelligence, network automation, infrastructure testing, and intelligence... ...small GPUs to large multi-GPU servers, including air-...Senior
$250.8k - $286.2k
Capital One National Association in San Jose is seeking a Senior Lead AI Engineer to design, develop, and support AI-powered products that enhance customer interactions. The ideal candidate has a strong foundation in programming, extensive experience in AI algorithms, and...Senior$200k - $400k
A dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on cutting-edge technologies in collaboration with world-class researchers. The ideal candidate has...Senior$272k - $431.25k
NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to... ...the efficiency of AI/ML research on GPU Clusters. The role involves collaboration with various teams, monitoring infrastructure performance, and implementing improvements...$124.09k - $210k
...Senior AI Data Infrastructure Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating... ...throughput for large-scale training on 10,000+ GPU clusters. Infrastructure Evolution: Support the seamless transition...SeniorFull timeWork experience placement$184k - $287.5k
...NVIDIA's DGX Cloud AI Efficiency Team... ...contributing to the infrastructure that powers our innovative... ...software engineer to join our team.... ...systems. As a senior DGX Cloud AI Infrastructure... ...the large scale clusters Experience in... .... The GPU, our invention, serves...Senior- A financial services company is seeking a Distinguished AI Engineer in San Jose to develop and support AI-powered products that enhance customer interactions. The ideal candidate will have 8+ years of experience in AI development, expertise in programming languages like...Senior
- IBM Computing in San Jose is looking for an experienced software engineer to design and maintain scalable AI platforms that support various engineering teams. Your role will involve developing core services, engineering autonomous AI workflows, and collaborating on tooling...Senior
- ...Introduction We are hiring a senior engineer to design and deliver a BYOC (Bring Your... ...is a strong plus), with a focus on GPU-enabled infrastructure. This role will lead architecture and... ...optimization. • Experience with multi-cluster/multi-region platform design. •...Senior
$176k - $276k
...and Visualization. The GPU, our invention, serves... ...team of innovative engineers who develop and maintain... ...maintaining large GPU clusters interconnected via NVLink... ...switches, and related infrastructure. ~ Automation expert... .... NVIDIA uses AI tools in its recruiting...Senior- NVIDIA Corporation is seeking a Senior Software Engineer to join its DGX Cloud Production Engineering... ...operational systems for large-scale GPU clusters, ensuring reliability and... ...8 years of experience in production infrastructure along with strong programming skills...Senior
$163.5k - $212.4k
NIO is seeking a Senior AI Inference Infrastructure Software Engineer in San Jose, CA, specializing in building scalable inference systems for large language and... ...and strong skills in performance optimization and GPU programming. The position offers a competitive salary...Senior$163.5k - $212.4k
...dependability. Partner with engineering teams to understand... ...and development for AI model training, and/or... ...Experience with cloud infrastructure and training (Azure, AWS... ...etc) Familiar with LLM and optimization on resource... ..., CPU / MPU, GPU / NPU. Solid understanding...SeniorFull timeTemporary workFlexible hours$209k
...pipelines for data preprocessing, feature engineering, and dataset versioning. •... ...and maintaining the high-performance LLM training GPU infrastructure and cluster. • Optimize GPU utilization for... ...fault-tolerant, and resource-efficient AI workloads across multi-node GPU...SeniorWork at officeRemote work1 day per week$191k - $315k
...Senior Staff AI Engineer, Network Growth AI LinkedIn is the world's largest professional network... ...techniques including Sequence Modeling, LLM, EBR, GNN, etc, and we're continuing... ...experience with large scale ML data infrastructure Experience with developing and...SeniorFor contractorsWork at officeFlexible hours- A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...Senior
- Advanced Micro Devices is seeking a principal software developer to join the ROCm GPU-compute team in Santa Clara, California. The ideal candidate will have over 10 years of software development experience in C/C++, Python, and GPU technologies. This role involves developing...Senior
$184k - $356.5k
NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa Clara, CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep...Senior$229.9k - $262.4k
...Senior Lead AI Engineer (LLM Gateway, FM Hosting) Overview: At Capital One, we are creating responsible and reliable AI systems, changing... ...customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience...SeniorFull timePart timeLocal area$272k - $431.25k
...Principal Ai And Ml Infra Software Engineer, Gpu Clusters We are seeking a Principal AI and ML Infra Software Engineer, GPU Clusters at NVIDIA to join our Hardware Infrastructure team. As an Engineer, you will have a pivotal role in enhancing efficiency for our researchers...$314.8k - $359.3k
...Sr. Distinguished AI Engineer (Agentic AI Platform) Overview... ...in technology infrastructure and world-class talent... ...Staff, Principal and Senior engineers, authoring technical... ...or technologies (e.g. LLM Inference, Similarity... ...mastery (multi-region clusters, service mesh) ~...SeniorFull timePart timeWork at officeLocal area- Advanced Micro Devices, Inc. is seeking a Senior Staff Software Developer who will play a pivotal role in shaping the future of AI and improving performance in key applications.... ...expertise in high-performance C++ programming and GPU technologies, with experience optimizing AI...Senior
$168k - $270.25k
...technical, creative, and Senior AI Platform Engineer to build, support,... ...and lead AI-native infrastructure roadmaps and cross-organizational... ...Architect and scale LLM/ML infrastructure across cloud-native clusters and on-premises... ..., model serving, and GPU-accelerated...$229.9k - $262.4k
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At... .... Our investments in technology infrastructure and world-class talent - along with our... ...Invent and introduce state-of-the-art LLM optimization techniques to improve the...SeniorFull timePart timeLocal area$152k - $241.5k
...NVIDIA seeks a senior software engineer to join the AI Networking co-design and benchmark... ...workloads across large GPU and CPU clusters, thereby ensuring the... ..., particularly within LLM training and inference... ...opportunity to support the core infrastructure powering the next...Senior- A technology firm specializing in AI solutions is seeking a Senior AI Engineer in Santa Clara, California. You will design and implement AI-powered software... ...in software engineering and a strong background in LLM-powered systems, demonstrating the ability to deliver projects...Senior
$152k - $241.5k
...Performance Computing and Visualization. The GPU, our invention, serves as the visual... ...We are looking for highly motivated Senior Software Engineers to work on our GPU Fabric Networking team... ...existing vacancy. NVIDIA uses AI tools in its recruiting processes....SeniorRemote work- ...in Sunnyvale, California, is seeking an experienced MongoDB Database Administrator. You will provide onsite administration, create clusters, and manage database environments with a strong focus on MongoDB, especially in cloud setups. Candidates with over 8 years of...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Infrastructure Engineer: GPU Clusters & LLM Ops. Be the first to apply!
- machine learning ai engineer San Jose, CA
- senior ai engineer San Jose, CA
- ai engineer remote San Jose, CA
- ai ml engineer San Jose, CA
- ai engineer San Jose, CA
- ai developer San Jose, CA
- ai research engineer San Jose, CA
- ai prompt engineer San Jose, CA
- data infrastructure engineer San Jose, CA
- infrastructure engineering manager San Jose, CA

