Sr. Staff AI Engineer - On-Prem AI Infrastructure & Agentic Systems
$140k - $165kSK hynix memory solutions America Inc.
About the Company: At SK Hynix Memory Solution, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive the evolution of advancing mobile technology, empowering cloud computing, and pioneering future technologies. Our cutting-edge memory technologies are essential in today's most advanced electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape. We're looking for innovative minds to join our mission of shaping the future of technology. At SK Hynix Memory, you'll be part of a team that's pioneering breakthrough memory solutions while maintaining a strong commitment to sustainability. We're not just adapting to technological change – we're driving it, with significant investments in artificial intelligence, machine learning, and eco-friendly solutions and operational practices. As we continue to expand our market presence and push the boundaries of what's possible in semiconductor technology, we invite you to be part of our journey to creating the next generation of memory solutions that will define the future of computing. Why Join Us? Build foundational AI infrastructure that powers next-gen enterprise systems. Work on cutting-edge agentic AI — not just chatbots, but autonomous systems that reason, plan, and act. Opportunity to influence AI strategy, deployment, and governance in a high-impact environment. About the Role: We are seeking a hands-on AI Engineer to design, deploy, and maintain on-prem AI infrastructure and build agentic AI systems that drive real-world automation. You’ll be responsible for setting up scalable AI environments, implementing RAG pipelines, fine-tuning embedded models, and architecting AI agents that operate autonomously in enterprise settings. This role sits at the intersection of AI systems engineering and applied ML — you’ll bridge infrastructure, model deployment, and agent logic. Responsibilities: Design and deploy on-prem AI infrastructure — including GPU clusters, model serving (e.g., vLLM, TGI, Triton), vector DBs (e.g., Milvus, Qdrant, FAISS), and orchestration (Kubernetes, Helm, Docker). Build and optimize RAG pipelines — including document chunking, retrieval strategies (hybrid, re-ranking), and evaluation of retrieval accuracy and latency. Develop agentic AI systems — design stateful agents with memory, tool use, and planning capabilities (e.g., using LangGraph, AutoGen, or custom frameworks). Fine-tune and deploy embedded models — work with LoRA, QLoRA, or full fine-tuning for domain-specific tasks; optimize for edge/on-device inference. Implement Model Control Protocols (MCP) — ensure model governance, versioning, access control, and monitoring for production AI systems. Collaborate with product and engineering teams to integrate AI capabilities into enterprise workflows — especially in storage, QA, or systems engineering contexts. Automate and monitor AI pipelines — build CI/CD for model deployment, logging, and performance tracking. Minimum Qualifications: 2+ years of experience in AI/ML engineering, with hands-on deployment of AI systems on-prem or private cloud. Proven experience building agentic AI systems — including state management, tool integration, and multi-step reasoning. Strong working knowledge of RAG architectures — chunking, retrieval, re-ranking, evaluation metrics. Experience with model fine-tuning (LoRA, QLoRA, full fine-tuning) and embedding models for retrieval. Familiarity with Model Control Protocols (MCP) or similar governance frameworks (model versioning, access control, audit trails). Proficiency in Python, Linux, Docker/Kubernetes, and vector databases (e.g., Milvus, Qdrant, Pinecone). Experience with AI serving frameworks (vLLM, TGI, Triton, Ollama, etc.). Preferred Qualifications: Experience deploying AI in enterprise storage or hardware-adjacent environments. Background in systems engineering or QA automation — bonus if you’ve used AI to automate testing or validation. Familiarity with embedded AI or edge inference (ONNX, TensorRT, GGUF, etc.). Experience with AI agent frameworks (LangGraph, AutoGen, BabyAGI, etc.). Knowledge of AI observability tools (LangSmith, Weights & Biases, Prometheus/Grafana for AI). As a Storage company, knowledge of storage area/NVMe is a PLUS. Education Requirement: Bachelor of Science in CS, EE, ME, or other applicable Engineering field. COMPENSATION: $140,000/yr - $165,000/yr
REGARDING COMPENSATION:
SK hynix memory solutions America Inc. offers you the opportunity to apply your skills to exciting projects while working with innovative teams. Our compensation package is complimented by a generous benefits package including medical, dental, vision, life insurance and a company 401(k) match, as well as cafeteria, onsite gym and much more. If you are motivated by technical challenges, we offer a collaborative work environment that encourages career growth. The salary offered to a selected candidate will be tailored based on several factors, including the location, job grade, relevant knowledge, skills, and experience. We also take into account the internal equity among our current team members to ensure fairness and competitiveness- Applied Materials is hiring an Agentic AI Systems Engineer in Santa Clara, CA. This role involves designing the infrastructure for GenAI applications, bridging AI and software needs. Candidates must have 7+ years of experience with a strong proficiency in programming languages...Suggested
$262k - $365k
Google Inc. seeks a Senior Staff Software Engineer for AI Infrastructure within Google Cloud. This role involves architecting... ...performance, distributed infrastructure for agentic AI workflows, with responsibilities including system reliability and transitioning...Senior$152k - $208.5k
...leader in materials engineering solutions used to... ...our world - like AI and IoT. If you want... ...in intricate systems, deciphering code,... ...display sectors. Role: Agentic AI Systems... ...building the agentic infrastructure that enables the next... ...on exposure to on‑prem LLM deployment and...SuggestedFull timeRelocation$207k - $300k
Google Inc. is looking for a Staff Software Engineer for AI and Infrastructure to contribute to Google Cloud's mission. The ideal candidate will have deep experience... ...include designing and implementing computer systems, collaborating on impactful projects, and providing technical...Suggested$287.8k - $328.5k
...Distinguished AI Engineer (Agentic AI Platform Infrastructure) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized...SuggestedLocal area- ## Sr AI/Agentic EngineerApplylocations: On-Site - Los Angeles... ...Life**The Senior AI Engineer will deliver the... ...lead, and the Senior Staff Engineer, AI. You will... ..., retrieval systems, and borrower- and operator... ...evaluation harness, retrieval infrastructure, and the inference...SeniorFull timeWork at officeLocal areaShift work
$176.1k - $308.2k
...all started when engineer Fred Luddy wrote code... ...ServiceNow is the AI control tower for... ...the TeamThe Agentic Engineering organization... ...in this role:As a Staff AI Engineer, you... ...agent harness — the infrastructure layer that enables... ...prompt management systems — versioning,...Full timeWork at officeImmediate startRemote workFlexible hours- ...Gruppe in Santa Clara seeks a Software Engineer to join the Managed AI Research Superclusters team. You'll design and operate cutting-edge infrastructure to enable AI research, collaborating... ...years of experience in distributed systems, excellent programming skills in C++...Senior
$229.9k - $262.4k
...Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services) Overview At... ...responsible and reliable AI systems, changing banking for good.... ...investments in technology infrastructure and world‑class talent — along... ...: $229,900 - $262,400 for Sr. Lead AI Engineer;...SeniorLocal area- ...looking for a Senior Software Engineer to help build NeMo Platform,... ..., deploying, and operating AI systems at scale. This role will... ...workflows, teams need practical infrastructure for observing behavior,... ...is increasingly focused on agentic development and automated agent...Senior
- Apple Inc. in Sunnyvale is seeking an experienced engineer to design and implement cutting-edge agentic systems leveraging large language models (LLMs). This role focuses... ...of agentic systems, strong proficiency with AI coding, and the ability to design end-to-end solutions...
$174k - $252k
Senior Software Engineer, Infrastructure, CoreOS Agentic Engineering Sunnyvale, CA, USA Mid Experience driving progress... ...-scale infrastructure, distributed systems or networks, or experience with... ...role. Experience with either AI or Linux kernel development. Experience...SeniorFull timeWorldwide$181.1k - $318.4k
...United States Machine Learning and AI Apple is where individual... ...something! Description As a Senior/Staff Engineer on the Foundation Model Compute Infrastructure team, you will lead the design... ...of scheduling and orchestration systems for large‑scale TPU workloads across...SeniorRelocation- Build and Deploy AI the right way, anywhere. The FlexAI Compute Infrastructure Platform provides an "end... ...public, hybrid, or on-prem). It brings together... ...inference workloads. As a Staff AI Runtime Engineer , you’ll play a... ...role - perfect for a systems‑minded software engineer...Work at office
$314.8k - $359.3k
Senior. Distinguished AI Engineer (Agentic AI Platform) At... ...responsible and reliable AI systems, changing banking for... ...in technology infrastructure and world-class talent... ...office hours, mentoring Staff, Principal and Senior... ...4,800 - $359,300 for Sr. Distinguished AI Engineer...SeniorFull timePart timeWork at officeLocal area$126k - $423k
Decisive Point is seeking a Research Engineer (AI/RL Infrastructure) in Sunnyvale, California to design and operate large-scale ML systems. You will collaborate with leading experts and contribute to next-generation physical AI, impacting self-driving technologies. This...Senior- ...Corporation in Santa Clara is seeking a Senior Software Engineer to lead the optimization of large-scale AI systems. This role will involve profiling and tuning... ...have over 8 years of experience in software infrastructure for AI systems, with expert-level programming in...Senior
$269.1k - $307.2k
...Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are... ...responsible and reliable AI systems, changing banking for good.... ...investments in technology infrastructure and world-class talent - along... ...office hours, mentoring Staff, Principal and Senior engineers...Full timePart timeWork at officeLocal area$224k - $356.5k
NVIDIA is hiring engineers to scale up the introduction... ...into its EDA Infrastructure. We expect you to have... ...introductions (NPIs), distributed systems, familiarity with... ...between cloud and on-prem Providing expertise in... ...the crowd Developing ML/AI infrastructure. Developing...- ...deeply technical, creative, and Senior AI Platform Engineer to build, support, and maintain the... ...future of our organization, ensuring our systems are scalable, reliable, and ready for... ...be doing: Define and lead AI-native infrastructure roadmaps and cross‑organizational...Senior
$181.1k - $318.4k
Senior Software Engineer, On-Device Health Agentic Systems Cupertino, California, United States Hardware We are seeking a senior iOS Software Engineer with... ...that serve as the primary interface for on‑device AI. A key part of your role will be to explore and define the...SeniorRelocation- ...Corporation is seeking a Senior Software Engineer for its Generative AI Research team in Santa Clara,... ...you will design and operate scalable infrastructure to support large-scale AI models. With a strong focus on distributed systems and data pipelines, you will work collaboratively...Senior
$152k - $241.5k
...redefining how software is built in the age of Generative AI? Join NVIDIA’s TensorRT team to help lead a first... ...software at an unprecedented scale. If you are a systems‑thinking C++ engineer who wants to help scale out an agentic development framework, stay on top of state‑of‑...Senior$218.8k - $335.3k
...unique demands of AI and ML innovation,... ...productivity of ML engineers, and drive the... ...techniques. Our ML infrastructure includes: AI Validation... ...cloud and on‑prem compute resources.... ..., these tools and systems empower GM to tackle... ...Overview: As a Staff AI/ML Full‑Stack Engineer...Flexible hours$272k - $431.25k
...people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our... ...on the world. At NVIDIA, as a Principal Rack Scale Systems Infrastructure Engineer, you will build and guide the development of software systems...Shift work$148.7k - $297.3k
...routine fingersticks. Principal AI/ML Engineer Location: Santa Clara, CA.... ...secure, scalable cloud/on-prem hybrid ML infrastructure. Engage directly with ML... ...Experience with GenAI and Agentic AI development... ...architecture and distributed systems. Experience in reviewing and...- ...is looking for an outstanding, passionate, and dedicated Senior AI Infrastructure Engineer to join our DGX Cloud group. This engineering role will design, build and maintain large‑scale production systems with high efficiency and availability using a combination of software...Senior
- Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research... ...an AI infrastructure software engineer to join our team. You'll be instrumental... ...implementing software and systems engineering practices to ensure...Senior
$180k - $280k
...unleashing the potential of generative AI to power the transformation of... ...3 days per week. The role: Senior Staff SW Engineer (Systems) What you will do: The role requires... ...compiler experts to build out the compiler infrastructure working closely with other software (...SeniorWork experience placement3 days per week$181.1k - $318.4k
Senior Software Engineer: Agentic Systems Cupertino, California, United States Software and Services Join a passionate, dynamic team responsible... ...automation expertise with a with a demonstrated ability to integrate AI-driven agents into complex content and engineering...SeniorRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr. Staff AI Engineer - On-Prem AI Infrastructure & Agentic Systems. Be the first to apply!
- staff engineer San Jose, CA
- senior staff systems engineer San Jose, CA
- engineering aide San Jose, CA
- software engineer staff San Jose, CA
- assistant engineer San Jose, CA
- technology administrator San Jose, CA
- senior staff engineer San Jose, CA
- ai developer San Jose, CA
- ai prompt engineer San Jose, CA
- ai engineer San Jose, CA


