Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. Staff AI Engineer - On-Prem AI Infrastructure & Agentic Systems

$140k - $165k
Full-time

SK hynix memory solutions America Inc.

About the Company: At SK Hynix Memory Solution, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive the evolution of advancing mobile technology, empowering cloud computing, and pioneering future technologies. Our cutting-edge memory technologies are essential in today's most advanced electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape. We're looking for innovative minds to join our mission of shaping the future of technology. At SK Hynix Memory, you'll be part of a team that's pioneering breakthrough memory solutions while maintaining a strong commitment to sustainability. We're not just adapting to technological change – we're driving it, with significant investments in artificial intelligence, machine learning, and eco-friendly solutions and operational practices. As we continue to expand our market presence and push the boundaries of what's possible in semiconductor technology, we invite you to be part of our journey to creating the next generation of memory solutions that will define the future of computing. Why Join Us? Build foundational AI infrastructure that powers next-gen enterprise systems. Work on cutting-edge agentic AI — not just chatbots, but autonomous systems that reason, plan, and act. Opportunity to influence AI strategy, deployment, and governance in a high-impact environment. About the Role: We are seeking a hands-on AI Engineer to design, deploy, and maintain on-prem AI infrastructure and build agentic AI systems that drive real-world automation. You’ll be responsible for setting up scalable AI environments, implementing RAG pipelines, fine-tuning embedded models, and architecting AI agents that operate autonomously in enterprise settings. This role sits at the intersection of AI systems engineering and applied ML — you’ll bridge infrastructure, model deployment, and agent logic. Responsibilities: Design and deploy on-prem AI infrastructure — including GPU clusters, model serving (e.g., vLLM, TGI, Triton), vector DBs (e.g., Milvus, Qdrant, FAISS), and orchestration (Kubernetes, Helm, Docker). Build and optimize RAG pipelines — including document chunking, retrieval strategies (hybrid, re-ranking), and evaluation of retrieval accuracy and latency. Develop agentic AI systems — design stateful agents with memory, tool use, and planning capabilities (e.g., using LangGraph, AutoGen, or custom frameworks). Fine-tune and deploy embedded models — work with LoRA, QLoRA, or full fine-tuning for domain-specific tasks; optimize for edge/on-device inference. Implement Model Control Protocols (MCP) — ensure model governance, versioning, access control, and monitoring for production AI systems. Collaborate with product and engineering teams to integrate AI capabilities into enterprise workflows — especially in storage, QA, or systems engineering contexts. Automate and monitor AI pipelines — build CI/CD for model deployment, logging, and performance tracking. Minimum Qualifications: 2+ years of experience in AI/ML engineering, with hands-on deployment of AI systems on-prem or private cloud. Proven experience building agentic AI systems — including state management, tool integration, and multi-step reasoning. Strong working knowledge of RAG architectures — chunking, retrieval, re-ranking, evaluation metrics. Experience with model fine-tuning (LoRA, QLoRA, full fine-tuning) and embedding models for retrieval. Familiarity with Model Control Protocols (MCP) or similar governance frameworks (model versioning, access control, audit trails). Proficiency in Python, Linux, Docker/Kubernetes, and vector databases (e.g., Milvus, Qdrant, Pinecone). Experience with AI serving frameworks (vLLM, TGI, Triton, Ollama, etc.). Preferred Qualifications: Experience deploying AI in enterprise storage or hardware-adjacent environments. Background in systems engineering or QA automation — bonus if you’ve used AI to automate testing or validation. Familiarity with embedded AI or edge inference (ONNX, TensorRT, GGUF, etc.). Experience with AI agent frameworks (LangGraph, AutoGen, BabyAGI, etc.). Knowledge of AI observability tools (LangSmith, Weights & Biases, Prometheus/Grafana for AI). As a Storage company, knowledge of storage area/NVMe is a PLUS. Education Requirement: Bachelor of Science in CS, EE, ME, or other applicable Engineering field. COMPENSATION: $140,000/yr - $165,000/yr

REGARDING COMPENSATION:

SK hynix memory solutions America Inc. offers you the opportunity to apply your skills to exciting projects while working with innovative teams. Our compensation package is complimented by a generous benefits package including medical, dental, vision, life insurance and a company 401(k) match, as well as cafeteria, onsite gym and much more. If you are motivated by technical challenges, we offer a collaborative work environment that encourages career growth. The salary offered to a selected candidate will be tailored based on several factors, including the location, job grade, relevant knowledge, skills, and experience. We also take into account the internal equity among our current team members to ensure fairness and competitiveness

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Sr. Staff AI Engineer - On-Prem AI Infrastructure & Agentic Systems in San Jose, CA vacancy
  • Applied Materials is hiring an Agentic AI Systems Engineer in Santa Clara, CA. This role involves designing the infrastructure for GenAI applications, bridging AI and software needs. Candidates must have 7+ years of experience with a strong proficiency in programming languages... 
    Suggested

    Applied Materials

    Santa Clara, CA
    13 hours ago
  • $262k - $365k

    Google Inc. seeks a Senior Staff Software Engineer for AI Infrastructure within Google Cloud. This role involves architecting...  ...performance, distributed infrastructure for agentic AI workflows, with responsibilities including system reliability and transitioning... 
    Senior

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $152k - $208.5k

     ...leader in materials engineering solutions used to...  ...our world - like AI and IoT. If you want...  ...in intricate systems, deciphering code,...  ...display sectors. Role: Agentic AI Systems...  ...building the agentic infrastructure that enables the next...  ...on exposure to on‑prem LLM deployment and... 
    Suggested
    Full time
    Relocation

    Applied Materials

    Santa Clara, CA
    13 hours ago
  • $207k - $300k

    Google Inc. is looking for a Staff Software Engineer for AI and Infrastructure to contribute to Google Cloud's mission. The ideal candidate will have deep experience...  ...include designing and implementing computer systems, collaborating on impactful projects, and providing technical... 
    Suggested

    Google Inc.

    Sunnyvale, CA
    13 hours ago
  • $287.8k - $328.5k

     ...Distinguished AI Engineer (Agentic AI Platform Infrastructure) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized... 
    Suggested
    Local area

    Capital One National Association

    San Jose, CA
    3 days ago
  • ## Sr AI/Agentic EngineerApplylocations: On-Site - Los Angeles...  ...Life**The Senior AI Engineer will deliver the...  ...lead, and the Senior Staff Engineer, AI. You will...  ..., retrieval systems, and borrower- and operator...  ...evaluation harness, retrieval infrastructure, and the inference... 
    Senior
    Full time
    Work at office
    Local area
    Shift work

    Lendistry, LLC.

    Santa Clara, CA
    2 days ago
  • $176.1k - $308.2k

     ...all started when engineer Fred Luddy wrote code...  ...ServiceNow is the AI control tower for...  ...the TeamThe Agentic Engineering organization...  ...in this role:As a Staff AI Engineer, you...  ...agent harness — the infrastructure layer that enables...  ...prompt management systems — versioning,... 
    Full time
    Work at office
    Immediate start
    Remote work
    Flexible hours

    ServiceNow

    Santa Clara, CA
    1 day ago
  •  ...Gruppe in Santa Clara seeks a Software Engineer to join the Managed AI Research Superclusters team. You'll design and operate cutting-edge infrastructure to enable AI research, collaborating...  ...years of experience in distributed systems, excellent programming skills in C++... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services) Overview At...  ...responsible and reliable AI systems, changing banking for good....  ...investments in technology infrastructure and world‑class talent — along...  ...: $229,900 - $262,400 for Sr. Lead AI Engineer;... 
    Senior
    Local area

    Capital One National Association

    San Jose, CA
    4 days ago
  •  ...looking for a Senior Software Engineer to help build NeMo Platform,...  ..., deploying, and operating AI systems at scale. This role will...  ...workflows, teams need practical infrastructure for observing behavior,...  ...is increasingly focused on agentic development and automated agent... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • Apple Inc. in Sunnyvale is seeking an experienced engineer to design and implement cutting-edge agentic systems leveraging large language models (LLMs). This role focuses...  ...of agentic systems, strong proficiency with AI coding, and the ability to design end-to-end solutions... 

    Apple Inc.

    Sunnyvale, CA
    3 days ago
  • $174k - $252k

    Senior Software Engineer, Infrastructure, CoreOS Agentic Engineering Sunnyvale, CA, USA Mid Experience driving progress...  ...-scale infrastructure, distributed systems or networks, or experience with...  ...role. Experience with either AI or Linux kernel development. Experience... 
    Senior
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $181.1k - $318.4k

     ...United States Machine Learning and AI Apple is where individual...  ...something! Description As a Senior/Staff Engineer on the Foundation Model Compute Infrastructure team, you will lead the design...  ...of scheduling and orchestration systems for large‑scale TPU workloads across... 
    Senior
    Relocation

    Apple

    Santa Clara, CA
    3 days ago
  • Build and Deploy AI the right way, anywhere. The FlexAI Compute Infrastructure Platform provides an "end...  ...public, hybrid, or on-prem). It brings together...  ...inference workloads. As a Staff AI Runtime Engineer , you’ll play a...  ...role - perfect for a systems‑minded software engineer... 
    Work at office

    FlexAI

    Santa Clara, CA
    13 hours ago
  • $314.8k - $359.3k

    Senior. Distinguished AI Engineer (Agentic AI Platform) At...  ...responsible and reliable AI systems, changing banking for...  ...in technology infrastructure and world-class talent...  ...office hours, mentoring Staff, Principal and Senior...  ...4,800 - $359,300 for Sr. Distinguished AI Engineer... 
    Senior
    Full time
    Part time
    Work at office
    Local area

    Capital One

    San Jose, CA
    3 days ago
  • $126k - $423k

    Decisive Point is seeking a Research Engineer (AI/RL Infrastructure) in Sunnyvale, California to design and operate large-scale ML systems. You will collaborate with leading experts and contribute to next-generation physical AI, impacting self-driving technologies. This... 
    Senior

    Decisive Point

    Sunnyvale, CA
    1 day ago
  •  ...Corporation in Santa Clara is seeking a Senior Software Engineer to lead the optimization of large-scale AI systems. This role will involve profiling and tuning...  ...have over 8 years of experience in software infrastructure for AI systems, with expert-level programming in... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $269.1k - $307.2k

     ...Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are...  ...responsible and reliable AI systems, changing banking for good....  ...investments in technology infrastructure and world-class talent - along...  ...office hours, mentoring Staff, Principal and Senior engineers... 
    Full time
    Part time
    Work at office
    Local area

    Capital One Financial Corp

    San Jose, CA
    3 days ago
  • $224k - $356.5k

    NVIDIA is hiring engineers to scale up the introduction...  ...into its EDA Infrastructure. We expect you to have...  ...introductions (NPIs), distributed systems, familiarity with...  ...between cloud and on-prem Providing expertise in...  ...the crowd Developing ML/AI infrastructure. Developing... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...deeply technical, creative, and Senior AI Platform Engineer to build, support, and maintain the...  ...future of our organization, ensuring our systems are scalable, reliable, and ready for...  ...be doing: Define and lead AI-native infrastructure roadmaps and cross‑organizational... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $181.1k - $318.4k

    Senior Software Engineer, On-Device Health Agentic Systems Cupertino, California, United States Hardware We are seeking a senior iOS Software Engineer with...  ...that serve as the primary interface for on‑device AI. A key part of your role will be to explore and define the... 
    Senior
    Relocation

    Apple

    Cupertino, CA
    4 days ago
  •  ...Corporation is seeking a Senior Software Engineer for its Generative AI Research team in Santa Clara,...  ...you will design and operate scalable infrastructure to support large-scale AI models. With a strong focus on distributed systems and data pipelines, you will work collaboratively... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...redefining how software is built in the age of Generative AI? Join NVIDIA’s TensorRT team to help lead a first...  ...software at an unprecedented scale. If you are a systems‑thinking C++ engineer who wants to help scale out an agentic development framework, stay on top of state‑of‑... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $218.8k - $335.3k

     ...unique demands of AI and ML innovation,...  ...productivity of ML engineers, and drive the...  ...techniques. Our ML infrastructure includes: AI Validation...  ...cloud and on‑prem compute resources....  ..., these tools and systems empower GM to tackle...  ...Overview: As a Staff AI/ML Full‑Stack Engineer... 
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $272k - $431.25k

     ...people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our...  ...on the world. At NVIDIA, as a Principal Rack Scale Systems Infrastructure Engineer, you will build and guide the development of software systems... 
    Shift work

    Jobleads-US

    Santa Clara, CA
    1 day ago
  • $148.7k - $297.3k

     ...routine fingersticks. Principal AI/ML Engineer Location: Santa Clara, CA....  ...secure, scalable cloud/on-prem hybrid ML infrastructure. Engage directly with ML...  ...Experience with GenAI and Agentic AI development...  ...architecture and distributed systems. Experience in reviewing and... 

    Abbott Laboratories company

    Santa Clara, CA
    4 days ago
  •  ...is looking for an outstanding, passionate, and dedicated Senior AI Infrastructure Engineer to join our DGX Cloud group. This engineering role will design, build and maintain large‑scale production systems with high efficiency and availability using a combination of software... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research...  ...an AI infrastructure software engineer to join our team. You'll be instrumental...  ...implementing software and systems engineering practices to ensure... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $180k - $280k

     ...unleashing the potential of generative AI to power the transformation of...  ...3 days per week. The role: Senior Staff SW Engineer (Systems) What you will do: The role requires...  ...compiler experts to build out the compiler infrastructure working closely with other software (... 
    Senior
    Work experience placement
    3 days per week

    d-Matrix

    Santa Clara, CA
    more than 2 months ago
  • $181.1k - $318.4k

    Senior Software Engineer: Agentic Systems Cupertino, California, United States Software and Services Join a passionate, dynamic team responsible...  ...automation expertise with a with a demonstrated ability to integrate AI-driven agents into complex content and engineering... 
    Senior
    Relocation

    Apple

    Cupertino, CA
    13 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Staff AI Engineer - On-Prem AI Infrastructure & Agentic Systems. Be the first to apply!