Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Infrastructure & Experience Engineer

$70 - $79 per hour

FocusKPI Inc.

FocusKPI is seeking an AI Infrastructure & Experience Engineer to join one of our clients, a high-tech SaaS company. 

Work Location: Mountain View, CA (Onsite role, 5 days/week onsite)

Duration: 4-month contract 

Pay Range: $70 - 79/hr

No C2C resumes are considered

Position Responsibilities:

  • Inference Optimization: Deploy and tune multiple LLMs and generative multimodal models on local inference hardware. Optimize performance metrics (TTFT, tokens/sec) via model quantization, caching strategies, and architecture-specific adjustments.

  • Systems Engineering & CUDA: Leverage deep knowledge of the CUDA environment to build custom kernels, ensuring maximum utilization of the low-cost GPU compute.

  • Orchestration & Integration: Seamlessly bridge inference backends with orchestration layers (LiteLLM, Ollama, etc.) and frontends like OpenWebUI.

  • Rapid Prototyping: Build functional, high-fidelity demos showcasing model memory capabilities, agentic workflows, and context-aware web search.

  • Peripheral Connectivity: Implement communication protocols to bridge local AI compute with peripheral devices, including smart TVs, household appliances, and XR hardware.

Requirements/Technical qualifications:

  • Recent experience in model optimization is required

  • Hardware & Compute: Proven experience with NVIDIA ecosystems and ARM64 architecture.

  • Systems Programming: Advanced proficiency in C++, Python, and Rust. Deep familiarity with CUDA and the ability to author/debug custom CUDA kernels for compute-intensive tasks.

  • AI/ML Frameworks: Extensive experience with modern inference engines (llama.cpp, TensorRT-LLM, Ollama) and orchestration frameworks (LiteLLM).

  • Software Engineering: Robust understanding of asynchronous programming (FastAPI), containerization (Docker/Kubernetes), sandbox environments, and API design for low-latency communication.

  • Full-Stack Prototyping: Ability to quickly spin up modern frontend UIs (React, Next.js, or similar) to present AI-driven intelligence to end users.

  • Communication Protocols: Familiarity with WebSockets, gRPC, and REST for device-to-device communication in a local network environment.

  • Overall Mandatory skills required: Model optimization recent exparience, Interference Optimization, NVIDIA ecosystems, Custom CUDA Kernel Development, ARM64 architecture, Python

Ideal Candidate Profile:

  • A minimum of 3 years of relevant industry experience is required

  • The "Builder" Mindset: You are energized by the prospect of building proofs-of-concept in days rather than months. You thrive in environments where speed and creativity are paramount.

  • Problem Solver: You approach unsolved, messy engineering challenges with enthusiasm rather than trepidation.

  • Architectural Vision: You see the "big picture" of how AI becomes part of consumers' daily lives, not just how the model generates text.

  • Agile & Adaptable: You are comfortable working in a fast-paced environment where priorities shift based on the results of rapid experimentation.

  • Degree in Computer Science, Machine Learning, or Artificial Intelligence Specialization preferred, but not required

No C2C resumes are considered

Thank you!

FocusKPI Hiring Team

Founded in 2010, FocusKPI, Inc. (FocusKPI) is a data science and technology firm specializing in predictive analytics practice and methodologies. FocusKPI is a US company headquartered in Silicon Valley, California, with an East Coast office in Boston, Massachusetts.

NOTICE: Please be aware of fraudulent emails regarding job postings, job offers and fake checks. FocusKPI's recruiting team will strictly reach out via @focuskpi.com email domain. If you have received fraudulent emails now or in the past, please report it to . The domain @focuskpijobs.com is fraudulent and not related to FocusKPI. Please do not not reply or communicate to anyone with @focuskpijobs.com.

Powered by JazzHR

Vacancy posted 21 hours ago
Similar jobs that could be interesting for youBased on the AI Infrastructure & Experience Engineer in Mountain View, CA vacancy
  • About the Company Hippocratic AI is a generative AI company...  ...Role As a Senior Staff Software Engineer at Hippocratic AI, you’ll...  ...engineering standards, CI/CD infrastructure, and developer platform that...  ...years of software engineering experience with production ownership at... 
    Suggested
    Work at office
    Local area

    Hippocratic-Ai

    Palo Alto, CA
    1 day ago
  •  ...looking for an enthusiastic and talented Infrastructure Engineering Leader to join our cloud...  ...Report to IT Leadership in Canada Experience and skills: Minimum of 5 years...  ...other, share knowledge, and leverage AI to solve complex technical challenges.... 
    Suggested

    Fortinet

    Sunnyvale, CA
    8 days ago
  • Government Employees Insurance Company is seeking an Engineer II in Solutions Engineering to design, build, and maintain automated processes...  ...cycle times. The ideal candidate will have programming experience, knowledge of cloud architectures, and a strong understanding... 
    Suggested
    Work from home
    Flexible hours

    Government Employees Insurance Company

    Palo Alto, CA
    2 days ago
  •  ...Inference Infrastructure Engineer At Rhoda AI, we're building the next generation of generalist intelligent robots. We own the full robotics stack...  ...footprint grow What We're Looking For ~3+ years of experience in ML infrastructure, MLOps, or distributed systems ~... 
    Suggested

    Rhoda ai

    Palo Alto, CA
    3 days ago
  •  ...company in Palo Alto is seeking a Software Engineer, Infrastructure, who thrives in ambiguity and...  ...languages like Python, Go, or Rust, and have experience with cloud infrastructure. This is a...  ...role at a cutting-edge firm focused on AI-powered systems. #J-18808-Ljbffr... 
    Suggested
    Full time

    Simular

    Palo Alto, CA
    3 days ago
  •  ...Description It all started when engineer Fred Luddy wrote code that...  .... Today, ServiceNow is the AI control tower for business...  ...a composable, agent-native infrastructure foundation that agents and...  ...high-quality agentic search experiences. Apply techniques such as... 
    Full time
    Work at office
    Immediate start
    Remote work
    Flexible hours
    Shift work

    ServiceNow

    Mountain View, CA
    5 days ago
  • $132.1k - $279.8k

    About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers...  ...is possible. Build fast. Senior Infrastructure Engineer Mission At Groq, we’re building a custom...  ...repeatable. Ideal candidates have/are Experience with Linux / Kubernetes systems and... 

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    2 days ago
  • At Rhoda AI, we're building the full-stack foundation for the next generation of...  .... We're looking for an Inference Infrastructure Engineer to help build and operate the systems...  ...What We're Looking For 3+ years of experience in ML infrastructure, MLOps, or distributed... 

    Rhoda ai

    Palo Alto, CA
    2 days ago
  •  ...Get AI-powered advice on this job and more exclusive features...  ...We are hiring an IP Network Engineer in the Palo Alto area to join...  ...engineering teams to optimize infrastructure. Document designs, configurations...  ...field Minimum of 2 years' experience in IP network design and... 
    Full time
    Remote work

    Hamilton Barnes ?

    Palo Alto, CA
    14 days ago
  • $147k - $237.5k

     ...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do and use...  ...Your Career The Cortex Vulnerability Experience Platform team is expanding, and we're looking for an experienced Software Engineer to join our team. This team is responsible... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    5 days ago
  • $200k - $340k

     ...Infrastructure Security Engineer Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit...  ..., or a related field ~3-5 years of experience in cloud security or related roles ~ Strong... 
    Temporary work

    Xai

    Palo Alto, CA
    5 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our...  ...increase in speed is transforming the user experience of AI applications, unlocking real-time iteration...  ...are seeking a highly skilled WAN Network Engineer to design, implement, manage, and... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    5 days ago
  • $149.4k - $205.4k

     ...Staff HPC Infrastructure Engineer page is loaded## Staff HPC Infrastructure Engineerlocations: Palo...  ...and tissue tests, real-world data and AI analytics. Guardant tests help improve...  ...field· 4+ years of TCP/IP networking experience· 2+ years of RDMA networking experience... 
    Work at office
    Remote work
    Work from home
    Flexible hours

    Guardant Health

    Palo Alto, CA
    3 days ago
  • $139k - $204k

     ...Senior Engineer, Network Observability Livingston, NJ / New York...  ...is The Essential Cloud for AI™. Built for pioneers by...  ...CoreWeave combines superior infrastructure performance with deep technical...  ...can bring their diversified experiences to our teams. Here are some... 
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $158.9k - $238.3k

     ...business processes, employee experience, and technologies to scale...  ...centralizing the management of Infrastructure, Technology, and Data. The...  ...the first customers of the Engineering teams at Rubrik. Rubrik Corp...  ...Accelerating the World's AI Transformation Rubrik (RBRK... 
    Local area

    Rubrik

    Palo Alto, CA
    5 days ago
  • $300 per month

     ...Staff Software Infrastructure Engineer Crusoe is on a mission to accelerate the abundance of energy...  .... As the only vertically integrated AI infrastructure company built from the...  ...Bring to the Team Solid hardware experience and GPU troubleshooting expertise.... 
    Temporary work

    Crusoe

    Sunnyvale, CA
    1 day ago
  •  ...fast finality. Social games and community AI can use our onchain tokens for micro-...  ...Because the invincible summer awaits! For engineers, we value your deep understanding of how...  ...creatives, we approve your obsession with user experience. You are a product designer, a brand... 
    Full time
    Work experience placement
    Summer work
    Work at office

    Harmony

    Palo Alto, CA
    4 days ago
  • $200k - $400k

     ...nurture the next generation of AI builders, and drive...  ...researchers, data scientists, and engineers, tackling the most fundamental...  ...pipelines through Infrastructure‑as‑Code (IaC) best practices...  ...networking solutions Professional Experience High‑Performance Networks: Hands... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $200k - $400k

     ...dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on...  .... The ideal candidate has strong experience with NVIDIA RDMA technologies, networking... 

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  •  ...Network Engineer - AI/HPC Memphis, TN; Palo Alto, CA About XAI XAI's mission is...  ...and all. We need an engineer with deep experience in RoCEv2 that can develop at hyper scale...  ...us to seamlessly build-out new GPU infrastructure with little to no engineering assistance... 

    Xai

    Palo Alto, CA
    2 days ago
  • $180k

     ...Network Engineer - ML Infrastructure (High-Speed Interconnects) Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity...  ...At least 8+ years of hands-on experience in designing, deploying and... 
    Temporary work

    Xai

    Palo Alto, CA
    2 days ago
  •  ...builds the world's largest AI chip, 56 times larger than GPUs...  ...is transforming the user experience of AI applications, unlocking...  ...Architect to join our Cluster Engineering Team and help shape the...  ..., and validation of network infrastructure using Python, including topology... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  •  ...Job role : Network Engineer Duration : 6+ months, can extend Location : Palo...  ...hands-on enterprise network engineering experience. ~ Proficiency with Palo Alto Networks...  ...ClearPass, or equivalent). Comfort using AI tools (e.g., Claude, Copilot) for network... 

    VDart

    Palo Alto, CA
    3 days ago
  •  ..., or Business Administration. ~2–3+ years of relevant working experience in real estate investment, private equity, finance, or related sectors...  ...talent with high‑growth industries such as semiconductors, AI/ML, automation, and manufacturing technology. At IntelliPro, we... 
    Full time
    Work experience placement
    Work at office

    Intellipro, Inc.

    Palo Alto, CA
    3 days ago
  •  ...Job Description Job Description About the Role We are seeking an AI Infrastructure & Experience Engineer to help build next-generation AI-powered experiences. This role is ideal for engineers who enjoy working at the intersection of AI infrastructure, systems... 
    Local area

    NuByt, Inc

    Mountain View, CA
    7 days ago
  • $108k - $162k

     ...seeking a highly skilled Sr. Systems & Infrastructure Engineer to join a dynamic, security-first IT...  ...CloudOps), Microsoft 365 administration, AI-augmented tooling, and endpoint...  ...systems administration or engineering experience. ~ Expertise with VMware vSphere/ESXi... 
    Permanent employment

    Onto

    Milpitas, CA
    5 days ago
  • $150k

     ...nurture the next generation of AI builders, and drive...  ...researchers, data scientists, and engineers, tackling the most fundamental...  ...for a distributed ML infrastructure engineer to help extend and...  ..., multi‑GPU clusters Own experiment tracking, metrics logging, and... 
    Flexible hours

    Institute of Foundation Models

    Sunnyvale, CA
    7 days ago
  • $147.4k - $272.1k

    Software Development Engineer in Test, Swift Platform Experience Cupertino, California, United States Software and...  ...to testing non‑deterministic AI features. You will collaborate closely...  ...maintain sophisticated testing infrastructure that validates the reliability, performance... 
    Relocation

    Apple Inc.

    Cupertino, CA
    3 days ago
  • $150k - $218k

    Senior Network Engineer, Customer Services Edge corporate_fare Google place Sunnyvale...  ...related field, or equivalent practical experience. 6 years of experience with network...  ...applications, products and services. The AI and Infrastructure team is redefining what’s possible. We... 
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $150k - $218k

     ...USA Bachelor's degree in Computer Science, Engineering, a related field, or equivalent practical experience. 6 years of experience with network routing...  ...of applications, products and services. The AI and Infrastructure team is redefining what’s possible. We empower... 
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Infrastructure & Experience Engineer. Be the first to apply!