AI Infrastructure & Experience Engineer
FocusKPI Inc.
AI Infrastructure & Experience Engineer
FocusKPI is seeking an AI Infrastructure & Experience Engineer to join one of our clients, a high-tech SaaS company.
Work Location: Mountain View, CA (Onsite role, 5 days/week onsite) Duration: 4-month contract Pay Range: $70 - 79/hr **No C2C resumes are considered**
Position Responsibilities:
- Inference Optimization: Deploy and tune multiple LLMs and generative multimodal models on local inference hardware. Optimize performance metrics (TTFT, tokens/sec) via model quantization, caching strategies, and architecture-specific adjustments.
- Systems Engineering & CUDA: Leverage deep knowledge of the CUDA environment to build custom kernels, ensuring maximum utilization of the low-cost GPU compute.
- Orchestration & Integration: Seamlessly bridge inference backends with orchestration layers (LiteLLM, Ollama, etc.) and frontends like OpenWebUI.
- Rapid Prototyping: Build functional, high-fidelity demos showcasing model memory capabilities, agentic workflows, and context-aware web search.
- Peripheral Connectivity: Implement communication protocols to bridge local AI compute with peripheral devices, including smart TVs, household appliances, and XR hardware.
Requirements/Technical Qualifications:
- Recent experience in model optimization is required
- Hardware & Compute: Proven experience with NVIDIA ecosystems and ARM64 architecture.
- Systems Programming: Advanced proficiency in C++, Python, and Rust. Deep familiarity with CUDA and the ability to author/debug custom CUDA kernels for compute-intensive tasks.
- AI/ML Frameworks: Extensive experience with modern inference engines (llama.cpp, TensorRT-LLM, Ollama) and orchestration frameworks (LiteLLM).
- Software Engineering: Robust understanding of asynchronous programming (FastAPI), containerization (Docker/Kubernetes), sandbox environments, and API design for low-latency communication.
- Full-Stack Prototyping: Ability to quickly spin up modern frontend UIs (React, Next.js, or similar) to present AI-driven intelligence to end users.
- Communication Protocols: Familiarity with WebSockets, gRPC, and REST for device-to-device communication in a local network environment.
- Overall Mandatory skills required: Model optimization recent exparience, Interference Optimization, NVIDIA ecosystems, Custom CUDA Kernel Development, ARM64 architecture, Python
Ideal Candidate Profile:
- A minimum of 3 years of relevant industry experience is required
- The "Builder" Mindset: You are energized by the prospect of building proofs-of-concept in days rather than months. You thrive in environments where speed and creativity are paramount.
- Problem Solver: You approach unsolved, messy engineering challenges with enthusiasm rather than trepidation.
- Architectural Vision: You see the "big picture" of how AI becomes part of consumers' daily lives, not just how the model generates text.
- Agile & Adaptable: You are comfortable working in a fast-paced environment where priorities shift based on the results of rapid experimentation.
- Degree in Computer Science, Machine Learning, or Artificial Intelligence Specialization preferred, but not required
**No C2C resumes are considered**
Thank you!
FocusKPI Hiring Team
$147k - $237.5k
...Software Engineer At Palo Alto Networks®, we're united by a shared mission—to protect our... ..., Integrity, and Inclusion. We weave AI into the fabric of everything we do and use... ...Job Summary The Cortex Vulnerability Experience Platform team is expanding, and we're looking...SuggestedFull timeWork at office- ...looking for an enthusiastic and talented Infrastructure Engineering Leader to join our cloud... ...Report to IT Leadership in Canada Experience and skills: Minimum of 5 years... ...other, share knowledge, and leverage AI to solve complex technical challenges....Suggested
- Government Employees Insurance Company is seeking an Engineer II in Solutions Engineering to design, build, and maintain automated processes... ...cycle times. The ideal candidate will have programming experience, knowledge of cloud architectures, and a strong understanding...SuggestedWork from homeFlexible hours
- ...Inference Infrastructure Engineer At Rhoda AI, we're building the next generation of generalist intelligent robots. We own the full robotics stack... ...footprint grow What We're Looking For ~3+ years of experience in ML infrastructure, MLOps, or distributed systems ~...Suggested
- ...Skill 1 - Client Architect with Experience on Data Bricks Development... ...on working on Azure AI services including Data Bricks... ...with a team of researchers and engineers to build state-of-the-art language... ...Development, Data Analytics Infrastructure & Cloud Solutions, Cyber...Suggested
$140k - $360k
...What to Expect We'reseeking asystemsoftware engineer to join our AI Platforms team, working on system software that powers our autonomous... ...AI platforms and ship software that improves the driving experience for millions of customers. What You'll Do ~ Design...Hourly payFull timeTemporary workFlexible hours$148k - $224.25k
...tapping into the unlimited potential of AI to define the next era of computing. An... ...drive NVCF platform features and developer experience. You'll build what NVCF can do — and how... ...into the roadmap. Collaborate with engineering on feature design, prioritization, execution...- ...Job Description Job Description About the Role We are seeking an AI Infrastructure & Experience Engineer to help build next-generation AI-powered experiences. This role is ideal for engineers who enjoy working at the intersection of AI infrastructure, systems...Local area
$147.4k - $272.1k
...Software Development Engineer In Test, Swift Platform Experience Join Apple's Quality Engineering organization... ...to testing non-deterministic AI features. You will collaborate closely... ...and maintain sophisticated testing infrastructure that validates the reliability,...Relocation$132.1k - $279.8k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers... ...is possible. Build fast. Senior Infrastructure Engineer Mission At Groq, we’re building a custom... ...repeatable. Ideal candidates have/are Experience with Linux / Kubernetes systems and...- At Rhoda AI, we're building the full-stack foundation for the next generation of... .... We're looking for an Inference Infrastructure Engineer to help build and operate the systems... ...What We're Looking For 3+ years of experience in ML infrastructure, MLOps, or distributed...
$200k - $340k
...Infrastructure Security Engineer Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit... ..., or a related field ~3-5 years of experience in cloud security or related roles ~ Strong...Temporary work$139k - $204k
...Senior Engineer, Network Observability Livingston, NJ / New York... ...is The Essential Cloud for AI™. Built for pioneers by... ...CoreWeave combines superior infrastructure performance with deep technical... ...can bring their diversified experiences to our teams. Here are some...Temporary workCasual workWork at officeRemote workFlexible hours- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our... ...increase in speed is transforming the user experience of AI applications, unlocking real-time iteration... ...are seeking a highly skilled WAN Network Engineer to design, implement, manage, and...
- ...Get AI-powered advice on this job and more exclusive features... ...We are hiring an IP Network Engineer in the Palo Alto area to join... ...engineering teams to optimize infrastructure. Document designs, configurations... ...field Minimum of 2 years' experience in IP network design and...Full timeRemote work
- ...Job role : Network Engineer Duration : 6+ months, can extend Location : Palo... ...hands-on enterprise network engineering experience. ~ Proficiency with Palo Alto Networks... ...ClearPass, or equivalent). Comfort using AI tools (e.g., Claude, Copilot) for network...
$300 per month
...Staff Software Infrastructure Engineer Crusoe is on a mission to accelerate the abundance of energy... .... As the only vertically integrated AI infrastructure company built from the... ...Bring to the Team Solid hardware experience and GPU troubleshooting expertise....Temporary work$158.9k - $238.3k
...business processes, employee experience, and technologies to scale... ...centralizing the management of Infrastructure, Technology, and Data. The... ...the first customers of the Engineering teams at Rubrik. Rubrik Corp... ...Accelerating the World's AI Transformation Rubrik (RBRK...Local area$180k
...Network Engineer - ML Infrastructure (High-Speed Interconnects) Palo Alto, CA About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid humanity... ...At least 8+ years of hands-on experience in designing, deploying and...Temporary work- ...Network Engineer - AI/HPC Memphis, TN; Palo Alto, CA About XAI XAI's mission is... ...and all. We need an engineer with deep experience in RoCEv2 that can develop at hyper scale... ...us to seamlessly build-out new GPU infrastructure with little to no engineering assistance...
- ...fast finality. Social games and community AI can use our onchain tokens for micro-... ...Because the invincible summer awaits! For engineers, we value your deep understanding of how... ...creatives, we approve your obsession with user experience. You are a product designer, a brand...Full timeWork experience placementSummer workWork at office
$173k - $237.95k
...tissue tests, real-world data and AI analytics. Guardant tests help improve... ...and improve the computational infrastructure. You are dedicated to engineering excellence yet pragmatic and flexible... ...• 4+ years of TCP/IP networking experience • 2+ years of RDMA networking...Work at officeRemote workWork from homeFlexible hours$108k - $162k
...seeking a highly skilled Sr. Systems & Infrastructure Engineer to join a dynamic, security-first IT... ...CloudOps), Microsoft 365 administration, AI-augmented tooling, and endpoint... ...systems administration or engineering experience. ~ Expertise with VMware vSphere/ESXi...Permanent employment$153.2k - $234.1k
...process, we create vehicles and experiences that are designed not just... ...driving? Join the Embodied AI team at General Motors. Our... ...scenarios. As a Senior ML Infra Engineer, you will work on the core... ..., applications, or ML infrastructure. ~ Experience designing robust...Local areaRemote workWork from homeRelocation packageFlexible hours- ...Description About the Team: The AI Validation Platform team... ...We’re proud to serve as the infrastructure platform for teams developing... ...a Senior ML Infrastructure engineer to help build and scale... ...successful candidate will have experience building and running scalable...Local areaWork from home
$60k - $80k
...demand for skilled Network Engineers continues to grow as organizations... ...and require robust network infrastructure to support increasing... ...Engineer salaries vary based on experience, specialization, and the... ...intent‑based networking and AI‑driven network operations is...Local areaRemote work$200k - $400k
...dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on... .... The ideal candidate has strong experience with NVIDIA RDMA technologies, networking...$202.5k - $274k
...procedure/documentation to help level1/level2 engineers to perform their job efficiently Must... ...50 pounds for short periods of time. Experience working with Cisco routers and LAN... ...architecture and Cisco DNA Familiarity with AI/ML concepts for network operations,...Local areaShift work$225k - $275k
...Senior Staff Network Deployment Engineer Crusoe Cloud is seeking a... ...of how we deploy network infrastructure across our global fleet. As we... ...compute (HPC) and GPU-based AI infrastructure, you will define... ...years of network engineering experience with a heavy focus on large-scale...Temporary workRemote work$153.2k - $234.1k
...process, we create vehicles and experiences that are designed not just to be... ...autonomous driving? Join the Embodied AI Infra Foundation team at General... ..., where we build the critical infrastructure that powers every machine learning engineer working on our cutting-edge...Work at officeLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Infrastructure & Experience Engineer. Be the first to apply!
- machine learning ai engineer Mountain View, CA
- ai engineer remote Mountain View, CA
- ai prompt engineer Mountain View, CA
- ai developer Mountain View, CA
- ai engineer Mountain View, CA
- ai ml engineer Mountain View, CA
- senior ai engineer Mountain View, CA
- principal infrastructure engineer Mountain View, CA
- remote infrastructure engineer Mountain View, CA
- data infrastructure engineer Mountain View, CA

