Lead AI Inference Engineer 100% Remote
Framework Ventures
- Remote job
About the job You will own the inference backbone behind QVAC's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory pressure, throughput/latency balance, and long‑session stability. You will define and evolve the core abstractions that inference features depend on, so new capabilities can be added without sacrificing performance or maintainability. This is a role for someone who enjoys low‑level problem solving, clear technical ownership, and building infrastructure that other teams trust in production. Your work directly enables private, on‑device AI experiences and helps set the technical foundation for QVAC's next generation of peer‑to‑peer AI products. Responsibilities Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx. Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the latest advancements in machine learning. Manage a cross‑functional team (pod) made of middleware (JS), foundation (C++), QA and documentation engineers to produce high‑quality deliverables. Regularly assess, both qualitatively and quantitatively, our position in the market with regards to similar products or platforms. Leverage the expertise of technical architects to ensure robust architectural choices and code quality. Ensure stable releases by following precise internal release processes. Qualifications Excellent programming skills in C++. Strong experience with Llama.cpp and ggml inference engines, which facilitates the deployment of models to specific GPU architectures. Good understanding of deep learning concepts and model architectures. Experience with transformers, LLMs, Diffusion Models. Demonstrated ability to rapidly assimilate new technologies and techniques. Experience managing a small, specialized, cross‑functional team (pod) of 3‑5 people. Genuine passion for building good products that improve people's lives. Degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D. Bonus points if Extensive experience with Javascript/Typescript. Understand the difficulties, nuances and importance of p2p technology. Experience with any of Vulkan, Metal and OpenCL. Have productionized models. #J-18808-Ljbffr Framework Ventures
- ...transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the... ...is a bonus. Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures...Remote job
$175k - $225k
...veteran operators and engineers, alumni of Sonos, Paypal... ...from other leading venture capital firms.... ...We're looking for an AI Inference Engineer who lives at... ...navigation. Exposure to remote logging, log ingestion... ...role, but do not meet 100% of the qualifications...Remote workLocal area- ...BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion,... ...us and help build the platform engineers turn to to ship AI products. THE ROLE... ...compensation, including meaningful equity. - 100% coverage of medical, dental, and...Remote workWork experience placementFlexible hours
- ...A leading technology company in California is seeking an AI Inference Engineer to bridge AI models with unique platforms. Key responsibilities include model optimization, deployment, and performance profiling. Candidates should have a Bachelor’s or Master’s degree, 5+...Remote workWork from home
- A cloud technology company is looking for a Senior Engineer 2 to enhance their AI Inference Optimization team. In this role, you will drive architectural... ...position offers competitive compensation and is fully remote, promoting a collaborative and innovative work...Remote work
- ...Akamai is seeking a Senior Security Engineer to protect AI inference environments from emerging threats. In this role, you will define and implement... ...security and AI/ML architectures. The position supports remote work arrangements, allowing flexibility in your work location...Remote work
- A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance... ...a competitive salary range along with remote work flexibility and numerous professional...Remote job
$152k - $241.5k
A leading technology company in Austin is seeking a Senior Compiler Engineer for their AI Inference Platforms team. The role involves analyzing deep learning networks and developing optimization algorithms, requiring expertise in compiler technologies. Ideal candidates...Remote job- Rula is seeking a Staff AI Engineer to lead significant AI endeavors within a fully remote work environment. This role involves influencing product innovation and engineering... ...employees enjoy comprehensive health benefits, 100% remote work, and generous leave policies. #J-188...Remote jobFull time
- Nerdy is hiring a Staff Engineer in the United States to drive technical direction and support AI-powered customer experiences. This role involves mentoring engineers... ...familiarity with AI-native tools. The position offers 100% remote work within the home country and competitive...Remote workWork from home
- ...A remote-first Voice AI startup is looking for an AI Engineer to join their early team. The role involves building and honing major capabilities for their voice... ...as well as infrastructure for model training and inference. Ideal candidates have a Bachelor's degree in a technical...Remote work
$167.2k - $209k
...pioneering cloud service provider in Seattle seeks a Senior Engineer 2 for its AI Inference Data Plane team. This role requires designing and... ...GoLang or Python. Competitive salary range from $167,200 to $209,000 with remote work options. #J-18808-Ljbffr DigitalOceanRemote job$167.2k - $209k
A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong...Remote job$100k
...Software engineering is undergoing its biggest transformation... ..., Cloud, and DevOps. AI is changing how software... ...Engineering Director to lead that transformation. This... ...within a global, fully remote organization. You will directly... ...company. We are 100% remote and global. Live...Remote workHome officeFlexible hoursShift work- About the Job We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine‑tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine‑tuning for language models...Remote job
- ...Our Client which is a large Payroll Co. is looking to hire Lead Full Stack Gen AI Engineers. Lead Full Stack Gen AI Engineers. Location - Montvale, N J3 Roles. Long Term Contract .100 % Remote. Lead-Level GenAI Engineer ~ s Expert in...Remote workLong term contract
$11.5k - $15.5k
...Job Title: Lead Instructor: MLOps / AI Platform Engineering Client: Confidential – Customer Success Reskilling Location: Remote – Must work West Coast / Pacific Time hours Duration: 2 Weeks... ...Credentials: AZ‑900, AI‑900, and DP‑100 are required; AI‑102 is preferred. The...Remote workWork at office- ...A financial solutions company is seeking a Lead Generation Representative to work on a 100% commission basis. The role involves taking cold calls to develop leads, working flexible hours from home. Candidates must possess excellent English skills, experience in telemarketing...Remote workWork from homeFlexible hours
$144.2k - $288.4k
...POSITION SUMMARY CVS Health is hiring a Lead Director, AI/ML Solutions Engineering & Delivery to deliver better health... ...adoption. This is a U.S.-based remote position; candidates must reside... ...workloads, supporting high-throughput inference, low-latency APIs, and multi-tenant...Remote workHourly payFull timeTemporary workWork experience placementLocal area$75 per hour
Overview Senior AI Platform Engineer with Agentic AI & Model Context Protocol (MCP) Location: Preferred local candidates but open to 100% remote working EST HOURS This is a long term contract. Open to W2 and CTC Contractors. This role supports AthenaHealth’s Internal...Remote jobLong term contractContract workFor contractorsLocal area$185.1k - $335.3k
...A leading automotive company is seeking a Staff Compiler Engineer to enhance the model compilation stack for autonomous vehicles... ...optimizing high-level AI models into inference artifacts, defining technical... ...Competitive salary range is $185,100 to $335,300, alongside...$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good... ...to be regularly worked. McLean, VA: $197,300 - $225,100 for Lead AI Engineer New York, NY: $215,200 - $245,...Full timePart timeLocal area- ..., Arizona is seeking an experienced Inside Sales Consultant who will make 100+ outbound calls daily to generate leads. Candidates must have at least 2-3 years in outbound calling within a remote environment and strong CRM skills. Responsibilities include initiating the...Remote job
- ...Navsan is seeking a skilled SR AI Developer to join their Technology team. This role involves... ...designing and deploying AI solutions, leading projects, and mentoring junior developers.... ...contribute to innovative solutions in a remote work environment, making a significant impact...Remote work
- ...Lead IT Security Administrator - 100% Remote (Fulltime) Day to day security administration of systems and user ids, which include creation, deletion, modification, password resets for all technologies supported. Ensure adherence to all Security Administration Standard...Remote workFull time
- ...Salesforce, Inc. is looking for a Senior Agentforce Forward Deployed Engineer in New York City. This role emphasizes leading technical delivery of AI solutions, ensuring integration with the Salesforce platform and mentoring junior team members. The ideal candidate will...Remote work
- A leading technical consulting firm is seeking an experienced Data Scientist for a remote position. This role involves implementing advanced Machine Learning solutions, managing MLOps practices, and engaging directly with clients. A minimum of 4 years of experience in ML...Remote work
- ...A leading tech firm in Boston seeks an experienced NLP and AI professional to lead the development of strategic AI agents. Key responsibilities include implementing advanced algorithms, managing AI solutions, and collaborating with diverse teams. The ideal candidate has...Remote work
$197.3k - $225.1k
...responsible and reliable AI systems, changing banking... ...class applied science and engineering teams to deliver our industry leading capabilities with... ...training, large language model inference, similarity search, guardrails... ..., VA: $197,300 - $225,100 for Lead AI Engineer New...Full timePart timeLocal area$197.3k - $225.1k
Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For... ...Salary ranges by location: Cambridge, MA: $197,300 - $225,100; McLean, VA: $197,300 - $225,100; New York, NY: $215,2...Local area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead AI Inference Engineer 100% Remote. Be the first to apply!
- lead automation engineer New York, NY
- lead support engineer New York, NY
- lead project engineer New York, NY
- lead engineer New York, NY
- lead quality engineer New York, NY
- lead security engineer New York, NY
- lead industrial engineer New York, NY
- lead algorithm engineer New York, NY
- lead backend developer New York, NY
- lead maintenance engineer New York, NY


