Lead AI Inference Engineer 100% Remote
Framework Ventures
- Remote job
About the job You will own the inference backbone behind QVAC's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory pressure, throughput/latency balance, and long‑session stability. You will define and evolve the core abstractions that inference features depend on, so new capabilities can be added without sacrificing performance or maintainability. This is a role for someone who enjoys low‑level problem solving, clear technical ownership, and building infrastructure that other teams trust in production. Your work directly enables private, on‑device AI experiences and helps set the technical foundation for QVAC's next generation of peer‑to‑peer AI products. Responsibilities Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx. Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the latest advancements in machine learning. Manage a cross‑functional team (pod) made of middleware (JS), foundation (C++), QA and documentation engineers to produce high‑quality deliverables. Regularly assess, both qualitatively and quantitatively, our position in the market with regards to similar products or platforms. Leverage the expertise of technical architects to ensure robust architectural choices and code quality. Ensure stable releases by following precise internal release processes. Qualifications Excellent programming skills in C++. Strong experience with Llama.cpp and ggml inference engines, which facilitates the deployment of models to specific GPU architectures. Good understanding of deep learning concepts and model architectures. Experience with transformers, LLMs, Diffusion Models. Demonstrated ability to rapidly assimilate new technologies and techniques. Experience managing a small, specialized, cross‑functional team (pod) of 3‑5 people. Genuine passion for building good products that improve people's lives. Degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D. Bonus points if Extensive experience with Javascript/Typescript. Understand the difficulties, nuances and importance of p2p technology. Experience with any of Vulkan, Metal and OpenCL. Have productionized models. #J-18808-Ljbffr Framework Ventures
- ...transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the... ...is a bonus. Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures...Remote work
$175k - $225k
...veteran operators and engineers, alumni of Sonos, Paypal... ...from other leading venture capital firms.... ...We're looking for an AI Inference Engineer who lives at... ...navigation. Exposure to remote logging, log ingestion... ...role, but do not meet 100% of the qualifications...Remote workLocal area- ...Applied AI Inference Engineer Baseten powers mission-critical inference for the world's most dynamic AI companies. By uniting applied AI research... ...~ Competitive compensation, including meaningful equity. ~100% coverage of medical, dental, and vision insurance for...Remote workWork experience placementFlexible hours
$151.8k
...What you can expect We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. In this... ...hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote...Remote workWork at office$167.2k - $209k
A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong...Remote work$167.2k - $209k
...pioneering cloud service provider in Seattle seeks a Senior Engineer 2 for its AI Inference Data Plane team. This role requires designing and... ...particularly in GoLang or Python. Competitive salary range from $167,200 to $209,000 with remote work options. #J-18808-LjbffrRemote work- ...TypeScript-Entwickler zur Entwicklung moderner Webapplikationen und zur aktiven Mitgestaltung an KI-Features. Die Position bietet 100% Remote-Arbeit, flexible Arbeitszeiten und überdurchschnittliche Vergütung. Geübte Kommunikation in Deutsch ist Voraussetzung, da die...Remote workFlexible hours
$242k - $290k
...As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient... ...CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution... ...latency and maximize memory bandwidth on AI accelerators. Write production-level,...Remote workTemporary workRelocation package- A leading tech consulting firm in New York is seeking an experienced AI/ML Engineer to innovate AI solutions for complex business challenges. You will design and implement... ...production-ready solutions. This position offers a flexible, 100% remote work model. #J-18808-LjbffrRemote workFlexible hours
- Jobgether is seeking an AI Research Engineer (Kernel & Inference Optimization) to enhance model serving architectures and deployment efficiencies. This fully remote role involves designing scalable inference pipelines and collaborating with international teams. Ideal candidates...Remote job
$152k - $241.5k
A leading technology company in Austin is seeking a Senior Compiler Engineer for their AI Inference Platforms team. The role involves analyzing deep learning networks and developing optimization algorithms, requiring expertise in compiler technologies. Ideal candidates...Remote job- ...About the job We’re seeking experienced AI infrastructure Engineers to design and implement robust, scalable pipelines for massive data workloads... ...of data and model workflows from prototyping to inference. Qualifications Proficient in Python with strong programming...Remote work
- Rula is seeking a Staff AI Engineer to lead significant AI endeavors within a fully remote work environment. This role involves influencing product innovation and engineering... ...employees enjoy comprehensive health benefits, 100% remote work, and generous leave policies. #J-188...Remote jobFull time
$152k - $241.5k
Senior AI Inference Compiler Engineer page is loaded## Senior AI Inference Compiler Engineerlocations: US... ...Santa Clara: US, TX, Austin: US, TX, Remote: US, WA, Remote: US, CA, Remotetime type... ...robotics. The compiler must deliver leading inference performance, fast build...Remote work- ...A leading ERP software reseller is seeking a Sage 100 Application Consultant to lead client software implementations primarily for construction industry clients. The ideal candidate will have 3-5 years of experience with Sage 100 ERP, demonstrated accounting skills, and...Remote workWork from home
$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good... ...to be regularly worked. McLean, VA: $197,300 - $225,100 for Lead AI Engineer New York, NY: $215,200 - $245,...Full timePart timeLocal area- POSITION SUMMARY CVS Health is hiring a Lead Director, AI/ML Solutions Engineering & Delivery to deliver better health... ...adoption. This is a U.S.-based remote position; candidates must reside... ...and batch workloads, high‑throughput inference, low‑latency APIs, and multi‑tenant...Remote workWork experience placementLocal area
$185.1k - $335.3k
...A leading automotive company is seeking a Staff Compiler Engineer to enhance the model compilation stack for autonomous vehicles... ...optimizing high-level AI models into inference artifacts, defining technical... ...Competitive salary range is $185,100 to $335,300, alongside...- A financial solutions company is seeking a Lead Generation Representative to work on a 100% commission basis. The role involves taking cold calls to develop leads, working flexible hours from home. Candidates must possess excellent English skills, experience in telemarketing...Remote jobWork from homeFlexible hours
$244.7k - $279.2k
...Capital One is looking for a Distinguished AI Engineer to lead the development of AI-powered products in Nashville, TN. This remote position involves partnering with cross-functional teams to design and implement scalable AI solutions, leveraging cutting-edge technologies...Remote work$244.7k - $279.2k
...Capital One is looking for a Distinguished AI Engineer to develop responsible and reliable AI systems. This remote role requires deep technical expertise and at least 8 years of experience in AI and ML technologies. The ideal candidate will partner with a diverse team...Remote work$244.7k - $279.2k
...Capital One is seeking a Distinguished AI Engineer to revolutionize banking through responsible AI. This fully remote role involves developing AI software, partnering with cross-functional teams, and improving large-scale AI systems. Candidates should have substantial...Remote work$244.7k - $279.2k
...Capital One is seeking a Distinguished AI Engineer to join their remote team. In this role, you will collaborate with cross-functional teams to design and deploy AI-powered products, leveraging cutting-edge technologies. You should have at least 8 years of experience...Remote work- ...Capital One is seeking a Distinguished AI Engineer to help build and optimize AI systems. The ideal candidate will have at least 8 years... ...programming skills, and expertise in cloud platforms like AWS. This remote role involves partnering with cross-functional teams and...Remote work
- ...Lead IT Security Administrator - 100% Remote (Fulltime) Day to day security administration of systems and user ids, which include creation, deletion, modification, password resets for all technologies supported. Ensure adherence to all Security Administration Standard...Remote workFull time
- ..., Arizona is seeking an experienced Inside Sales Consultant who will make 100+ outbound calls daily to generate leads. Candidates must have at least 2-3 years in outbound calling within a remote environment and strong CRM skills. Responsibilities include initiating the...Remote job
- ...Job Title: Generative AI Engineer (Senior / Lead / Principal)- Multiple openings Experience Level: 8+ to 13+ Years Location: Hybrid - Remote (India-based) with onsite every Thursday in Chennai Industry: AI/ML, Enterprise Applications, Healthcare...Remote workWork at office
$120k - $200k
...want to hire a highly skilled AI Software Engineer to join their rapidly... ...depending on experience) Fully remote in the U.S. Competitive equity... ...packages offered Healthcare: 100% employee coverage, ~75–80%... ...at D33P Search Group by 2x Inferred from the description for this...Remote workFull timeFreelance$180k - $300k
...talent at innovative companies. Our team is 100% remote and we work with teams across the United States to help them hire. AI Engineer (Full-stack) Location: San Francisco,... ...systems for visual datasets Build inference serving systems Develop APIs powering...Remote workH1bWork at officeVisa sponsorship$87.12k - $181.5k
...AI Engineer - Insurance Domain - HYBRID NTT DATA Services... ...as REST APIs or batch inference services on Azure... ...Scientist Associate (DP-100) preferred. Exposure... ...starting pay range for this remote role is $87,120 - $181,... ...are one of the world's leading AI and digital...Remote workWork at officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead AI Inference Engineer 100% Remote. Be the first to apply!
- lead support engineer New York, NY
- lead ios developer New York, NY
- lead solutions engineer New York, NY
- lead mobile developer New York, NY
- lead quality engineer New York, NY
- lead project engineer New York, NY
- lead network engineer New York, NY
- lead product engineer New York, NY
- lead web developer New York, NY
- lead software test engineer New York, NY

