Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead AI Inference Engineer 100% Remote

Framework Ventures

New York, NY
  • Remote job

About the job You will own the inference backbone behind QVAC's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory pressure, throughput/latency balance, and long‑session stability. You will define and evolve the core abstractions that inference features depend on, so new capabilities can be added without sacrificing performance or maintainability. This is a role for someone who enjoys low‑level problem solving, clear technical ownership, and building infrastructure that other teams trust in production. Your work directly enables private, on‑device AI experiences and helps set the technical foundation for QVAC's next generation of peer‑to‑peer AI products. Responsibilities Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx. Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the latest advancements in machine learning. Manage a cross‑functional team (pod) made of middleware (JS), foundation (C++), QA and documentation engineers to produce high‑quality deliverables. Regularly assess, both qualitatively and quantitatively, our position in the market with regards to similar products or platforms. Leverage the expertise of technical architects to ensure robust architectural choices and code quality. Ensure stable releases by following precise internal release processes. Qualifications Excellent programming skills in C++. Strong experience with Llama.cpp and ggml inference engines, which facilitates the deployment of models to specific GPU architectures. Good understanding of deep learning concepts and model architectures. Experience with transformers, LLMs, Diffusion Models. Demonstrated ability to rapidly assimilate new technologies and techniques. Experience managing a small, specialized, cross‑functional team (pod) of 3‑5 people. Genuine passion for building good products that improve people's lives. Degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D. Bonus points if Extensive experience with Javascript/Typescript. Understand the difficulties, nuances and importance of p2p technology. Experience with any of Vulkan, Metal and OpenCL. Have productionized models. #J-18808-Ljbffr Framework Ventures

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Lead AI Inference Engineer 100% Remote in New York, NY vacancy
  •  ...transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the...  ...is a bonus. Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures... 
    Remote work

    Framework Ventures

    United States
    3 days ago
  • $175k - $225k

     ...veteran operators and engineers, alumni of Sonos, Paypal...  ...from other leading venture capital firms....  ...We're looking for an AI Inference Engineer who lives at...  ...navigation. Exposure to remote logging, log ingestion...  ...role, but do not meet 100% of the qualifications... 
    Remote work
    Local area

    Sauron

    San Francisco, CA
    2 days ago
  •  ...Applied AI Inference Engineer Baseten powers mission-critical inference for the world's most dynamic AI companies. By uniting applied AI research...  ...~ Competitive compensation, including meaningful equity. ~100% coverage of medical, dental, and vision insurance for... 
    Remote work
    Work experience placement
    Flexible hours

    BaseTen

    United States
    2 days ago
  • $151.8k

     ...What you can expect We are looking for an AI Inference Engineer with a solid background in speech recognition and model inference. In this...  ...hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote... 
    Remote work
    Work at office

    Zoom Video Communications

    San Jose, CA
    4 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Remote work

    DigitalOcean

    San Francisco, CA
    1 day ago
  • $167.2k - $209k

     ...pioneering cloud service provider in Seattle seeks a Senior Engineer 2 for its AI Inference Data Plane team. This role requires designing and...  ...particularly in GoLang or Python. Competitive salary range from $167,200 to $209,000 with remote work options. #J-18808-Ljbffr
    Remote work

    DigitalOcean

    Seattle, WA
    1 day ago
  •  ...TypeScript-Entwickler zur Entwicklung moderner Webapplikationen und zur aktiven Mitgestaltung an KI-Features. Die Position bietet 100% Remote-Arbeit, flexible Arbeitszeiten und überdurchschnittliche Vergütung. Geübte Kommunikation in Deutsch ist Voraussetzung, da die... 
    Remote work
    Flexible hours

    dreifach.ai

    United States
    4 days ago
  • $242k - $290k

     ...As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient...  ...CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution...  ...latency and maximize memory bandwidth on AI accelerators. Write production-level,... 
    Remote work
    Temporary work
    Relocation package

    Zoox

    San Diego, CA
    3 days ago
  • A leading tech consulting firm in New York is seeking an experienced AI/ML Engineer to innovate AI solutions for complex business challenges. You will design and implement...  ...production-ready solutions. This position offers a flexible, 100% remote work model. #J-18808-Ljbffr
    Remote work
    Flexible hours

    Experis ManpowerGroup Sp. z o.o.

    Poland, NY
    3 days ago
  • Jobgether is seeking an AI Research Engineer (Kernel & Inference Optimization) to enhance model serving architectures and deployment efficiencies. This fully remote role involves designing scalable inference pipelines and collaborating with international teams. Ideal candidates... 
    Remote job

    Jobgether

    New Bremen, OH
    22 hours ago
  • $152k - $241.5k

    A leading technology company in Austin is seeking a Senior Compiler Engineer for their AI Inference Platforms team. The role involves analyzing deep learning networks and developing optimization algorithms, requiring expertise in compiler technologies. Ideal candidates... 
    Remote job

    NVIDIA Corporation

    Austin, TX
    1 day ago
  •  ...About the job We’re seeking experienced AI infrastructure Engineers to design and implement robust, scalable pipelines for massive data workloads...  ...of data and model workflows from prototyping to inference. Qualifications Proficient in Python with strong programming... 
    Remote work

    Framework Ventures

    United States
    3 days ago
  • Rula is seeking a Staff AI Engineer to lead significant AI endeavors within a fully remote work environment. This role involves influencing product innovation and engineering...  ...employees enjoy comprehensive health benefits, 100% remote work, and generous leave policies. #J-188... 
    Remote job
    Full time

    Rula

    Los Angeles, CA
    1 day ago
  • $152k - $241.5k

    Senior AI Inference Compiler Engineer page is loaded## Senior AI Inference Compiler Engineerlocations: US...  ...Santa Clara: US, TX, Austin: US, TX, Remote: US, WA, Remote: US, CA, Remotetime type...  ...robotics. The compiler must deliver leading inference performance, fast build... 
    Remote work

    NVIDIA Corporation

    Austin, TX
    1 day ago
  •  ...A leading ERP software reseller is seeking a Sage 100 Application Consultant to lead client software implementations primarily for construction industry clients. The ideal candidate will have 3-5 years of experience with Sage 100 ERP, demonstrated accounting skills, and... 
    Remote work
    Work from home

    Aktion

    Maumee, OH
    1 day ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good...  ...to be regularly worked. McLean, VA: $197,300 - $225,100 for Lead AI Engineer New York, NY: $215,200 - $245,... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    McLean, VA
    7 days ago
  • POSITION SUMMARY CVS Health is hiring a Lead Director, AI/ML Solutions Engineering & Delivery to deliver better health...  ...adoption. This is a U.S.-based remote position; candidates must reside...  ...and batch workloads, high‑throughput inference, low‑latency APIs, and multi‑tenant... 
    Remote work
    Work experience placement
    Local area

    Koitecc Solutions

    Plano, TX
    22 hours ago
  • $185.1k - $335.3k

     ...A leading automotive company is seeking a Staff Compiler Engineer to enhance the model compilation stack for autonomous vehicles...  ...optimizing high-level AI models into inference artifacts, defining technical...  ...Competitive salary range is $185,100 to $335,300, alongside... 

    General Motors

    New York, NY
    4 days ago
  • A financial solutions company is seeking a Lead Generation Representative to work on a 100% commission basis. The role involves taking cold calls to develop leads, working flexible hours from home. Candidates must possess excellent English skills, experience in telemarketing... 
    Remote job
    Work from home
    Flexible hours

    Insurance Protection Specialists

    Cleveland, OH
    22 hours ago
  • $244.7k - $279.2k

     ...Capital One is looking for a Distinguished AI Engineer to lead the development of AI-powered products in Nashville, TN. This remote position involves partnering with cross-functional teams to design and implement scalable AI solutions, leveraging cutting-edge technologies... 
    Remote work

    Capital One

    Nashville, TN
    13 days ago
  • $244.7k - $279.2k

     ...Capital One is looking for a Distinguished AI Engineer to develop responsible and reliable AI systems. This remote role requires deep technical expertise and at least 8 years of experience in AI and ML technologies. The ideal candidate will partner with a diverse team... 
    Remote work

    Capital One

    Raleigh, NC
    13 days ago
  • $244.7k - $279.2k

     ...Capital One is seeking a Distinguished AI Engineer to revolutionize banking through responsible AI. This fully remote role involves developing AI software, partnering with cross-functional teams, and improving large-scale AI systems. Candidates should have substantial... 
    Remote work

    Capital One

    Santa Fe, NM
    13 days ago
  • $244.7k - $279.2k

     ...Capital One is seeking a Distinguished AI Engineer to join their remote team. In this role, you will collaborate with cross-functional teams to design and deploy AI-powered products, leveraging cutting-edge technologies. You should have at least 8 years of experience... 
    Remote work

    Capital One

    Saint Paul, MN
    3 days ago
  •  ...Capital One is seeking a Distinguished AI Engineer to help build and optimize AI systems. The ideal candidate will have at least 8 years...  ...programming skills, and expertise in cloud platforms like AWS. This remote role involves partnering with cross-functional teams and... 
    Remote work

    Capital One

    Austin, TX
    13 days ago
  •  ...Lead IT Security Administrator - 100% Remote (Fulltime) Day to day security administration of systems and user ids, which include creation, deletion, modification, password resets for all technologies supported. Ensure adherence to all Security Administration Standard... 
    Remote work
    Full time

    The Dignify Solutions, LLC

    United States
    1 day ago
  •  ..., Arizona is seeking an experienced Inside Sales Consultant who will make 100+ outbound calls daily to generate leads. Candidates must have at least 2-3 years in outbound calling within a remote environment and strong CRM skills. Responsibilities include initiating the... 
    Remote job

    Coverall

    Phoenix, AZ
    22 hours ago
  •  ...Job Title: Generative AI Engineer (Senior / Lead / Principal)- Multiple openings Experience Level: 8+ to 13+ Years Location: Hybrid - Remote (India-based) with onsite every Thursday in Chennai Industry: AI/ML, Enterprise Applications, Healthcare... 
    Remote work
    Work at office

    Saviance

    Boston, MA
    2 days ago
  • $120k - $200k

     ...want to hire a highly skilled AI Software Engineer to join their rapidly...  ...depending on experience) Fully remote in the U.S. Competitive equity...  ...packages offered Healthcare: 100% employee coverage, ~75–80%...  ...at D33P Search Group by 2x Inferred from the description for this... 
    Remote work
    Full time
    Freelance

    D33P Search Group

    New York, NY
    4 days ago
  • $180k - $300k

     ...talent at innovative companies. Our team is 100% remote and we work with teams across the United States to help them hire. AI Engineer (Full-stack) Location: San Francisco,...  ...systems for visual datasets Build inference serving systems Develop APIs powering... 
    Remote work
    H1b
    Work at office
    Visa sponsorship

    Recruiting from Scratch

    San Francisco, CA
    22 hours ago
  • $87.12k - $181.5k

     ...AI Engineer - Insurance Domain - HYBRID NTT DATA Services...  ...as REST APIs or batch inference services on Azure...  ...Scientist Associate (DP-100) preferred. Exposure...  ...starting pay range for this remote role is $87,120 - $181,...  ...are one of the world's leading AI and digital... 
    Remote work
    Work at office
    Flexible hours

    Sierra Systems, An Ntt Data Company

    Warren, NJ
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead AI Inference Engineer 100% Remote. Be the first to apply!