Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead AI Inference Engineer 100% Remote

Framework Ventures

New York, NY
  • Remote job

About the job You will own the inference backbone behind QVAC's local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering quality at runtime level, including startup behavior, memory pressure, throughput/latency balance, and long‑session stability. You will define and evolve the core abstractions that inference features depend on, so new capabilities can be added without sacrificing performance or maintainability. This is a role for someone who enjoys low‑level problem solving, clear technical ownership, and building infrastructure that other teams trust in production. Your work directly enables private, on‑device AI experiences and helps set the technical foundation for QVAC's next generation of peer‑to‑peer AI products. Responsibilities Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx. Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the latest advancements in machine learning. Manage a cross‑functional team (pod) made of middleware (JS), foundation (C++), QA and documentation engineers to produce high‑quality deliverables. Regularly assess, both qualitatively and quantitatively, our position in the market with regards to similar products or platforms. Leverage the expertise of technical architects to ensure robust architectural choices and code quality. Ensure stable releases by following precise internal release processes. Qualifications Excellent programming skills in C++. Strong experience with Llama.cpp and ggml inference engines, which facilitates the deployment of models to specific GPU architectures. Good understanding of deep learning concepts and model architectures. Experience with transformers, LLMs, Diffusion Models. Demonstrated ability to rapidly assimilate new technologies and techniques. Experience managing a small, specialized, cross‑functional team (pod) of 3‑5 people. Genuine passion for building good products that improve people's lives. Degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D. Bonus points if Extensive experience with Javascript/Typescript. Understand the difficulties, nuances and importance of p2p technology. Experience with any of Vulkan, Metal and OpenCL. Have productionized models. #J-18808-Ljbffr Framework Ventures

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Lead AI Inference Engineer 100% Remote in New York, NY vacancy
  •  ...transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the...  ...is a bonus. Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures... 
    Remote job

    Framework Ventures

    New York, NY
    1 day ago
  • $175k - $225k

     ...veteran operators and engineers, alumni of Sonos, Paypal...  ...from other leading venture capital firms....  ...We're looking for an AI Inference Engineer who lives at...  ...navigation. Exposure to remote logging, log ingestion...  ...role, but do not meet 100% of the qualifications... 
    Remote work
    Local area

    Sauron

    San Francisco, CA
    18 hours ago
  •  ...BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion,...  ...us and help build the platform engineers turn to to ship AI products. THE ROLE...  ...compensation, including meaningful equity. - 100% coverage of medical, dental, and... 
    Remote work
    Work experience placement
    Flexible hours

    Baseten

    United States
    4 days ago
  •  ...A leading technology company in California is seeking an AI Inference Engineer to bridge AI models with unique platforms. Key responsibilities include model optimization, deployment, and performance profiling. Candidates should have a Bachelor’s or Master’s degree, 5+... 
    Remote work
    Work from home

    quadric.io

    Burlingame, CA
    4 days ago
  • A cloud technology company is looking for a Senior Engineer 2 to enhance their AI Inference Optimization team. In this role, you will drive architectural...  ...position offers competitive compensation and is fully remote, promoting a collaborative and innovative work... 
    Remote work

    DigitalOcean

    Seattle, WA
    2 days ago
  •  ...Akamai is seeking a Senior Security Engineer to protect AI inference environments from emerging threats. In this role, you will define and implement...  ...security and AI/ML architectures. The position supports remote work arrangements, allowing flexibility in your work location... 
    Remote work

    Akamai

    Poland, NY
    3 days ago
  • A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance...  ...a competitive salary range along with remote work flexibility and numerous professional... 
    Remote job

    DigitalOcean

    San Francisco, CA
    4 days ago
  • $152k - $241.5k

    A leading technology company in Austin is seeking a Senior Compiler Engineer for their AI Inference Platforms team. The role involves analyzing deep learning networks and developing optimization algorithms, requiring expertise in compiler technologies. Ideal candidates... 
    Remote job

    NVIDIA Corporation

    Austin, TX
    4 days ago
  • Rula is seeking a Staff AI Engineer to lead significant AI endeavors within a fully remote work environment. This role involves influencing product innovation and engineering...  ...employees enjoy comprehensive health benefits, 100% remote work, and generous leave policies. #J-188... 
    Remote job
    Full time

    Rula

    Los Angeles, CA
    4 days ago
  • Nerdy is hiring a Staff Engineer in the United States to drive technical direction and support AI-powered customer experiences. This role involves mentoring engineers...  ...familiarity with AI-native tools. The position offers 100% remote work within the home country and competitive... 
    Remote work
    Work from home

    Nerdy

    New York, NY
    2 days ago
  •  ...A remote-first Voice AI startup is looking for an AI Engineer to join their early team. The role involves building and honing major capabilities for their voice...  ...as well as infrastructure for model training and inference. Ideal candidates have a Bachelor's degree in a technical... 
    Remote work

    Incept AI

    New York, NY
    1 day ago
  • $167.2k - $209k

     ...pioneering cloud service provider in Seattle seeks a Senior Engineer 2 for its AI Inference Data Plane team. This role requires designing and...  ...GoLang or Python. Competitive salary range from $167,200 to $209,000 with remote work options. #J-18808-Ljbffr DigitalOcean
    Remote job

    DigitalOcean

    Seattle, WA
    5 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Remote job

    DigitalOcean

    San Francisco, CA
    8 days ago
  • $100k

     ...Software engineering is undergoing its biggest transformation...  ..., Cloud, and DevOps. AI is changing how software...  ...Engineering Director to lead that transformation. This...  ...within a global, fully remote organization. You will directly...  ...company. We are 100% remote and global. Live... 
    Remote work
    Home office
    Flexible hours
    Shift work

    Ring

    Peru, IL
    3 days ago
  • About the Job We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine‑tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine‑tuning for language models... 
    Remote job

    Framework Ventures

    New York, NY
    2 days ago
  •  ...Our Client which is a large Payroll Co. is looking to hire Lead Full Stack Gen AI Engineers. Lead Full Stack Gen AI Engineers. Location - Montvale, N J3 Roles. Long Term Contract .100 % Remote. Lead-Level GenAI Engineer ~ s Expert in... 
    Remote work
    Long term contract

    Iris Software Inc.

    Montvale, NJ
    3 days ago
  • $11.5k - $15.5k

     ...Job Title: Lead Instructor: MLOps / AI Platform Engineering Client: Confidential – Customer Success Reskilling Location: Remote – Must work West Coast / Pacific Time hours Duration: 2 Weeks...  ...Credentials: AZ‑900, AI‑900, and DP‑100 are required; AI‑102 is preferred. The... 
    Remote work
    Work at office

    General Assembly

    California, MO
    4 days ago
  •  ...A financial solutions company is seeking a Lead Generation Representative to work on a 100% commission basis. The role involves taking cold calls to develop leads, working flexible hours from home. Candidates must possess excellent English skills, experience in telemarketing... 
    Remote work
    Work from home
    Flexible hours

    Insurance Protection Specialists

    Cleveland, OH
    4 days ago
  • $144.2k - $288.4k

     ...POSITION SUMMARY CVS Health is hiring a Lead Director, AI/ML Solutions Engineering & Delivery to deliver better health...  ...adoption. This is a U.S.-based remote position; candidates must reside...  ...workloads, supporting high-throughput inference, low-latency APIs, and multi-tenant... 
    Remote work
    Hourly pay
    Full time
    Temporary work
    Work experience placement
    Local area

    9025 CVS Shared Services Resources LLC

    Hartford, CT
    4 days ago
  • $75 per hour

    Overview Senior AI Platform Engineer with Agentic AI & Model Context Protocol (MCP) Location: Preferred local candidates but open to 100% remote working EST HOURS This is a long term contract. Open to W2 and CTC Contractors. This role supports AthenaHealth’s Internal... 
    Remote job
    Long term contract
    Contract work
    For contractors
    Local area

    Yoh, A Day & Zimmermann Company

    Boston, MA
    5 days ago
  • $185.1k - $335.3k

     ...A leading automotive company is seeking a Staff Compiler Engineer to enhance the model compilation stack for autonomous vehicles...  ...optimizing high-level AI models into inference artifacts, defining technical...  ...Competitive salary range is $185,100 to $335,300, alongside... 

    General Motors

    New York, NY
    2 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good...  ...to be regularly worked. McLean, VA: $197,300 - $225,100 for Lead AI Engineer New York, NY: $215,200 - $245,... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    New York, NY
    1 day ago
  •  ..., Arizona is seeking an experienced Inside Sales Consultant who will make 100+ outbound calls daily to generate leads. Candidates must have at least 2-3 years in outbound calling within a remote environment and strong CRM skills. Responsibilities include initiating the... 
    Remote job

    Coverall

    Phoenix, AZ
    3 days ago
  •  ...Navsan is seeking a skilled SR AI Developer to join their Technology team. This role involves...  ...designing and deploying AI solutions, leading projects, and mentoring junior developers....  ...contribute to innovative solutions in a remote work environment, making a significant impact... 
    Remote work

    Navsan

    New York, NY
    3 days ago
  •  ...Lead IT Security Administrator - 100% Remote (Fulltime) Day to day security administration of systems and user ids, which include creation, deletion, modification, password resets for all technologies supported. Ensure adherence to all Security Administration Standard... 
    Remote work
    Full time

    The Dignify Solutions, LLC

    United States
    4 days ago
  •  ...Salesforce, Inc. is looking for a Senior Agentforce Forward Deployed Engineer in New York City. This role emphasizes leading technical delivery of AI solutions, ensuring integration with the Salesforce platform and mentoring junior team members. The ideal candidate will... 
    Remote work

    Salesforce

    New York, NY
    3 days ago
  • A leading technical consulting firm is seeking an experienced Data Scientist for a remote position. This role involves implementing advanced Machine Learning solutions, managing MLOps practices, and engaging directly with clients. A minimum of 4 years of experience in ML... 
    Remote work

    Koantek LLC

    Wildwood, MO
    4 days ago
  •  ...A leading tech firm in Boston seeks an experienced NLP and AI professional to lead the development of strategic AI agents. Key responsibilities include implementing advanced algorithms, managing AI solutions, and collaborating with diverse teams. The ideal candidate has... 
    Remote work

    Red Hat

    Boston, MA
    4 days ago
  • $197.3k - $225.1k

     ...responsible and reliable AI systems, changing banking...  ...class applied science and engineering teams to deliver our industry leading capabilities with...  ...training, large language model inference, similarity search, guardrails...  ..., VA: $197,300 - $225,100 for Lead AI Engineer New... 
    Full time
    Part time
    Local area

    Capital One National Association

    Mc Lean, VA
    3 days ago
  • $197.3k - $225.1k

    Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For...  ...Salary ranges by location: Cambridge, MA: $197,300 - $225,100; McLean, VA: $197,300 - $225,100; New York, NY: $215,2... 
    Local area

    Capital One National Association

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead AI Inference Engineer 100% Remote. Be the first to apply!