Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Edge AI Inference Engineer — Deploy LLMs to Edge Devices

Framework Ventures

Framework Ventures is seeking a skilled Machine Learning Engineer to deploy models to edge devices using frameworks like Llama.cpp and ggml. The role involves collaborating with researchers to transition models from research to production and integrating AI features into current products. Candidates should possess excellent programming skills in C++, a strong grasp of deep learning concepts, and a degree in AI or a related field. Experience with P2P technology and productionized models is a plus. #J-18808-Ljbffr Framework Ventures

Vacancy posted 15 hours ago
Similar jobs that could be interesting for youBased on the Edge AI Inference Engineer — Deploy LLMs to Edge Devices in New York, NY vacancy
  • A technology company in Georgia is seeking a C++ Engineer to own the inference backbone of its AI stack, focusing on deploying models to edge devices. You'll collaborate closely with researchers and manage a cross-functional team to enhance existing products with AI features... 
    Suggested

    Framework Ventures

    New York, NY
    4 days ago
  • $250k

    Edge AI is a production requirement across automotive, robotics, and industrial...  ...underneath it doesn't exist. Every team deploying models on edge devices rebuilds memory management, platform...  ...models are doing in the field. Inference latency, memory pressure, thermal headroom... 
    Suggested

    Forum Ventures

    New York, NY
    2 days ago
  •  ...Ditto is redefining how data moves at the edge. Our mission is to make it seamless...  ...base, Ditto's peer-to-peer sync engine ensures devices stay connected and data stays consistent...  ...problems. About the Position As a Forward Deployed Engineer (FDE), you are the technical... 
    Suggested
    Fixed term contract
    Local area
    Immediate start
    Remote work
    Flexible hours

    Ditto

    New York, NY
    1 day ago
  • $148.75k - $201.25k

     ...Architect, build, deploy, and operate agentic Generative AI systems: APIs, data...  ...Generation, and context engineering. Work in cloud, on-prem, and edge/air-gapped...  .... Familiarity with LLMs or generative AI systems...  ...Gemma, Phi) including inference frameworks, structured... 
    Suggested
    Remote job
    Full time
    For contractors
    Local area
    Home office
    Flexible hours

    Defense Unicorns, Inc

    New York, NY
    15 hours ago
  • $135k - $200k

     ...Forward Deployed Software Engineer - Edge Autonomous Systems Title of Role: Forward Deployed Software Engineer - Edge Autonomous Systems Location: Washington D.C., hybrid Company Stage of Funding: Secondary Market — Software Development Office Type: Hybrid... 
    Suggested
    Work at office

    Recruiting from Scratch

    New York, NY
    4 days ago
  • $123.5k - $197.6k

    Cerence Inc. is seeking experts in Generative AI to design and implement cutting-edge models, collaborating with various teams to advance LLM technology...  ...candidates will have extensive experience in developing LLMs, strong Python skills, and a Ph.D. in a related field. The... 
    Flexible hours

    Cerence Inc.

    New York, NY
    15 hours ago
  • $160k

     ...technology firm in the U.S. seeks a Senior Implementation Engineer for a remote position. You will lead customer deployments and integrations of computer vision systems while...  ...interpersonal skills and proven experience in AI/ML solutions are essential. This role offers a... 
    Remote work

    Franklin Fitch

    New York, NY
    1 day ago
  • A cutting-edge tech startup in the United States is seeking a Forward Deployed Engineer to lead the integration of software with military avionics systems. The ideal candidate will have over 5 years’ experience in software engineering, must possess an active TS/SCI clearance... 

    Ditto

    New York, NY
    1 day ago
  •  ...healthcare. Our AI sensing...  ...building demos. We're deployed in real...  ...an Applied AI Engineer to take our growing...  ...across cloud and edge deployments, and...  ...(VLMs, LLMs, ASR/TTS, detection...  ...Deploy across our inference surfaces: third...  ...implementations Medical devices, SaMD, or other... 

    Norbert Health

    Brooklyn, NY
    1 day ago
  • Feedinkoo is seeking a talented Software AI Engineer to join our customer’s team in...  ...role involves working on cutting-edge AI/ML projects, utilizing LLMs and RAG methodologies. You'll design...  ...expertise in AI methodologies and cloud deployment. Exceptional communication skills... 

    Feedinkoo

    New York, NY
    15 hours ago
  • $200k - $240k

    Traba is the AI operating layer for the industrial...  ...relationships. But our edge is more than data: by...  ...entrepreneurial Senior Applied Agent Engineer to join as a founding...  ...0-to-1 ideas to scaled deployments. About You Builder with...  ...real product on top of LLMs—not just chat wrappers.... 
    Temporary work
    Local area
    Flexible hours
    Shift work
    Day shift

    Traba

    New York, NY
    1 day ago
  • $130.4k - $195.6k

    Qualcomm is looking for a Technical Pre-Sales Engineer to collaborate with North American customers in developing Edge AI solutions. This role focuses on designing data management and machine learning workflows while working alongside engineering and account teams. The... 

    Nutanix

    New York, NY
    4 days ago
  •  ...the fastest-growing AI-native patent intelligence...  ...scale. Our advanced LLMs and generative AI engine-custom-built for...  ...Develop and deploy robust, scalable AI/ML...  ...algorithms for cutting-edge IP applications Design...  ...tradeoffs for production inference: prompt caching strategies... 
    Immediate start
    Remote work

    Patlytics

    New York, NY
    2 days ago
  •  ...leading executive search firm is currently seeking an FAE for a cutting-edge semiconductor company. This remote position will allow you to leverage your deep knowledge of embedded systems and AI/ML to assist customers and deliver technical training. The ideal candidate... 
    Remote job

    Mackenzie Stuart

    New York, NY
    1 day ago
  • Scale AI, Inc. is seeking a Forward Deployed AI Engineer to serve as a technical bridge between the company and its enterprise customers. This role involves...  .... This position offers a chance to work with cutting-edge AI technologies in a dynamic environment. #J-18808-Ljbffr... 

    Scale AI, Inc.

    New York, NY
    15 hours ago
  • Summary Join a rapidly growing AI team as an AI Engineer, playing a pivotal role in building and deploying cutting-edge generative AI systems....  ...agentic AI systems integrating LLMs and GenAI models...  ...orchestration and automation Optimize inference pipelines for scalability,... 
    Remote work

    Prestige Staffing

    New York, NY
    1 day ago
  •  ...Senior AI Engineer Washington D.C. / New York Senior...  ...the cutting-edge development of credit...  ...Large Language Models (LLMs) to advance our use of...  ...development, evaluation, and deployment of machine learning (ML...  ...back-testing, rejection inference, and performance... 
    Flexible hours

    VantageScore®

    New York, NY
    1 day ago
  •  ...Distyl AI develops AI-native technologies that enable...  ...delivering AI deployments with $100M+ in impact....  ...functional teams of AI Engineers, Researchers, and Strategists...  ...Large Language Models (LLMs). At the most senior...  ...problems using cutting-edge AI techniques to deliver... 
    Work at office
    3 days per week

    Distyl AI

    New York, NY
    more than 2 months ago
  • $175k - $230k

     ...this Role As a Senior/Staff Backend IoT Engineer, you will be a technical leader...  ...define the architectural direction for our edge computing systems and drive technical decisions...  ...that will run on thousands of on-premise deployed systems and design the cloud systems that... 
    Apprenticeship
    Work at office
    Local area
    Immediate start
    Remote work
    2 days per week

    Sage Group plc

    New York, NY
    15 hours ago
  •  ...Join the Future of AI at Tessera Labs Tessera...  ...looking for a top-quality AI engineer with a strong focus on...  ...knows how to leverage LLMs, both open-source and...  ...AWS, Azure, or GCP. Deploy inference endpoints and serve AI...  ...take pride in cutting-edge AI, value clear ownership... 

    Tessera Labs

    New York, NY
    1 day ago
  • $200k - $250k

     ...its clients an Applied AI Engineer - this is a hybrid position...  ...and asset managers. We deploy software and custom...  ...knowledge and cutting-edge AI engineering - closing...  ...access, analytics, and AI inference Frontend Development...  ...possible - integrating LLMs, agentic workflows, and... 

    Career Renew

    New York, NY
    1 day ago
  • $135k - $175k

     ...quality and customer satisfaction in manufacturing. Our cutting-edge AI-powered platform empowers manufacturers to swiftly identify,...  .... About the Role: Axion Ray is looking for a Forward Deployed Engineer to join our Solutions team and help drive high-impact delivery... 
    Full time
    Local area
    Flexible hours

    Axion Ray

    New York, NY
    1 day ago
  • Capital One National Association is looking for a Lead AI Engineer in New York, NY, to drive the development of...  ...collaborating with cross-functional teams, designing, and deploying AI products using cutting-edge technologies. The ideal candidate will possess a Bachelor... 

    Capital One National Association

    New York, NY
    2 days ago
  • $300k - $400k

     ...Principal AI/ML Engineer - AdTech New York, New York, United...  ...design, build, and deploy ML solutions for...  ...large language models (LLMs) and intelligent agent...  ...training to real-time inference, for our real-time bidding...  ...capabilities on the cutting edge. AI & Agentic... 

    Zeta Global

    New York, NY
    15 hours ago
  • $100k - $200k

     ...Client Our Client is building AI agents that automate and...  ..., they are seeking a Forward Deployed AI Engineer to work directly with customers...  ...~ Hands-on experience with; LLMs, Agent frameworks, Prompt engineering...  ...Exposure to cutting-edge AI technologies Fast-moving... 
    Remote job
    Full time

    C&D Talent Advisory

    New York, NY
    10 days ago
  • $200k - $250k

     ...Founding AI Engineer New York City (Union Square). In-person. $200...  ...help us keep the system at the edge of what's possible, beyond...  ...our clients can dream of: new inference strategies, new model capabilities...  .... You have worked with LLMs, agents, tool use, and the standard... 
    Temporary work
    Flexible hours

    VoiceOps

    New York, NY
    2 days ago
  •  ...AI Engineers - USA New York About Indicium AI Indicium AI is trusted...  ...to design, build, and deploy production-grade AI systems powered...  .... You'll work with cutting-edge LLM technologies, building...  ...production AI systems integrating LLMs, RAG pipelines, vector... 

    Indicium

    New York, NY
    1 day ago
  •  ...Job Description Job Title: AI Engineer Location: United States (can...  ...Engineer to help design, build, and deploy innovative digital and AI-...  ...large language models (LLMs), to develop intelligent applications...  ...to work on cutting-edge AI and digital transformation... 

    Inizio Partners Corp

    New York, NY
    26 days ago
  • $150k - $200k

     ...Voice AI Prompt Engineer CollectWise is a fast growing and well funded...  ...systems that power customer deployments. You'll work at the intersection...  ...-safe phrasing, and edge-case handling. Build and...  ...Experience with GPT-5 or other LLMs. Compensation ~... 
    Immediate start

    Latitude

    New York, NY
    3 days ago
  • $160k

     ...About the job AI Engineer Job Title: AI Engineer Agentic & RAG...  ...pipelines to ground LLMs in enterprise or domain-specific...  ...sensitive data, user content, and edge cases with clear escalation...  ...training, feature engineering, and deployment of ML models (scikit-learn,... 
    Remote work

    Inizio Partners

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Edge AI Inference Engineer — Deploy LLMs to Edge Devices. Be the first to apply!