Edge AI Inference Engineer — Deploy LLMs to Edge Devices
Framework Ventures
Framework Ventures is seeking a skilled Machine Learning Engineer to deploy models to edge devices using frameworks like Llama.cpp and ggml. The role involves collaborating with researchers to transition models from research to production and integrating AI features into current products. Candidates should possess excellent programming skills in C++, a strong grasp of deep learning concepts, and a degree in AI or a related field. Experience with P2P technology and productionized models is a plus. #J-18808-Ljbffr Framework Ventures
- A technology company in Georgia is seeking a C++ Engineer to own the inference backbone of its AI stack, focusing on deploying models to edge devices. You'll collaborate closely with researchers and manage a cross-functional team to enhance existing products with AI features...Suggested
$250k
Edge AI is a production requirement across automotive, robotics, and industrial... ...underneath it doesn't exist. Every team deploying models on edge devices rebuilds memory management, platform... ...models are doing in the field. Inference latency, memory pressure, thermal headroom...Suggested- ...Ditto is redefining how data moves at the edge. Our mission is to make it seamless... ...base, Ditto's peer-to-peer sync engine ensures devices stay connected and data stays consistent... ...problems. About the Position As a Forward Deployed Engineer (FDE), you are the technical...SuggestedFixed term contractLocal areaImmediate startRemote workFlexible hours
$148.75k - $201.25k
...Architect, build, deploy, and operate agentic Generative AI systems: APIs, data... ...Generation, and context engineering. Work in cloud, on-prem, and edge/air-gapped... .... Familiarity with LLMs or generative AI systems... ...Gemma, Phi) including inference frameworks, structured...SuggestedRemote jobFull timeFor contractorsLocal areaHome officeFlexible hours$135k - $200k
...Forward Deployed Software Engineer - Edge Autonomous Systems Title of Role: Forward Deployed Software Engineer - Edge Autonomous Systems Location: Washington D.C., hybrid Company Stage of Funding: Secondary Market — Software Development Office Type: Hybrid...SuggestedWork at office$123.5k - $197.6k
Cerence Inc. is seeking experts in Generative AI to design and implement cutting-edge models, collaborating with various teams to advance LLM technology... ...candidates will have extensive experience in developing LLMs, strong Python skills, and a Ph.D. in a related field. The...Flexible hours$160k
...technology firm in the U.S. seeks a Senior Implementation Engineer for a remote position. You will lead customer deployments and integrations of computer vision systems while... ...interpersonal skills and proven experience in AI/ML solutions are essential. This role offers a...Remote work- A cutting-edge tech startup in the United States is seeking a Forward Deployed Engineer to lead the integration of software with military avionics systems. The ideal candidate will have over 5 years’ experience in software engineering, must possess an active TS/SCI clearance...
- ...healthcare. Our AI sensing... ...building demos. We're deployed in real... ...an Applied AI Engineer to take our growing... ...across cloud and edge deployments, and... ...(VLMs, LLMs, ASR/TTS, detection... ...Deploy across our inference surfaces: third... ...implementations Medical devices, SaMD, or other...
- Feedinkoo is seeking a talented Software AI Engineer to join our customer’s team in... ...role involves working on cutting-edge AI/ML projects, utilizing LLMs and RAG methodologies. You'll design... ...expertise in AI methodologies and cloud deployment. Exceptional communication skills...
$200k - $240k
Traba is the AI operating layer for the industrial... ...relationships. But our edge is more than data: by... ...entrepreneurial Senior Applied Agent Engineer to join as a founding... ...0-to-1 ideas to scaled deployments. About You Builder with... ...real product on top of LLMs—not just chat wrappers....Temporary workLocal areaFlexible hoursShift workDay shift$130.4k - $195.6k
Qualcomm is looking for a Technical Pre-Sales Engineer to collaborate with North American customers in developing Edge AI solutions. This role focuses on designing data management and machine learning workflows while working alongside engineering and account teams. The...- ...the fastest-growing AI-native patent intelligence... ...scale. Our advanced LLMs and generative AI engine-custom-built for... ...Develop and deploy robust, scalable AI/ML... ...algorithms for cutting-edge IP applications Design... ...tradeoffs for production inference: prompt caching strategies...Immediate startRemote work
- ...leading executive search firm is currently seeking an FAE for a cutting-edge semiconductor company. This remote position will allow you to leverage your deep knowledge of embedded systems and AI/ML to assist customers and deliver technical training. The ideal candidate...Remote job
- Scale AI, Inc. is seeking a Forward Deployed AI Engineer to serve as a technical bridge between the company and its enterprise customers. This role involves... .... This position offers a chance to work with cutting-edge AI technologies in a dynamic environment. #J-18808-Ljbffr...
- Summary Join a rapidly growing AI team as an AI Engineer, playing a pivotal role in building and deploying cutting-edge generative AI systems.... ...agentic AI systems integrating LLMs and GenAI models... ...orchestration and automation Optimize inference pipelines for scalability,...Remote work
- ...Senior AI Engineer Washington D.C. / New York Senior... ...the cutting-edge development of credit... ...Large Language Models (LLMs) to advance our use of... ...development, evaluation, and deployment of machine learning (ML... ...back-testing, rejection inference, and performance...Flexible hours
- ...Distyl AI develops AI-native technologies that enable... ...delivering AI deployments with $100M+ in impact.... ...functional teams of AI Engineers, Researchers, and Strategists... ...Large Language Models (LLMs). At the most senior... ...problems using cutting-edge AI techniques to deliver...Work at office3 days per week
$175k - $230k
...this Role As a Senior/Staff Backend IoT Engineer, you will be a technical leader... ...define the architectural direction for our edge computing systems and drive technical decisions... ...that will run on thousands of on-premise deployed systems and design the cloud systems that...ApprenticeshipWork at officeLocal areaImmediate startRemote work2 days per week- ...Join the Future of AI at Tessera Labs Tessera... ...looking for a top-quality AI engineer with a strong focus on... ...knows how to leverage LLMs, both open-source and... ...AWS, Azure, or GCP. Deploy inference endpoints and serve AI... ...take pride in cutting-edge AI, value clear ownership...
$200k - $250k
...its clients an Applied AI Engineer - this is a hybrid position... ...and asset managers. We deploy software and custom... ...knowledge and cutting-edge AI engineering - closing... ...access, analytics, and AI inference Frontend Development... ...possible - integrating LLMs, agentic workflows, and...$135k - $175k
...quality and customer satisfaction in manufacturing. Our cutting-edge AI-powered platform empowers manufacturers to swiftly identify,... .... About the Role: Axion Ray is looking for a Forward Deployed Engineer to join our Solutions team and help drive high-impact delivery...Full timeLocal areaFlexible hours- Capital One National Association is looking for a Lead AI Engineer in New York, NY, to drive the development of... ...collaborating with cross-functional teams, designing, and deploying AI products using cutting-edge technologies. The ideal candidate will possess a Bachelor...
$300k - $400k
...Principal AI/ML Engineer - AdTech New York, New York, United... ...design, build, and deploy ML solutions for... ...large language models (LLMs) and intelligent agent... ...training to real-time inference, for our real-time bidding... ...capabilities on the cutting edge. AI & Agentic...$100k - $200k
...Client Our Client is building AI agents that automate and... ..., they are seeking a Forward Deployed AI Engineer to work directly with customers... ...~ Hands-on experience with; LLMs, Agent frameworks, Prompt engineering... ...Exposure to cutting-edge AI technologies Fast-moving...Remote jobFull time$200k - $250k
...Founding AI Engineer New York City (Union Square). In-person. $200... ...help us keep the system at the edge of what's possible, beyond... ...our clients can dream of: new inference strategies, new model capabilities... .... You have worked with LLMs, agents, tool use, and the standard...Temporary workFlexible hours- ...AI Engineers - USA New York About Indicium AI Indicium AI is trusted... ...to design, build, and deploy production-grade AI systems powered... .... You'll work with cutting-edge LLM technologies, building... ...production AI systems integrating LLMs, RAG pipelines, vector...
- ...Job Description Job Title: AI Engineer Location: United States (can... ...Engineer to help design, build, and deploy innovative digital and AI-... ...large language models (LLMs), to develop intelligent applications... ...to work on cutting-edge AI and digital transformation...
$150k - $200k
...Voice AI Prompt Engineer CollectWise is a fast growing and well funded... ...systems that power customer deployments. You'll work at the intersection... ...-safe phrasing, and edge-case handling. Build and... ...Experience with GPT-5 or other LLMs. Compensation ~...Immediate start$160k
...About the job AI Engineer Job Title: AI Engineer Agentic & RAG... ...pipelines to ground LLMs in enterprise or domain-specific... ...sensitive data, user content, and edge cases with clear escalation... ...training, feature engineering, and deployment of ML models (scikit-learn,...Remote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Edge AI Inference Engineer — Deploy LLMs to Edge Devices. Be the first to apply!


