Edge AI Inference Engineer — On-Device ML Systems
Framework Ventures
A technology company in Georgia is seeking a C++ Engineer to own the inference backbone of its AI stack, focusing on deploying models to edge devices. You'll collaborate closely with researchers and manage a cross-functional team to enhance existing products with AI features. The ideal candidate has excellent C++ skills, is familiar with Llama.cpp and ggml, and possesses a solid background in AI and machine learning. This role offers an exciting opportunity to influence the development of next-generation peer-to-peer AI products. #J-18808-Ljbffr Framework Ventures
- Framework Ventures is seeking a skilled Machine Learning Engineer to deploy models to edge devices using frameworks like Llama.cpp and ggml. The role... ...transition models from research to production and integrating AI features into current products. Candidates should...Suggested
- Senior AI Engineer — Inference & Agent Systems United States Title: Applied AI Engineer — Inference & Agent Systems Location: United States What We're... ...but haven't built the layer underneath Strong ML research background without systems exposure Stack familiarity...Suggested
- ...healthcare. Our AI sensing platform... ...an Applied AI Engineer to take our... ...foundation models and ML components from... ...cloud and edge deployments, and some of the systems you'll touch are... ...Deploy across our inference surfaces: third-... ...implementations Medical devices, SaMD, or other...Suggested
$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry... ...time, our applications of AI & ML are bringing humanity and...SuggestedFull timePart timeLocal area- Responsibilities Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml,... ...to production environments. Integrate AI features into existing products,... ...experience with Llama.cpp and ggml inference engines, facilitating the deployment of models...SuggestedRemote job
$197.3k - $225.1k
Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry... ...time, our applications of AI and ML are bringing humanity and simplicity...Local area$300k
...Ventures is hiring for a role focused on building and maintaining systems for AI applications, optimizing request routing across diverse accelerators. The ideal candidate has strong software engineering skills, particularly in distributed systems, and a passion for advancing...- ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query... ...Triton, CUTLASS, or similar). Any other deep systems programming experience is a plus.... .... Good If You Touched Any Of ML Compilers and Framework Internals: PyTorch...
- ...leading executive search firm is currently seeking an FAE for a cutting-edge semiconductor company. This remote position will allow you to leverage your deep knowledge of embedded systems and AI/ML to assist customers and deliver technical training. The ideal candidate...Remote job
- ...Senior AI Engineer Washington D.C. / New York Senior... ...supporting the cutting-edge development of credit scoring... ...of machine learning (ML) models, including... ...back-testing, rejection inference, and performance analyses... ...models into production systems, ensuring scalability...Flexible hours
- ...the fastest-growing AI-native patent intelligence... ...and generative AI engine-custom-built for... ...robust, scalable AI/ML algorithms for cutting-edge IP applications Design... ...Develop retrieval systems (vector search, BM25... ...tradeoffs for production inference: prompt caching strategies...Immediate startRemote work
- ...Join the Future of AI at Tessera Labs... ...build multi-agent AI systems that can automate complex... ...a top-quality AI engineer with a strong focus... ...about AI, ML, and AI agents, we... ...or GCP. Deploy inference endpoints and serve... ...take pride in cutting-edge AI, value clear ownership...
$150k - $180k
Ironclad is the leading AI contracting... ...company events. AI Engineering @ Ironclad Ironclad... ...re looking for a AI/ML Engineer to help shape... ...work with cutting‑edge tools such as... ...models and intelligent systems that extract structured... ..., evaluation & inference, with a focus on scalability...Full timeContract workWork at office- ...specialized, efficient AI. Fastino's GLiNER... ...Innovate at the edge of efficiency by designing... ...agentic systems that leverage Fastino... ...collaborating with engineering teams to turn novel... ...and throughput of inference pipelines, proactively... ...on experience in AI/ML engineering roles....Full timeRemote work
- ...in computer vision and AI-powered document processing... ...are seeking a Python engineer to join our team in the... ..., working on the AI inference pipeline. This powers our... ...closely with senior ML engineers to add new OCR... ...engineers on real-world ML systems PyTorch experience...Full timeWork at officeRemote work
- A leading financial technology firm is looking for a Senior Software Engineer to join its AI Group in New York. The role involves collaborating on and designing production machine learning systems and applications. The ideal candidate will have over 7 years of programming...
- ...Consultants is seeking an experienced Artificial Intelligence Engineer to design and deploy AI/ML and Generative AI solutions addressing real-world... ...-driven applications using production-grade LLM-powered systems. The ideal candidate has 5+ years in AI/ML with solid Python...
$250k
Edge AI is a production requirement across automotive, robotics, and... ...deploying models on edge devices rebuilds memory management, platform... ...are doing in the field. Inference latency, memory pressure, thermal... ...Application‑specific memory systems for edge workloads don't yet...- ...you will do As our ML Engineer Intern, you'll be the... ...questions: How do we build ML systems that scale to millions... ...do we leverage cutting-edge models to enhance... ...datasets Implementing inference systems for content... ...and deploying multimodal AI systems using MLOps best...Contract workInternshipImmediate startRemote workWorldwide
- ...Department: Engineering & Technology Function... ...tech company using AI, machine learning,... ...We run on cutting-edge tools, creative experimentation... ...you score, every system you deploy here... ..., and own AI and ML systems that... ...models for contextual inference, personalization, and...Price workFull timeCasual workWork at officeRemote workDay shift
- An innovative AI startup in New York is looking for a Technical... ...performance across distributed systems. Candidates should ideally... ...generative AI. The role emphasizes engineering and research capabilities,... ...benefits and visa sponsorship available. #J-18808-Ljbffr Adaptive MLVisa sponsorship
$216.42k - $324.63k
Color Employer, LLC is seeking a Staff AI Engineer to architect core ML infrastructure and establish engineering standards for intelligent systems that extract data from unstructured documents. The role requires a minimum of 10 years in AI/ML and includes mentoring AI engineers...$175k - $250k
...is developing a cutting‑edge autonomous agent... ...market outcomes. The Staff AI Engineer will be responsible for... ...propagation of insights to a system where the fleet gets... ...strategies. Model & Inference Infrastructure Ownership... ...Production ML Engineering: Proven experience...Full timeImmediate startRemote workShift work$185.1k - $335.3k
...automotive company is seeking a Staff Compiler Engineer to enhance the model compilation stack... ...role involves optimizing high-level AI models into inference artifacts, defining technical visions,... .../C++ skills, and familiarity with ML frameworks. Competitive salary range...$300k - $400k
...Principal AI/ML Engineer - AdTech New York, New York, United States... ...science teams to ensure our ML systems are highly performant, scalable... ...and training to real-time inference, for our real-time bidding, targeting... ...capabilities on the cutting edge. AI & Agentic Applications...- ...powers mission-critical inference for the world's most dynamic AI companies, like... ...AI to bring cutting-edge models into production... ...build the platform engineers turn to to ship AI products... ...that ensure our systems are production-ready... ...to a variety of ML startups, offering unparalleled...Flexible hours
- Pfizer Belgium in New York is seeking an experienced AI Engineer to design and build enterprise-grade AI systems for Clinical Development & Operations. This hybrid role involves developing AI/ML models, automating processes, and collaborating across disciplines to drive...
- ...Mistral At Mistral AI, we believe in the... ...source and cutting-edge models, products... ...of Applied AI Engineers, ensuring the successful... ...of complex AI system s, including fine-... ...practices for fine-tuning, inference, and deployment.... ...experience in AI/ML, with at least 2+...Work at officeVisa sponsorship
- ...States (US). Job Duties Build agentic AI systems: Design and implement tool‑calling agents... ...policy enforcement) following MCP protocol. Engineer robust guardrails for safety, compliance,... ..., testing, and launching production ML systems, including model deployment/serving...Hourly payRemote work
$100k - $125k
...of recommendation systems powering our partnership... ..., applying cutting-edge techniques in... ...representation learning, graph ML, and retrieval. The... ...partnership with Engineering, Product, MLOps,... ...relentless user of AI coding agents to... ...), and low-latency inference patterns....Work at officeHome officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Edge AI Inference Engineer — On-Device ML Systems. Be the first to apply!
- senior ai engineer New York, NY
- ai ml engineer New York, NY
- ai engineer remote New York, NY
- ai engineer New York, NY
- ai prompt engineer New York, NY
- ai developer New York, NY
- ai research engineer New York, NY
- machine learning ai engineer New York, NY
- healthcare systems engineer New York, NY
- application system engineer New York, NY


