Edge AI Inference Engineer On-Device ML Systems
Framework Ventures
A technology company in Georgia is seeking a C++ Engineer to own the inference backbone of its AI stack, focusing on deploying models to edge devices. You'll collaborate closely with researchers and manage a cross-functional team to enhance existing products with AI features. The ideal candidate has excellent C++ skills, is familiar with Llama.cpp and ggml, and possesses a solid background in AI and machine learning. This role offers an exciting opportunity to influence the development of next-generation peer-to-peer AI products. #J-18808-Ljbffr
- A leading AI development firm in New York is looking for an AI/ML Systems Engineer to build and maintain on-device inference engines for local LLMs. The ideal candidate will have over 3 years of experience with C++ and Python, along with experience in machine learning...SuggestedLocal area
- ...Framework Ventures is seeking a skilled Machine Learning Engineer to deploy models to edge devices using frameworks like Llama.cpp and ggml. The role... ...transition models from research to production and integrating AI features into current products. Candidates should...Suggested
- ...Senior AI Engineer — Inference & Agent Systems United States Title: Applied AI Engineer — Inference & Agent Systems Location: United States What We're... ...LlamaIndex but haven't built the layer underneath Strong ML research background without systems exposure Stack familiarity...Suggested
- Staff AI Engineer, Conversation Intelligence Systems (New York) Apply now Who we are Xenoss is an... ..., AI assistants, edge computer vision, fraud detection... ...the modern applied AI and ML ecosystem, including, but... ...systems or low‑latency inference Experience combining unstructured...SuggestedLong term contract
$12 per hour
...Unicorn, $1B+). 6 AI patents. Enterprise... .... Founding AI Engineer (Applied ML / Vision + LLM) Engineer... ...the nervous system for construction.... ...You'll turn cutting‑edge LLM and vision research... ...sites and mobile devices. About the Role... ...record of optimizing inference costs What You'll...SuggestedFull timeFor contractorsRemote workFlexible hours$197.3k - $225.1k
Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry... ...real time, our applications of AI & ML are bringing humanity and simplicity...Full timePart timeLocal areaImmediate start- ...healthcare. Our AI sensing platform... ...an Applied AI Engineer to take our... ...foundation models and ML components from... ...cloud and edge deployments, and some of the systems you'll touch are... ...Deploy across our inference surfaces: third-... ...implementations Medical devices, SaMD, or other...
- * You love to build systems, take pride in the quality... ...a strong foundation in engineering and mathematics, and your... ..., software, and AI enable you to see and exploit... ...developing AI and ML algorithms or technologies... ...technologies (e.g. LLM Inference, Similarity Search and...Full timePart time
$300k
...Ventures is hiring for a role focused on building and maintaining systems for AI applications, optimizing request routing across diverse accelerators. The ideal candidate has strong software engineering skills, particularly in distributed systems, and a passion for advancing...- ...leading executive search firm is currently seeking an FAE for a cutting-edge semiconductor company. This remote position will allow you to leverage your deep knowledge of embedded systems and AI/ML to assist customers and deliver technical training. The ideal candidate...Remote work
$158.1k - $213.8k
...worldwide use their Amazon devices for entertainment, and... ...and talented Software Engineers with experience... ...latency, highly scalable systems enabling thousands of concurrent... ...ad Experiences (EDGE) team is charged with inventing... ...combine data science (AI/ML), device hardware,...InternshipWorldwideFlexible hours- ...the fastest-growing AI-native patent intelligence... ...and generative AI engine-custom-built for... ...robust, scalable AI/ML algorithms for cutting-edge IP applications Design... ...Develop retrieval systems (vector search, BM25... ...tradeoffs for production inference: prompt caching strategies...Immediate startRemote work
- ...Consultants is seeking an experienced Artificial Intelligence Engineer to design and deploy AI/ML and Generative AI solutions addressing real-world... ...-driven applications using production-grade LLM-powered systems. The ideal candidate has 5+ years in AI/ML with solid Python...
$30 - $50 per hour
...A leading tech company is seeking an AI Engineer for a remote role. You will build, evaluate, and deploy production AI systems, focusing on NLP, LLMs, and computer vision. The ideal... ...have mid-senior experience in delivering ML systems, strong Python skills, and a collaborative...Hourly payRemote work$60 per hour
...contribute to developing cutting-edge AI systems, while enjoying the... ...optimization, and statistical inference. Write clear technical explanations... ...Science, Mathematics, Engineering, or similar); a master's or... ...Competition ranking, AWS/GCP ML certifications, or equivalent...Hourly payFull timeRemote workFlexible hours$185.1k - $335.3k
...automotive company is seeking a Staff Compiler Engineer to enhance the model compilation stack... ...role involves optimizing high-level AI models into inference artifacts, defining technical visions,... .../C++ skills, and familiarity with ML frameworks. Competitive salary range...$165k - $290k
...thrive with our open, AI-driven commerce... ...connect the tools and systems that power growth, enabling... ...-driven AI Lead Engineer to play a pivotal... ...results through cutting-edge AI technologies.In... ...tools, inference frameworks, cloud-native ML workflows) is required...Local areaRemote work$170k - $210k
...growing NVIDIA‑backed AI company enabling AI... ...The AI Infrastructure Engineer is responsible for designing... ...Utilidata's AI and ML models across edge deployments, cloud... ...power data with AI inference software. This is Utilidata... ...serving, distributed systems, and GPU...Local areaRemote workFlexible hours- ...Join The Future Of AI At Tessera Labs... ...build multi-agent AI systems that can automate complex... ...a top-quality AI engineer with a strong focus... ...about AI, ML, and AI agents, we... ...or GCP. Deploy inference endpoints and serve... ...take pride in cutting-edge AI, value clear ownership...
$175k - $250k
...is developing a cutting‑edge autonomous agent... ...market outcomes. The Staff AI Engineer will be responsible for... ...propagation of insights to a system where the fleet gets... ...strategies. Model & Inference Infrastructure Ownership... ...Qualifications Production ML Engineering: Proven...Full timeImmediate startRemote workShift work$178k - $316k
...Applied AI Engineer At Quizlet, our mission is to help... ...retrieval and ranking systems that match learners with... ...components of end-to-end ML systems: candidate... ...improve latency/cost-aware inference; contribute to offline... ...interactions per week Cutting-edge tech: Generative AI,...Work at office3 days per week$172k - $349k
...Enterprise is the global edge-to-cloud company... ...Job Description As an AI Solution Engineer your role will be to architect... ...). Competency writing ML code (for example,... ...Python, Unix‑like systems. Ability to quickly prototype... ...model training or inference, and how model attributes...Work experience placementRemote workWork from home$215k - $230k
...Framework Ventures is seeking an AI Engineer for the Model Engineering Team. This role involves designing production-grade evaluation systems, optimizing machine learning models, and collaborating with a cross-functional team. Candidates should have advanced skills in...- A leading financial technology firm is looking for a Senior Software Engineer to join its AI Group in New York. The role involves collaborating on and designing production machine learning systems and applications. The ideal candidate will have over 7 years of programming...
$180k - $220k
...A healthcare technology company is seeking an experienced AI systems engineer to design and operate production AI workflows. The successful candidate... ...have 8+ years in software engineering, specifically in AI/ML systems, and a proven track record of deploying production AI...Remote workFlexible hours- ...Design and implement end-to-end agentic AI systems for production environments, including planning... ...architectural design reviews and mentor engineers on agent design, evaluation, and safe... ...years designing and deploying production ML/AI systems, including deployment, monitoring...
- ..., Java, Go, or C/C++), with strong production systems exposure • 3+ years designing and deploying production ML/AI systems, including deployment, monitoring, and... ...using / function-calling agents o Prompt engineering and adaptation • Deep understanding of...
$50 - $60 per hour
...US). Job Duties Build agentic AI systems: Design and implement tool-calling agents... ...policy enforcement) following MCP protocol. Engineer robust guardrails for safety, compliance,... ..., testing, and launching production ML systems, including model deployment/serving...Hourly payRemote work- ...A global technology firm seeks a Senior AI Engineer for remote work. This position requires 4... ...Responsibilities include designing and deploying AI systems, leading model monitoring, and developing... ..., with experience in cloud platforms and ML frameworks. The company values diversity...Remote work
- ...you will do As our ML Engineer Intern, you'll be the... ...questions: How do we build ML systems that scale to millions... ...do we leverage cutting-edge models to enhance... ...datasets Implementing inference systems for content... ...and deploying multimodal AI systems using MLOps best...Contract workInternshipImmediate startRemote workWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Edge AI Inference Engineer On-Device ML Systems. Be the first to apply!
- ai research engineer New York, NY
- machine learning ai engineer New York, NY
- ai engineer remote New York, NY
- ai prompt engineer New York, NY
- ai developer New York, NY
- ai engineer New York, NY
- ai ml engineer New York, NY
- senior ai engineer New York, NY
- healthcare systems engineer New York, NY
- broadcast systems engineer New York, NY


