Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Edge AI Inference Engineer — On-Device ML Systems

Framework Ventures

A technology company in Georgia is seeking a C++ Engineer to own the inference backbone of its AI stack, focusing on deploying models to edge devices. You'll collaborate closely with researchers and manage a cross-functional team to enhance existing products with AI features. The ideal candidate has excellent C++ skills, is familiar with Llama.cpp and ggml, and possesses a solid background in AI and machine learning. This role offers an exciting opportunity to influence the development of next-generation peer-to-peer AI products. #J-18808-Ljbffr Framework Ventures

Vacancy posted 10 hours ago
Similar jobs that could be interesting for youBased on the Edge AI Inference Engineer — On-Device ML Systems in New York, NY vacancy
  • Framework Ventures is seeking a skilled Machine Learning Engineer to deploy models to edge devices using frameworks like Llama.cpp and ggml. The role...  ...transition models from research to production and integrating AI features into current products. Candidates should... 
    Suggested

    Framework Ventures

    New York, NY
    1 day ago
  • Senior AI Engineer — Inference & Agent Systems United States Title: Applied AI Engineer — Inference & Agent Systems Location: United States What We're...  ...but haven't built the layer underneath Strong ML research background without systems exposure Stack familiarity... 
    Suggested

    Arcana Analytics Inc.

    New York, NY
    1 day ago
  •  ...healthcare. Our AI sensing platform...  ...an Applied AI Engineer to take our...  ...foundation models and ML components from...  ...cloud and edge deployments, and some of the systems you'll touch are...  ...Deploy across our inference surfaces: third-...  ...implementations Medical devices, SaMD, or other... 
    Suggested

    Norbert Health

    Brooklyn, NY
    2 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry...  ...time, our applications of AI & ML are bringing humanity and... 
    Suggested
    Full time
    Part time
    Local area

    Capital One Financial Corp

    New York, NY
    1 day ago
  • Responsibilities Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml,...  ...to production environments. Integrate AI features into existing products,...  ...experience with Llama.cpp and ggml inference engines, facilitating the deployment of models... 
    Suggested
    Remote job

    Framework Ventures

    New York, NY
    1 day ago
  • $197.3k - $225.1k

    Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry...  ...time, our applications of AI and ML are bringing humanity and simplicity... 
    Local area

    Capital One National Association

    New York, NY
    3 days ago
  • $300k

     ...Ventures is hiring for a role focused on building and maintaining systems for AI applications, optimizing request routing across diverse accelerators. The ideal candidate has strong software engineering skills, particularly in distributed systems, and a passion for advancing... 

    Menlo Ventures

    New York, NY
    2 days ago
  •  ...Inference Engine Engineer We build and run the inference engine behind every Perplexity query...  ...Triton, CUTLASS, or similar). Any other deep systems programming experience is a plus....  .... Good If You Touched Any Of ML Compilers and Framework Internals: PyTorch... 

    Perplexity AI

    New York, NY
    11 days ago
  •  ...leading executive search firm is currently seeking an FAE for a cutting-edge semiconductor company. This remote position will allow you to leverage your deep knowledge of embedded systems and AI/ML to assist customers and deliver technical training. The ideal candidate... 
    Remote job

    Mackenzie Stuart

    New York, NY
    2 days ago
  •  ...Senior AI Engineer Washington D.C. / New York Senior...  ...supporting the cutting-edge development of credit scoring...  ...of machine learning (ML) models, including...  ...back-testing, rejection inference, and performance analyses...  ...models into production systems, ensuring scalability... 
    Flexible hours

    VantageScore®

    New York, NY
    2 days ago
  •  ...the fastest-growing AI-native patent intelligence...  ...and generative AI engine-custom-built for...  ...robust, scalable AI/ML algorithms for cutting-edge IP applications Design...  ...Develop retrieval systems (vector search, BM25...  ...tradeoffs for production inference: prompt caching strategies... 
    Immediate start
    Remote work

    Patlytics

    New York, NY
    3 days ago
  •  ...Join the Future of AI at Tessera Labs...  ...build multi-agent AI systems that can automate complex...  ...a top-quality AI engineer with a strong focus...  ...about AI, ML, and AI agents, we...  ...or GCP. Deploy inference endpoints and serve...  ...take pride in cutting-edge AI, value clear ownership... 

    Tessera Labs

    New York, NY
    2 days ago
  • $150k - $180k

    Ironclad is the leading AI contracting...  ...company events. AI Engineering @ Ironclad Ironclad...  ...re looking for a AI/ML Engineer to help shape...  ...work with cutting‑edge tools such as...  ...models and intelligent systems that extract structured...  ..., evaluation & inference, with a focus on scalability... 
    Full time
    Contract work
    Work at office

    Ironclad

    New York, NY
    2 days ago
  •  ...specialized, efficient AI. Fastino's GLiNER...  ...Innovate at the edge of efficiency by designing...  ...agentic systems that leverage Fastino...  ...collaborating with engineering teams to turn novel...  ...and throughput of inference pipelines, proactively...  ...on experience in AI/ML engineering roles.... 
    Full time
    Remote work

    Fastino Labs

    New York, NY
    2 days ago
  •  ...in computer vision and AI-powered document processing...  ...are seeking a Python engineer to join our team in the...  ..., working on the AI inference pipeline. This powers our...  ...closely with senior ML engineers to add new OCR...  ...engineers on real-world ML systems PyTorch experience... 
    Full time
    Work at office
    Remote work

    Mathpix

    New York, NY
    4 days ago
  • A leading financial technology firm is looking for a Senior Software Engineer to join its AI Group in New York. The role involves collaborating on and designing production machine learning systems and applications. The ideal candidate will have over 7 years of programming... 

    Bloomberg

    New York, NY
    3 days ago
  •  ...Consultants is seeking an experienced Artificial Intelligence Engineer to design and deploy AI/ML and Generative AI solutions addressing real-world...  ...-driven applications using production-grade LLM-powered systems. The ideal candidate has 5+ years in AI/ML with solid Python... 

    Bluetick Consultants

    New York, NY
    3 days ago
  • $250k

    Edge AI is a production requirement across automotive, robotics, and...  ...deploying models on edge devices rebuilds memory management, platform...  ...are doing in the field. Inference latency, memory pressure, thermal...  ...Application‑specific memory systems for edge workloads don't yet... 

    Forum Ventures

    New York, NY
    3 days ago
  •  ...you will do As our ML Engineer Intern, you'll be the...  ...questions: How do we build ML systems that scale to millions...  ...do we leverage cutting-edge models to enhance...  ...datasets Implementing inference systems for content...  ...and deploying multimodal AI systems using MLOps best... 
    Contract work
    Internship
    Immediate start
    Remote work
    Worldwide

    Melotech

    New York, NY
    1 day ago
  •  ...Department: Engineering & Technology Function...  ...tech company using AI, machine learning,...  ...We run on cutting-edge tools, creative experimentation...  ...you score, every system you deploy here...  ..., and own AI and ML systems that...  ...models for contextual inference, personalization, and... 
    Price work
    Full time
    Casual work
    Work at office
    Remote work
    Day shift

    Gesture

    New York, NY
    1 day ago
  • An innovative AI startup in New York is looking for a Technical...  ...performance across distributed systems. Candidates should ideally...  ...generative AI. The role emphasizes engineering and research capabilities,...  ...benefits and visa sponsorship available. #J-18808-Ljbffr Adaptive ML
    Visa sponsorship

    Adaptive ML

    New York, NY
    4 days ago
  • $216.42k - $324.63k

    Color Employer, LLC is seeking a Staff AI Engineer to architect core ML infrastructure and establish engineering standards for intelligent systems that extract data from unstructured documents. The role requires a minimum of 10 years in AI/ML and includes mentoring AI engineers... 

    Color Employer, LLC

    Brooklyn, NY
    2 days ago
  • $175k - $250k

     ...is developing a cutting‑edge autonomous agent...  ...market outcomes. The Staff AI Engineer will be responsible for...  ...propagation of insights to a system where the fleet gets...  ...strategies. Model & Inference Infrastructure Ownership...  ...Production ML Engineering: Proven experience... 
    Full time
    Immediate start
    Remote work
    Shift work

    MLabs Ltd

    New York, NY
    4 days ago
  • $185.1k - $335.3k

     ...automotive company is seeking a Staff Compiler Engineer to enhance the model compilation stack...  ...role involves optimizing high-level AI models into inference artifacts, defining technical visions,...  .../C++ skills, and familiarity with ML frameworks. Competitive salary range... 

    General Motors

    New York, NY
    2 days ago
  • $300k - $400k

     ...Principal AI/ML Engineer - AdTech New York, New York, United States...  ...science teams to ensure our ML systems are highly performant, scalable...  ...and training to real-time inference, for our real-time bidding, targeting...  ...capabilities on the cutting edge. AI & Agentic Applications... 

    Zeta Global

    New York, NY
    11 days ago
  •  ...powers mission-critical inference for the world's most dynamic AI companies, like...  ...AI to bring cutting-edge models into production...  ...build the platform engineers turn to to ship AI products...  ...that ensure our systems are production-ready...  ...to a variety of ML startups, offering unparalleled... 
    Flexible hours

    Baseten

    New York, NY
    2 days ago
  • Pfizer Belgium in New York is seeking an experienced AI Engineer to design and build enterprise-grade AI systems for Clinical Development & Operations. This hybrid role involves developing AI/ML models, automating processes, and collaborating across disciplines to drive... 

    Pfizer Belgium

    New York, NY
    4 days ago
  •  ...Mistral    At Mistral AI, we believe in the...  ...source and cutting-edge models, products...  ...of Applied AI Engineers, ensuring the successful...  ...of complex AI system s, including fine-...  ...practices for fine-tuning, inference, and deployment....  ...experience in AI/ML, with at least 2+... 
    Work at office
    Visa sponsorship

    Mistral AI

    New York, NY
    6 days ago
  •  ...States (US). Job Duties Build agentic AI systems: Design and implement tool‑calling agents...  ...policy enforcement) following MCP protocol. Engineer robust guardrails for safety, compliance,...  ..., testing, and launching production ML systems, including model deployment/serving... 
    Hourly pay
    Remote work

    NTT DATA North America

    New York, NY
    2 days ago
  • $100k - $125k

     ...of recommendation systems powering our partnership...  ..., applying cutting-edge techniques in...  ...representation learning, graph ML, and retrieval. The...  ...partnership with Engineering, Product, MLOps,...  ...relentless user of AI coding agents to...  ...), and low-latency inference patterns.... 
    Work at office
    Home office
    Flexible hours

    IMPACT

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Edge AI Inference Engineer — On-Device ML Systems. Be the first to apply!