Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Edge AI Inference Engineer On-Device ML Systems

Framework Ventures

A technology company in Georgia is seeking a C++ Engineer to own the inference backbone of its AI stack, focusing on deploying models to edge devices. You'll collaborate closely with researchers and manage a cross-functional team to enhance existing products with AI features. The ideal candidate has excellent C++ skills, is familiar with Llama.cpp and ggml, and possesses a solid background in AI and machine learning. This role offers an exciting opportunity to influence the development of next-generation peer-to-peer AI products. #J-18808-Ljbffr

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Edge AI Inference Engineer On-Device ML Systems in New York, NY vacancy
  • A leading AI development firm in New York is looking for an AI/ML Systems Engineer to build and maintain on-device inference engines for local LLMs. The ideal candidate will have over 3 years of experience with C++ and Python, along with experience in machine learning... 
    Suggested
    Local area

    LM Studio

    New York, NY
    3 days ago
  •  ...Framework Ventures is seeking a skilled Machine Learning Engineer to deploy models to edge devices using frameworks like Llama.cpp and ggml. The role...  ...transition models from research to production and integrating AI features into current products. Candidates should... 
    Suggested

    Framework Ventures

    New York, NY
    2 days ago
  •  ...Senior AI Engineer — Inference & Agent Systems United States Title: Applied AI Engineer — Inference & Agent Systems Location: United States What We're...  ...LlamaIndex but haven't built the layer underneath Strong ML research background without systems exposure Stack familiarity... 
    Suggested

    Arcana Analytics Inc.

    New York, NY
    2 days ago
  • Staff AI Engineer, Conversation Intelligence Systems (New York) Apply now Who we are Xenoss is an...  ..., AI assistants, edge computer vision, fraud detection...  ...the modern applied AI and ML ecosystem, including, but...  ...systems or low‑latency inference Experience combining unstructured... 
    Suggested
    Long term contract

    Xenoss

    New York, NY
    1 day ago
  • $12 per hour

     ...Unicorn, $1B+). 6 AI patents. Enterprise...  .... Founding AI Engineer (Applied ML / Vision + LLM) Engineer...  ...the nervous system for construction....  ...You'll turn cutting‑edge LLM and vision research...  ...sites and mobile devices. About the Role...  ...record of optimizing inference costs What You'll... 
    Suggested
    Full time
    For contractors
    Remote work
    Flexible hours

    Intellus Build

    New York, NY
    3 days ago
  • $197.3k - $225.1k

    Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry...  ...real time, our applications of AI & ML are bringing humanity and simplicity... 
    Full time
    Part time
    Local area
    Immediate start

    Capital One

    New York, NY
    8 hours ago
  •  ...healthcare. Our AI sensing platform...  ...an Applied AI Engineer to take our...  ...foundation models and ML components from...  ...cloud and edge deployments, and some of the systems you'll touch are...  ...Deploy across our inference surfaces: third-...  ...implementations Medical devices, SaMD, or other... 

    Norbert Health

    Brooklyn, NY
    8 days ago
  • * You love to build systems, take pride in the quality...  ...a strong foundation in engineering and mathematics, and your...  ..., software, and AI enable you to see and exploit...  ...developing AI and ML algorithms or technologies...  ...technologies (e.g. LLM Inference, Similarity Search and... 
    Full time
    Part time

    Capital One

    New York, NY
    2 days ago
  • $300k

     ...Ventures is hiring for a role focused on building and maintaining systems for AI applications, optimizing request routing across diverse accelerators. The ideal candidate has strong software engineering skills, particularly in distributed systems, and a passion for advancing... 

    Menlo Ventures

    New York, NY
    3 days ago
  •  ...leading executive search firm is currently seeking an FAE for a cutting-edge semiconductor company. This remote position will allow you to leverage your deep knowledge of embedded systems and AI/ML to assist customers and deliver technical training. The ideal candidate... 
    Remote work

    Mackenzie Stuart

    New York, NY
    3 days ago
  • $158.1k - $213.8k

     ...worldwide use their Amazon devices for entertainment, and...  ...and talented Software Engineers with experience...  ...latency, highly scalable systems enabling thousands of concurrent...  ...ad Experiences (EDGE) team is charged with inventing...  ...combine data science (AI/ML), device hardware,... 
    Internship
    Worldwide
    Flexible hours

    Amazon

    New York, NY
    4 days ago
  •  ...the fastest-growing AI-native patent intelligence...  ...and generative AI engine-custom-built for...  ...robust, scalable AI/ML algorithms for cutting-edge IP applications Design...  ...Develop retrieval systems (vector search, BM25...  ...tradeoffs for production inference: prompt caching strategies... 
    Immediate start
    Remote work

    Patlytics

    New York, NY
    4 days ago
  •  ...Consultants is seeking an experienced Artificial Intelligence Engineer to design and deploy AI/ML and Generative AI solutions addressing real-world...  ...-driven applications using production-grade LLM-powered systems. The ideal candidate has 5+ years in AI/ML with solid Python... 

    Bluetick Consultants

    New York, NY
    4 days ago
  • $30 - $50 per hour

     ...A leading tech company is seeking an AI Engineer for a remote role. You will build, evaluate, and deploy production AI systems, focusing on NLP, LLMs, and computer vision. The ideal...  ...have mid-senior experience in delivering ML systems, strong Python skills, and a collaborative... 
    Hourly pay
    Remote work

    Rex USA

    New York, NY
    3 days ago
  • $60 per hour

     ...contribute to developing cutting-edge AI systems, while enjoying the...  ...optimization, and statistical inference. Write clear technical explanations...  ...Science, Mathematics, Engineering, or similar); a master's or...  ...Competition ranking, AWS/GCP ML certifications, or equivalent... 
    Hourly pay
    Full time
    Remote work
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    1 day ago
  • $185.1k - $335.3k

     ...automotive company is seeking a Staff Compiler Engineer to enhance the model compilation stack...  ...role involves optimizing high-level AI models into inference artifacts, defining technical visions,...  .../C++ skills, and familiarity with ML frameworks. Competitive salary range... 

    General Motors

    New York, NY
    3 days ago
  • $165k - $290k

     ...thrive with our open, AI-driven commerce...  ...connect the tools and systems that power growth, enabling...  ...-driven AI Lead Engineer to play a pivotal...  ...results through cutting-edge AI technologies.In...  ...tools, inference frameworks, cloud-native ML workflows) is required... 
    Local area
    Remote work

    BigCommerce

    New York, NY
    3 days ago
  • $170k - $210k

     ...growing NVIDIA‑backed AI company enabling AI...  ...The AI Infrastructure Engineer is responsible for designing...  ...Utilidata's AI and ML models across edge deployments, cloud...  ...power data with AI inference software. This is Utilidata...  ...serving, distributed systems, and GPU... 
    Local area
    Remote work
    Flexible hours

    Utilidata

    New York, NY
    3 days ago
  •  ...Join The Future Of AI At Tessera Labs...  ...build multi-agent AI systems that can automate complex...  ...a top-quality AI engineer with a strong focus...  ...about AI, ML, and AI agents, we...  ...or GCP. Deploy inference endpoints and serve...  ...take pride in cutting-edge AI, value clear ownership... 

    Tessera Labs

    New York, NY
    3 days ago
  • $175k - $250k

     ...is developing a cutting‑edge autonomous agent...  ...market outcomes. The Staff AI Engineer will be responsible for...  ...propagation of insights to a system where the fleet gets...  ...strategies. Model & Inference Infrastructure Ownership...  ...Qualifications Production ML Engineering: Proven... 
    Full time
    Immediate start
    Remote work
    Shift work

    MLabs

    New York, NY
    3 days ago
  • $178k - $316k

     ...Applied AI Engineer At Quizlet, our mission is to help...  ...retrieval and ranking systems that match learners with...  ...components of end-to-end ML systems: candidate...  ...improve latency/cost-aware inference; contribute to offline...  ...interactions per week Cutting-edge tech: Generative AI,... 
    Work at office
    3 days per week

    Quizlet

    New York, NY
    2 days ago
  • $172k - $349k

     ...Enterprise is the global edge-to-cloud company...  ...Job Description As an AI Solution Engineer your role will be to architect...  ...). Competency writing ML code (for example,...  ...Python, Unix‑like systems. Ability to quickly prototype...  ...model training or inference, and how model attributes... 
    Work experience placement
    Remote work
    Work from home

    Hewlett Packard Enterprise

    New York, NY
    3 days ago
  • $215k - $230k

     ...Framework Ventures is seeking an AI Engineer for the Model Engineering Team. This role involves designing production-grade evaluation systems, optimizing machine learning models, and collaborating with a cross-functional team. Candidates should have advanced skills in... 

    Framework Ventures

    New York, NY
    3 days ago
  • A leading financial technology firm is looking for a Senior Software Engineer to join its AI Group in New York. The role involves collaborating on and designing production machine learning systems and applications. The ideal candidate will have over 7 years of programming... 

    Bloomberg

    New York, NY
    4 days ago
  • $180k - $220k

     ...A healthcare technology company is seeking an experienced AI systems engineer to design and operate production AI workflows. The successful candidate...  ...have 8+ years in software engineering, specifically in AI/ML systems, and a proven track record of deploying production AI... 
    Remote work
    Flexible hours

    Atlas Health

    New York, NY
    3 days ago
  •  ...Design and implement end-to-end agentic AI systems for production environments, including planning...  ...architectural design reviews and mentor engineers on agent design, evaluation, and safe...  ...years designing and deploying production ML/AI systems, including deployment, monitoring... 

    VBeyond

    Jersey City, NJ
    3 days ago
  •  ..., Java, Go, or C/C++), with strong production systems exposure • 3+ years designing and deploying production ML/AI systems, including deployment, monitoring, and...  ...using / function-calling agents o Prompt engineering and adaptation • Deep understanding of... 

    VBeyond

    New York, NY
    1 day ago
  • $50 - $60 per hour

     ...US). Job Duties Build agentic AI systems: Design and implement tool-calling agents...  ...policy enforcement) following MCP protocol. Engineer robust guardrails for safety, compliance,...  ..., testing, and launching production ML systems, including model deployment/serving... 
    Hourly pay
    Remote work

    The Nippon Telegraph and Telephone Corporation (NTT)

    New York, NY
    3 days ago
  •  ...A global technology firm seeks a Senior AI Engineer for remote work. This position requires 4...  ...Responsibilities include designing and deploying AI systems, leading model monitoring, and developing...  ..., with experience in cloud platforms and ML frameworks. The company values diversity... 
    Remote work

    Traackr

    New York, NY
    3 days ago
  •  ...you will do As our ML Engineer Intern, you'll be the...  ...questions: How do we build ML systems that scale to millions...  ...do we leverage cutting-edge models to enhance...  ...datasets Implementing inference systems for content...  ...and deploying multimodal AI systems using MLOps best... 
    Contract work
    Internship
    Immediate start
    Remote work
    Worldwide

    Melotech

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Edge AI Inference Engineer On-Device ML Systems. Be the first to apply!