Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote AI Inference Engineer Edge Model Deployment & Optimization

quadric.io

A leading technology company in California is seeking an AI Inference Engineer to bridge AI models with unique platforms. Key responsibilities include model optimization, deployment, and performance profiling. Candidates should have a Bachelor’s or Master’s degree, 5+ years' experience in AI frameworks, and proficiency in C/C++ and Python. Competitive benefits included, such as health care, retirement plans, and work from home options. #J-18808-Ljbffr

Vacancy posted 15 hours ago
Similar jobs that could be interesting for youBased on the Remote AI Inference Engineer Edge Model Deployment & Optimization in Burlingame, CA vacancy
  • $242k - $290k

     ...-modality foundation model to drive the next generation...  .... As a Model Optimization & Deployment Engineer, you will focus on...  ...build highly concurrent inference code to ensure real-...  ...execution on edge devices. In this role...  ...memory bandwidth on AI accelerators. Write... 
    Remote work
    Temporary work
    Relocation package

    Zoox

    San Diego, CA
    3 days ago
  •  ...looking for an experienced AI Model Engineer with deep expertise in...  ...development, model optimization, fine‑tuning, and GPU acceleration...  ...will extend the inference framework to support inference...  ...language model deployment for mobile and edge use cases. Work closely... 
    Remote work

    Framework Ventures

    New York, NY
    4 days ago
  • $70k - $300k

     ...Staff AI Software Engineer - Edge Model Optimization & Deployment FieldAI is transforming how robots interact with the real world. Our growing ML team in Seattle...  ...platforms. In this role, you will own the edge inference stack end to end, profiling and accelerating... 
    Suggested

    Field AI

    Seattle, WA
    3 days ago
  •  ...seeking a highly motivated and technically skilled Edge AI/Model Optimization Engineer to support the deployment, optimization, and sustainment of AI and agentic...  ...Language Models (LLMs), embedding models, and AI inference services for constrained hardware platforms,... 
    Suggested
    Local area

    NextGen Federal Systems

    Aberdeen, MD
    19 days ago
  •  ...Bright Vision Technologies is looking for a remote Edge AI Engineer to design and deploy machine learning models on resource-constrained devices. The ideal candidate...  ...in Python and C++, and strong skills in model optimization. Responsibilities include collaborating with... 
    Remote work
    Full time

    Bright Vision Technologies

    Alpharetta, GA
    1 day ago
  •  ...enhance the performance of large-scale models through advanced optimization techniques in Santa Clara,...  ...background in DL model training and deployment, ideally with a PhD or equivalent experience...  ...to work closely with cutting-edge technologies and a collaborative team... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $45 - $60 per hour

     ...architecture. Quadric's co-optimized software and hardware is...  ...run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging...  ...site. Responsibilities Model pruning: Prune the model...  ...industry experts in AI and semiconductor technology... 
    Hourly pay
    Temporary work
    Internship
    Work at office
    Relocation

    quadric.io, Inc

    Burlingame, CA
    1 day ago
  • $110k - $300k

     ...redefining the future of AI with our...  ...applications ranging from edge devices to data...  ...talented team of engineers and industry‑...  ...available. Develop, optimize, and deploy lightweight machine learning models for edge AI applications...  .... Improve inference efficiency and model... 

    TETRAMEM INC

    San Jose, CA
    15 hours ago
  • $176k - $420k

     ...AIissolvingrobust, real-world AI through humanoid...  ....As a Software Engineer for the Optimus...  ...exporting and deploying neural networks toTesla...  ...You'll Do ~ Optimize ML models for latency, memory usage, and inference speed ~...  ...platforms (cloud, edge, mobile) ~... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    2 days ago
  • $184k - $287.5k

     ...state‑of‑the‑art model optimization techniques—speculative...  ...for production deployments. Implement...  ...optimization strategies for inference, such as...  ...across diverse NVIDIA edge architectures,...  ...Science, Computer Engineering, or a related...  ...control, embodied AI, and autonomous decision... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $158.4k - $237.6k

     ...Technologies, Inc.Job Area:Engineering Group,...  ...is building the AI first stack and platform...  ...a Physical AI Model Optimization Engineer to help bring cutting‑edge robotic AI models...  ...’s internal deployment and optimization...  ...building Qualcomm’s inference engines, compilers... 
    Work experience placement
    Immediate start
    Work from home

    Nutanix

    San Diego, CA
    3 days ago
  • $212.8k

     ...Convert and compile ML models for execution on edge NPUs, and apply...  ...Apply hardware-aware optimization strategies, such as...  ..., Electrical Engineering, Computer Engineering...  ...engineering, model deployment, or ML systems for...  ...Understanding of model inference constraints on edge... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    3 days ago
  • Jobgether is seeking an AI Research Engineer (Kernel & Inference Optimization) to enhance model serving architectures and deployment efficiencies. This fully remote role involves designing scalable inference...  .... Join us to work on cutting-edge technologies in a fast-paced,... 
    Remote job

    Jobgether

    New Bremen, OH
    14 hours ago
  •  ...that powers local AI, porting and enhancing inference engines like llama.cpp, ONNX...  ...run efficiently on edge devices. Your focus...  ...the runtime: making models load faster, run...  ...inference layer is stable, optimized, and ready for...  ...Work on deploying machine learning models... 
    Remote work
    Local area

    Framework Ventures

    United States
    4 days ago
  • $244.8k

     ...research in Generative AI and CV/Multimodal...  ...dedicated to generative models for content creation,...  ...Model Training and Inference Optimization Engineer with expertise in optimizing...  ...work at the cutting edge of AI efficiency,...  ..., scalability, and deployment of large-scale generative... 
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    1 day ago
  • $100k - $150k

     ...businesses automate and optimize their operations...  ...cutting-edge technologies to...  ...a skilled Edge AI Engineer to join our dynamic...  ...Location: 100% Remote (Continental United...  ..., optimize, and deploy machine learning models that run...  ...cross‑platform inference runtimes leveraging... 
    Remote work
    Full time
    H1b
    Local area
    Immediate start
    Visa sponsorship

    Bright Vision Technologies

    Rockville, MD
    3 days ago
  • $101.66k - $200.02k

     ...Lead Edge AI / Machine Learning Engineer Strategic Technology Consulting...  ...lead the design, optimization, and deployment of advanced AI/ML...  ..., IMU drift modeling, anomaly detection...  ..., and efficient inference, as well as the ability...  ...location (For Remote Opportunities),... 
    Remote work
    Hourly pay
    Contract work
    Temporary work
    For contractors
    Work experience placement

    Arcfield

    Richmond, VA
    2 days ago
  •  ...A technology company is seeking an AI Kernel Engineer to develop and optimize AI kernel libraries for efficient inference on their platform. Ideal candidates will have over 5 years of experience in kernel development, strong C/C++ and Python skills, and the ability to... 

    quadric.io, Inc

    Burlingame, CA
    15 hours ago
  •  ...Bright Vision Technologies is seeking an experienced Edge AI Engineer to design, optimize, and deploy machine learning models on resource-constrained edge devices. This full-time position is remote, suitable for skilled candidates with over six years of relevant experience... 
    Remote work
    Full time

    Bright Vision Technologies

    Bellevue, WA
    15 hours ago
  •  ...Bright Vision Technologies is seeking a skilled Edge AI Engineer to design and optimize machine learning models for resource-constrained edge devices. This full-time, remote position requires deep expertise in model compression, quantization, and hardware-aware optimization... 
    Remote work
    Full time

    Bright Vision Technologies

    Edison, NJ
    1 day ago
  • $100k - $150k

     ...Bright Vision Technologies is seeking an Edge AI Engineer to design, optimize, and deploy machine learning models on resource-constrained devices. The role requires 6...  ...degree in a related field. This is a full-time remote position with a salary range of $100k to $150k.... 
    Remote work
    Full time

    Bright Vision Technologies

    Plano, TX
    1 day ago
  •  ...company is looking for a Senior Engineer 2 to enhance their AI Inference Optimization team. In this role, you will drive...  ...throughput and reduce latency in large models. Candidates should have over 5...  ...compensation and is fully remote, promoting a collaborative and innovative... 
    Remote work

    DigitalOcean

    Seattle, WA
    14 hours ago
  •  ...Bright Vision Technologies is seeking an Edge AI Engineer to design, optimize, and deploy machine learning models on edge devices. The role focuses on model compression...  ...in edge or mobile AI. This is a full-time remote position with competitive compensation and benefits... 
    Remote work
    Full time

    Bright Vision Technologies

    Johns Creek, GA
    15 hours ago
  •  ...Responsibilities Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx....  ...production environments. Integrate AI features into existing products, enriching...  ...experience with Llama.cpp and ggml inference engines, facilitating the deployment of... 
    Remote work

    Framework Ventures

    United States
    3 days ago
  • $250k

     ...Edge Ai Infrastructure Edge AI is a production requirement across...  ...doesn't exist. Every team deploying models on edge devices rebuilds...  ...platforms, memory managers that optimize dynamically, observability...  ...models are doing in the field. Inference latency, memory pressure,... 
    Remote work

    Forum Ventures

    United States
    7 days ago
  • $180k - $210k

     ...Overview The Principal AI/ML Engineer will support the...  ...large language models. We offer generous...  ...applications within remote sensing such as...  ...as LangChain, DSPy Deploy LLM solutions across...  ...and apply cutting edge concepts to...  ...engineering techniques / Inference time techniques (e... 
    Remote work
    Full time
    Temporary work
    Work at office
    Local area
    Visa sponsorship
    Relocation package
    Flexible hours

    TSG

    Aurora, CO
    15 hours ago
  •  ...The Principal AI/ML Engineer will support the development...  ...large language models. We offer...  ...applications within remote sensing such as tasking...  ..., DSPy Deploy LLM solutions across...  ...and apply cutting edge concepts to defense...  ...engineering techniques / Inference time techniques (e... 
    Remote work
    Temporary work
    Work at office
    Local area
    Visa sponsorship
    Relocation package
    Flexible hours

    ARKA Group

    King of Prussia, PA
    2 days ago
  • $250k - $350k

     ...photorealistic, real-time AI avatars with emotional...  ...foundation models designed for it from the...  ...quantization, KV cache optimization, kernel-level acceleration...  ...than a standard LLM deployment: we're serving a full...  ...DO * Own end-to-end inference optimization across our... 

    Nuance Labs, Inc.

    Seattle, WA
    1 day ago
  • $114.6k - $252.1k

    ## Principal AI/ML Engineer (Large Language Model)Aurora, Colorado, United States...  ...applications within remote sensing such as...  ...as LangChain, DSPy* Deploy LLM solutions across...  ...understand and apply cutting edge concepts to defense...  ...techniques / Inference time techniques (e.g... 
    Remote work
    Contract work
    Work experience placement
    Local area
    Flexible hours

    CACI International Inc.

    Aurora, CO
    14 hours ago
  • $180k - $210k

     ...Overview The Principal AI/ML Engineer will support the...  ...large language models. We offer generous...  ...within remote sensing such as tasking...  ...LangChain, DSPy Deploy LLM solutions across...  ...and apply cutting edge concepts to defense...  ...engineering techniques / Inference time techniques (e... 
    Remote work
    Full time
    Temporary work
    Work at office
    Local area
    Visa sponsorship
    Relocation package
    Flexible hours

    TSG

    Aurora, CO
    14 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote AI Inference Engineer Edge Model Deployment & Optimization. Be the first to apply!