Remote AI Inference Engineer Edge Model Deployment & Optimization
quadric.io
A leading technology company in California is seeking an AI Inference Engineer to bridge AI models with unique platforms. Key responsibilities include model optimization, deployment, and performance profiling. Candidates should have a Bachelor’s or Master’s degree, 5+ years' experience in AI frameworks, and proficiency in C/C++ and Python. Competitive benefits included, such as health care, retirement plans, and work from home options. #J-18808-Ljbffr
$242k - $290k
...-modality foundation model to drive the next generation... .... As a Model Optimization & Deployment Engineer, you will focus on... ...build highly concurrent inference code to ensure real-... ...execution on edge devices. In this role... ...memory bandwidth on AI accelerators. Write...Remote workTemporary workRelocation package- ...looking for an experienced AI Model Engineer with deep expertise in... ...development, model optimization, fine‑tuning, and GPU acceleration... ...will extend the inference framework to support inference... ...language model deployment for mobile and edge use cases. Work closely...Remote work
$70k - $300k
...Staff AI Software Engineer - Edge Model Optimization & Deployment FieldAI is transforming how robots interact with the real world. Our growing ML team in Seattle... ...platforms. In this role, you will own the edge inference stack end to end, profiling and accelerating...Suggested- ...seeking a highly motivated and technically skilled Edge AI/Model Optimization Engineer to support the deployment, optimization, and sustainment of AI and agentic... ...Language Models (LLMs), embedding models, and AI inference services for constrained hardware platforms,...SuggestedLocal area
- ...Bright Vision Technologies is looking for a remote Edge AI Engineer to design and deploy machine learning models on resource-constrained devices. The ideal candidate... ...in Python and C++, and strong skills in model optimization. Responsibilities include collaborating with...Remote workFull time
- ...enhance the performance of large-scale models through advanced optimization techniques in Santa Clara,... ...background in DL model training and deployment, ideally with a PhD or equivalent experience... ...to work closely with cutting-edge technologies and a collaborative team...
$45 - $60 per hour
...architecture. Quadric's co-optimized software and hardware is... ...run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging... ...site. Responsibilities Model pruning: Prune the model... ...industry experts in AI and semiconductor technology...Hourly payTemporary workInternshipWork at officeRelocation$110k - $300k
...redefining the future of AI with our... ...applications ranging from edge devices to data... ...talented team of engineers and industry‑... ...available. Develop, optimize, and deploy lightweight machine learning models for edge AI applications... .... Improve inference efficiency and model...$176k - $420k
...AIissolvingrobust, real-world AI through humanoid... ....As a Software Engineer for the Optimus... ...exporting and deploying neural networks toTesla... ...You'll Do ~ Optimize ML models for latency, memory usage, and inference speed ~... ...platforms (cloud, edge, mobile) ~...Hourly payFull timeTemporary workFlexible hours$184k - $287.5k
...state‑of‑the‑art model optimization techniques—speculative... ...for production deployments. Implement... ...optimization strategies for inference, such as... ...across diverse NVIDIA edge architectures,... ...Science, Computer Engineering, or a related... ...control, embodied AI, and autonomous decision...$158.4k - $237.6k
...Technologies, Inc.Job Area:Engineering Group,... ...is building the AI first stack and platform... ...a Physical AI Model Optimization Engineer to help bring cutting‑edge robotic AI models... ...’s internal deployment and optimization... ...building Qualcomm’s inference engines, compilers...Work experience placementImmediate startWork from home$212.8k
...Convert and compile ML models for execution on edge NPUs, and apply... ...Apply hardware-aware optimization strategies, such as... ..., Electrical Engineering, Computer Engineering... ...engineering, model deployment, or ML systems for... ...Understanding of model inference constraints on edge...Temporary workLocal area- Jobgether is seeking an AI Research Engineer (Kernel & Inference Optimization) to enhance model serving architectures and deployment efficiencies. This fully remote role involves designing scalable inference... .... Join us to work on cutting-edge technologies in a fast-paced,...Remote job
- ...that powers local AI, porting and enhancing inference engines like llama.cpp, ONNX... ...run efficiently on edge devices. Your focus... ...the runtime: making models load faster, run... ...inference layer is stable, optimized, and ready for... ...Work on deploying machine learning models...Remote workLocal area
$244.8k
...research in Generative AI and CV/Multimodal... ...dedicated to generative models for content creation,... ...Model Training and Inference Optimization Engineer with expertise in optimizing... ...work at the cutting edge of AI efficiency,... ..., scalability, and deployment of large-scale generative...Temporary workLocal area$100k - $150k
...businesses automate and optimize their operations... ...cutting-edge technologies to... ...a skilled Edge AI Engineer to join our dynamic... ...Location: 100% Remote (Continental United... ..., optimize, and deploy machine learning models that run... ...cross‑platform inference runtimes leveraging...Remote workFull timeH1bLocal areaImmediate startVisa sponsorship$101.66k - $200.02k
...Lead Edge AI / Machine Learning Engineer Strategic Technology Consulting... ...lead the design, optimization, and deployment of advanced AI/ML... ..., IMU drift modeling, anomaly detection... ..., and efficient inference, as well as the ability... ...location (For Remote Opportunities),...Remote workHourly payContract workTemporary workFor contractorsWork experience placement- ...A technology company is seeking an AI Kernel Engineer to develop and optimize AI kernel libraries for efficient inference on their platform. Ideal candidates will have over 5 years of experience in kernel development, strong C/C++ and Python skills, and the ability to...
- ...Bright Vision Technologies is seeking an experienced Edge AI Engineer to design, optimize, and deploy machine learning models on resource-constrained edge devices. This full-time position is remote, suitable for skilled candidates with over six years of relevant experience...Remote workFull time
- ...Bright Vision Technologies is seeking a skilled Edge AI Engineer to design and optimize machine learning models for resource-constrained edge devices. This full-time, remote position requires deep expertise in model compression, quantization, and hardware-aware optimization...Remote workFull time
$100k - $150k
...Bright Vision Technologies is seeking an Edge AI Engineer to design, optimize, and deploy machine learning models on resource-constrained devices. The role requires 6... ...degree in a related field. This is a full-time remote position with a salary range of $100k to $150k....Remote workFull time- ...company is looking for a Senior Engineer 2 to enhance their AI Inference Optimization team. In this role, you will drive... ...throughput and reduce latency in large models. Candidates should have over 5... ...compensation and is fully remote, promoting a collaborative and innovative...Remote work
- ...Bright Vision Technologies is seeking an Edge AI Engineer to design, optimize, and deploy machine learning models on edge devices. The role focuses on model compression... ...in edge or mobile AI. This is a full-time remote position with competitive compensation and benefits...Remote workFull time
- ...Responsibilities Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx.... ...production environments. Integrate AI features into existing products, enriching... ...experience with Llama.cpp and ggml inference engines, facilitating the deployment of...Remote work
$250k
...Edge Ai Infrastructure Edge AI is a production requirement across... ...doesn't exist. Every team deploying models on edge devices rebuilds... ...platforms, memory managers that optimize dynamically, observability... ...models are doing in the field. Inference latency, memory pressure,...Remote work$180k - $210k
...Overview The Principal AI/ML Engineer will support the... ...large language models. We offer generous... ...applications within remote sensing such as... ...as LangChain, DSPy Deploy LLM solutions across... ...and apply cutting edge concepts to... ...engineering techniques / Inference time techniques (e...Remote workFull timeTemporary workWork at officeLocal areaVisa sponsorshipRelocation packageFlexible hours- ...The Principal AI/ML Engineer will support the development... ...large language models. We offer... ...applications within remote sensing such as tasking... ..., DSPy Deploy LLM solutions across... ...and apply cutting edge concepts to defense... ...engineering techniques / Inference time techniques (e...Remote workTemporary workWork at officeLocal areaVisa sponsorshipRelocation packageFlexible hours
$250k - $350k
...photorealistic, real-time AI avatars with emotional... ...foundation models designed for it from the... ...quantization, KV cache optimization, kernel-level acceleration... ...than a standard LLM deployment: we're serving a full... ...DO * Own end-to-end inference optimization across our...$114.6k - $252.1k
## Principal AI/ML Engineer (Large Language Model)Aurora, Colorado, United States... ...applications within remote sensing such as... ...as LangChain, DSPy* Deploy LLM solutions across... ...understand and apply cutting edge concepts to defense... ...techniques / Inference time techniques (e.g...Remote workContract workWork experience placementLocal areaFlexible hours$180k - $210k
...Overview The Principal AI/ML Engineer will support the... ...large language models. We offer generous... ...within remote sensing such as tasking... ...LangChain, DSPy Deploy LLM solutions across... ...and apply cutting edge concepts to defense... ...engineering techniques / Inference time techniques (e...Remote workFull timeTemporary workWork at officeLocal areaVisa sponsorshipRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote AI Inference Engineer Edge Model Deployment & Optimization. Be the first to apply!
- remote coding manager Burlingame, CA
- data science remote Burlingame, CA
- remote entry level developer Burlingame, CA
- part time remote work from home Burlingame, CA
- fully remote Burlingame, CA
- full time remote Burlingame, CA
- remote coding part time Burlingame, CA
- remote executive assistant (part-time) Burlingame, CA
- remote utilization review nurse part time Burlingame, CA
- part-time virtual/remote assistant Burlingame, CA


