Remote AI Inference Engineer Edge Model Deployment & Optimization

quadric.io

A leading technology company in California is seeking an AI Inference Engineer to bridge AI models with unique platforms. Key responsibilities include model optimization, deployment, and performance profiling. Candidates should have a Bachelor’s or Master’s degree, 5+ years' experience in AI frameworks, and proficiency in C/C++ and Python. Competitive benefits included, such as health care, retirement plans, and work from home options. #J-18808-Ljbffr

Apply

Vacancy posted 15 hours ago

Similar jobs that could be interesting for youBased on the Remote AI Inference Engineer Edge Model Deployment & Optimization in Burlingame, CA vacancy

Senior AI Inference Engineer - Model Optimization & Deployment
$242k - $290k
...-modality foundation model to drive the next generation... .... As a Model Optimization & Deployment Engineer, you will focus on... ...build highly concurrent inference code to ensure real-... ...execution on edge devices. In this role... ...memory bandwidth on AI accelerators. Write...
Remote work
Temporary work
Relocation package
Zoox
San Diego, CA
3 days ago
Senior AI Research Engineer Model Inference Remote
...looking for an experienced AI Model Engineer with deep expertise in... ...development, model optimization, fine‑tuning, and GPU acceleration... ...will extend the inference framework to support inference... ...language model deployment for mobile and edge use cases. Work closely...
Remote work
Framework Ventures
New York, NY
4 days ago
Staff AI Software Engineer, Edge Model Optimization & Deployment
$70k - $300k
...Staff AI Software Engineer - Edge Model Optimization & Deployment FieldAI is transforming how robots interact with the real world. Our growing ML team in Seattle... ...platforms. In this role, you will own the edge inference stack end to end, profiling and accelerating...
Suggested
Field AI
Seattle, WA
3 days ago
Edge AI/Model Optimization Engineer
...seeking a highly motivated and technically skilled Edge AI/Model Optimization Engineer to support the deployment, optimization, and sustainment of AI and agentic... ...Language Models (LLMs), embedding models, and AI inference services for constrained hardware platforms,...
Suggested
Local area
NextGen Federal Systems
Aberdeen, MD
19 days ago
Remote Edge AI Engineer On-Device ML & Optimizations
...Bright Vision Technologies is looking for a remote Edge AI Engineer to design and deploy machine learning models on resource-constrained devices. The ideal candidate... ...in Python and C++, and strong skills in model optimization. Responsibilities include collaborating with...
Remote work
Full time
Bright Vision Technologies
Alpharetta, GA
1 day ago
Senior DL Engineer: Edge Model Optimization & Inference
...enhance the performance of large-scale models through advanced optimization techniques in Santa Clara,... ...background in DL model training and deployment, ideally with a PhD or equivalent experience... ...to work closely with cutting-edge technologies and a collaborative team...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Edge AI Inference Engineer Intern: Model Pruning
$45 - $60 per hour
...architecture. Quadric's co-optimized software and hardware is... ...run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging... ...site. Responsibilities Model pruning: Prune the model... ...industry experts in AI and semiconductor technology...
Hourly pay
Temporary work
Internship
Work at office
Relocation
quadric.io, Inc
Burlingame, CA
1 day ago
Edge AI Engineer for Embedded ML & Inference
$110k - $300k
...redefining the future of AI with our... ...applications ranging from edge devices to data... ...talented team of engineers and industry‑... ...available. Develop, optimize, and deploy lightweight machine learning models for edge AI applications... .... Improve inference efficiency and model...
TETRAMEM INC
San Jose, CA
15 hours ago
AI Infrastructure Engineer, Model Optimization & Deployment, Optimus
$176k - $420k
...AIissolvingrobust, real-world AI through humanoid... ....As a Software Engineer for the Optimus... ...exporting and deploying neural networks toTesla... ...You'll Do ~ Optimize ML models for latency, memory usage, and inference speed ~... ...platforms (cloud, edge, mobile) ~...
Hourly pay
Full time
Temporary work
Flexible hours
Tesla
Palo Alto, CA
2 days ago
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles
$184k - $287.5k
...state‑of‑the‑art model optimization techniques—speculative... ...for production deployments. Implement... ...optimization strategies for inference, such as... ...across diverse NVIDIA edge architectures,... ...Science, Computer Engineering, or a related... ...control, embodied AI, and autonomous decision...
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Physical AI Model Optimization Engineer - Qualcomm Advanced Robotics Team
$158.4k - $237.6k
...Technologies, Inc.Job Area:Engineering Group,... ...is building the AI first stack and platform... ...a Physical AI Model Optimization Engineer to help bring cutting‑edge robotic AI models... ...’s internal deployment and optimization... ...building Qualcomm’s inference engines, compilers...
Work experience placement
Immediate start
Work from home
Nutanix
San Diego, CA
3 days ago
Edge ML Software Engineer (Model Optimization-PICO) - San Jose
$212.8k
...Convert and compile ML models for execution on edge NPUs, and apply... ...Apply hardware-aware optimization strategies, such as... ..., Electrical Engineering, Computer Engineering... ...engineering, model deployment, or ML systems for... ...Understanding of model inference constraints on edge...
Temporary work
Local area
ByteDance
San Jose, CA
3 days ago
Remote AI Inference Engineer: Kernel & Performance
Jobgether is seeking an AI Research Engineer (Kernel & Inference Optimization) to enhance model serving architectures and deployment efficiencies. This fully remote role involves designing scalable inference... .... Join us to work on cutting-edge technologies in a fast-paced,...
Remote job
Jobgether
New Bremen, OH
14 hours ago
Senior AI Inference Engineer llama.cpp specialist 100% Remote
...that powers local AI, porting and enhancing inference engines like llama.cpp, ONNX... ...run efficiently on edge devices. Your focus... ...the runtime: making models load faster, run... ...inference layer is stable, optimized, and ready for... ...Work on deploying machine learning models...
Remote work
Local area
Framework Ventures
United States
4 days ago
Sr. Multimodal Model Training and Inference Optimization Engineer
$244.8k
...research in Generative AI and CV/Multimodal... ...dedicated to generative models for content creation,... ...Model Training and Inference Optimization Engineer with expertise in optimizing... ...work at the cutting edge of AI efficiency,... ..., scalability, and deployment of large-scale generative...
Temporary work
Local area
Tik Tok
San Jose, CA
1 day ago
Edge AI Engineer
$100k - $150k
...businesses automate and optimize their operations... ...cutting-edge technologies to... ...a skilled Edge AI Engineer to join our dynamic... ...Location: 100% Remote (Continental United... ..., optimize, and deploy machine learning models that run... ...cross‑platform inference runtimes leveraging...
Remote work
Full time
H1b
Local area
Immediate start
Visa sponsorship
Bright Vision Technologies
Rockville, MD
3 days ago
Lead Edge AI/ML Engineer
$101.66k - $200.02k
...Lead Edge AI / Machine Learning Engineer Strategic Technology Consulting... ...lead the design, optimization, and deployment of advanced AI/ML... ..., IMU drift modeling, anomaly detection... ..., and efficient inference, as well as the ability... ...location (For Remote Opportunities),...
Remote work
Hourly pay
Contract work
Temporary work
For contractors
Work experience placement
Arcfield
Richmond, VA
2 days ago
Senior AI Kernel Engineer Edge Inference & Optimization
...A technology company is seeking an AI Kernel Engineer to develop and optimize AI kernel libraries for efficient inference on their platform. Ideal candidates will have over 5 years of experience in kernel development, strong C/C++ and Python skills, and the ability to...
quadric.io, Inc
Burlingame, CA
15 hours ago
Remote Edge AI Engineer - On-Device ML & Optimization
...Bright Vision Technologies is seeking an experienced Edge AI Engineer to design, optimize, and deploy machine learning models on resource-constrained edge devices. This full-time position is remote, suitable for skilled candidates with over six years of relevant experience...
Remote work
Full time
Bright Vision Technologies
Bellevue, WA
15 hours ago
Remote Edge AI Engineer On-Device ML & Edge Optimization
...Bright Vision Technologies is seeking a skilled Edge AI Engineer to design and optimize machine learning models for resource-constrained edge devices. This full-time, remote position requires deep expertise in model compression, quantization, and hardware-aware optimization...
Remote work
Full time
Bright Vision Technologies
Edison, NJ
1 day ago
Remote Edge AI Engineer: On-Device ML & Optimization
$100k - $150k
...Bright Vision Technologies is seeking an Edge AI Engineer to design, optimize, and deploy machine learning models on resource-constrained devices. The role requires 6... ...degree in a related field. This is a full-time remote position with a salary range of $100k to $150k....
Remote work
Full time
Bright Vision Technologies
Plano, TX
1 day ago
Senior AI Inference Optimizations Engineer - Remote
...company is looking for a Senior Engineer 2 to enhance their AI Inference Optimization team. In this role, you will drive... ...throughput and reduce latency in large models. Candidates should have over 5... ...compensation and is fully remote, promoting a collaborative and innovative...
Remote work
DigitalOcean
Seattle, WA
14 hours ago
Remote Edge AI Engineer: On-Device ML & Optimization
...Bright Vision Technologies is seeking an Edge AI Engineer to design, optimize, and deploy machine learning models on edge devices. The role focuses on model compression... ...in edge or mobile AI. This is a full-time remote position with competitive compensation and benefits...
Remote work
Full time
Bright Vision Technologies
Johns Creek, GA
15 hours ago
Senior AI Inference Engineer 100% Remote
...Responsibilities Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx.... ...production environments. Integrate AI features into existing products, enriching... ...experience with Llama.cpp and ggml inference engines, facilitating the deployment of...
Remote work
Framework Ventures
United States
3 days ago
Edge Inference Developer Tooling Founder
$250k
...Edge Ai Infrastructure Edge AI is a production requirement across... ...doesn't exist. Every team deploying models on edge devices rebuilds... ...platforms, memory managers that optimize dynamically, observability... ...models are doing in the field. Inference latency, memory pressure,...
Remote work
Forum Ventures
United States
7 days ago
Principal AI/ML Engineer (Large Language Model) (TS/SCI) {S}
$180k - $210k
...Overview The Principal AI/ML Engineer will support the... ...large language models. We offer generous... ...applications within remote sensing such as... ...as LangChain, DSPy Deploy LLM solutions across... ...and apply cutting edge concepts to... ...engineering techniques / Inference time techniques (e...
Remote work
Full time
Temporary work
Work at office
Local area
Visa sponsorship
Relocation package
Flexible hours
TSG
Aurora, CO
15 hours ago
Principal AI/ML Engineer (Large Language Model) (TS/SCI) {S}
...The Principal AI/ML Engineer will support the development... ...large language models. We offer... ...applications within remote sensing such as tasking... ..., DSPy Deploy LLM solutions across... ...and apply cutting edge concepts to defense... ...engineering techniques / Inference time techniques (e...
Remote work
Temporary work
Work at office
Local area
Visa sponsorship
Relocation package
Flexible hours
ARKA Group
King of Prussia, PA
2 days ago
Member of Technical Staff - Model Optimization and Inference
$250k - $350k
...photorealistic, real-time AI avatars with emotional... ...foundation models designed for it from the... ...quantization, KV cache optimization, kernel-level acceleration... ...than a standard LLM deployment: we're serving a full... ...DO * Own end-to-end inference optimization across our...
Nuance Labs, Inc.
Seattle, WA
1 day ago
Principal AI/ML Engineer (Large Language Model) Aurora, CO, US + 1 more
$114.6k - $252.1k
## Principal AI/ML Engineer (Large Language Model)Aurora, Colorado, United States... ...applications within remote sensing such as... ...as LangChain, DSPy* Deploy LLM solutions across... ...understand and apply cutting edge concepts to defense... ...techniques / Inference time techniques (e.g...
Remote work
Contract work
Work experience placement
Local area
Flexible hours
CACI International Inc.
Aurora, CO
14 hours ago
Principal AI/ML Engineer (Large Language Model) (TS/SCI) {S}
$180k - $210k
...Overview The Principal AI/ML Engineer will support the... ...large language models. We offer generous... ...within remote sensing such as tasking... ...LangChain, DSPy Deploy LLM solutions across... ...and apply cutting edge concepts to defense... ...engineering techniques / Inference time techniques (e...
Remote work
Full time
Temporary work
Work at office
Local area
Visa sponsorship
Relocation package
Flexible hours
TSG
Aurora, CO
14 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote AI Inference Engineer Edge Model Deployment & Optimization. Be the first to apply!