AI Inference Engineer Intern - Model Pruning

$45 - $60 per hour

quadric, Inc

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.

Note: Our preference is for this internship to be based out of our Burlingame, California office. Candidates should be based in the Bay Area or able to relocate for the internship period and available to work on site.

Model pruning: Prune the model to speed up inference with re-training to maintain accuracy. MS student in CS or related fields. Proficiency in Python Experience with model pruning and training in PyTorch Experience in quantization, and vision model accuracy metrics. We are a collaborative team focused on building something extraordinary in the edge computing space. The hourly rate for this temporary internship position is $45.00/hour to $60.00/hour. Quadric interns receive hands-on experience working alongside industry experts in AI and semiconductor technology, with access to mentorship and meaningful project ownership from day one. Founded in 2016 and based in downtown Burlingame, California, Quadric is building the world’s first supercomputer designed for the real-time needs of edge devices. We consider all qualified applicants without regard to race, color, religion, sex, gender identity or expression, sexual orientation, national origin, age, disability, veteran status, or any other protected characteristic under applicable law. By submitting an application, you acknowledge that Quadric will collect and process your personal information as part of the hiring process. Please review our Privacy Policy to understand how we handle your data.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the AI Inference Engineer Intern - Model Pruning in Burlingame, CA vacancy

AI Inference Engineer Intern - Model Pruning
$45 - $60 per hour
...targeted to run neural network (NN) inference workloads in a wide variety... .... Responsibilities: Model pruning: Prune the model to speed up... ...work location. Quadric interns receive hands-on experience working... ...industry experts in AI and semiconductor technology,...
Internship
Hourly pay
Temporary work
Work at office
Relocation
quadric, Inc
Burlingame, CA
9 days ago
Senior AI Inference Engineer - Model Optimization & Deployment
$242k - $290k
...Model Optimization & Deployment Engineer The Perception team is pioneering the development... ...and build highly concurrent inference code to ensure real-time,... ...quantization (PTQ, QAT), pruning, mixed-precision inference... ...maximize memory bandwidth on AI accelerators. Write...
Suggested
Temporary work
Relocation package
Zoox
San Mateo, CA
4 days ago
AI Inference Engineer
$110k - $270k
...neural network (NN) inference workloads in a wide variety... .... Role The AI Inference Engineer in Quadric is the key... ...the world of AI/LLM models and Quadric unique... ...Quantize, prune and convert models for... ...role to the business, internal equity, and work location...
Suggested
Work at office
Local area
Immediate start
Flexible hours
2 days per week
quadric.io
Burlingame, CA
7 days ago
Data Science Intern - Model Optimization
$45 - $60 per hour
...targeted to run neural network (NN) inference workloads in a wide variety of... ...for internship focused on model optimization for Quadric's... ...and work location. Quadric interns receive hands-on experience working... ...alongside industry experts in AI and semiconductor technology,...
Internship
Hourly pay
Temporary work
quadric.io
Burlingame, CA
3 days ago
AI Software Engineering Intern - Fall 2026
$55 - $65 per hour
...AI Software Engineering Intern - Fall 2026 San Mateo, CA United States Who We Are Verkada is transforming how organizations protect their... ...exploring and applying cutting-edge large video and audio models to solve real-world security challenges. We are committed...
Internship
Hourly pay
Work at office
Work visa
Shift work
Verkada
San Mateo, CA
1 day ago
AI Inference Engineer
$110k - $270k
...to run neural network (NN) inference workloads in a wide variety... ...control code. Role: The AI Inference Engineer in Quadric is the key bridge... ...between the world of AI/LLM models and Quadric unique platforms... ...: Quantize, prune and convert models for deployment...
Full time
Temporary work
Work from home
quadric, Inc
Burlingame, CA
more than 2 months ago
Applied AI Engineering Intern
$5,000 - $8,000 per month
...Aegis is a YC-backed AI startup building agents for health insurance denial management... ..., and we are now hiring an Applied AI Engineering Intern to work closely with the founding team... ..., evaluate, fine-tune, and improve AI models in production. Maintaining and building...
Internship
Immediate start
Aegis (YC X25)
San Mateo, CA
3 days ago
AI Engineer / Senior AI Engineer
$100k - $150k
...integrates strategy, data, and AI to deliver scalable,... ...portfolio. As an AI Engineer / Senior AI Engineer, you... ...powered solutions for both internal teams and clients. You... ...processes to support AI model training, fine-tuning, and inference workflows Investigate...
Temporary work
Immediate start
Flexible hours
Blue Matter
South San Francisco, CA
4 days ago
Data Scientist - Model Optimization
$110k - $270k
...targeted to run neural network (NN) inference workloads in a wide variety... ...data science team focused on model optimization for Quadric's... ...California Bay Area based engineering role is intended to be primarily... ...configs. Publish internal white papers, external benchmarks...
Work at office
Local area
Immediate start
Flexible hours
quadric.io, Inc
Burlingame, CA
5 days ago
AI Applications Engineer
$110k - $270k
...to run neural network (NN) inference workloads in a wide variety... ...control code. Role The AI Applications Engineer is the key bridge between... ...Experience with quantization and model accuracy analysis a plus... ...the role to the business, internal equity, and work location....
Work at office
Local area
Immediate start
Worldwide
Flexible hours
quadric.io
Burlingame, CA
8 days ago
Software Engineer Intern - Compiler
$45 - $60 per hour
...targeted to run neural network (NN) inference workloads in a wide variety... ...The Role As a Software Engineer Intern - Compiler, you will work... ...compiler features with real model requirements and hardware constraints... ...industry experts in AI and semiconductor technology,...
Internship
Hourly pay
Temporary work
Work at office
Relocation
quadric.io, Inc
Burlingame, CA
5 hours ago
AI-Powered UX Engineer Intern (Frontend)
Alation is seeking a UX Engineer Intern in Redwood City, CA, to contribute to frontend code across the product. You'll bring design sensibility... ..., and HTML/CSS. This intern position offers a hybrid work model with local candidates preferred, as relocation is not available...
Internship
Local area
Relocation
Alation
Redwood City, CA
3 days ago
PhD Data Scientist, Intern
$143k - $156k
...PhD Data Scientist, Intern Stripe is a financial infrastructure platform for businesses... ...our products, and our business have the models, data products, and insights needed to make... ...Apply machine learning, causal inference, or advanced analytics on large datasets...
Internship
Summer work
Work at office
Immediate start
Stripe
South San Francisco, CA
5 hours ago
Software Engineer Intern
...We are particularly interested in candidates with experience in AI, programming languages, compilers, static and dynamic analysis,... ...seed stage company with big ambitions. Work with experienced engineers who are experts in DevEx Work in an environment that values...
Internship
Summer internship
Worldwide
Gitar, Inc.
San Mateo, CA
5 hours ago
Senior AI/ML Platform Engineer
$148k - $247k
...is at the forefront of AI, cloud, and data platform... ...Senior AI/ML Platform Engineer, you will architect and... ...from data ingestion to model monitoring. Design... ...Experience with real-time model inference and streaming ML... ...development and internal career growth opportunities...
Full time
Part time
Immediate start
Flexible hours
Guidewire
San Mateo, CA
4 days ago
Software Engineer, Inference
$187.5k - $395k
...About Luma AI Luma's mission is to build multimodal AI... ...intelligence. To go beyond language models and build more aware, capable... ...by integrating them into our inference engine Collaborate closely across... ...and deployments Build internal tooling to measure, profile,...
Luma AI
Redwood City, CA
2 days ago
Software Engineer, ML Performance Optimization
$192k - $257k
...large-scale Foundation models, VLMs, and VLAs to make... ...quantization, distillation, and pruning, among other things,... ...of strong software engineers and act as a force multiplier for our internal customers. This team... ...cutting-edge ML Training OR Inference performance...
Temporary work
Relocation package
Zoox
San Mateo, CA
1 day ago
Autonomy Engineer - Deep Learning Model Acceleration
$170k - $277.5k
...deep learning infrastructure engineer, you will be responsible for building... ...'s Deep Learning (DL) and AI efforts. You will be working... ...high-performance deep learning inference for CV workloads that can... ...Profile CV and Vision Language Models (VLMs) to analyze performance,...
Full time
Local area
Relocation package
Skydio
San Mateo, CA
5 hours ago
Founding AI Engineer
$160k - $250k
...Title: Founding AI Engineer (Research & Systems) Target: PhDs & Research Masters from Stanford, MIT, Berkeley, CMU focused on AI, ML,... ...mission is to move beyond simple RAG and chain-of-thought, creating models that can dynamically plan, execute, and learn in complex...
H1b
Immediate start
Visa sponsorship
AimHire
San Mateo, CA
5 days ago
Senior AI Developer Productivity Engineer
$192k - $300k
...define and enforce the best practices for engineering across the company. Our approach involves... ...our own. We're leveraging Large Language Models (LLMs) to improve development velocity... ...establishing best practices for responsible AI integration in our development pipeline...
Temporary work
Zoox
San Mateo, CA
4 days ago
Senior AI Engineer
...to drive life-changing impact to ZS. AI Engineer We are seeking an AI Engineer with experience... ...layer from data all-the-way to the AI model output • Design, develop and deploy... ...to career progression opportunities Internal mobility paths that empower growth via s...
Work experience placement
Work at office
Local area
Remote work
Work from home
Worldwide
Flexible hours
2 days per week
3 days per week
ZS
South San Francisco, CA
2 days ago
AI Engineer, Computer Vision
$240k - $280k
...AI Engineer, Computer Vision San Bruno, California Mill is a waste prevention technology company reimagining what it means to eliminate... ...compute directly into our high-capacity food recycler; models running on the edge identify, classify, and quantify food scraps...
Mill
San Bruno, CA
3 days ago
Senior AI Engineer
$152.7k - $249.2k
...We're looking for a Senior AI Engineer to help bring pragmatic, production... ...into production to improve internal workflows (e.g., knowledge... ...core ML/LLM infrastructure (model gateways, prompt/agent orchestration... ...ML, reproducible training/inference pipelines. Experience...
Temporary work
Joby Aviation
San Carlos, CA
5 hours ago
Senior Software Engineer, Data Model
$242.1k - $293.8k
...experiences for everyone. As a Senior Software Engineer on the Engine DataModel team, you will... ...of our HQ in San Mateo, CA in a hybrid model 3 days a week (Tuesdays to Thursdays).... ...experience working on game engine internals Responsibility for building and maintaining...
Full time
Work experience placement
H1b
Work at office
Local area
Visa sponsorship
Monday to Friday
3 days per week
Roblox
San Mateo, CA
1 day ago
Principal AI Engineer
$250k - $350k
...About the Company A high-growth AI startup backed by over $100M in funding, serving some of the world's largest enterprises.... ...AI innovation. You'll architect AI systems, lead a team of top engineers, and drive technical strategy across the company's core AI initiatives...
Flexible hours
remoti
San Mateo, CA
2 days ago
Applied AI Engineer
...Software Engineer Healthcare operations have always depended on people... ...work to carry out critical internal processes, yet most health... .... By delegating to autonomous AI systems those mission-critical... ...and optimize machine learning models. Perform rigorous evaluation and...
Work at office
Worldwide
3 days per week
Luminai
San Mateo, CA
4 days ago
Software Engineer Intern (Summer 2026)
...Software Engineer Intern We're looking for a Software Engineer Intern to join our team in San Mateo, CA for eight weeks this summer (June... ...Runners are self-driving CI - fast GitHub Actions runners with AI agents that continuously analyze and ship optimization PRs....
Internship
Summer work
Immediate start
Starsling (yc X25)
San Mateo, CA
2 days ago
Senior AI Software Engineer — Full-Stack & GenAI
$80 - $85 per hour
A leading clinical development solutions provider seeks a Software Developer to innovate and develop applications that integrate AI capabilities. You will design user-centric interfaces, write maintainable code, and work with cross-functional teams. Candidates should have...
Hourly pay
Contract work
Integrated Resources, Inc ( IRI )
South San Francisco, CA
4 days ago
Sr AI Fullstack Engineer (GC/Citizens, W2 Only)
...leader in biotechnology and life sciences, is looking for a "Sr AI Fullstack Engineer" based out of South San Francisco, CA. Job Duration:... ...software which interacts with cutting-edge generative AI models and applications in collaboration with AI scientists, full stack...
Long term contract
Work at office
Dawar Consulting
South San Francisco, CA
5 hours ago
Senior AI/ML Engineer
$188k - $250k
...and LLM systems that analyze AI Answering engine outputs and public web... ...and monitoring frameworks for model quality (factuality, coverage... .../recall, latency, and total inference spend (model selection, prompt... ...fairness, consistency, and internal equity across teams and geographies...
Local area
Meltwater
Redwood City, CA
26 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Inference Engineer Intern - Model Pruning. Be the first to apply!