AI Inference Engineer Intern - Model Pruning
$45 - $60 per hourquadric, Inc
Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Note: Our preference is for this internship to be based out of our Burlingame, California office. Candidates should be based in the Bay Area or able to relocate for the internship period and available to work on site. Model pruning: Prune the model to speed up inference with re-training to maintain accuracy.
MS student in CS or related fields.
Proficiency in Python
Experience with model pruning and training in PyTorch
Experience in quantization, and vision model accuracy metrics.
We are a collaborative team focused on building something extraordinary in the edge computing space.
The hourly rate for this temporary internship position is $45.00/hour to $60.00/hour. Quadric interns receive hands-on experience working alongside industry experts in AI and semiconductor technology, with access to mentorship and meaningful project ownership from day one.
Founded in 2016 and based in downtown Burlingame, California, Quadric is building the world’s first supercomputer designed for the real-time needs of edge devices. We consider all qualified applicants without regard to race, color, religion, sex, gender identity or expression, sexual orientation, national origin, age, disability, veteran status, or any other protected characteristic under applicable law.
By submitting an application, you acknowledge that Quadric will collect and process your personal information as part of the hiring process. Please review our Privacy Policy to understand how we handle your data.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Inference Engineer Intern - Model Pruning in Burlingame, CA vacancy
$45 - $60 per hour
...targeted to run neural network (NN) inference workloads in a wide variety... .... Responsibilities: Model pruning: Prune the model to speed up... ...work location. Quadric interns receive hands-on experience working... ...industry experts in AI and semiconductor technology,...InternshipHourly payTemporary workWork at officeRelocation$242k - $290k
...Model Optimization & Deployment Engineer The Perception team is pioneering the development... ...and build highly concurrent inference code to ensure real-time,... ...quantization (PTQ, QAT), pruning, mixed-precision inference... ...maximize memory bandwidth on AI accelerators. Write...SuggestedTemporary workRelocation package$110k - $270k
...neural network (NN) inference workloads in a wide variety... .... Role The AI Inference Engineer in Quadric is the key... ...the world of AI/LLM models and Quadric unique... ...Quantize, prune and convert models for... ...role to the business, internal equity, and work location...SuggestedWork at officeLocal areaImmediate startFlexible hours2 days per week$45 - $60 per hour
...targeted to run neural network (NN) inference workloads in a wide variety of... ...for internship focused on model optimization for Quadric's... ...and work location. Quadric interns receive hands-on experience working... ...alongside industry experts in AI and semiconductor technology,...InternshipHourly payTemporary work$55 - $65 per hour
...AI Software Engineering Intern - Fall 2026 San Mateo, CA United States Who We Are Verkada is transforming how organizations protect their... ...exploring and applying cutting-edge large video and audio models to solve real-world security challenges. We are committed...InternshipHourly payWork at officeWork visaShift work$110k - $270k
...to run neural network (NN) inference workloads in a wide variety... ...control code. Role: The AI Inference Engineer in Quadric is the key bridge... ...between the world of AI/LLM models and Quadric unique platforms... ...: Quantize, prune and convert models for deployment...Full timeTemporary workWork from home$5,000 - $8,000 per month
...Aegis is a YC-backed AI startup building agents for health insurance denial management... ..., and we are now hiring an Applied AI Engineering Intern to work closely with the founding team... ..., evaluate, fine-tune, and improve AI models in production. Maintaining and building...InternshipImmediate start$100k - $150k
...integrates strategy, data, and AI to deliver scalable,... ...portfolio. As an AI Engineer / Senior AI Engineer, you... ...powered solutions for both internal teams and clients. You... ...processes to support AI model training, fine-tuning, and inference workflows Investigate...Temporary workImmediate startFlexible hours$110k - $270k
...targeted to run neural network (NN) inference workloads in a wide variety... ...data science team focused on model optimization for Quadric's... ...California Bay Area based engineering role is intended to be primarily... ...configs. Publish internal white papers, external benchmarks...Work at officeLocal areaImmediate startFlexible hours$110k - $270k
...to run neural network (NN) inference workloads in a wide variety... ...control code. Role The AI Applications Engineer is the key bridge between... ...Experience with quantization and model accuracy analysis a plus... ...the role to the business, internal equity, and work location....Work at officeLocal areaImmediate startWorldwideFlexible hours$45 - $60 per hour
...targeted to run neural network (NN) inference workloads in a wide variety... ...The Role As a Software Engineer Intern - Compiler, you will work... ...compiler features with real model requirements and hardware constraints... ...industry experts in AI and semiconductor technology,...InternshipHourly payTemporary workWork at officeRelocation- Alation is seeking a UX Engineer Intern in Redwood City, CA, to contribute to frontend code across the product. You'll bring design sensibility... ..., and HTML/CSS. This intern position offers a hybrid work model with local candidates preferred, as relocation is not available...InternshipLocal areaRelocation
$143k - $156k
...PhD Data Scientist, Intern Stripe is a financial infrastructure platform for businesses... ...our products, and our business have the models, data products, and insights needed to make... ...Apply machine learning, causal inference, or advanced analytics on large datasets...InternshipSummer workWork at officeImmediate start- ...We are particularly interested in candidates with experience in AI, programming languages, compilers, static and dynamic analysis,... ...seed stage company with big ambitions. Work with experienced engineers who are experts in DevEx Work in an environment that values...InternshipSummer internshipWorldwide
$148k - $247k
...is at the forefront of AI, cloud, and data platform... ...Senior AI/ML Platform Engineer, you will architect and... ...from data ingestion to model monitoring. Design... ...Experience with real-time model inference and streaming ML... ...development and internal career growth opportunities...Full timePart timeImmediate startFlexible hours$187.5k - $395k
...About Luma AI Luma's mission is to build multimodal AI... ...intelligence. To go beyond language models and build more aware, capable... ...by integrating them into our inference engine Collaborate closely across... ...and deployments Build internal tooling to measure, profile,...$192k - $257k
...large-scale Foundation models, VLMs, and VLAs to make... ...quantization, distillation, and pruning, among other things,... ...of strong software engineers and act as a force multiplier for our internal customers. This team... ...cutting-edge ML Training OR Inference performance...Temporary workRelocation package$170k - $277.5k
...deep learning infrastructure engineer, you will be responsible for building... ...'s Deep Learning (DL) and AI efforts. You will be working... ...high-performance deep learning inference for CV workloads that can... ...Profile CV and Vision Language Models (VLMs) to analyze performance,...Full timeLocal areaRelocation package$160k - $250k
...Title: Founding AI Engineer (Research & Systems) Target: PhDs & Research Masters from Stanford, MIT, Berkeley, CMU focused on AI, ML,... ...mission is to move beyond simple RAG and chain-of-thought, creating models that can dynamically plan, execute, and learn in complex...H1bImmediate startVisa sponsorship$192k - $300k
...define and enforce the best practices for engineering across the company. Our approach involves... ...our own. We're leveraging Large Language Models (LLMs) to improve development velocity... ...establishing best practices for responsible AI integration in our development pipeline...Temporary work- ...to drive life-changing impact to ZS. AI Engineer We are seeking an AI Engineer with experience... ...layer from data all-the-way to the AI model output • Design, develop and deploy... ...to career progression opportunities Internal mobility paths that empower growth via s...Work experience placementWork at officeLocal areaRemote workWork from homeWorldwideFlexible hours2 days per week3 days per week
$240k - $280k
...AI Engineer, Computer Vision San Bruno, California Mill is a waste prevention technology company reimagining what it means to eliminate... ...compute directly into our high-capacity food recycler; models running on the edge identify, classify, and quantify food scraps...$152.7k - $249.2k
...We're looking for a Senior AI Engineer to help bring pragmatic, production... ...into production to improve internal workflows (e.g., knowledge... ...core ML/LLM infrastructure (model gateways, prompt/agent orchestration... ...ML, reproducible training/inference pipelines. Experience...Temporary work$242.1k - $293.8k
...experiences for everyone. As a Senior Software Engineer on the Engine DataModel team, you will... ...of our HQ in San Mateo, CA in a hybrid model 3 days a week (Tuesdays to Thursdays).... ...experience working on game engine internals Responsibility for building and maintaining...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday3 days per week$250k - $350k
...About the Company A high-growth AI startup backed by over $100M in funding, serving some of the world's largest enterprises.... ...AI innovation. You'll architect AI systems, lead a team of top engineers, and drive technical strategy across the company's core AI initiatives...Flexible hours- ...Software Engineer Healthcare operations have always depended on people... ...work to carry out critical internal processes, yet most health... .... By delegating to autonomous AI systems those mission-critical... ...and optimize machine learning models. Perform rigorous evaluation and...Work at officeWorldwide3 days per week
- ...Software Engineer Intern We're looking for a Software Engineer Intern to join our team in San Mateo, CA for eight weeks this summer (June... ...Runners are self-driving CI - fast GitHub Actions runners with AI agents that continuously analyze and ship optimization PRs....InternshipSummer workImmediate start
$80 - $85 per hour
A leading clinical development solutions provider seeks a Software Developer to innovate and develop applications that integrate AI capabilities. You will design user-centric interfaces, write maintainable code, and work with cross-functional teams. Candidates should have...Hourly payContract work- ...leader in biotechnology and life sciences, is looking for a "Sr AI Fullstack Engineer" based out of South San Francisco, CA. Job Duration:... ...software which interacts with cutting-edge generative AI models and applications in collaboration with AI scientists, full stack...Long term contractWork at office
$188k - $250k
...and LLM systems that analyze AI Answering engine outputs and public web... ...and monitoring frameworks for model quality (factuality, coverage... .../recall, latency, and total inference spend (model selection, prompt... ...fairness, consistency, and internal equity across teams and geographies...Local area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Inference Engineer Intern - Model Pruning. Be the first to apply!



