Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Inference Engineer Intern - Model Pruning

$45 - $60 per hour

quadric, Inc

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.

Note: Our preference is for this internship to be based out of our Burlingame, California office. Candidates should be based in the Bay Area or able to relocate for the internship period and available to work on site.

Model pruning: Prune the model to speed up inference with re-training to maintain accuracy. MS student in CS or related fields. Proficiency in Python Experience with model pruning and training in PyTorch Experience in quantization, and vision model accuracy metrics. We are a collaborative team focused on building something extraordinary in the edge computing space. The hourly rate for this temporary internship position is $45.00/hour to $60.00/hour. Quadric interns receive hands-on experience working alongside industry experts in AI and semiconductor technology, with access to mentorship and meaningful project ownership from day one. Founded in 2016 and based in downtown Burlingame, California, Quadric is building the world’s first supercomputer designed for the real-time needs of edge devices. We consider all qualified applicants without regard to race, color, religion, sex, gender identity or expression, sexual orientation, national origin, age, disability, veteran status, or any other protected characteristic under applicable law. By submitting an application, you acknowledge that Quadric will collect and process your personal information as part of the hiring process. Please review our Privacy Policy to understand how we handle your data.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Inference Engineer Intern - Model Pruning in Burlingame, CA vacancy
  • $45 - $60 per hour

     ...targeted to run neural network (NN) inference workloads in a wide variety...  .... Responsibilities: Model pruning: Prune the model to speed up...  ...work location. Quadric interns receive hands-on experience working...  ...industry experts in AI and semiconductor technology,... 
    Internship
    Hourly pay
    Temporary work
    Work at office
    Relocation

    quadric, Inc

    Burlingame, CA
    9 days ago
  • $242k - $290k

     ...Model Optimization & Deployment Engineer The Perception team is pioneering the development...  ...and build highly concurrent inference code to ensure real-time,...  ...quantization (PTQ, QAT), pruning, mixed-precision inference...  ...maximize memory bandwidth on AI accelerators. Write... 
    Suggested
    Temporary work
    Relocation package

    Zoox

    San Mateo, CA
    4 days ago
  • $110k - $270k

     ...neural network (NN) inference workloads in a wide variety...  .... Role The AI Inference Engineer in Quadric is the key...  ...the world of AI/LLM models and Quadric unique...  ...Quantize, prune and convert models for...  ...role to the business, internal equity, and work location... 
    Suggested
    Work at office
    Local area
    Immediate start
    Flexible hours
    2 days per week

    quadric.io

    Burlingame, CA
    7 days ago
  • $45 - $60 per hour

     ...targeted to run neural network (NN) inference workloads in a wide variety of...  ...for internship focused on model optimization for Quadric's...  ...and work location.  Quadric interns receive hands-on experience working...  ...alongside industry experts in AI and semiconductor technology,... 
    Internship
    Hourly pay
    Temporary work

    quadric.io

    Burlingame, CA
    3 days ago
  • $55 - $65 per hour

     ...AI Software Engineering Intern - Fall 2026 San Mateo, CA United States Who We Are Verkada is transforming how organizations protect their...  ...exploring and applying cutting-edge large video and audio models to solve real-world security challenges. We are committed... 
    Internship
    Hourly pay
    Work at office
    Work visa
    Shift work

    Verkada

    San Mateo, CA
    1 day ago
  • $110k - $270k

     ...to run neural network (NN) inference workloads in a wide variety...  ...control code. Role: The AI Inference Engineer in Quadric is the key bridge...  ...between the world of AI/LLM models and Quadric unique platforms...  ...: Quantize, prune and convert models for deployment... 
    Full time
    Temporary work
    Work from home

    quadric, Inc

    Burlingame, CA
    more than 2 months ago
  • $5,000 - $8,000 per month

     ...Aegis is a YC-backed AI startup building agents for health insurance denial management...  ..., and we are now hiring an Applied AI Engineering Intern to work closely with the founding team...  ..., evaluate, fine-tune, and improve AI models in production. Maintaining and building... 
    Internship
    Immediate start

    Aegis (YC X25)

    San Mateo, CA
    3 days ago
  • $100k - $150k

     ...integrates strategy, data, and AI to deliver scalable,...  ...portfolio. As an AI Engineer / Senior AI Engineer, you...  ...powered solutions for both internal teams and clients. You...  ...processes to support AI model training, fine-tuning, and inference workflows Investigate... 
    Temporary work
    Immediate start
    Flexible hours

    Blue Matter

    South San Francisco, CA
    4 days ago
  • $110k - $270k

     ...targeted to run neural network (NN) inference workloads in a wide variety...  ...data science team focused on model optimization for Quadric's...  ...California Bay Area based engineering role is intended to be primarily...  ...configs. Publish internal white papers, external benchmarks... 
    Work at office
    Local area
    Immediate start
    Flexible hours

    quadric.io, Inc

    Burlingame, CA
    5 days ago
  • $110k - $270k

     ...to run neural network (NN) inference workloads in a wide variety...  ...control code. Role The AI Applications Engineer is the key bridge between...  ...Experience with quantization and model accuracy analysis a plus...  ...the role to the business, internal equity, and work location.... 
    Work at office
    Local area
    Immediate start
    Worldwide
    Flexible hours

    quadric.io

    Burlingame, CA
    8 days ago
  • $45 - $60 per hour

     ...targeted to run neural network (NN) inference workloads in a wide variety...  ...The Role As a Software Engineer Intern - Compiler, you will work...  ...compiler features with real model requirements and hardware constraints...  ...industry experts in AI and semiconductor technology,... 
    Internship
    Hourly pay
    Temporary work
    Work at office
    Relocation

    quadric.io, Inc

    Burlingame, CA
    5 hours ago
  • Alation is seeking a UX Engineer Intern in Redwood City, CA, to contribute to frontend code across the product. You'll bring design sensibility...  ..., and HTML/CSS. This intern position offers a hybrid work model with local candidates preferred, as relocation is not available... 
    Internship
    Local area
    Relocation

    Alation

    Redwood City, CA
    3 days ago
  • $143k - $156k

     ...PhD Data Scientist, Intern Stripe is a financial infrastructure platform for businesses...  ...our products, and our business have the models, data products, and insights needed to make...  ...Apply machine learning, causal inference, or advanced analytics on large datasets... 
    Internship
    Summer work
    Work at office
    Immediate start

    Stripe

    South San Francisco, CA
    5 hours ago
  •  ...We are particularly interested in candidates with experience in AI, programming languages, compilers, static and dynamic analysis,...  ...seed stage company with big ambitions. Work with experienced engineers who are experts in DevEx Work in an environment that values... 
    Internship
    Summer internship
    Worldwide

    Gitar, Inc.

    San Mateo, CA
    5 hours ago
  • $148k - $247k

     ...is at the forefront of AI, cloud, and data platform...  ...Senior AI/ML Platform Engineer, you will architect and...  ...from data ingestion to model monitoring. Design...  ...Experience with real-time model inference and streaming ML...  ...development and internal career growth opportunities... 
    Full time
    Part time
    Immediate start
    Flexible hours

    Guidewire

    San Mateo, CA
    4 days ago
  • $187.5k - $395k

     ...About Luma AI Luma's mission is to build multimodal AI...  ...intelligence. To go beyond language models and build more aware, capable...  ...by integrating them into our inference engine Collaborate closely across...  ...and deployments Build internal tooling to measure, profile,... 

    Luma AI

    Redwood City, CA
    2 days ago
  • $192k - $257k

     ...large-scale Foundation models, VLMs, and VLAs to make...  ...quantization, distillation, and pruning, among other things,...  ...of strong software engineers and act as a force multiplier for our internal customers. This team...  ...cutting-edge ML Training OR Inference performance... 
    Temporary work
    Relocation package

    Zoox

    San Mateo, CA
    1 day ago
  • $170k - $277.5k

     ...deep learning infrastructure engineer, you will be responsible for building...  ...'s Deep Learning (DL) and AI efforts. You will be working...  ...high-performance deep learning inference for CV workloads that can...  ...Profile CV and Vision Language Models (VLMs) to analyze performance,... 
    Full time
    Local area
    Relocation package

    Skydio

    San Mateo, CA
    5 hours ago
  • $160k - $250k

     ...Title: Founding AI Engineer (Research & Systems) Target: PhDs & Research Masters from Stanford, MIT, Berkeley, CMU focused on AI, ML,...  ...mission is to move beyond simple RAG and chain-of-thought, creating models that can dynamically plan, execute, and learn in complex... 
    H1b
    Immediate start
    Visa sponsorship

    AimHire

    San Mateo, CA
    5 days ago
  • $192k - $300k

     ...define and enforce the best practices for engineering across the company. Our approach involves...  ...our own. We're leveraging Large Language Models (LLMs) to improve development velocity...  ...establishing best practices for responsible AI integration in our development pipeline... 
    Temporary work

    Zoox

    San Mateo, CA
    4 days ago
  •  ...to drive life-changing impact to ZS. AI Engineer We are seeking an AI Engineer with experience...  ...layer from data all-the-way to the AI model output • Design, develop and deploy...  ...to career progression opportunities Internal mobility paths that empower growth via s... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Work from home
    Worldwide
    Flexible hours
    2 days per week
    3 days per week

    ZS

    South San Francisco, CA
    2 days ago
  • $240k - $280k

     ...AI Engineer, Computer Vision San Bruno, California Mill is a waste prevention technology company reimagining what it means to eliminate...  ...compute directly into our high-capacity food recycler; models running on the edge identify, classify, and quantify food scraps... 

    Mill

    San Bruno, CA
    3 days ago
  • $152.7k - $249.2k

     ...We're looking for a Senior AI Engineer to help bring pragmatic, production...  ...into production to improve internal workflows (e.g., knowledge...  ...core ML/LLM infrastructure (model gateways, prompt/agent orchestration...  ...ML, reproducible training/inference pipelines. Experience... 
    Temporary work

    Joby Aviation

    San Carlos, CA
    5 hours ago
  • $242.1k - $293.8k

     ...experiences for everyone. As a Senior Software Engineer on the Engine DataModel team, you will...  ...of our HQ in San Mateo, CA in a hybrid model 3 days a week (Tuesdays to Thursdays)....  ...experience working on game engine internals Responsibility for building and maintaining... 
    Full time
    Work experience placement
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday
    3 days per week

    Roblox

    San Mateo, CA
    1 day ago
  • $250k - $350k

     ...About the Company A high-growth AI startup backed by over $100M in funding, serving some of the world's largest enterprises....  ...AI innovation. You'll architect AI systems, lead a team of top engineers, and drive technical strategy across the company's core AI initiatives... 
    Flexible hours

    remoti

    San Mateo, CA
    2 days ago
  •  ...Software Engineer Healthcare operations have always depended on people...  ...work to carry out critical internal processes, yet most health...  .... By delegating to autonomous AI systems those mission-critical...  ...and optimize machine learning models. Perform rigorous evaluation and... 
    Work at office
    Worldwide
    3 days per week

    Luminai

    San Mateo, CA
    4 days ago
  •  ...Software Engineer Intern We're looking for a Software Engineer Intern to join our team in San Mateo, CA for eight weeks this summer (June...  ...Runners are self-driving CI - fast GitHub Actions runners with AI agents that continuously analyze and ship optimization PRs.... 
    Internship
    Summer work
    Immediate start

    Starsling (yc X25)

    San Mateo, CA
    2 days ago
  • $80 - $85 per hour

    A leading clinical development solutions provider seeks a Software Developer to innovate and develop applications that integrate AI capabilities. You will design user-centric interfaces, write maintainable code, and work with cross-functional teams. Candidates should have... 
    Hourly pay
    Contract work

    Integrated Resources, Inc ( IRI )

    South San Francisco, CA
    4 days ago
  •  ...leader in biotechnology and life sciences, is looking for a "Sr AI Fullstack Engineer" based out of South San Francisco, CA. Job Duration:...  ...software which interacts with cutting-edge generative AI models and applications in collaboration with AI scientists, full stack... 
    Long term contract
    Work at office

    Dawar Consulting

    South San Francisco, CA
    5 hours ago
  • $188k - $250k

     ...and LLM systems that analyze AI Answering engine outputs and public web...  ...and monitoring frameworks for model quality (factuality, coverage...  .../recall, latency, and total inference spend (model selection, prompt...  ...fairness, consistency, and internal equity across teams and geographies... 
    Local area

    Meltwater

    Redwood City, CA
    26 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Inference Engineer Intern - Model Pruning. Be the first to apply!