A 3-D BANK MEMORY SYSTEM FOR LOW-POWER NEURAL NETWORK PROCESSING ACHIEVED BY INSTANT CONTEXT SWITCHING AND EXTENDED POWER GATING TIME

A 3-D Bank Memory System for Low-Power Neural Network Processing Achieved by Instant Context Switching and Extended Power Gating Time

A 3-D Bank Memory System for Low-Power Neural Network Processing Achieved by Instant Context Switching and Extended Power Gating Time

Blog Article

Using a 3-D monolithic stacking memory technology of crystalline oxide semiconductor (OS) transistors, we fabricated a test chip having AI accelerator (ACC) memory for weight data of a neural network (NN), backup incredibleindiatourtravels.com memory of flip-flops (FF), and CPU memory storing instructions and data.These memories are composed of two-layer OS transistors on Si CMOS, where memories in each layer correspond to a bank.In this structure, bank switching of the ACC memory and the FF backup memory work together, and thus inference of different NNs is switched with low latency and low power so that the power gating standby time can be extended.

Consequently, here a 92% reduction in power consumption is achieved in inference at a frame rate of 60 fps as compared with a chip using static random access memory (SRAM) as the ACC memory.

Report this page