Paper 9172 Detail Information

Information for Paper ID 9172

Paper Information:
Paper Title:	An Efficient Sparse CNN Architecture with Index-Based Kernel Transformation
Student Contest:	No
Affiliation Type:	Academia
Keywords:	Convolutional Neural Networks, Sparse Network, Index-based Kernel Transformation
Abstract:	This paper proposes an index-based kernel transformation to reduce the memory requirements and energy consumption of convolutional neural networks (CNNs). This transformation is implemented on networks with quantized weights represented by an index while maintaining fixed-point precision. The proposed algorithm eliminates redundant operations by extracting common index patterns from different kernels, performing identical operations only once. A specifically designed hardware is implemented to perform the convolution and rebuild the correct results from the extracted patterns in parallel. The experiment shows that deploying an extensive network like VGG-16 on the proposed hardware only requires 2.620KB of on-chip memory, and importantly, it exhibits superior energy efficiency compared to the state-of-the-art. This promising result suggests a practical solution to the growing demands on memory and operations in large CNNs.
Track ID:	9.6
Track Name:	Efficient and Reliable AI Accelerator Design and Optimization for Deep Neural Networks
Final Decision:	Accept as Lecture
Session Name:	Emerging Computing Techniques for Machine Learning, Bio-sensing, & Security (Lecture)
Author Questions:
Demo:	No