Journal article
Fast Inner-Product Algorithms and Architectures for Deep Neural Network Accelerators
Abstract
We introduce a new algorithm called the Free-pipeline Fast Inner Product (FFIP) and its hardware architecture that improve an under-explored fast inner-product algorithm (FIP) proposed by Winograd in 1968. Unlike the unrelated Winograd minimal filtering algorithms for convolutional layers, FIP is applicable to all machine learning (ML) model layers that can mainly decompose to matrix multiplication, including fully-connected, convolutional, …
Authors
Pogue TE; Nicolici N
Journal
IEEE Transactions on Computers, Vol. 73, No. 2, pp. 495–509
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
DOI
10.1109/tc.2023.3334140
ISSN
0018-9340