Review and Benchmarking of Precision-Scalable Multiply-Accumulate Unit Architectures for Embedded Neural-Network Processing

Camus, Vincent; Mei, Linyan; Enz, Christian; Verhelst, Marian

doi:10.1109/JETCAS.2019.2950386

research article

Review and Benchmarking of Precision-Scalable Multiply-Accumulate Unit Architectures for Embedded Neural-Network Processing

Camus, Vincent

•

Mei, Linyan

•

Enz, Christian

more

October 30, 2019

IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS)

The current trend for deep learning has come with an enormous computational need for billions of Multiply-Accumulate (MAC) operations per inference. Fortunately, reduced precision has demonstrated large benefits with low impact on accuracy, paving the way towards processing in mobile devices and IoT nodes. To this end, various precision-scalable MAC architectures optimized for neural networks have recently been proposed. Yet, it has been hard to comprehend their differences and make a fair judgment of their relative benefits as they have been implemented with different technologies and performance targets. To overcome this, this work exhaustively reviews the state-of-the-art precision-scalable MAC architectures and unifies them in a new taxonomy. Subsequently, these different topologies are thoroughly benchmarked in a 28nm commercial CMOS process, across a wide range of performance targets, and with precision ranging from 2 to 8 bits. Circuits are analyzed for each precision as well as jointly in practical use cases, highlighting the impact of architectures and scalability in terms of energy, throughput, area and bandwidth, aiming to understand the key trends to reduce computation costs in neural-network processing.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/164321

Name

camus-mei_jetcas19.pdf

Type

Publisher's version

Access type

openaccess

License Condition

CC BY

Size

13.06 MB

Format

Adobe PDF

Checksum (MD5)

3d0e1130c97345c57ea87751f63486f0