TiC-SAT: Tightly-coupled Systolic Accelerator for Transformers

Amirshahi, Alireza; Klein, Joshua Alexander Harrison; Ansaloni, Giovanni; Atienza Alonso, David

doi:10.1145/3566097.3567867

conference paper not in proceedings

TiC-SAT: Tightly-coupled Systolic Accelerator for Transformers

Amirshahi, Alireza

•

Klein, Joshua Alexander Harrison

•

Ansaloni, Giovanni

more

ASP-DAC 2023

Transformer models have achieved impressive results in various AI scenarios, ranging from vision to natural language processing. However, their computational complexity and their vast number of parameters hinder their implementations on resource-constrained platforms. Furthermore, while loosely-coupled hardware accelerators have been proposed in the literature, data transfer costs limit their speed-up potential. We address this challenge along two axes. First, we introduce tightly-coupled, small-scale systolic arrays (TiC-SATs), governed by dedicated ISA extensions, as dedicated functional units to speed up execution. Then, thanks to the tightly-coupled architecture, we employ software optimizations to maximize data reuse, thus lowering miss rates across cache hierarchies. Full system simulations across various BERT and VisionTransformer models are employed to validate our strategy, resulting in substantial application-wide speed-ups (e.g., up to 89.5X for BERT-large). TiC-SAT is available as an open-source framework.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/192304

Name

TiC_SAT_ASPDAC-preprint.pdf

Type

postprint

Access type

openaccess

License Condition

copyright

Size

999.07 KB

Format

Adobe PDF

Checksum (MD5)

13a2553936b9dbb3d6c4e5bedd06f8d0