Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems
 
conference poster

Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems

Palacios Almendros, Pedro  
•
Medina Morillas, Rafael  
•
Rouas, Jean-Luc
Show more
2025
GLSVLSI '25: Proceedings of the Great Lakes Symposium on VLSI 2025
35th Great Lakes Symposium on VLSI

Efficient deployment of resource-intensive transformers on edge devices necessitates cross-stack optimization. We thus study the interrelation between structured pruning and systolic acceleration, matching the size of pruned blocks with the systolic array dimensions. In this setting, computations of pruned weight blocks can be skipped, reducing run-time and energy consumption, but potentially impacting quality of service (QoS). To evaluate the trade-offs between systolic array size and sparsity opportunities, we present a novel co-design framework that integrates algorithmic optimization, system simulation, and hardware design. Targeting speech recognition and machine translation using transformers as case study, we analyze how configuration choices across the stack affect performance metrics. Results demonstrate that structured pruning on systems featuring systolic array acceleration can effectively increase performance, while maintaining high QoS levels. Up to 44% system-wide speedups due to structured pruning and quantization were measured, with only 1.4% word error rate degradation on the standard LibriSpeech dataset.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

GLSVLSI_2025_Pedro.pdf

Type

Main Document

Version

http://purl.org/coar/version/c_ab4af688f83e57aa

Access type

openaccess

License Condition

N/A

Size

1.43 MB

Format

Adobe PDF

Checksum (MD5)

f37f12391f7ab433f96ee1639bb456d0

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés