Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Structured pruning for efficient systolic array accelerated cascade Speech-to-Text Translation
 
conference paper not in proceedings

Structured pruning for efficient systolic array accelerated cascade Speech-to-Text Translation

Rouas, Jean-Luc
•
Brazier, Charles
•
Letaifa, Leila
Show more
2025
26th Interspeech Conference 2025

We present in this paper a simple method for pruning tiles of weights in sparse matrices, that do not require fine-tuning or retraining. This method is applied here to the feed-forward layers of transformers. We assess in a first experiment the impact of such pruning on the performances of speech recognition, machine translation, and the cascaded speech-to-text translation, on the MuST-C database, for the English to French direction. Depending on the size of the pruned tiles (from 4x4 to 32x32), we observe that pruning rates from 15 to 40% for speech recognition and from 40 to 70% for machine translation are feasible for a performance degradation of 10%. Applying this pruning method to the systolic array accelerated version of the cascade speech-to-text translation system results in speedups up to 74x compared to the non-accelerated system. Energy consumption also benefits from structured pruning with a maximum reduction of 35%.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Structured pruning for efficient systolic array accelerated cascade Speech-to-Text Translation.pdf

Type

Main Document

Version

Accepted version

Access type

openaccess

License Condition

N/A

Size

1.34 MB

Format

Adobe PDF

Checksum (MD5)

7978ae8d65bc1ee06ab01cc621cb37cf

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés