Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Datasets and Code
  4. Out with LSQs: Custom Circuits for Memory Access Reordering in Dynamic HLS
 
software

Out with LSQs: Custom Circuits for Memory Access Reordering in Dynamic HLS

Pirayadi, Rouzbeh  
•
Elakhras, Ayatallah  
•
Stojilović, Mirjana  
Show more
December 18, 2025
Zenodo

This repository contains the source code, benchmarks, and experimental results associated with the paper "Out with LSQs: Custom Circuits for Memory Access Reordering in Dynamic HLS" by Rouzbeh Pirayadi (EPFL), Ayatallah Elakhras (EPFL), Mirjana Stojilović (EPFL), and Paolo Ienne (EPFL). The paper has been accepted for publication in the Proceedings of the 34th International Symposium on Field-Programmable Gate Arrays (ISFPGA 2026).

Abstract:

Circuits generated by dynamically scheduled high-level synthesis (HLS) outperform static counterparts when execution-dependent operation reordering is possible. A key opportunity are potential memory dependencies that static analysis cannot rule out: Statically scheduled circuits must assume a worst-case scenario, enforcing conservative in-order execution even when unnecessary. In contrast, dynamically scheduled circuits use load-store queues (LSQs) to detect access collisions at runtime and reorder accesses whenever possible. While effective, LSQs are costly in both area and timing due to their size and critical path. We argue that LSQs in application-specific circuits are overly generic, checking all ordering relationships instead of only the necessary ones. Instead of relying on LSQs, our approach generates dataflow circuits that enforce only essential dependencies, reducing resource usage while preserving execution time. We implemented our methodology in an open-source, state-of-the-art dynamic HLS compiler and compared it against optimized LSQs. Our solution exploits the same reordering opportunities to achieve consistently Pareto-optimal solutions in terms of wall-clock time and resources. On average, we obtain similar execution times (2% better) while reducing resources by 37%, offering a superior alternative for high-performance dataflow circuit synthesis.

  • Details
  • Metrics
Type
software
DOI
10.5281/zenodo.17975150
Author(s)
Pirayadi, Rouzbeh  

École Polytechnique Fédérale de Lausanne

Elakhras, Ayatallah  

École Polytechnique Fédérale de Lausanne

Stojilović, Mirjana  

École Polytechnique Fédérale de Lausanne

Ienne, Paolo  

École Polytechnique Fédérale de Lausanne

Date Issued

2025-12-18

Version

v1

Publisher

Zenodo

License

CC BY

Subjects

High-level synthesis

•

Dataflow circuits

•

Load-store queue

Additional link

Code Repository URL

https://github.com/EPFL-LAP/dynamatic/tree/replace-lsq
EPFL units
LAP  
PARSA  
Event nameEvent acronymEvent placeEvent date
International Symposium on Field-Programmable Gate Arrays

ISFPGA

Seaside, CA, USA

2026-02-24 - 2026-02-24

RelationRelated workURL/DOI

Continues

From C/C plus plus Code to High-Performance Dataflow Circuits

https://infoscience.epfl.ch/handle/20.500.14299/188830

IsVersionOf

https://doi.org/10.5281/zenodo.17975149
Available on Infoscience
December 31, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/257430
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés