Exploration Methodology for BTI-Induced Failures on RRAM-Based Edge AI Systems

Levisse, Alexandre Sébastien Julien; Rios, Marco Antonio; Peon Quiros, Miguel; Atienza Alonso, David

doi:10.1109/ICASSP40776.2020.9054524

Levisse, Alexandre Sébastien Julien; Rios, Marco Antonio; Peon Quiros, Miguel; Atienza Alonso, David

2020

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

Resistive switching memory technologies (RRAM) are seen by most of the scientific community as an enabler for Edge-level applications such as embedded deep Learning, AI or signal processing of audio and video signals. However, going beyond a "simple'' replacement of eFlash in micro-controller and introducing RRAM inside the memory hierarchy is not a straightforward move. Indeed, integrating a RRAM technology inside the cache hierarchy requires higher endurance requirement than for eFlash replacement, and thus necessitates relaxed programming conditions. By doing so, the reliability bottleneck is moved from programming to the read operations (i.e., read margin is reduced and the risk of read failure is increased). Based on this observation, in this work, we propose to explore how Edge-level applications running on a RRAM-based Edge device could fail because of Bias Temperature Instability (BTI). BTI causes threshold voltage (Vt) degradation on the transistors along the memory WordLines (WL), leading to a reduction of the read margin along regularly used WLs. We thereby propose a 3-steps methodology consisting in (i) characterizing the RRAM bitcell and identifying beyond which Vt shift the read operation is going to fail. (ii) characterizing applications and extracting the memory traces. And (iii) running a long term BTI simulation to extract the actual Vt shift of the bitcells sharing the same array WordLine. Based on this, we show that for a 1T1R bitcell featuring a 250% High/Low Resistance State (HRS/LRS) ratio, read failures tend to happen after less than a month in the case of a constantly running convolution kernel. These simulations highlight the fact that transistor-level reliability can be critical for embedded RRAM and that specific workload aware simulation frameworks are required to assess their effects.

Détails

Titre Exploration Methodology for BTI-Induced Failures on RRAM-Based Edge AI Systems

Auteur(s) Levisse, Alexandre Sébastien Julien ; Rios, Marco Antonio ; Peon Quiros, Miguel ; Atienza Alonso, David

Publié dans 2020 Ieee International Conference On Acoustics, Speech, And Signal Processing

Pagination 4

Pages 1549-1552

Présenté à 45th International Conference on Acoustics, Speech, and Signal Processing _ ICASSP 2020, Barcelona, Spain, 4-6 May, 2020

Date 2020

Editeur IEEE

ISSN 1520-6149

DOI https://doi.org/10.1109/ICASSP40776.2020.9054524

Autres identifiant(s) Afficher la publication dans Web of Science

Laboratoires ESL

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > ESL - Laboratoire des systèmes embarqués
Publications validées par des pairs
Papiers de conférence
Travail produit à l'EPFL
Publié

Date de création de la notice 2020-02-03

Files

Résumé

Détails

PDF