000266022 001__ 266022
000266022 005__ 20190812204802.0
000266022 037__ $$aCONF
000266022 245__ $$aToward a Dynamic Threshold for Quality-Score Distortion in Reference-Based Alignment
000266022 260__ $$c2019
000266022 269__ $$a2019
000266022 336__ $$aConference Papers
000266022 520__ $$aThe intrinsic high entropy metadata, known as quality scores, are largely the cause of the substantial size of sequence data files. Yet, there is no consensus on a viable reduction of the resolution of the quality score scale, arguably because of collateral side effects. In this paper we leverage on the penalty functions of HISAT2 aligner to rebin the quality score scale in such a way as to avoid any impact on sequence alignment, identifying alongside a distortion threshold. We tested our findings on whole-genome sequence and RNA sequence data, and contrasted the results with three methods for lossy distortion of the quality scores.
000266022 6531_ $$aQuality scores
000266022 6531_ $$aReference-based alignment
000266022 6531_ $$aQuality score distortion
000266022 6531_ $$aHISAT2
000266022 6531_ $$aLossy compression
000266022 700__ $$aHernandez-Lopez, Ana A.$$g248328
000266022 700__ $$aAlberti, C.$$g123574
000266022 700__ $$aMattavelli, M.$$g102553
000266022 7112_ $$a15th International Symposium on Bioinformatics Research and Applications (ISBRA)$$cBarcelona, Spain$$dJune 3–6, 2019
000266022 8560_ $$fana.hernandezlopez@epfl.ch
000266022 8564_ $$uhttps://infoscience.epfl.ch/record/266022/files/QS-distortion-threshold_ISBRA2019.pdf$$s1654181
000266022 909C0 $$mdaniela.vallat@epfl.ch$$mclaudio.alberti@epfl.ch$$0252288$$zMarselli, Béatrice$$xU12149$$pSCI-STI-MM
000266022 909CO $$pconf$$pSTI$$ooai:infoscience.epfl.ch:266022
000266022 960__ $$aana.hernandezlopez@epfl.ch
000266022 961__ $$apierre.devaud@epfl.ch
000266022 973__ $$aEPFL$$rREVIEWED
000266022 980__ $$aCONF
000266022 981__ $$aoverwrite