The Codebook/GRECO-BIT ConsortiumVorontsov, Ilya E.Kozin, IvanAbramov, SergeyBoytsov, AlexandrJolma, ArttuAlbu, MihaiAmbrosini, GiovannaFaltejsková, KateřinaGralak, Antoni J.Gryzunov, NikitaInukai, SachiKolmykov, SemyonKravchenko, PavelKribelbauer, Judith F.Laverty, Kaitlin U.Nozdrin, VladimirPatel, Z.Penzar, DmitryPlescher, Marie-LuisePour, Sara E.Razavi, RozitaYang, AllyYevshin, IvanZinkevich, ArseniiWeirauch, Matthew T.Bücher, PhilippDeplancke, BartFornés, OriolGrau, JanGroße, IvoKolpakov, FedorBarazandeh, MarjanBrechalov, AlexanderDeng, ZhenfengFathi, AliHu, ChunLambert, Samuel A.Salnikov, MikhailYellan, IsaacID, Deleted AuthorMeshcheryakov, G. A.Nikonov, MikhailKamenets, VasiliiВласов, А ПHernández-Corchado, AldoNajafabadi, Hamed S.Morris, QuaidChen, XiaotingMakeev, Vsevolod J.Hughes, Timothy R.Kulakovskiy, Ivan V.2025-11-112025-11-112025-11-102025-11-0710.1038/s42003-025-08909-9https://infoscience.epfl.ch/handle/20.500.14299/255727A sequence motif representing the DNA-binding specificity of a transcription factor (TF) is commonly modelled with a positional weight matrix (PWM). Focusing on understudied human TFs, we processed results of 4,237 experiments for 394 TFs, assayed using five different experimental platforms. By human curation, we approved a subset of experiments that yielded consistent motifs across platforms and replicates, and evaluated quantitatively the cross-platform performance of PWMs obtained with ten motif discovery tools. Notably, nucleotide composition and information content are not correlated with motif performance and do not help in detecting underperformers, while motifs with low information content, in many cases, describe well the binding specificity assessed across different experimental platforms. By combining multiple PMWs into a random forest, we demonstrate the potential of accounting for multiple modes of TF binding. Finally, we present the Codebook Motif Explorer ( https://mex.autosome.org ), cataloguing motifs, benchmarking results, and the underlying experimental data.enCross-platform motif discovery and benchmarking to explore binding specificities of poorly studied human transcription factorstext::journal::journal article::research article