000114888 001__ 114888
000114888 005__ 20181203021032.0
000114888 0247_ $$2doi$$a10.1093/bib/3.3.225
000114888 037__ $$aARTICLE
000114888 245__ $$aInterPro: an integrated documentation resource for protein families, domains and functional sites
000114888 269__ $$a2002
000114888 260__ $$c2002
000114888 336__ $$aJournal Articles
000114888 500__ $$aEMBL Outstation, European Bioinformatics Institute, Hinxton, Cambridge, UK. mulder@ebi.ac.uk
000114888 520__ $$aThe exponential increase in the submission of nucleotide sequences to the nucleotide sequence database by genome sequencing centres has resulted in a need for rapid, automatic methods for classification of the resulting protein sequences. There are several signature and sequence cluster-based methods for protein classification, each resource having distinct areas of optimum application owing to the differences in the underlying analysis methods. In recognition of this, InterPro was developed as an integrated documentation resource for protein families, domains and functional sites, to rationalise the complementary efforts of the individual protein signature database projects. The member databases - PRINTS, PROSITE, Pfam, ProDom, SMART and TIGRFAMs - form the InterPro core. Related signatures from each member database are unified into single InterPro entries. Each InterPro entry includes a unique accession number, functional descriptions and literature references, and links are made back to the relevant member database(s). Release 4.0 of InterPro (November 2001) contains 4,691 entries, representing 3,532 families, 1,068 domains, 74 repeats and 15 sites of post-translational modification (PTMs) encoded by different regular expressions, profiles, fingerprints and hidden Markov models (HMMs). Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (2,141,621 InterPro hits from 586,124 SWISS-PROT and TrEMBL protein sequences). The database is freely accessible for text- and sequence-based searches.
000114888 700__ $$aMulder, N. J.
000114888 700__ $$aApweiler, R.
000114888 700__ $$aAttwood, T. K.
000114888 700__ $$aBairoch, A.
000114888 700__ $$aBateman, A.
000114888 700__ $$aBinns, D.
000114888 700__ $$aBiswas, M.
000114888 700__ $$aBradley, P.
000114888 700__ $$aBork, P.
000114888 700__ $$0244404$$g113607$$aBucher, P.
000114888 700__ $$aCopley, R.
000114888 700__ $$aCourcelle, E.
000114888 700__ $$aDurbin, R.
000114888 700__ $$aFalquet, L.
000114888 700__ $$aFleischmann, W.
000114888 700__ $$aGouzy, J.
000114888 700__ $$aGriffith-Jones, S.
000114888 700__ $$aHaft, D.
000114888 700__ $$aHermjakob, H.
000114888 700__ $$aHulo, N.
000114888 700__ $$aKahn, D.
000114888 700__ $$aKanapin, A.
000114888 700__ $$aKrestyaninova, M.
000114888 700__ $$aLopez, R.
000114888 700__ $$aLetunic, I.
000114888 700__ $$aOrchard, S.
000114888 700__ $$aPagni, M.
000114888 700__ $$aPeyruc, D.
000114888 700__ $$aPonting, C. P.
000114888 700__ $$aServant, F.
000114888 700__ $$aSigrist, C. J.
000114888 773__ $$j3$$tBrief Bioinform$$k3$$q225-35
000114888 909C0 $$xU11780$$0252244$$pGR-BUCHER
000114888 909CO $$pSV$$particle$$ooai:infoscience.tind.io:114888
000114888 937__ $$aGR-BUCHER-ARTICLE-2002-004
000114888 973__ $$rREVIEWED$$sPUBLISHED$$aOTHER
000114888 980__ $$aARTICLE