001     893374
005     20230815122843.0
024 7 _ |a 10.1021/acs.jctc.1c00129
|2 doi
024 7 _ |a 1549-9618
|2 ISSN
024 7 _ |a 1549-9626
|2 ISSN
024 7 _ |a 2128/28154
|2 Handle
024 7 _ |a altmetric:108134031
|2 altmetric
024 7 _ |a 34161735
|2 pmid
024 7 _ |a WOS:000674289800059
|2 WOS
037 _ _ |a FZJ-2021-02715
082 _ _ |a 610
100 1 _ |a Mulnaes, Daniel
|0 P:(DE-HGF)0
|b 0
245 _ _ |a TopDomain: Exhaustive Protein Domain Boundary Metaprediction Combining Multisource Information and Deep Learning
260 _ _ |a Washington, DC
|c 2021
336 7 _ |a article
|2 DRIVER
336 7 _ |a Output Types/Journal article
|2 DataCite
336 7 _ |a Journal Article
|b journal
|m journal
|0 PUB:(DE-HGF)16
|s 1626172944_14697
|2 PUB:(DE-HGF)
336 7 _ |a ARTICLE
|2 BibTeX
336 7 _ |a JOURNAL_ARTICLE
|2 ORCID
336 7 _ |a Journal Article
|0 0
|2 EndNote
520 _ _ |a Protein domains are independent, functional, and stable structural units of proteins. Accurate protein domain boundary prediction plays an important role in understanding protein structure and evolution, as well as for protein structure prediction. Current domain boundary prediction methods differ in terms of boundary definition, methodology, and training databases resulting in disparate performance for different proteins. We developed TopDomain, an exhaustive metapredictor, that uses deep neural networks to combine multisource information from sequence- and homology-based features of over 50 primary predictors. For this purpose, we developed a new domain boundary data set termed the TopDomain data set, in which the true annotations are informed by SCOPe annotations, structural domain parsers, human inspection, and deep learning. We benchmark TopDomain against 2484 targets with 3354 boundaries from the TopDomain test set and achieve F1 scores of 78.4% and 73.8% for multidomain boundary prediction within ±20 residues and ±10 residues of the true boundary, respectively. When examined on targets from CASP11-13 competitions, TopDomain achieves F1 scores of 47.5% and 42.8% for multidomain proteins. TopDomain significantly outperforms 15 widely used, state-of-the-art ab initio and homology-based domain boundary predictors. Finally, we implemented TopDomainTMC, which accurately predicts whether domain parsing is necessary for the target protein.
536 _ _ |a 5111 - Domain-Specific Simulation Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5111
|c POF4-511
|x 0
|f POF IV
536 _ _ |a 2171 - Biological and environmental resources for sustainable use (POF4-217)
|0 G:(DE-HGF)POF4-2171
|c POF4-217
|x 1
|f POF IV
536 _ _ |a 2172 - Utilization of renewable carbon and energy sources and engineering of ecosystem functions (POF4-217)
|0 G:(DE-HGF)POF4-2172
|c POF4-217
|x 2
|f POF IV
536 _ _ |a Forschergruppe Gohlke (hkf7_20200501)
|0 G:(DE-Juel1)hkf7_20200501
|c hkf7_20200501
|x 3
|f Forschergruppe Gohlke
536 _ _ |a DFG project 267205415 - SFB 1208: Identität und Dynamik von Membransystemen - von Molekülen bis zu zellulären Funktionen
|0 G:(GEPRIS)267205415
|c 267205415
|x 4
588 _ _ |a Dataset connected to CrossRef, Journals: juser.fz-juelich.de
700 1 _ |a Golchin, Pegah
|0 P:(DE-HGF)0
|b 1
700 1 _ |a Koenig, Filip
|0 P:(DE-HGF)0
|b 2
700 1 _ |a Gohlke, Holger
|0 P:(DE-Juel1)172663
|b 3
|e Corresponding author
773 _ _ |a 10.1021/acs.jctc.1c00129
|g p. acs.jctc.1c00129
|0 PERI:(DE-600)2166976-4
|n 7
|p 4599–4613
|t Journal of chemical theory and computation
|v 17
|y 2021
|x 1549-9626
856 4 _ |u https://juser.fz-juelich.de/record/893374/files/acs.jctc.1c00129.pdf
856 4 _ |y Published on 2021-06-23. Available in OpenAccess from 2022-06-23.
|u https://juser.fz-juelich.de/record/893374/files/TopDomain_Ms_JCTC_rev_final.pdf
909 C O |o oai:juser.fz-juelich.de:893374
|p openaire
|p open_access
|p VDB
|p driver
|p dnbdelivery
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 3
|6 P:(DE-Juel1)172663
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5111
|x 0
913 1 _ |a DE-HGF
|b Forschungsbereich Erde und Umwelt
|l Erde im Wandel – Unsere Zukunft nachhaltig gestalten
|1 G:(DE-HGF)POF4-210
|0 G:(DE-HGF)POF4-217
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-200
|4 G:(DE-HGF)POF
|v Für eine nachhaltige Bio-Ökonomie – von Ressourcen zu Produkten
|9 G:(DE-HGF)POF4-2171
|x 1
913 1 _ |a DE-HGF
|b Forschungsbereich Erde und Umwelt
|l Erde im Wandel – Unsere Zukunft nachhaltig gestalten
|1 G:(DE-HGF)POF4-210
|0 G:(DE-HGF)POF4-217
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-200
|4 G:(DE-HGF)POF
|v Für eine nachhaltige Bio-Ökonomie – von Ressourcen zu Produkten
|9 G:(DE-HGF)POF4-2172
|x 2
914 1 _ |y 2021
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0200
|2 StatID
|b SCOPUS
|d 2021-02-02
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0300
|2 StatID
|b Medline
|d 2021-02-02
915 _ _ |a Embargoed OpenAccess
|0 StatID:(DE-HGF)0530
|2 StatID
915 _ _ |a JCR
|0 StatID:(DE-HGF)0100
|2 StatID
|b J CHEM THEORY COMPUT : 2019
|d 2021-02-02
915 _ _ |a IF >= 5
|0 StatID:(DE-HGF)9905
|2 StatID
|b J CHEM THEORY COMPUT : 2019
|d 2021-02-02
915 _ _ |a WoS
|0 StatID:(DE-HGF)0113
|2 StatID
|b Science Citation Index Expanded
|d 2021-02-02
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0150
|2 StatID
|b Web of Science Core Collection
|d 2021-02-02
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)1150
|2 StatID
|b Current Contents - Physical, Chemical and Earth Sciences
|d 2021-02-02
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0160
|2 StatID
|b Essential Science Indicators
|d 2021-02-02
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0199
|2 StatID
|b Clarivate Analytics Master Journal List
|d 2021-02-02
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
920 1 _ |0 I:(DE-Juel1)NIC-20090406
|k NIC
|l John von Neumann - Institut für Computing
|x 1
920 1 _ |0 I:(DE-Juel1)IBI-7-20200312
|k IBI-7
|l Strukturbiochemie
|x 2
920 1 _ |0 I:(DE-Juel1)IBG-4-20200403
|k IBG-4
|l Bioinformatik
|x 3
980 _ _ |a journal
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a I:(DE-Juel1)NIC-20090406
980 _ _ |a I:(DE-Juel1)IBI-7-20200312
980 _ _ |a I:(DE-Juel1)IBG-4-20200403
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21