000908264 001__ 908264
000908264 005__ 20230711160605.0
000908264 0247_ $$2doi$$a10.12688/f1000research.109080.2
000908264 0247_ $$2Handle$$a2128/31527
000908264 0247_ $$2altmetric$$aaltmetric:128566175
000908264 037__ $$aFZJ-2022-02498
000908264 041__ $$aEnglish
000908264 082__ $$a610
000908264 1001_ $$0P:(DE-Juel1)191149$$aBeier, Sebastian$$b0
000908264 245__ $$aRecommendations for the formatting of Variant Call Format (VCF) files to make plant genotyping data FAIR
000908264 260__ $$aLondon$$bF1000 Research Ltd$$c2022
000908264 3367_ $$2DRIVER$$aarticle
000908264 3367_ $$2DataCite$$aOutput Types/Journal article
000908264 3367_ $$0PUB:(DE-HGF)16$$2PUB:(DE-HGF)$$aJournal Article$$bjournal$$mjournal$$s1673254625_17708
000908264 3367_ $$2BibTeX$$aARTICLE
000908264 3367_ $$2ORCID$$aJOURNAL_ARTICLE
000908264 3367_ $$00$$2EndNote$$aJournal Article
000908264 520__ $$aIn this opinion article, we discuss the formatting of files from (plant) genotyping studies, in particular the formatting of metadata in Variant Call Format (VCF) files. The flexibility of the VCF format specification facilitates its use as a generic interchange format across domains but can lead to inconsistency between files in the presentation of metadata. To enable fully autonomous machine actionable data flow, generic elements need to be further specified.We strongly support the merits of the FAIR principles and see the need to facilitate them also through technical implementation specifications. They form a basis for the proposed VCF extensions here. We have learned from the existing application of VCF that the definition of relevant metadata using controlled standards, vocabulary and the consistent use of cross-references via resolvable identifiers (machine-readable) are particularly necessary and propose their encoding.VCF is an established standard for the exchange and publication of genotyping data. Other data formats are also used to capture variant data (for example, the HapMap and the gVCF formats), but none currently have the reach of VCF. For the sake of simplicity, we will only discuss VCF and our recommendations for its use, but these recommendations could also be applied to gVCF. However, the part of the VCF standard relating to metadata (as opposed to the actual variant calls) defines a syntactic format but no vocabulary, unique identifier or recommended content. In practice, often only sparse descriptive metadata is included. When descriptive metadata is provided, proprietary metadata fields are frequently added that have not been agreed upon within the community which may limit long-term and comprehensive interoperability. To address this, we propose recommendations for supplying and encoding metadata, focusing on use cases from plant sciences. We expect there to be overlap, but also divergence, with the needs of other domains.
000908264 536__ $$0G:(DE-HGF)POF4-2171$$a2171 - Biological and environmental resources for sustainable use (POF4-217)$$cPOF4-217$$fPOF IV$$x0
000908264 536__ $$0G:(EU-Grant)862613$$aAGENT - Activated GEnebank NeTwork (862613)$$c862613$$fH2020-SFS-2019-2$$x1
000908264 536__ $$0G:(BMBF)031A536C$$ade.NBI - Etablierungsphase - Leistungszentrum - GCBN - German Crop BioGreenformatics Network (031A536C)$$c031A536C$$x2
000908264 588__ $$aDataset connected to CrossRef, Journals: juser.fz-juelich.de
000908264 7001_ $$00000-0003-3159-3593$$aFiebig, Anne$$b1
000908264 7001_ $$00000-0002-9040-8733$$aPommier, Cyril$$b2
000908264 7001_ $$00000-0002-4839-5158$$aLiyanage, Isuru$$b3
000908264 7001_ $$00000-0002-4316-078X$$aLange, Matthias$$b4
000908264 7001_ $$0P:(DE-HGF)0$$aKersey, Paul J.$$b5
000908264 7001_ $$00000-0003-4031-9131$$aWeise, Stephan$$b6
000908264 7001_ $$00000-0002-4368-8058$$aFinkers, Richard$$b7
000908264 7001_ $$00000-0002-1187-8148$$aKoylass, Baron$$b8
000908264 7001_ $$00000-0002-5626-270X$$aCezard, Timothee$$b9
000908264 7001_ $$00000-0002-9551-6370$$aCourtot, Mélanie$$b10
000908264 7001_ $$00000-0002-5462-907X$$aContreras-Moreira, Bruno$$b11
000908264 7001_ $$0P:(DE-HGF)0$$aNaamati, Guy$$b12
000908264 7001_ $$0P:(DE-HGF)0$$aDyer, Sarah$$b13
000908264 7001_ $$00000-0001-6113-3518$$aScholz, Uwe$$b14$$eCorresponding author
000908264 773__ $$0PERI:(DE-600)2699932-8$$a10.12688/f1000research.109080.2$$gVol. 11, p. 231 -$$p231 -$$tF1000Research$$v11$$x2046-1402$$y2022
000908264 8564_ $$uhttps://f1000research.com/articles/11-231/v2
000908264 8564_ $$uhttps://juser.fz-juelich.de/record/908264/files/d93007ae-53fd-4438-b1a2-4fec2532da0b_109080_-_sebastian_beier.pdf$$yOpenAccess
000908264 909CO $$ooai:juser.fz-juelich.de:908264$$pdnbdelivery$$pec_fundedresources$$pVDB$$pdriver$$popen_access$$popenaire
000908264 915__ $$0LIC:(DE-HGF)CCBYNV$$2V:(DE-HGF)$$aCreative Commons Attribution CC BY (No Version)$$bDOAJ$$d2020-09-04
000908264 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
000908264 915__ $$0StatID:(DE-HGF)0561$$2StatID$$aArticle Processing Charges$$d2020-09-04
000908264 915__ $$0StatID:(DE-HGF)0700$$2StatID$$aFees$$d2020-09-04
000908264 915__ $$0StatID:(DE-HGF)0200$$2StatID$$aDBCoverage$$bSCOPUS$$d2022-11-16
000908264 915__ $$0StatID:(DE-HGF)0300$$2StatID$$aDBCoverage$$bMedline$$d2022-11-16
000908264 915__ $$0StatID:(DE-HGF)0501$$2StatID$$aDBCoverage$$bDOAJ Seal$$d2020-10-14T09:38:47Z
000908264 915__ $$0StatID:(DE-HGF)0500$$2StatID$$aDBCoverage$$bDOAJ$$d2020-10-14T09:38:47Z
000908264 915__ $$0StatID:(DE-HGF)0030$$2StatID$$aPeer Review$$bDOAJ : Open peer review$$d2020-10-14T09:38:47Z
000908264 9141_ $$y2022
000908264 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)191149$$aForschungszentrum Jülich$$b0$$kFZJ
000908264 9101_ $$0I:(DE-HGF)0$$6P:(DE-Juel1)191149$$aIPK Gatersleben$$b0
000908264 9101_ $$0I:(DE-HGF)0$$60000-0001-6113-3518$$aIPK Gatersleben$$b14
000908264 9131_ $$0G:(DE-HGF)POF4-217$$1G:(DE-HGF)POF4-210$$2G:(DE-HGF)POF4-200$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-2171$$aDE-HGF$$bForschungsbereich Erde und Umwelt$$lErde im Wandel – Unsere Zukunft nachhaltig gestalten$$vFür eine nachhaltige Bio-Ökonomie – von Ressourcen zu Produkten$$x0
000908264 920__ $$lyes
000908264 9201_ $$0I:(DE-Juel1)IBG-4-20200403$$kIBG-4$$lBioinformatik$$x0
000908264 980__ $$ajournal
000908264 980__ $$aVDB
000908264 980__ $$aI:(DE-Juel1)IBG-4-20200403
000908264 980__ $$aUNRESTRICTED
000908264 980__ $$aOPENSCIENCE
000908264 9801_ $$aFullTexts