001     1038382
005     20250203103330.0
024 7 _ |2 doi
|a 10.37044/osf.io/7y2jh
037 _ _ |a FZJ-2025-01379
100 1 _ |0 P:(DE-Juel1)191149
|a Beier, Sebastian
|b 0
245 _ _ |a BioHackEU23 report: Enabling continuous RDM using Annotated Research Contexts with RO-Crate profiles for ISA
260 _ _ |c 2024
336 7 _ |0 PUB:(DE-HGF)25
|2 PUB:(DE-HGF)
|a Preprint
|b preprint
|m preprint
|s 1738222063_11107
336 7 _ |2 ORCID
|a WORKING_PAPER
336 7 _ |0 28
|2 EndNote
|a Electronic Article
336 7 _ |2 DRIVER
|a preprint
336 7 _ |2 BibTeX
|a ARTICLE
336 7 _ |2 DataCite
|a Output Types/Working Paper
520 _ _ |a A prevailing paradigm in Research Data Management (RDM) is to publish research datasets in designated archives upon conclusion of a research process. However, it is beneficial to abandon the notion of final or static data artifacts and instead adopt a continuous approach towards working with research data, where data is constantly shared, versioned, and updated. This immutable yet evolving perspective allows for the application of existing technologies and processes from software engineering, such as continuous integration, release practices, and version management backed by decades of experience, and adaptable to RDM.To facilitate this, we propose the Annotated Research Context (ARC), a data and metadata layout convention based on the well-established ISA model for metadata annotation and implemented using Git repositories. ARCs are amenable towards frequent, lightweight data management operations, such as (meta)data validation and transformation. The Omnipy Python library is designed to help develop stepwise validated (meta)data transformations as scalable data flows that can be incrementally designed, updated, and rerun as requirements or data evolve.To demonstrate the concept of continuous RDM we will use Omnipy to define and orchestrate Git-backed CI/CD (Continuous Integration/Continuous Delivery) data flows to convert ISA metadata present in ARCs into validated RO-Crate representations adhering to the Bioschemas convention. A RO-Crate package combines the actual research data with its metadata description. Downstream, this allows semantic interpretation by Galaxy for e.g. workflow execution as well as machine-readable data access and data harvesting for search engines such as FAIDARE.
536 _ _ |0 G:(DE-HGF)POF4-2171
|a 2171 - Biological and environmental resources for sustainable use (POF4-217)
|c POF4-217
|f POF IV
|x 0
536 _ _ |0 G:(GEPRIS)442077441
|a DFG project G:(GEPRIS)442077441 - DataPLANT – Daten in Pflanzen-Grundlagenforschung (442077441)
|c 442077441
|x 1
588 _ _ |a Dataset connected to CrossRef
700 1 _ |0 P:(DE-HGF)0
|a Mühlhaus, Timo
|b 1
700 1 _ |0 0000-0002-9040-8733
|a Pommier, Cyril
|b 2
700 1 _ |0 P:(DE-HGF)0
|a Owen, Stuart
|b 3
700 1 _ |0 0000-0001-9021-3197
|a Brilhaus, Dominik
|b 4
700 1 _ |0 P:(DE-HGF)0
|a Weil, Heinrich Lukas
|b 5
700 1 _ |0 P:(DE-HGF)0
|a Wetzels, Florian
|b 6
700 1 _ |0 P:(DE-HGF)0
|a Chait, Gavin
|b 7
700 1 _ |0 0000-0002-2455-5938
|a Arend, Daniel
|b 8
700 1 _ |0 0000-0001-6546-1818
|a Feser, Manuel
|b 9
700 1 _ |0 P:(DE-HGF)0
|a Doniparthi, Gajendra
|b 10
700 1 _ |0 P:(DE-HGF)0
|a Bauer, Jonathan
|b 11
700 1 _ |0 0000-0001-9888-7954
|a Gundersen, Sveinung
|b 12
700 1 _ |0 P:(DE-HGF)0
|a Vázquez, Pável
|b 13
773 _ _ |t BioHackrXiv
|y 2024
909 C O |o oai:juser.fz-juelich.de:1038382
|p VDB
910 1 _ |0 I:(DE-588b)5008462-8
|6 P:(DE-Juel1)191149
|a Forschungszentrum Jülich
|b 0
|k FZJ
913 1 _ |0 G:(DE-HGF)POF4-217
|1 G:(DE-HGF)POF4-210
|2 G:(DE-HGF)POF4-200
|3 G:(DE-HGF)POF4
|4 G:(DE-HGF)POF
|9 G:(DE-HGF)POF4-2171
|a DE-HGF
|b Forschungsbereich Erde und Umwelt
|l Erde im Wandel – Unsere Zukunft nachhaltig gestalten
|v Für eine nachhaltige Bio-Ökonomie – von Ressourcen zu Produkten
|x 0
914 1 _ |y 2024
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)IBG-4-20200403
|k IBG-4
|l Bioinformatik
|x 0
980 _ _ |a preprint
980 _ _ |a VDB
980 _ _ |a I:(DE-Juel1)IBG-4-20200403
980 _ _ |a UNRESTRICTED


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21