TY  - CONF
AU  - Szczepanik, Michał
AU  - Heunis, Stephan
AU  - Mönch, Christian
AU  - Wagner, Adina
AU  - Waite, Alexander Q.
AU  - Waite, Laura
AU  - Hanke, Michael
TI  - Distributed data management for large collaborative projects: DataLad ecosystem in Collaborative Research Center 1451
M1  - FZJ-2023-05235
PY  - 2023
AB  - Multi-site research projects offer a unique opportunity for scientific insight based on data collected across different modalities, paradigms, and species. Yet, they also pose unique research data management challenges. Here, we present software developments and lessons learned from the information management project of CRC1451. Given the large variability of RDM demands across over 20 CRC member projects, we opted for a decentralized approach: Projects retain full control over key data management decisions (standards, storage, sharing), and the findability, accessibility, interoperability, and reusability of their data is achieved with DataLad as an overlay structure for all distributed datasets. We use DataLad Catalog to generate an online data portal based on metadata. Metadata extraction is done using MetaLad, based on the 'capture immediately, curate perpetually' iterative approach. To mitigate DataLad’s limited adoption outside central projects, we are developing two solutions. First, DataLad Gooey is a graphical user interface for basic data management operations. Second, DataLad Tabby is a format specification and a collection of tools for dataset descriptions which can be created and provided as a spreadsheet, using well-defined terms, translatable to catalog records and linked data objects.
T2  - INCF Neuroinformatics Assembly 2023
CY  - 18 Sep 2023 - 20 Sep 2023, online (Sweden)
Y2  - 18 Sep 2023 - 20 Sep 2023
M2  - online, Sweden
LB  - PUB:(DE-HGF)24
DO  - DOI:10.5281/ZENODO.8355962
UR  - https://juser.fz-juelich.de/record/1019189
ER  -