Hauptseite > Publikationsdatenbank > Metadata practices for simulation workflows |
Journal Article | FZJ-2025-02769 |
; ; ; ; ; ;
2025
Nature Publ. Group
London
This record in other databases:
Please use a persistent id in citations: this repositories doi:10.34734/FZJ-2024-05343 doi:10.1038/s41597-025-05126-1 doi:10.34734/FZJ-2025-02769
Abstract: Computer simulations are an essential pillar of knowledge generation in science. Exploring, understanding, reproducing, and sharing the results of simulations relies on tracking and organizing the metadata describing the numerical experiments. The models used to understand real-world systems, and the computational machinery required to simulate them, are typically complex, and produce large amounts of heterogeneous metadata. Here, we present general practices for acquiring and handling metadata that are agnostic to software and hardware, and highly flexible for the user. These consist of two steps: 1) recording and storing raw metadata, and 2) selecting and structuring metadata. As a proof of concept, we develop the Archivist, a Python tool to help with the second step, and use it to apply our practices to distinct high-performance computing use cases from neuroscience and hydrology. Our practices and the Archivist can readily be applied to existing workflows without the need for substantial restructuring. They support sustainable numerical workflows, fostering replicability, reproducibility, data exploration, and data sharing in simulation-based research.
Keyword(s): Information Retrieval (cs.IR) ; FOS: Computer and information sciences
![]() |
The record appears in these collections: |