001050382 001__ 1050382
001050382 005__ 20260112202639.0
001050382 0247_ $$2doi$$a10.5281/ZENODO.17419899
001050382 037__ $$aFZJ-2026-00155
001050382 041__ $$aEnglish
001050382 1001_ $$0P:(DE-Juel1)198812$$aLoup, Ulrich$$b0$$eCorresponding author
001050382 1112_ $$aDistribits 2025$$cDüsseldorf$$d2025-10-23 - 2025-10-24$$wGermany
001050382 245__ $$aThe Helmholtz Earth and Environment DataHub - Highly Distributed Data That Thrives on Metadata
001050382 260__ $$c2025
001050382 3367_ $$033$$2EndNote$$aConference Paper
001050382 3367_ $$2DataCite$$aOther
001050382 3367_ $$2BibTeX$$aINPROCEEDINGS
001050382 3367_ $$2DRIVER$$aconferenceObject
001050382 3367_ $$2ORCID$$aLECTURE_SPEECH
001050382 3367_ $$0PUB:(DE-HGF)6$$2PUB:(DE-HGF)$$aConference Presentation$$bconf$$mconf$$s1768209299_29455$$xPlenary/Keynote
001050382 520__ $$aIn Environmental Sciences, Time-series data is key to, for example, monitoring environmental processes, validating earth system models and remote sensing products, training of data driven methods and better understanding of climate processes. A major issue is the lack of a consistent data availability standard aligned with the FAIR (findable accessible interoperable reusable) principles. The DataHub initiative, which is part of the Helmholtz Research Field Earth and Environment, addresses these shortcomings by establishing a large-scale infrastructure around common data standards and interfaces, for example, the Open Geospatial Consortium’s SensorThings API (STA). Closely related to the DataHub is the STAMPLATE project, whose challenging task was to harmonize the extremely heterogeneous metadata formats stemming from the different observation domains such as the earth, atmosphere and ocean. Moreover, within the domains different metadata formats developed historically due to diverging system architectures and missing guidelines. In DataHub, the research data, whether it is collected by measurement devices or acquired through manual processes, is distributed among the seven participating research centers. Each of these centers is responsible for operating its own time series management system, which ingests the observational data. In addition to these data ingest systems, sensor and device management systems provide easy-to-use self-services for entering metadata, such as the Helmholtz Sensor Management System (https://helmholtz.software/software/sensor-management-system) or the O2A Registry (https://registry.o2a-data.de/). Each center operates a data/metadata synchronization service that ultimately makes the data available through STA, which integrates both data and metadata. Quality checking tools such as SaQC (https://helmholtz.software/software/saqc) facilitate data quality control. The powerful and modern Earth Data Portal (www.earth-data.de) with highly customizable thematic viewers is the central portal for data exploration. In order to ensure that metadata entered in any user self-service is also displayed in the Earth Data Portal along with the ingested data, custom, semantic metadata profiles developed in STAMPLATE augment STA’s core data model with domain-specific information. In summary, the data that is accessible on the Earth Data Portal and available from the STA endpoints is distributed in two distinct categories. Firstly, observation data and its metadata are acquired by separate systems. And secondly, each center operates its own data and metadata infrastructure, with all centers ultimately connecting to STA endpoints. The operationalization of the framework and its subsequent integration into research data workflows is imminent, presenting us with a number of challenges as our research data management processes undergo a transformative shift from manual, human-based workflows to self-organized, digitally-enabled workflows. For example, new ways of downloading data need to be found that meet the needs of researchers, while addressing issues such as copyright and avoiding infrastructure overload. This talk addresses the fundamental elements of our initiative and the associated challenges.
001050382 536__ $$0G:(DE-HGF)POF4-2173$$a2173 - Agro-biogeosystems: controls, feedbacks and impact (POF4-217)$$cPOF4-217$$fPOF IV$$x0
001050382 588__ $$aDataset connected to DataCite
001050382 7001_ $$00000-0001-8159-3888$$aBrinckmann, Nils$$b1
001050382 7001_ $$00000-0002-4861-3338$$aFaber, Claas$$b2$$eProject Member
001050382 7001_ $$0P:(DE-HGF)0$$aIngenbeek, Martin$$b3$$eProject Member
001050382 7001_ $$00000-0002-2826-3932$$aKoppe, Roland$$b4$$eProject Member
001050382 7001_ $$00000-0001-5590-5470$$aLorenz, Christof$$b5$$eProject Member
001050382 7001_ $$00000-0003-4517-6459$$aSchäfer, David$$b6$$eProject Member
001050382 7001_ $$0P:(DE-Juel1)129537$$aSorg, Jürgen$$b7$$eProject Member$$ufzj
001050382 7001_ $$0P:(DE-HGF)0$$aRambhia, Mihir$$b8$$eProject Member
001050382 773__ $$a10.5281/ZENODO.17419899
001050382 909CO $$ooai:juser.fz-juelich.de:1050382$$pVDB
001050382 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)198812$$aForschungszentrum Jülich$$b0$$kFZJ
001050382 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)129537$$aForschungszentrum Jülich$$b7$$kFZJ
001050382 9131_ $$0G:(DE-HGF)POF4-217$$1G:(DE-HGF)POF4-210$$2G:(DE-HGF)POF4-200$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-2173$$aDE-HGF$$bForschungsbereich Erde und Umwelt$$lErde im Wandel – Unsere Zukunft nachhaltig gestalten$$vFür eine nachhaltige Bio-Ökonomie – von Ressourcen zu Produkten$$x0
001050382 920__ $$lyes
001050382 9201_ $$0I:(DE-Juel1)IBG-3-20101118$$kIBG-3$$lAgrosphäre$$x0
001050382 980__ $$aconf
001050382 980__ $$aVDB
001050382 980__ $$aI:(DE-Juel1)IBG-3-20101118
001050382 980__ $$aUNRESTRICTED