| Hauptseite > Online First > Bridging the Gap Between Storage and Applications: A Modular Concept for Large Image Data Access > print |
| 001 | 1048800 | ||
| 005 | 20251203104034.0 | ||
| 037 | _ | _ | |a FZJ-2025-04913 |
| 041 | _ | _ | |a English |
| 100 | 1 | _ | |a Thönnißen, Julia |0 P:(DE-Juel1)171151 |b 0 |u fzj |
| 111 | 2 | _ | |a Distribits 2025 |c Düsseldorf |d 2025-10-23 - 2025-10-25 |w Germany |
| 245 | _ | _ | |a Bridging the Gap Between Storage and Applications: A Modular Concept for Large Image Data Access |
| 260 | _ | _ | |c 2025 |
| 336 | 7 | _ | |a Conference Paper |0 33 |2 EndNote |
| 336 | 7 | _ | |a Other |2 DataCite |
| 336 | 7 | _ | |a INPROCEEDINGS |2 BibTeX |
| 336 | 7 | _ | |a conferenceObject |2 DRIVER |
| 336 | 7 | _ | |a LECTURE_SPEECH |2 ORCID |
| 336 | 7 | _ | |a Conference Presentation |b conf |m conf |0 PUB:(DE-HGF)6 |s 1764754667_2681 |2 PUB:(DE-HGF) |x After Call |
| 520 | _ | _ | |a Recent advances in imaging technologies—particularly high-throughput methods—have led to an unprecedented growth of image datasets, reaching Terabytes to Petabytes in scale. While such massive datasets offer transformative potential for scientific discovery, they also introduce significant challenges for visualization and analysis due to the sheer size of the data and its continuous growth. Visualizing, annotating, and analyzing large-scale image datasets raises a fundamental dilemma of balancing computational efficiency and memory requirements. Many existing tools fail to manage large datasets effectively due to memory constraints, often forcing lossy methods like downsampling. Conversely, solutions optimized for large data volumes frequently depend on specialized or proprietary formats, reducing interoperability with other ecosystems. This highlights diverging requirements: storage systems favour compression for compactness, analysis tools require fast data access, and visualization tools benefit from tiled, multi-resolution formats. Without a unified strategy, institutions often resort to inefficient workflows involving repeated format conversions and costly data duplication to support diverse applications. Ongoing standardization efforts within the bioimaging community [1-4] represent important developments towards more efficient and standardized use of bioimaging data. However, the conversion of data into a single (and yet evolving) standard is not feasible for rapidly growing large-scale datasets, especially given very diverging needs for parallel processing on HPC systems. To address these issues, we present a concept for a modular cloud-native image delivery service designed to act as a flexible middleware layer between large-scale image repositories and consuming applications. The system supports heterogeneous input formats and delivers transformed data views on demand. It performs real-time operations such as coordinate transformations, filtering, and multi-resolution tiling, eliminating the need for pre-processing or intermediate storage. The service offers an extensible set of access points: RESTful APIs for web-based visualization (e.g., Neuroglancer, OpenSeadragon), virtual file system mounts for file-oriented tools (e.g., OMERO, ImageJ), and programmatic interfaces compatible with customizable environments (e.g., Napari, datalad). Additionally, it can dynamically present standard-conformant data views—such as those aligned with the Brain Imaging Data Structure (BIDS) [4]—from arbitrarily organized datasets. By decoupling data access from physical storage layout, the service facilitates scalable, multi-tool interoperability in distributed environments without data duplication. In summary, we propose a flexible and extensible approach to image data access that supports dynamic transformations, minimizes redundancy, and bridges the gap between diverse storage backends and modern, distributed applications. It aligns with the FAIR data principles and builds upon community standards while enabling efficient workflows for managing and exploiting large-scale image datasets. [1] S. Besson et al., “Bringing Open Data to Whole Slide Imaging”, Digital Pathology ECDP 2019, Lecture Notes in Computer Science, vol. 11435, pp. 3–10, Jul. 2019, DOI: 10.1007/978-3-030-23937-4_1 [2] J. Moore et al., “OME-NGFF: A next-generation file format for expanding bioimaging data-access strategies”, Nature Methods, vol. 18, no. 12, pp. 1496–1498, Dec. 2021. DOI: 10.1038/s41592-021-01326-w. [3] C. Allan et al., “OMERO: Flexible, model-driven data management for experimental biology”, Nature Methods, vol. 9, no. 3, pp. 245–253, Mar. 2012. DOI: 10.1038/nmeth.1896. [4] K. J. Gorgolewski et al., “The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments”, Scientific Data, vol. 3, no. 1, p. 160 044, Jun. 2016. DOI: 10.1038/sdata.2016.44. |
| 536 | _ | _ | |a 5251 - Multilevel Brain Organization and Variability (POF4-525) |0 G:(DE-HGF)POF4-5251 |c POF4-525 |f POF IV |x 0 |
| 536 | _ | _ | |a 5254 - Neuroscientific Data Analytics and AI (POF4-525) |0 G:(DE-HGF)POF4-5254 |c POF4-525 |f POF IV |x 1 |
| 536 | _ | _ | |a HIBALL - Helmholtz International BigBrain Analytics and Learning Laboratory (HIBALL) (InterLabs-0015) |0 G:(DE-HGF)InterLabs-0015 |c InterLabs-0015 |x 2 |
| 536 | _ | _ | |a EBRAINS 2.0 - EBRAINS 2.0: A Research Infrastructure to Advance Neuroscience and Brain Health (101147319) |0 G:(EU-Grant)101147319 |c 101147319 |f HORIZON-INFRA-2022-SERV-B-01 |x 3 |
| 536 | _ | _ | |a DFG project G:(GEPRIS)501864659 - NFDI4BIOIMAGE - Nationale Forschungsdateninfrastruktur für Mikroskopie und Bildanalyse (501864659) |0 G:(GEPRIS)501864659 |c 501864659 |x 4 |
| 856 | 4 | _ | |u https://www.distribits.live/events/2025-distribits/schedule/#2025-10-24 |
| 910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)171151 |
| 913 | 1 | _ | |a DE-HGF |b Key Technologies |l Natural, Artificial and Cognitive Information Processing |1 G:(DE-HGF)POF4-520 |0 G:(DE-HGF)POF4-525 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Decoding Brain Organization and Dysfunction |9 G:(DE-HGF)POF4-5251 |x 0 |
| 913 | 1 | _ | |a DE-HGF |b Key Technologies |l Natural, Artificial and Cognitive Information Processing |1 G:(DE-HGF)POF4-520 |0 G:(DE-HGF)POF4-525 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Decoding Brain Organization and Dysfunction |9 G:(DE-HGF)POF4-5254 |x 1 |
| 914 | 1 | _ | |y 2025 |
| 920 | _ | _ | |l yes |
| 920 | 1 | _ | |0 I:(DE-Juel1)INM-1-20090406 |k INM-1 |l Strukturelle und funktionelle Organisation des Gehirns |x 0 |
| 980 | _ | _ | |a conf |
| 980 | _ | _ | |a EDITORS |
| 980 | _ | _ | |a VDBINPRINT |
| 980 | _ | _ | |a I:(DE-Juel1)INM-1-20090406 |
| 980 | _ | _ | |a UNRESTRICTED |
| Library | Collection | CLSMajor | CLSMinor | Language | Author |
|---|