Accelerating massive data processing in Python with Heat

Comito, Claudia; Rüttgers, Alexander; Hoppe, Fabian; Hagemeier, Björn; Götz, Markus; Streit, Achim; Gutiérrez Hermosillo Muriedas, Juan Pedro; Krajsek, Kai; Tarnawa, Michael; Knechtges, Philipp

Poster (After Call)

FZJ-2023-05810

Accelerating massive data processing in Python with Heat

Comito, C. (Corresponding author)FZJ* ; Götz, M. ; Gutiérrez Hermosillo Muriedas, J. P. ; Hagemeier, B.FZJ* ; Hoppe, F. ; Knechtges, P. ; Krajsek, K.FZJ* ; Rüttgers, A. ; Streit, A. ; Tarnawa, M.FZJ*

2023

Artificial Intelligence Symposium on Theory, Application and Research 2023, AI STAR#2023, ESOC, Darmstadt, Germany, 27 Sep 2023 - 28 Sep 2023

Abstract: Heat [1, 2] is an open-source Python library designed to address the challenges of working with massive data sets and harnessing the power of machine learning across disciplines. Developed collaboratively by within the Helmholtz Association (FZJ, KIT, and DLR), Heat offers cutting-edge capabilities for high-performance data analytics, machine learning, and deep learning.Heat provides a Numpy-like API that simplifies the development of scalable, GPU-accelerated applications. What sets Heat apart is its underlying data-parallelism, implemented on top of MPI, which significantly enhances efficiency and performance of data processing compared to traditional task-parallel frameworks.By exploring practical use cases in space science (materials engineering, atmospheric modeling, anomaly detection) and its potential as a backend for diverse data processing pipelines, we will illustrate how Heat can accelerate AI research and applications.[1] Götz, M., Debus, C., Coquelin, et al.: "HeAT - a Distributed and GPU-accelerated Tensor Framework for Data Analytics" [2] https://github.com/helmholtz-analytics/heat

Contributing Institute(s):

Jülich Supercomputing Center (JSC)

Research Program(s):

Appears in the scientific report 2023

Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Poster
Workflow collections > Public records
Institute Collections > JSC
Publications database

Record created 2023-12-21, last modified 2024-01-05

Similar records

Rate this document:

(Not yet reviewed)

Add to personal basket
Export as Author List with IDs BibTeX (UTF-8), EndNote XML, EndNote Text, RIS, MARC, Print MARC, MARCXML, DC,
Request correction
Submit fulltext

guest :: login JuSER
		Search		Submit		Personalize Your alerts Your baskets Your searches		Help