Poster (After Call) FZJ-2024-07444

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Scaling data-intensive analytics with Heat: a Python library for massively-parallel array computing and machine learning

 ;  ;  ;  ;  ;  ;  ;  ;

2024

Helmholtz AI Conference, DüsseldorfDüsseldorf, Germany, 12 Jun 2024 - 14 Jun 20242024-06-122024-06-14

Abstract: Handling and analyzing massive data sets is highly important for the vast majority of research communities, but it is also challenging, especially for those communities without a background in HPC. The Helmholtz Analytics Toolkit (Heat) library offers a solution to this problem by providing memory-distributed and hardware-accelerated array manipulation, data analytics, and machine learning algorithms in Python, targeting the usage by non-experts in HPC. In short: Heats objective is to make array computing and machine learning as easy on a CPU/GPU-cluster as it is on a workstation.Our poster provides an overview of Heats design principles, its current features and capabilities, and discusses its role in the ecosystem of distributed array computing and machine learning in Python.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  2. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  3. SLNS - SimLab Neuroscience (Helmholtz-SLNS) (Helmholtz-SLNS)

Appears in the scientific report 2024
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Poster
Workflow collections > Public records
Institute Collections > JSC
Publications database

 Record created 2024-12-19, last modified 2025-01-13



Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)