EuroEXA
Co-designed Innovation and System for Resilient Exascale Computing in Europe: From Applications to Silicon
Coordinator | IDRYMA TECHNOLOGIAS KAI EREVNAS ; INTERUNIVERSITAIR MICRO-ELECTRONICA CENTRUM ; FRAUNHOFER GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V. ; SCIENCE AND TECHNOLOGY FACILITIES COUNCIL ; SYNELIXIS LYSEIS PLIROFORIKIS AUTOMATISMOU & TILEPIKOINONION ANONIMI ETAIRIA ; ZEROPOINT TECHNOLOGIES AB ; ICEOTOPE RESEARCH & DEVELOPMENT LTD ; NEURASMUS BV ; Institute of Communication and Computer Systems ; National Institute for Astrophysics ; Maxeler Technologies (United Kingdom) ; Technology Strategy Board ; ARM LIMITED ; European Centre for Medium-Range Weather Forecasts ; National Institute for Nuclear Physics ; University of Manchester ; BARCELONA SUPERCOMPUTING CENTER - CENTRO NACIONAL DE SUPERCOMPUTACION |
Grant period | 2017-09-01 - 2021-12-31 |
Funding body | European Union |
Call number | H2020-FETHPC-2016 |
Grant number | 754337 |
Identifier | G:(EU-Grant)754337 |
Note: To achieve the demands of extreme scale and the delivery of exascale, we embrace the computing platform as a whole, not just component optimization or fault resilience. EuroEXA brings a holistic foundation from multiple European HPC projects and partners together with the industrial SME focus of MAX for FPGA data-flow; ICE for infrastructure; ALLIN for HPC tooling and ZPT to collapse the memory bottleneck; to co-design a ground-breaking platform capable of scaling peak performance to 400 PFLOP in a peak system power envelope of 30MW; over four times the performance at four times the energy efficiency of today’s HPC platforms. Further, we target a PUE parity rating of 1.0 through use of renewables and immersion-based cooling.
We co-design a balanced architecture for both compute- and data-intensive applications using a cost-efficient, modular-integration approach enabled by novel inter-die links and the investigation of a custom EuroEXA processing unit with integration of FPGA for data-flow acceleration. We provide a homogenised software platform offering heterogeneous acceleration with scalable shared memory access and create a unique hybrid geographically-addressed, switching and topology interconnect within the rack while enabling the adoption of low-cost Ethernet switches offering low-Latency and high-switching bandwidth.
Working together with a rich mix of key HPC applications from across climate/weather, physics/energy and life-science/bioinformatics domains we will demonstrate the results of the project through the deployment of an integrated and operational peta-flop level prototype hosted at STFC. Supported by run-to-completion platform-wide resilience mechanisms, components will manage local failures, while communicating with higher levels of the stack. Monitored and controlled by advanced runtime capabilities, EuroEXA will demonstrate its co-design solution supporting both existing pre-exascale and project-developed exascale applications.
Recent Publications
All known publications ...
Download: BibTeX | EndNote XML, Text | RIS |
http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Contribution to a conference proceedings
Vardas, I. ; Ploumidis, M. ; Marazakis, M.
Exploring the impact of node failures on the resource allocation for parallel jobs
202114th Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grids, LisbonLisbon, Portugal, 30 Aug 2021 - 30 Aug 20212021-08-302021-08-30
12 p. (2021)2021
Files
Fulltext by OpenAccess repository
BibTeX |
EndNote:
XML,
Text |
RIS
All known publications ...
Download: BibTeX | EndNote XML, Text | RIS |