Contribution to a conference proceedings/Contribution to a book FZJ-2014-06521

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
High productivity data processing analytics methods with applications

 ;  ;

2014
IEEE

2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) : [Proceedings] - IEEE, 2014. - ISBN 978-953-233-077-9978-953-233-081-6 - doi:10.1109/MIPRO.2014.6859579
2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), OpatijaOpatija, Croatia, 26 May 2014 - 30 May 20142014-05-262014-05-30
IEEE 289 - 294 () [10.1109/MIPRO.2014.6859579]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: The term `big data analytics' emerged in order to engage in the ever increasing amount of scientific and engineering data with general analytics techniques that support the often more domain-specific data analysis process. It is recognized that the big data challenge can only be adequately addressed when knowledge of various different fields such as data mining, machine learning algorithms, parallel processing, and data management practices are effectively combined. This paper thus describes some of the `smart data analytics methods' that enable a high productivity data processing of large quantities of scientific data in order to enhance the data analysis efficiency. The paper thus aims to provide new insights how various fields can be successfully combined. Contributions of this paper include the concretization of the cross-industry standard process for data mining (CRISP-DM) process model in scientific environments using concrete machine learning algorithms (e.g. support vector machines that enable data classification) or data mining mechanisms (e.g. outlier detection in measurements). Serial and parallel approaches to specific data analysis challenges are discussed in the context of concrete earth science application data sets. Solutions also include various data visualizations that enable a better insight in the corresponding data analytics and analysis process.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 412 - Grid Technologies and Infrastructures (POF2-412) (POF2-412)

Appears in the scientific report 2014
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Workflow collections > Public records
Institute Collections > JSC
Publications database

 Record created 2014-12-04, last modified 2021-01-29



Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)