001     173108
005     20210129214552.0
024 7 _ |a 10.1109/MIPRO.2014.6859579
|2 doi
037 _ _ |a FZJ-2014-06521
100 1 _ |a Riedel, Morris
|0 P:(DE-Juel1)132239
|b 0
|e Corresponding Author
|u fzj
111 2 _ |a 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)
|c Opatija
|d 2014-05-26 - 2014-05-30
|w Croatia
245 _ _ |a High productivity data processing analytics methods with applications
260 _ _ |c 2014
|b IEEE
295 1 0 |a 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) : [Proceedings] - IEEE, 2014. - ISBN 978-953-233-077-9978-953-233-081-6 - doi:10.1109/MIPRO.2014.6859579
300 _ _ |a 289 - 294
336 7 _ |a Contribution to a conference proceedings
|b contrib
|m contrib
|0 PUB:(DE-HGF)8
|s 1417706347_21915
|2 PUB:(DE-HGF)
336 7 _ |a Contribution to a book
|0 PUB:(DE-HGF)7
|2 PUB:(DE-HGF)
|m contb
336 7 _ |a Conference Paper
|0 33
|2 EndNote
336 7 _ |a CONFERENCE_PAPER
|2 ORCID
336 7 _ |a Output Types/Conference Paper
|2 DataCite
336 7 _ |a conferenceObject
|2 DRIVER
336 7 _ |a INPROCEEDINGS
|2 BibTeX
520 _ _ |a The term `big data analytics' emerged in order to engage in the ever increasing amount of scientific and engineering data with general analytics techniques that support the often more domain-specific data analysis process. It is recognized that the big data challenge can only be adequately addressed when knowledge of various different fields such as data mining, machine learning algorithms, parallel processing, and data management practices are effectively combined. This paper thus describes some of the `smart data analytics methods' that enable a high productivity data processing of large quantities of scientific data in order to enhance the data analysis efficiency. The paper thus aims to provide new insights how various fields can be successfully combined. Contributions of this paper include the concretization of the cross-industry standard process for data mining (CRISP-DM) process model in scientific environments using concrete machine learning algorithms (e.g. support vector machines that enable data classification) or data mining mechanisms (e.g. outlier detection in measurements). Serial and parallel approaches to specific data analysis challenges are discussed in the context of concrete earth science application data sets. Solutions also include various data visualizations that enable a better insight in the corresponding data analytics and analysis process.
536 _ _ |a 412 - Grid Technologies and Infrastructures (POF2-412)
|0 G:(DE-HGF)POF2-412
|c POF2-412
|f POF II
|x 0
588 _ _ |a Dataset connected to CrossRef Conference
700 1 _ |a Memon, Mohammad Shahbaz
|0 P:(DE-Juel1)132190
|b 1
|u fzj
700 1 _ |a Memon, Ahmed
|0 P:(DE-Juel1)132191
|b 2
|u fzj
773 _ _ |a 10.1109/MIPRO.2014.6859579
909 C O |o oai:juser.fz-juelich.de:173108
|p VDB
910 1 _ |a Forschungszentrum Jülich GmbH
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)132239
910 1 _ |a Forschungszentrum Jülich GmbH
|0 I:(DE-588b)5008462-8
|k FZJ
|b 1
|6 P:(DE-Juel1)132190
910 1 _ |a Forschungszentrum Jülich GmbH
|0 I:(DE-588b)5008462-8
|k FZJ
|b 2
|6 P:(DE-Juel1)132191
913 2 _ |a DE-HGF
|b Key Technologies
|l Supercomputing & Big Data
|1 G:(DE-HGF)POF3-510
|0 G:(DE-HGF)POF3-512
|2 G:(DE-HGF)POF3-500
|v Data-Intensive Science and Federated Computing
|x 0
913 1 _ |a DE-HGF
|b Schlüsseltechnologien
|l Supercomputing
|1 G:(DE-HGF)POF2-410
|0 G:(DE-HGF)POF2-412
|2 G:(DE-HGF)POF2-400
|v Grid Technologies and Infrastructures
|x 0
|4 G:(DE-HGF)POF
|3 G:(DE-HGF)POF2
914 1 _ |y 2014
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a contrib
980 _ _ |a VDB
980 _ _ |a contb
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a UNRESTRICTED


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21