Contribution to a conference proceedings/Contribution to a book FZJ-2018-05194

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Best Practices in Structuring Data Science Projects



2019
Springer International Publishing Cham
ISBN: 978-3-319-99992-0 (print), 978-3-319-99993-7 (electronic)

Information Systems Architecture and Technology: Proceedings of 39th International Conference on Information Systems Architecture and Technology – ISAT 2018 / Wilimowska, Zofia (Editor) ; Cham : Springer International Publishing, 2019, Chapter 31 ; ISSN: 2194-5357=2194-5365 ; ISBN: 978-3-319-99992-0=978-3-319-99993-7 ; doi:10.1007/978-3-319-99993-7
39th International Conference on Information Systems Architecture and Technology – ISAT 2018, NysaNysa, Poland, 16 Sep 2018 - 19 Sep 20182018-09-162018-09-19
Cham : Springer International Publishing, Advances in Intelligent Systems and Computing 854, 348 - 357 () [10.1007/978-3-319-99993-7_31]

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: The goal of Data Science projects is to extract knowledge and insights from collected data. The focus is put on the novelty and usability of the obtained insights. However, the impact of a project can be seriously reduced if the results are not communicated well. In this paper, we describe a means of managing and describing the outcomes of the Data Science projects in such a way that they optimally convey the insights gained. We focus on the main artifact of the non-verbal communication, namely project structure. In particular, we surveyed three sources of information on how to structure projects: common management methodologies, community best practices, and data sharing platforms. The survey resulted in a list of recommendations on how to build the project artifacts to make them clear, intuitive, and logical. We also provide hints on tools that can be helpful for managing such structures in an efficient manner. The paper is intended to motivate and support an informed decision on how to structure a Data Science project to facilitate better communication of the outcomes.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 512 - Data-Intensive Science and Federated Computing (POF3-512) (POF3-512)

Appears in the scientific report 2018
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Workflow collections > Public records
Institute Collections > JSC
Publications database

 Record created 2018-09-06, last modified 2021-01-29



Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)