001     153150
005     20230711154229.0
020 _ _ |a 978-3-89336-949-2
024 7 _ |2 URN
|a urn:nbn:de:0001-2014022000
024 7 _ |2 Handle
|a 2128/6698
024 7 _ |2 ISSN
|a 1868-8489
037 _ _ |a FZJ-2014-02813
041 _ _ |a English
100 1 _ |0 P:(DE-Juel1)132139
|a Holl, Sonja
|b 0
|e Corresponding Author
|g female
|u fzj
245 _ _ |a Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures
|f 2014-01-27
260 _ _ |a Jülich
|b Forschungszentrum Jülich GmbH Zentralbibliothek, Verlag
|c 2014
300 _ _ |a xvi, 182 S.
336 7 _ |0 PUB:(DE-HGF)11
|2 PUB:(DE-HGF)
|a Dissertation / PhD Thesis
|b phd
|m phd
|s 153150
336 7 _ |0 2
|2 EndNote
|a Thesis
336 7 _ |2 DRIVER
|a doctoralThesis
336 7 _ |2 BibTeX
|a PHDTHESIS
336 7 _ |2 DataCite
|a Output Types/Dissertation
336 7 _ |2 ORCID
|a DISSERTATION
490 0 _ |a Schriften des Forschungszentrums Jülich. IAS Series
|v 24
502 _ _ |a Universität Bonn, Diss., 2014
|b Dr.
|c Universität Bonn
|d 2014
520 _ _ |a Scientific workflows have emerged as a key technology that assists scientists with the design, management, execution, sharing and reuse of in silico experiments. Workflow management systems simplify the management of scientific workflows by providing graphical interfaces for their development, monitoring and analysis. Nowadays, e-Science combines such workflow management systems with large-scale data and computing resources into complex research infrastructures. For instance, e-Science allows the conveyance of best practice research in collaborations by providing workflow repositories, which facilitate the sharing and reuse of scientific workflows. However, scientists are still faced with different limitations while reusing workflows. One of the most common challenges they meet is the need to select appropriate applications and their individual execution parameters. If scientists do not want to rely on default or experience-based parameters, the best-effort option is to test different workflow set-ups using either trial and error approaches or parameter sweeps. Both methods may be inefficient or time consuming respectively, especially when tuning a large number of parameters. Therefore, scientists require an effective and efficient mechanism that automatically tests different workflow set-ups in an intelligent way and will help them to improve their scientific results. This thesis addresses the limitation described above by defining and implementing an approach for the optimization of scientific workflows. In the course of this work, scientists’ needs are investigated and requirements are formulated resulting in an appropriate optimization concept. In a following step, this concept is prototypically implemented by extending a workflow management system with an optimization framework, including general mechanisms required to conduct workflow optimization. As optimization is an ongoing research topic, different algorithms are provided by pluggable extensions (plugins) that can be loosely coupled with the framework, resulting in a generic and quickly extendable system. In this thesis, an exemplary plugin is introduced which applies a Genetic Algorithm for parameter optimization. In order to accelerate and therefore make workflow optimization feasible at all, e-Science infrastructures are utilized for the parallel execution of scientific workflows. This is empowered by additional extensions enabling the execution of applications and workflows on distributed computing resources. The actual implementation and therewith the general approach of workflow optimization is experimentally verified by four use cases in the life science domain. All workflows were significantly improved, which demonstrates the advantage of the proposed workflow optimization. Finally, a new collaboration-based approach is introduced that harnesses optimization provenance to make optimization faster and more robust in the future.
536 _ _ |0 G:(DE-HGF)POF2-412
|a 412 - Grid Technologies and Infrastructures (POF2-412)
|c POF2-412
|f POF II
|x 0
650 _ 7 |0 V:(DE-588b)4012494-0
|2 GND
|a Dissertation
|x Diss.
773 _ _ |y 2014
856 4 _ |u https://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.ps.gz
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.jpg?subformat=icon-144
|x icon-144
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.jpg?subformat=icon-180
|x icon-180
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.jpg?subformat=icon-640
|x icon-640
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:153150
|p openaire
|p open_access
|p urn
|p driver
|p VDB
|p dnbdelivery
910 1 _ |0 I:(DE-588b)5008462-8
|6 P:(DE-Juel1)132139
|a Forschungszentrum Jülich GmbH
|b 0
|k FZJ
913 2 _ |0 G:(DE-HGF)POF3-512
|1 G:(DE-HGF)POF3-510
|2 G:(DE-HGF)POF3-500
|a DE-HGF
|b Key Technologies
|l Supercomputing & Big Data
|v Data-Intensive Science and Federated Computing
|x 0
913 1 _ |0 G:(DE-HGF)POF2-412
|1 G:(DE-HGF)POF2-410
|2 G:(DE-HGF)POF2-400
|3 G:(DE-HGF)POF2
|4 G:(DE-HGF)POF
|a DE-HGF
|b Schlüsseltechnologien
|l Supercomputing
|v Grid Technologies and Infrastructures
|x 0
914 1 _ |y 2014
915 _ _ |0 StatID:(DE-HGF)0510
|2 StatID
|a OpenAccess
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a phd
980 _ _ |a UNRESTRICTED
980 _ _ |a FullTexts
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a VDB
980 _ _ |a OPENSCIENCE
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21