000153150 001__ 153150
000153150 005__ 20230711154229.0
000153150 0247_ $$2URN$$aurn:nbn:de:0001-2014022000
000153150 0247_ $$2Handle$$a2128/6698
000153150 0247_ $$2ISSN$$a1868-8489
000153150 020__ $$a978-3-89336-949-2
000153150 037__ $$aFZJ-2014-02813
000153150 041__ $$aEnglish
000153150 1001_ $$0P:(DE-Juel1)132139$$aHoll, Sonja$$b0$$eCorresponding Author$$gfemale$$ufzj
000153150 245__ $$aAutomated Optimization Methods for Scientific Workflows in e-Science Infrastructures$$f2014-01-27
000153150 260__ $$aJülich$$bForschungszentrum Jülich GmbH Zentralbibliothek, Verlag$$c2014
000153150 300__ $$axvi, 182 S.
000153150 3367_ $$0PUB:(DE-HGF)11$$2PUB:(DE-HGF)$$aDissertation / PhD Thesis$$bphd$$mphd$$s153150
000153150 3367_ $$02$$2EndNote$$aThesis
000153150 3367_ $$2DRIVER$$adoctoralThesis
000153150 3367_ $$2BibTeX$$aPHDTHESIS
000153150 3367_ $$2DataCite$$aOutput Types/Dissertation
000153150 3367_ $$2ORCID$$aDISSERTATION
000153150 4900_ $$aSchriften des Forschungszentrums Jülich. IAS Series$$v24
000153150 502__ $$aUniversität Bonn, Diss., 2014$$bDr.$$cUniversität Bonn$$d2014
000153150 520__ $$aScientific workflows have emerged as a key technology that assists scientists with the design, management, execution, sharing and reuse of in silico experiments. Workflow management systems simplify the management of scientific workflows by providing graphical interfaces for their development, monitoring and analysis. Nowadays, e-Science combines such workflow management systems with large-scale data and computing resources into complex research infrastructures. For instance, e-Science allows the conveyance of best practice research in collaborations by providing workflow repositories, which facilitate the sharing and reuse of scientific workflows. However, scientists are still faced with different limitations while reusing workflows. One of the most common challenges they meet is the need to select appropriate applications and their individual execution parameters. If scientists do not want to rely on default or experience-based parameters, the best-effort option is to test different workflow set-ups using either trial and error approaches or parameter sweeps. Both methods may be inefficient or time consuming respectively, especially when tuning a large number of parameters. Therefore, scientists require an effective and efficient mechanism that automatically tests different workflow set-ups in an intelligent way and will help them to improve their scientific results. This thesis addresses the limitation described above by defining and implementing an approach for the optimization of scientific workflows. In the course of this work, scientists’ needs are investigated and requirements are formulated resulting in an appropriate optimization concept. In a following step, this concept is prototypically implemented by extending a workflow management system with an optimization framework, including general mechanisms required to conduct workflow optimization. As optimization is an ongoing research topic, different algorithms are provided by pluggable extensions (plugins) that can be loosely coupled with the framework, resulting in a generic and quickly extendable system. In this thesis, an exemplary plugin is introduced which applies a Genetic Algorithm for parameter optimization. In order to accelerate and therefore make workflow optimization feasible at all, e-Science infrastructures are utilized for the parallel execution of scientific workflows. This is empowered by additional extensions enabling the execution of applications and workflows on distributed computing resources. The actual implementation and therewith the general approach of workflow optimization is experimentally verified by four use cases in the life science domain. All workflows were significantly improved, which demonstrates the advantage of the proposed workflow optimization. Finally, a new collaboration-based approach is introduced that harnesses optimization provenance to make optimization faster and more robust in the future.
000153150 536__ $$0G:(DE-HGF)POF2-412$$a412 - Grid Technologies and Infrastructures (POF2-412)$$cPOF2-412$$fPOF II$$x0
000153150 650_7 $$0V:(DE-588b)4012494-0$$2GND$$aDissertation$$xDiss.
000153150 773__ $$y2014
000153150 8564_ $$uhttps://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.pdf$$yOpenAccess
000153150 8564_ $$uhttps://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.ps.gz$$yOpenAccess
000153150 8564_ $$uhttps://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.jpg?subformat=icon-144$$xicon-144$$yOpenAccess
000153150 8564_ $$uhttps://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.jpg?subformat=icon-180$$xicon-180$$yOpenAccess
000153150 8564_ $$uhttps://juser.fz-juelich.de/record/153150/files/IAS_Series_24_A.jpg?subformat=icon-640$$xicon-640$$yOpenAccess
000153150 909CO $$ooai:juser.fz-juelich.de:153150$$pdnbdelivery$$pVDB$$pdriver$$purn$$popen_access$$popenaire
000153150 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
000153150 9141_ $$y2014
000153150 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132139$$aForschungszentrum Jülich GmbH$$b0$$kFZJ
000153150 9132_ $$0G:(DE-HGF)POF3-512$$1G:(DE-HGF)POF3-510$$2G:(DE-HGF)POF3-500$$aDE-HGF$$bKey Technologies$$lSupercomputing & Big Data$$vData-Intensive Science and Federated Computing$$x0
000153150 9131_ $$0G:(DE-HGF)POF2-412$$1G:(DE-HGF)POF2-410$$2G:(DE-HGF)POF2-400$$3G:(DE-HGF)POF2$$4G:(DE-HGF)POF$$aDE-HGF$$bSchlüsseltechnologien$$lSupercomputing$$vGrid Technologies and Infrastructures$$x0
000153150 920__ $$lyes
000153150 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000153150 980__ $$aphd
000153150 980__ $$aUNRESTRICTED
000153150 980__ $$aFullTexts
000153150 980__ $$aI:(DE-Juel1)JSC-20090406
000153150 980__ $$aVDB
000153150 980__ $$aOPENSCIENCE
000153150 9801_ $$aFullTexts