Hauptseite > Publikationsdatenbank > Hearts Gym: Learning Reinforcement Learning as a Team Event > print |
001 | 1020524 | ||
005 | 20240712112858.0 | ||
024 | 7 | _ | |a 10.34734/FZJ-2024-00243 |2 datacite_doi |
037 | _ | _ | |a FZJ-2024-00243 |
041 | _ | _ | |a English |
100 | 1 | _ | |a Ebert, Jan |0 P:(DE-Juel1)187002 |b 0 |
111 | 2 | _ | |a Third Teaching Machine Learning and Artificial Intelligence Workshop at ECML |c Grenoble |d 2022-09-19 - 2022-09-23 |w France |
245 | _ | _ | |a Hearts Gym: Learning Reinforcement Learning as a Team Event |
250 | _ | _ | |a 3rd ed. |
260 | _ | _ | |c 2023 |b PMLR |
300 | _ | _ | |a 16-21 |
336 | 7 | _ | |a CONFERENCE_PAPER |2 ORCID |
336 | 7 | _ | |a Conference Paper |0 33 |2 EndNote |
336 | 7 | _ | |a INPROCEEDINGS |2 BibTeX |
336 | 7 | _ | |a conferenceObject |2 DRIVER |
336 | 7 | _ | |a Output Types/Conference Paper |2 DataCite |
336 | 7 | _ | |a Contribution to a conference proceedings |b contrib |m contrib |0 PUB:(DE-HGF)8 |s 1704810885_18680 |2 PUB:(DE-HGF) |
490 | 0 | _ | |v 207 |
520 | _ | _ | |a Amidst the COVID-19 pandemic, the authors of this paper organized a Reinforcement Learning (RL) course for a graduate school in the field of data science. We describe the strategy and materials for creating an exciting learning experience despite the ubiquitous Zoom fatigue and evaluate the course qualitatively. The key organizational features are a focus on a competitive hands-on setting in teams, supported by a minimum of lectures providing the essential background on RL. The practical part of the course revolved around Hearts Gym, an RL environment for the card game Hearts that we developed as an entry-level tutorial to RL. Participants were tasked with training agents to explore reward shaping and other RL hyperparameters. For a final evaluation, the agents of the participants competed against each other. |
536 | _ | _ | |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) |0 G:(DE-HGF)POF4-5112 |c POF4-511 |f POF IV |x 0 |
536 | _ | _ | |a HDS LEE - Helmholtz School for Data Science in Life, Earth and Energy (HDS LEE) (HDS-LEE-20190612) |0 G:(DE-Juel1)HDS-LEE-20190612 |c HDS-LEE-20190612 |x 1 |
588 | _ | _ | |a Dataset connected to DataCite |
700 | 1 | _ | |a Doncevic, Danimir T. |0 P:(DE-Juel1)180221 |b 1 |
700 | 1 | _ | |a Kloß, Ramona |0 P:(DE-Juel1)162489 |b 2 |
700 | 1 | _ | |a Kesselheim, Stefan |0 P:(DE-Juel1)185654 |b 3 |e Corresponding author |u fzj |
773 | _ | _ | |y 2023 |
856 | 4 | _ | |u https://proceedings.mlr.press/v207/ebert23a.html |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/1020524/files/ebert23a.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/1020524/files/ebert23a.gif?subformat=icon |x icon |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/1020524/files/ebert23a.jpg?subformat=icon-1440 |x icon-1440 |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/1020524/files/ebert23a.jpg?subformat=icon-180 |x icon-180 |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/1020524/files/ebert23a.jpg?subformat=icon-640 |x icon-640 |y OpenAccess |
909 | C | O | |o oai:juser.fz-juelich.de:1020524 |p openaire |p open_access |p VDB |p driver |p dnbdelivery |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)187002 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 1 |6 P:(DE-Juel1)180221 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 2 |6 P:(DE-Juel1)162489 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 3 |6 P:(DE-Juel1)185654 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action |1 G:(DE-HGF)POF4-510 |0 G:(DE-HGF)POF4-511 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Enabling Computational- & Data-Intensive Science and Engineering |9 G:(DE-HGF)POF4-5112 |x 0 |
914 | 1 | _ | |y 2023 |
915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
920 | 1 | _ | |0 I:(DE-Juel1)IEK-10-20170217 |k IEK-10 |l Modellierung von Energiesystemen |x 1 |
920 | 1 | _ | |0 I:(DE-Juel1)IBG-1-20101118 |k IBG-1 |l Biotechnologie |x 2 |
980 | 1 | _ | |a FullTexts |
980 | _ | _ | |a contrib |
980 | _ | _ | |a VDB |
980 | _ | _ | |a UNRESTRICTED |
980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
980 | _ | _ | |a I:(DE-Juel1)IEK-10-20170217 |
980 | _ | _ | |a I:(DE-Juel1)IBG-1-20101118 |
981 | _ | _ | |a I:(DE-Juel1)ICE-1-20170217 |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|