| Home > Publications database > Automatic Supervision of CRAY UNICOS Systems |
| Contribution to a conference proceedings/Contribution to a book | FZJ-2015-01737 |
; ; ;
1991
Abstract: This paper describes automated strategies for recovering from UNICOS hardware and software errors that normally required human intervention. The first major step towards automation is an automatic reboot of the UNICOS system which can be performed by a Motorola operator workstation (OWS) directly connected to the IOS of the CRAY. The shell scripts on the OWS which control and perform this reboot are discussed and a description of changes to inittab and to the UNICOS startup scripts on the CRAY is given. Furthermore, the development of this supervising system and its present state is described along with a presentation of some typical problems which can be handled automatically now.
|
The record appears in these collections: |