Setting Up A Shadow Master In Grid Engine |
To set up master shadowing in Grid Engine, the following steps must
be taken.
1) Create the shadow_masters file
2) Verify correct permissions
3) Start the shadowd daemon(s)
1) Create a shadow_masters file The file needs to be created in $SGE_ROOT/default/common. This file should contain the name of the primary master host as the first line. Other hosts that are chosen to assume master responsibility should then be listed in the order desired. For example: >cat shadow_masters
Here, host1 is the primary master host. Should host1 fail, host2 will take over as the master server after a period of approximately 10 minutes. Further, if host2 should then fail, host3 will take over. 2) Verify correct permissions All master shadow hosts must have read/write permissions to the qmaster spool directory. 3) Start the shadow daemons The shadow daemon must be started on all shadow master hosts. This is done via the startup script, rcsge. As root on each host, run the following: $SGE_ROOT/default/common/rcsge -shadowd [Version 5.3 and its patches] $SGE_ROOT/default/common/sgemaster -shadowd [Version 6 or later]After these steps are successfully completed, master shadowing for the Grid Engine cluster is active. Refer to the Shadow Master Documentation and Shadow Master Man Page for more information about shadowd failover delay (SGE_DELAY_TIME) and check interval (SGE_CHECK_INTERVAL). |
When using this shadow master feature with the master hosts with multiple network interfaces, the following things have to be addressed.
|