Installing and Managing SPIDER's PubSub System for Distributed Processing
Create environment variables PUBSUB_DIR for the location of the
PubSub installation directory and PUBSUB_MASTER for name of the host
where PubSub master is run. These environment variables must be set in
each users startup file (i.e. for csh users in their .cshrc file) e.g.
setenv PUBSUB_DIR /net/bali/usr1/spider/pubsub
setenv PUBSUB_MASTER bali
Following steps should be done when logged in on PUBSUB_MASTER as member of group that is planning on using PubSub NOT as: root
cd PUBSUB_DIRECTORY e.g.
mkdir $PUBSUB_DIR ; cd $PUBSUB_DIR
Copy the PubSub files normally distributed in:
SPIDER_DIR/pubsub to your PUBSUB_DIRECTORY e.g.
cp SPIDER_DIR/pubsub/* $PUBSUB_DIR
Edit pubsub.permit e.g.
xedit pubsub.permit
Set machine specific permissions. Currently contains: machine name, limit for
number of simultaneous jobs, permitted run days, permitted start-time, permitted end-time, and que check
frequency (seconds), and comments. The machine names here determine where jobs can
be run.
Create an empty que file e.g.
touch pubsub.que
Tune NFS (if master node responds slowly).
If your master and compute nodes node will be accessing lots of data from a NFS mounted disk
you may want to speed up the process by altering the /etc/fstab
mount for the data disks to increase the read and write buffersize e.g.:
tonga2:/usr6 /usr6 nfs rsize=8192,wsize=8192 0 0
See: NFS tuning
for discussion.
You also may want to increase the number of nfs threads on the master node
and any other machines where the data is located using:
/usr/sbin/rpc.nfsd nproc
This should be placed in your init file so that it will be
preserved on reboot. On redhat GNU/Linux this is set in:
/etc/rc.d/init.d/nfs. Both changes will require root access
to the machine. See your Unix manual pages for: fstab & nfsd
Source: spider/pubsub/pubsub_inst.html Last page update: 29 May 2009 ArDean Leith