Help:Rocks cluster

From CECS wiki
Jump to navigation Jump to search

This is general documentation for all Rocks based clusters at UCF. Some may not apply to your specific cluster.

You can view the status of all known clusters in CECS at http://newton.i2lab.ucf.edu/ganglia2/

See also the general cluster notes.


Additional documentation[edit]

The cluster software is composed of various components, each of which has their own documentation.

See also Steve's cluster research page.

tutorial[edit]

This information is used for a 10 minute (or longer as necessary) short course on cluster use, which can be taylored to fit your needs and expected use.

general info[edit]

  • Monitor cluster status with ganglia
  • use ssh to enter the cluster
  • use ssh or rsh to get between nodes if necessary
  • use MPI to program and start your job if appropriate
  • use Xming or similar to access the system from windows if you need graphics

specific info[edit]

(These should probably be expanded--ask if you need details.)

  • Use the batch queue systems appropriate for your cluster to schedule your jobs, such as Sun grid engine
  • Watch out for hidden performance penalties if hyperthreading is enabled.
  • your job can be optimized in various ways.
  • please use ganglia to make sure your job is not left running accidentally and does not interfere with others' jobs.
  • Be aware of disk quotas, move files to an appropriate storage location (or delete) when you are done with them.

I2Lab clusters[edit]

The general access hosts in the i2lab cluster are

  • newton.i2lab.ucf.edu -- archive server for large datasets
  • hilbert.i2lab.ucf.edu (64 nodes / 4 cpus per node)
  • euler.i2lab.ucf.edu ( 64 nodes / 2 cpus per node)

Cluster programming guides[edit]