Grid Engine System Components

Searching Sun Grid Engine 6.2

Sun Grid Engine Information Center
Sun Grid Engine Overview
Index


Grid Engine System Components

Hosts

Four types of hosts are fundamental to the Grid Engine system:

  • Master host – The master host is central to the overall cluster activity. The master host runs the master daemon sge_qmaster. This daemon controls all Grid Engine system scheduling and components, such as queues and jobs. The daemon maintains tables about the status of the components, user access permissions, and the like. By default, the master host is also an administration host and a submit host.
  • Execution hosts – Execution hosts are systems that have permission to execute jobs. Therefore, queue instances are attached to the execution hosts. Execution hosts run the execution daemon sge_execd.
  • Administration hosts – Administration hosts are hosts that have permission to carry out any kind of administrative activity for the Grid Engine system.
  • Submit hosts – Submit hosts enable users to submit and control batch jobs only. In particular, a user who is logged in to a submit host can submit jobs with the qsub command, can monitor the job status with the qstat command, and can use the Grid Engine system OSF/1 Motif graphical user interface QMON, which is described in QMON, the Grid Engine System's Graphical User Interface.
Note
A system can act as more than one type of host.

Daemons

Two daemons provide the functionality of the Grid Engine system.

sge_qmaster - The Master Daemon

The center of the cluster's management and scheduling activities, sge_qmaster maintains tables about hosts, queues, jobs, system load, and user permissions. sge_qmaster also performs scheduling functions and requests actions from sge_execd on the appropriate execution hosts. sge_qmaster makes the following scheduling decisions:

  • Which jobs are dispatched to which queues
  • How to reorder and reprioritize jobs to maintain share, priority, or deadline

sge_execd - The Execution Daemon

The execution daemon is responsible for the queue instances on its host and for the running of jobs in these queue instances. Periodically, the execution daemon forwards information such as job status or load on its host to sge_qmaster.

Queues

A queue is a container for a class of jobs that are allowed to run on one or more hosts concurrently. A queue determines certain job attributes, for example, whether the job can be migrated. Throughout its lifetime, a running job is associated with its queue. Association with a queue affects some of the things that can happen to a job. For example, if a queue is suspended, all jobs associated with that queue are also suspended.

Jobs do not need to be submitted directly to a queue. If you submit a job to a specified queue, the job is bound to this queue. As a result, the Grid Engine system daemons are unable to select a better-suited device or a device that has a lighter load.

You only need to specify the requirement profile of the job. A profile might include requirements such as memory, operating system, available software, and so forth. The Grid Engine software automatically dispatches the job to a suitable queue and a suitable host with a light execution load.

A queue can reside on a single host, or a queue can extend across multiple hosts. For this reason, Grid Engine system queues are also referred to as cluster queues. Cluster queues enable users and administrators to work with a cluster of execution hosts by means of a single queue configuration. Each host that is attached to a cluster queue receives its own queue instance from the cluster queue.

Client Commands

The command-line user interface is a set of ancillary programs (commands) that enable you to do the following tasks:

  • Manage queues
  • Submit and delete jobs
  • Check job status
  • Suspend or enable queues and jobs

For a complete list of ancillary programs, see Client Commands. To view detailed information about each command, see the Grid Engine man pages, which are available in your $SGE_ROOT/man directory or on the Open Grid Engine site.


Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.

Sign up or Log in to add a comment or watch this page.


The individuals who post here are part of the extended Sun Microsystems community and they might not be employed or in any way formally affiliated with Sun Microsystems. The opinions expressed here are their own, are not necessarily reviewed in advance by anyone but the individual authors, and neither Sun nor any other party necessarily agrees with them.

Copyright 1994-2009 Sun Microsystems, Inc.
Powered by Atlassian Confluence
Sun Guidelines on Public Discourse Privacy Policy Terms of Use Trademarks Site Map Employment Investor Relations Contact