|
Sun Grid Engine Information Center Other Sun Grid Engine Installation IssuesAdditional considerations for installing Sun Grid Engine software are identified in this section. These include the following topics: Verifying and Installing Linux Motif LibrariesOn newer Linux systems, the libXm.so.2 Motif libraries are not always installed, which results in the inability to run the precompiled Linux qmon binary. To correct this problem, follow these steps:
Installing the Grid Engine on a System With IPMPThis section describes how to install the Grid Engine software on hosts with the Solaris Operating Environment IP Multipathing (IPMP) technology. What Is IP Multipathing?IP Multipathing is a technology that allows TCP/IP interfaces to be grouped for failover and load balancing purposes. If an interface within an IP Multipathing group fails, the interface is disabled and its IP address is relocated to another interface in the group. Outbound IP traffic is distributed across the interfaces of a group. For further details on IP Multipathing, refer to the Solaris Operating Environment documentation at http://docs.sun.com/app/docs/doc/816-4554/ipmptm-1. Issues Between IPMP and Grid EngineWhen starting the Grid Engine daemons on a machine where the main interface is part of an IPMP group, error messages appear. When the IPMP load balancing distributes the connections across the interfaces in the group, the IP packets show up at the receiving end as coming from a different host from the one associated with the main interface. For example, on a machine with three interfaces named qfe0, qfe1, and qfe3, where the IP addresses for these interfaces are 10.1.1.1, 10.1.1.2, and 10.1.1.3 respectively, IPMP would need an extra address for each interface for testing. However, that requirement is ignored in this example. Each of these addresses has a host name associated with it. The hosts table looks like the following example: 10.1.1.1 sge 10.1.1.2 sge-qfe1 10.1.1.3 sge-qfe2 The machine's host name is sge. When a connection is established from sge to another machine, it might go through sge, sge-qfe1, or sge-qfe2. Upon installation, Grid Engine will only recognize sge. When Grid Engine receives a connection request from sge-qfe2, it closes the connection because the request is not from one of the authorized (or known) nodes. To solve this problem, use the host_aliases files to "tell" Grid Engine that sge, sge1, and sge-qfe2 are all from the same machine. See the sge_h_aliases man page for details. The host_aliases file in this case would look like this: sge sge-qfe1 sge-qfe2
Installing the Grid Engine Master Node With IPMPThere are two ways that you can fix this problem:
Ignoring the Error MessagesTo ignore the error messages, follow these steps:
Temporarily Disabling IPMPTo temporarily disable IPMP, follow these steps:
Installing a Grid Engine on an Execution Host With IPMPOnce the host_aliases file is installed and the Grid Engine daemons are restarted, you can simply start the execution host installation without further problems. Enabling Administrative and Submit Hosts With IPMPYou have two choices when enabling these hosts with IPMP:
|