Knowledge Center         Previous  Next    Index  
Platform Computing Corp.

Installing LSF in a Mixed Cluster

This chapter describes how to set up LSF in a mixed environment.

Contents

Setting up a Linux cluster with Windows compute nodes

Complete the following steps to set up a Linux cluster with Windows compute nodes.

  1. Install the Linux cluster on a shared file system.
  2. Configure the Linux cluster.
  3. Restart the LSF cluster.
  4. Install the Windows compute node.
  5. Start the Windows compute node.

Install the Linux cluster on a shared file system

  1. Install the Linux cluster as described in the Linux install guide, with exceptions to allow for Windows compute nodes.
  2. Edit install.conf and specify the following:

    1. Enable dynamic hosts.
    2. Enable or add the following line:

      ENABLE_DYNAMIC_HOSTS=Y 
      
    3. Optional. Allow EGO to control the LSF daemons.
    4. Enable or add the following line:

      EGO_DAEMON_CONTROL=Y 
      
    5. Specify the cluster administrator.
    6. LSF_ADMIN=user_account 
       

      For example:

      LSF_ADMIN=lsfadmin
    7. Specify the installation directory.
    8. LSF_TOP=directory 
       

      For example:

      LSF_TOP=$SHARE/LSF_7.0

Configure the Linux cluster

  1. If you allowed EGO to control the LSF daemons, add Windows compute node information to the LSF service configuration files.
    1. Edit LSF_TOP/conf/ego/cluster_name/eservice/esc/conf/services
      /res.xml
    2. Navigate to the section with the correct Windows host type.
    3. For 64-bit compute hosts, navigate to NTX64. For IA hosts, navigate to NTIA64. For other Windows compute hosts, navigate to NTX86.

    4. Add the proper Windows compute node information for the Command and ExecutionUser tags.
    5. For example:

      <sc:ActivityDescription> 
        <ego:Attribute name="hostType" type="xsd:string">NTX86</ego:Attribute> 
        <ego:ActivitySpecification> 
        <ego:Command>C:\LSF_7.0\7.0\etc\res.exe -3</ego:Command> 
        <ego:ExecutionUser>LSF\lsfadmin</ego:ExecutionUser> 
        <ego:EnvironmentVariable name="LSF_ENVDIR">c:\LSF_7.0\
      conf</ego:EnvironmentVariable> 
        ... 
      
    6. Edit LSF_TOP/conf/ego/cluster_name/eservice/esc/conf/services/
      sbatchd.xml
    7. Navigate to the section with the correct Windows host type.
    8. For 64-bit compute hosts, navigate to NTX64. For IA hosts, navigate to NTIA64. For other Windows compute hosts, navigate to NTX86.

    9. Add the proper Windows compute node information for the Command and ExecutionUser tags.
    10. For example:

      <sc:ActivityDescription> 
        <ego:Attribute name="hostType" type="xsd:string">NTX86</ego:Attribute> 
        <ego:ActivitySpecification> 
        <ego:Command>C:\LSF_7.0\7.0\etc\sbatchd.exe -3</ego:Command> 
        <ego:ExecutionUser>LSF\lsfadmin</ego:ExecutionUser> 
        ... 
      
  2. Add the Windows cluster administrator account to the your cluster file.
    1. Edit LSF_CONFDIR/lsf.cluster.cluster_name
    2. In the ClusterAdmins section, add LSF\lsfadmin to the Administrators list.
    3. For example:

      Begin   ClusterAdmins 
      Administrators = lsfadmin LSF\lsfadmin 
      End    ClusterAdmins 
      
  3. Add the LSF user domain to the lsf.conf file.
    1. Edit LSF_TOP/LSF7.0/conf/lsf.conf
    2. Add the LSF user domain.
    3. LSF_USER_DOMAIN=lsf_user_domain 
       

      For example:

      LSF_USER_DOMAIN=LSF
  4. Register the Windows execution password to your Linux cluster.
    1. Log on to any host in the cluster as egoadmin.
    2. Log on to EGO as the cluster administrator.
    3. For example:

      egosh user logon -u Admin -x mypasswd 
      
    4. Register the password of the Windows user account.
    5. egosh ego execpasswd -u "domain\admin" -x password 
       

      For example:

      egosh ego execpasswd -u "LSF\lsfadmin" -x lsfpasswd

      The password must be 31 characters or less.

      important:  
      If you run egosh ego execpasswd from a UNIX or Linux host, you must run with the -noverify option. Only a Windows host can verify the password for a Windows user. For example:
      egosh ego execpasswd -u "LSF\lsfadmin" -x password -noverify

Restart the LSF cluster

  1. Restart the LSF cluster.
  2. lsfstartup

Install the Windows compute node

  1. Install the Windows compute node as described in the Windows install guide, with exceptions to be part of a Linux cluster.
  2. Specify the following options during installation:

    1. Specify the Linux master host as your master host name.
    2. Master_Name=linux_master_name 
      
    3. If you allowed EGO to control the LSF daemons in your cluster master host, allow EGO to control the LSF daemons in your compute node.
    4. EGO_DAEMON_CONTROL=Y 
      
    5. Specify the same port number as that of the Linux master host.
    6. Port_Number=base_port_number 
      
    7. Specify the cluster administrator to be the same as on your master host.
    8. LSF_ADMIN=domain\user_account 
       

      For example:

      LSF_ADMIN=LSF\lsfadmin
    9. Specify the installation directory.
    10. LSF_TOP=directory 
       

      For example:

      LSF_TOP=C:\LSF_7.0
  3. Register the Windows execution user password to your cluster password file.
  4. lspasswd -u "domain\admin" -p password

    For example:

    lspasswd -u "LSF\lsfadmin" -p lsfpasswd

    The password must be 31 characters or less.

Start the Windows compute node

  1. Start the Windows compute node
  2. lsadmin limstartup

  3. If you did not allow EGO to control the LSF daemons, manually start the LSF services.
  4. lsfadmin resstartup

    badmin hstartup

Setting up a Windows cluster with Linux compute nodes

Complete the following steps to set up a Windows cluster with Linux compute nodes.

  1. Install the Windows cluster master host as described in the Windows installation guide and allow EGO to control the LSF daemons.
  2. Specify the following option:

    EGO_DAEMON_CONTROL=Y 
    
  3. Install the Linux compute node as described in the Linux installation guide and allow EGO to control the LSF daemons.
  4. Edit install.conf and add the following line:

    EGO_DAEMON_CONTROL=Y 
    
  5. Add the LSF user domain to the lsf.conf file.
    1. Edit C:\LSF7.0\conf\lsf.conf
    2. Add the LSF user domain.
    3. LSF_USER_DOMAIN=lsf_user_domain 
       

      For example:

      LSF_USER_DOMAIN=LSF
  6. Restart the Windows cluster.
  7. egosh ego restart 
    

Platform Computing Inc.
www.platform.com
Knowledge Center         Previous  Next    Index