IBM WebSphere Information Integrator

Installation Requirements for Enterprise Search

Version 8.3

Before using this information and the product it supports, be sure to read the general information under "Notices."

This document contains proprietary information of IBM. It is provided under a license agreement and Copyright law protects it. The information contained in this publication does not include any product warranties, and any statements provided in this manual should not be interpreted as such.

You can order IBM publications online or through your local IBM representative:

When you send information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you.

Copyright International Business Machines Corporation 2004, 2005. All rights reserved.
US Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

Contents

About the installation requirements
Required software and supported data sources
Hardware and disk space requirements
Notices
Trademarks

About the installation requirements

The Installation Requirements document describes supported operating system levels, prerequisite software, hardware requirements, and supported data sources for WebSphere(R) Information Integrator OmniFind(TM) Edition.

WebSphere Information Integrator OmniFind Edition provides a technology called enterprise search. To install the enterprise search solution, ensure that you have the correct prerequisite software. The installation program for enterprise search will help you install the prerequisite software except for the WebSphere Application Server fix packs.

Required software and supported data sources

Before you install WebSphere Information Integrator OmniFind Edition, ensure that you have the required software, hardware, a supported operating system, and the required software for data sources.

Supported operating systems

WebSphere Information Integrator OmniFind Edition is supported on the following operating systems:

IBM(R) AIX 5L(TM) (32-bit and 64-bit systems)
Linux(R)
Microsoft(R) Windows(R)
Solaris operating environment
Solaris 9, kernel SunOS 5.9 Generic 112233-12 Mar 2004

To download the AIX PTF and other fixes (maintenance levels):

  1. Go to the IBM AIX product support site: August 2004 C++ Runtime for AIX PTF.
  2. Download thexlc.rte.60.aug2004.ptf.tar.Z file.
  3. Follow the instructions on the Web page to install the PTF.
  4. Apply the appropriate maintenance levels for your version of AIX. Go to the following Web site to download AIX fixes: www.ibm.com/servers/eserver/support/pseries/aixfixes.html.
  5. Follow the instructions on the Web page to install the maintenance level (fixes).

To run enterprise search on AIX, you can set EXTSHM=ON. The WebSphere Information Integrator OmniFind Edition installation program sets the environment variable DB2ENVLIST=EXTSHM for the DB2 Universal Database(TM) profile variable, and the enterprise search administrator user sets the environment variable EXTSHM=ON. To allow another DB2 Universal Database user, such as db2instance user, to start DB2 Universal Database, you can set the environment variable EXTSHM=ON for the user. This environment variable setting might be necessary to run some enterprise search crawlers, such as the Domino(R) crawler, the DB2(R) crawler, and the Content Manager crawler.

Required software for WebSphere Information Integrator OmniFind Edition

The WebSphere Information Integrator OmniFind Edition installation program installs the required software. Otherwise, you can install the required software manually, or you can use existing installations of the required software. Enterprise search requires the following software:

IBM DB2 Universal Database Enterprise Server Edition, Version 8.2 (8.1 with Fix Pack 7)
Serves as a repository for collected data.
IBM DB2 Run-Time Client, Version 8.2 (8.1 with Fix Pack 7)
Required if you install WebSphere Information Integrator OmniFind Edition on multiple servers.
IBM WebSphere Application Server Network Deployment, Version 5.1.1 and 5.1.1.3 or IBM WebSphere Application Server, Version 6.0.2
Includes a Web application server and the IBM HTTP server. Both servers must be installed.

Optional software for WebSphere Information Integrator OmniFind Edition

The IBM WebSphere Information Integrator Information Center, Version 8.3, provides information for WebSphere Information Integrator OmniFind Edition and WebSphere Information Integrator Content Edition. The WebSphere Information Integrator OmniFind Edition installation program automatically installs the information center as part of the product installation. If you do use the installation program to install the information center, when you click a help topic, you are connected to an IBM Web site that hosts the information center. The information center does not include PDF files.

Required software for data sources

v83 You can manually install the required software for data sources or allow the WebSphere Information Integrator OmniFind Edition installation program to automatically install most of the required software as part of the product installation.

For current information about software requirements and supported data sources for WebSphere Information Integrator OmniFind Edition, go to WebSphere Information Integrator OmniFind Edition System Requirements.

To crawl Lotus(R) Domino or Notes(R) databases, DB2 Content Manager databases, federated relational databases, WebSphere Information Integrator Content Edition sources, or DB2 Universal Database by using Event Publishing, you must install the following versions of these products:

IBM Lotus Domino Server 6.0.2 or later for Linux, AIX, and Solaris or Lotus Notes(R) 6.0.2 or later for Windows
This software is required if you plan to collect data from Lotus Notes or Domino sources, Domino Document manager documents, and QuickPlace(R) documents. The Notes crawler for name-to-address resolution over TCP/IP (NRPC) uses Domino libraries as a Lotus Notes client. You install these libraries by installing Lotus Domino Server on the enterprise search crawler server. To ensure that the Notes crawler can work with the Domino libraries, you run a setup script that WebSphere Information Integrator OmniFind Edition provides on the crawler server after you install the Domino libraries. To use Domino Native Security, install and configure Lotus Domino 6.0.2 CF2 or later as a server on the crawler server for all supported operating systems.
IBM DB2 Information Integrator for Content, Version 8.2 and Version 8.3 for Windows and AIX, or IBM DB2 Content Manager Toolkit, Version 8.2 for Linux

For enterprise search on AIX and Windows, the Content Manager crawler uses the Java(TM) connector for Content Manager, Version 8, to access DB2 Content Manager servers. You install this connector by installing IBM DB2 Information Integrator for Content, Version 8.2 for Windows and AIX on the crawler server. To ensure that the Content Manager crawler can work with DB2 Content Manager, you run a setup script that WebSphere Information Integrator OmniFind Edition provides on the crawler server after you install the connector.

For enterprise search on Linux, the Content Manager crawler uses the Java(TM) connector for Content Manager, Version 8, to access DB2 Content Manager servers. You install this connector by installing the IBM DB2 Content Manager Linux Toolkit, Version 8.2, on the crawler server. To ensure that the Content Manager crawler can work with DB2 Content Manager, you run a setup script that WebSphere Information Integrator OmniFind Edition provides on the crawler server after you install the connector.

WebSphere Information Integrator Content Edition, Version 8.3 connectors
The Content Edition crawler uses the Java libraries of WebSphere Information Integrator Content Edition as a Java client. To ensure that the Content Edition crawler can work.
IBM DB2 Information Integrator, Version 8.2 or later
DB2 Information Integrator, Version 8.2 is shipped with WebSphere Information Integrator OmniFind Edition. You can use DB2 Information Integrator to crawl relational databases in DB2 Universal Database for z/OS(R), IBM Informix(R) IDS, Oracle 9i and Oracle 10g, IBM DB2 Universal Database for iSeries(TM), Microsoft SQL Server 2000, Sybase, Version 11.9.2, Version 12.0, and Version 12.5 or later.
WebSphere MQ Version 5.3 Java Messaging libraries
To crawl DB2 Universal Database databases through DB2 Information Integrator Event Publisher Edition's event publishing feature, the DB2 crawler needs the Java Messaging libraries of WebSphere MQ. You can install these libraries from the WebSphere MQ installer. To ensure that the DB2 crawler that event publishing works with the libraries, you run a setup script after you install the WebSphere MQ libraries on the crawler server. If the DB2 crawler does not use event publishing to crawl DB2 databases, the WebSphere MQ libraries are not required.

Required levels of Java

WebSphere Information Integrator OmniFind Edition requires the following levels of Java.

IBM Software Development Kit for Java 1.4.x. (SDK for Java 1.5 is not supported)
The SDK for Java is required to compile the Java search applications that are created with the enterprise search application programming interfaces (APIs). The SDK for Java is not required to install WebSphere Information Integrator OmniFind Edition. You can compile the enterprise search ESSearchApplication sample application, the search and index API applications, and data listener applications and samples with the SDK for Java 1.4.x.

The ESSearchApplication sample application in the ES_INSTALL_ROOT/samples directory must run in a JRE Version 1.4 environment. WebSphere Application Server and WebSphere Portal both provide the JRE Version 1.4.

Supported data sources

You can use enterprise search to create searchable collections from the following data sources. Some of these data sources require additional software.

Documentum 4.2.x, 5.2.5, and 5.3
Accessed with the Content Edition crawler (WebSphere Information Integrator Content Edition, Version 8.3).
FileNet CS 5.3 and 5.4
Accessed with the Content Edition crawler (WebSphere Information Integrator Content Edition, Version 8.3).
FileNet P8 CM 3.0 and 3.5
Accessed with the Content Edition crawler (WebSphere Information Integrator Content Edition, Version 8.3).
Hummingbird(R) DM 5.1.0.5 with SR4
Accessed with the Content Edition crawler (WebSphere Information Integrator Content Edition, Version 8.3).
Open Text Livelink Enterprise Server 9.2 and 9.5
Accessed with the Content Edition crawler (WebSphere Information Integrator Content Edition, Version 8.3). Opentext Livelink patch and server parameter changes are required to access OpenText Livelink Enterprise Server 9.2 through WebSphere Information Integrator Content Edition with the Content Edition crawler. See the WebSphere Information Integrator Content Edition support Web site at www.ibm.com/software/data/integration/db2ii/supportcontent.html for information about the latest updates and patches.
IBM DB2 Content Manager, Version 8.2 or Version 8.3
Accessed with the Content Manager crawler.
IBM Lotus Domino Document Manager Version 6.5.1 (formerly Domino.Doc(R))
Accessed with the Domino Document Manager crawler. If the Domino Document Manager crawler uses NRPC (Notes Remote Procedure Call), Lotus Domino Server 6.0.2 CF2 or later (AIX, Linux, or Solaris) or Lotus Notes 6.0.2 CF2 (Windows) must be installed on the crawler server. You must also run the appropriate setup script for your operating system: escrnote.sh for AIX, Linux, or Solaris or escrnote.vbs for Windows.
IBM Lotus Domino, Version 5.0 or later and Version 6.0 or later
Accessed with the Notes/Domino crawler, Lotus Domino Server 5.0.9a or later is supported. If you use native security feature, Lotus Domino Server 6.0.2 CF2 or later is supported. If the Notes/Domino crawler uses Notes Remote Procedure Call (NRPC), Lotus Domino Server 6.0.2 CF2 or later (AIX, Linux, or Solaris) or Lotus Notes 6.0.2 CF2 (Windows) must be installed on the crawler server. You must also run the appropriate setup script for your operating system: escrnote.sh for AIX, Linux, or Solaris or escrnote.vbs for Windows.

IBM Lotus QuickPlace, Version 6.5.1 (formerly called Team Workplace(TM) and Quickplace)
Accessed with the Quickplace crawler. If the Quickplace crawler uses Notes Remote Procedure Call (NRPC), Lotus Domino Server 6.0.2 CF2 or later (AIX, Linux, or Solaris) or Lotus Notes 6.0.2 CF2 (Windows) must be installed on the crawler server. You must also run the appropriate setup script for your operating system. For AIX, Linux, or Solaris use the escrnote.sh script. For Windows use the escrnote.vbs script.
IBM DB2 Universal Database for iSeries, Version 5.3
Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler.
IBM DB2 Universal Database for Linux, UNIX(R), and Windows, Version 8.1 and Version 8.2
Accessed with the DB2 crawler.
IBM DB2 Universal Database for z/OS, Version 7 or later and Version 8 or later
Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler.
IBM Informix IDS, Version 9 or later
Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler.
IBM WebSphere Portal, Version 5.1.1 Web sites
Accessed with the WebSphere Portal crawler. The WebSphere Portal crawler can crawl sites that are created with WebSphere Portal, Version 5.1.
IBM WebSphere Portal Document Manager, Version 5.1.0.1
Accessed with the Content Edition crawler (WebSphere Information Integrator Content Edition, Version 8.3).
Informix IDS, Version 9 or later
Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler.
Microsoft SQL Server 2000
Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler.
Microsoft Exchange Server 2000 or 2003
Accessed with the Exchange Server crawler.
Oracle 9i and Oracle 10g
Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler.
v83 Sybase Version 11.9.2, 12.0, 12.5 or later
Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler.
v83 NewsGroup (NNTP)
Accessed through Network News Transfer Protocol (NNTP) crawler.
v83 UNIX file system
Accessed through UNIX file system crawler.
v83 Windows file system
Accessed through Windows file system crawler.
v83 Web (HTTP or HTTPS)
Accessed through Web crawler.
IBM Workplace Web Content Manager, Version 2.5 and Version 5.1
Accessed with the Web crawler.

Hardware and disk space requirements

Hardware and disk space requirements depend on your operating system and your intended use for enterprise search.

Disk space requirements can vary depending on the number of documents that you want to crawl and the types of data sources that you crawl. These requirements assume that you build indexes regularly, which means that new documents are added, removed, or updated in the index. For a multiple server configuration, the space requirements affect the index server. The ES_NODE_ROOT directory requires the most disk space on your system.

v83 For current information about hardware requirements for enterprise search, go to the Capacity Planner spreadsheet for WebSphere Information Integrator OmniFind Edition.

The following list describes the minimum hardware requirements and minimum disk space requirements for a single server configuration and a multiple server configuration:

Small installation
Single server configuration:
Medium installation
Four-server configuration:
Large installation
Four-server configuration:

Notices

This information was developed for products and services offered in the U.S.A. IBM may not offer the products, services, or features discussed in this document in all countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any non-IBM product, program, or service.

IBM may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to: IBM Director of Licensing IBM Corporation North Castle Drive Armonk, NY 10504-1785 U.S.A.

For license inquiries regarding double-byte (DBCS) information, contact the IBM Intellectual Property Department in your country/region or send inquiries, in writing, to:IBM World Trade Asia Corporation Licensing 2-31 Roppongi 3-chome, Minato-ku Tokyo 106-0032, Japan

The following paragraph does not apply to the United Kingdom or any other country/region where such provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY, OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions; therefore, this statement may not apply to you.

This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice.

Any references in this information to non-IBM Web sites are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM product, and use of those Web sites is at your own risk.

IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you.

Licensees of this program who wish to have information about it for the purpose of enabling: (i) the exchange of information between independently created programs and other programs (including this one) and (ii) the mutual use of the information that has been exchanged, should contact:

IBM Corporation J46A/G4
555 Bailey Avenue
San Jose, CA 95141-1003 U.S.A.

Such information may be available, subject to appropriate terms and conditions, including in some cases payment of a fee.

The licensed program described in this document and all licensed material available for it are provided by IBM under terms of the IBM Customer Agreement, IBM International Program License Agreement, or any equivalent agreement between us.

Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained in other operating environments may vary significantly. Some measurements may have been made on development-level systems, and there is no guarantee that these measurements will be the same on generally available systems. Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of this document should verify the applicable data for their specific environment.

Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements, or other publicly available sources. IBM has not tested those products and cannot confirm the accuracy of performance, compatibility, or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products.

All statements regarding IBM's future direction or intent are subject to change or withdrawal without notice, and represent goals and objectives only.

This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious, and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental.

COPYRIGHT LICENSE:

This information contains sample application programs, in source language, which illustrate programming techniques on various operating platforms. You may copy, modify, and distribute these sample programs in any form without payment to IBM for the purposes of developing, using, marketing, or distributing application programs conforming to the application programming interface for the operating platform for which the sample programs are written. These examples have not been thoroughly tested under all conditions. IBM, therefore, cannot guarantee or imply reliability, serviceability, or function of these programs. You may copy, modify, and distribute these sample programs in any form without payment to IBM for the purposes of developing, using, marketing, or distributing application programs conforming to IBM's application programming interfaces.

Each copy or any portion of these sample programs or any derivative work must include a copyright notice as follows:

Outside In ((R)) Viewer Technology, (C)1992-2004 Stellent, Chicago, IL., Inc. All Rights Reserved.

IBM XSLT Processor Licensed Materials - Property of IBM (C)Copyright IBM Corp., 1999-2004. All Rights Reserved.

Trademarks

This topic lists IBM trademarks and certain non-IBM trademarks.

See http://www.ibm.com/legal/copytrade.shtml for information about IBM trademarks.

The following terms are trademarks or registered trademarks of other companies:

Java and all Java-based trademarks and logos are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States, other countries, or both.

Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both.

Intel, Intel Inside (logos), MMX and Pentium are trademarks of Intel Corporation in the United States, other countries, or both.

UNIX is a registered trademark of The Open Group in the United States and other countries.

Linux is a trademark of Linus Torvalds in the United States, other countries, or both.

Other company, product or service names might be trademarks or service marks of others.