Documents in HTML or XML format are examples of structured documents; they contain tags that identify text fields and document attributes. Text fields can contain information like the title, author, or a description of the document. Document attributes can contain a number.
The following is an extract from a structured plain-text document. It contains fields and attributes delimited by HTML-like tags.
[head]Handling structured documents [/head] [abstract]This document describes the concept of structured documents and the use of document models to... [/abstract] : :
When Net Search Extender indexes structured documents, it has to recognize the structure so that it can index the text field and the attributes, and store them together with a unique name that identifies the field or attribute that contains them. This enables Net Search Extender to selectively search in a particular text field or to find documents having a particular attribute.
To enable Net Search Extender to understand the structure of a particular document format, you must pass to Net Search Extender a definition of the structure in a document model.
You specify the name of the document model as an argument when you call the CREATE INDEX command to index the documents.
Before you can index documents using a document model, you must first define a document model and then add the document model to the index.