Document models

A document model primarily controls what parts of a document's structure need to be indexed and how they are indexed. Its purpose is to:

When the document model identifies text as belonging to a text field, the text is considered to be part of the textual content of the document, and terms are extracted and stored in the index.

The elements of a document model vary depending on the parser used for that document format:

See the relevant "Defining a Document Model" section for information.

For information on the document model syntax in the form of a Document Type Definition (DTD), and text field limitations, see Appendix G. Document model reference.