A document model lets you identify document parts or sections as either belonging to a particular text field or as being a document attribute, or both.
The text of a document is fully indexed regardless of whether or not it is part of a text field. When the document model identifies text as belonging to a text field, the text is considered to be part of the textual content of the document and is indexed in the normal way. That is, meaningful terms are extracted and stored in the index. This means that unrestricted text searches include a search of that text.
In this way, you can search for text selectively in a particular field. For example, you can search for documents that contain the word structure in the text field Abstract.
A text field can occur several times in a document. For instance, you can define a text field that contains all figure captions. A text field may also overlap another text field.
If you want to avoid indexing the content of certain text fields, you can specify a field definition that contains exclude="YES". You can find a list of limitations for text fields and document attributes in Limitations for text fields and document attributes.
Document attributes contain short, formatted information of type number. You can use value ranges to search documents containing such attributes.
Attributes are not stored with indexed text, but in a separate item index. So, to search for documents by content of an attribute, you must make an attribute search explicitly on the attribute.
Net Search Extender provides a parser that recognizes floating-point numbers. The following are examples of correct and incorrect formats for integer values.
Correct format | Incorrect format |
---|---|
1000 1 000 1.000 - where the period is a decimal character |
1,000 |
100 000 100 000.00123 |
1 000 000 - two spaces between 1 and 0 |
Note that space characters are not allowed in the decimal fraction of a number. For example, 1 000.000 100 is treated as two numbers, 1000.000 and 100.
Language-specific separators and language-specific monetary formats are not supported.