A B C D E F G I L M N O R S T U X

A

autoDetectParser - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 

B

BOOST_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
The boost value for the name of the field.

C

CAPTURE_ATTRIBUTES - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Capture attributes separately according to the name of the element, instead of just adding them to the string buffer
CAPTURE_ELEMENTS - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Capture the specified fields (and everything included below it that isn't capture by some other capture field) separately from the default.
characters(char[], int, int) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
 
config - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
config - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
CONFIG_LOCATION - Static variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
createFactory() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class org.apache.solr.handler.extraction.SolrContentHandlerFactory
 

D

DATE_FORMATS - Static variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
dateFormats - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
dateFormats - Variable in class org.apache.solr.handler.extraction.SolrContentHandlerFactory
 
DEFAULT_FIELD - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Optional.

E

endElement(String, String, String) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
 
EXTRACT_FORMAT - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Content output format if extractOnly is true.
EXTRACT_ONLY - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Only extract and return the content, do not index it.
ExtractingDocumentLoader - Class in org.apache.solr.handler.extraction
The class responsible for loading extracted content into Solr.
ExtractingDocumentLoader(SolrQueryRequest, UpdateRequestProcessor, TikaConfig, SolrContentHandlerFactory) - Constructor for class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
ExtractingMetadataConstants - Interface in org.apache.solr.handler.extraction
Constants used internally by the ExtractingRequestHandler.
ExtractingParams - Interface in org.apache.solr.handler.extraction
The various Solr Parameters names to use when extracting content.
ExtractingRequestHandler - Class in org.apache.solr.handler.extraction
Handler for rich documents like PDF or Word or any other file format that Tika handles that need the text to be extracted first from the document.
ExtractingRequestHandler() - Constructor for class org.apache.solr.handler.extraction.ExtractingRequestHandler
 

F

factory - Variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
factory - Variable in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
findMappedName(String) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Get the name mapping

G

getBoost(String) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Get the value of any boost factor for the mapped name.
getDescription() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
getSource() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
getSourceId() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
getVersion() - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 

I

inform(SolrCore) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 
init(NamedList) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 

L

LITERALS_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Pass in literal values to be added to the document, as in
load(SolrQueryRequest, SolrQueryResponse, ContentStream) - Method in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
 
LOWERNAMES - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Map all generated attribute names to field names with lowercase and underscores.

M

MAP_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
The param prefix for mapping Tika metadata to Solr fields.

N

newDocument() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
This is called by a consumer when it is ready to deal with a new SolrInputDocument.
newLoader(SolrQueryRequest, UpdateRequestProcessor) - Method in class org.apache.solr.handler.extraction.ExtractingRequestHandler
 

O

org.apache.solr.handler.extraction - package org.apache.solr.handler.extraction
 

R

RESOURCE_NAME - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Optional.

S

SolrContentHandler - Class in org.apache.solr.handler.extraction
The class responsible for handling Tika events and translating them into SolrInputDocuments.
SolrContentHandler(Metadata, SolrParams, IndexSchema) - Constructor for class org.apache.solr.handler.extraction.SolrContentHandler
 
SolrContentHandler(Metadata, SolrParams, IndexSchema, Collection<String>) - Constructor for class org.apache.solr.handler.extraction.SolrContentHandler
 
SolrContentHandlerFactory - Class in org.apache.solr.handler.extraction
 
SolrContentHandlerFactory(Collection<String>) - Constructor for class org.apache.solr.handler.extraction.SolrContentHandlerFactory
 
startDocument() - Method in class org.apache.solr.handler.extraction.SolrContentHandler
 
startElement(String, String, String, Attributes) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
 
STREAM_CONTENT_TYPE - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
 
STREAM_NAME - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
 
STREAM_SIZE - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
 
STREAM_SOURCE_INFO - Static variable in interface org.apache.solr.handler.extraction.ExtractingMetadataConstants
 
STREAM_TYPE - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
The type of the stream.

T

TEXT_FORMAT - Static variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
Extract Only supported format
transformValue(String, SchemaField) - Method in class org.apache.solr.handler.extraction.SolrContentHandler
Can be used to transform input values based on their SchemaField

This implementation only formats dates using the DateUtil.


U

UNKNOWN_FIELD_PREFIX - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Optional.

X

XML_FORMAT - Static variable in class org.apache.solr.handler.extraction.ExtractingDocumentLoader
Extract Only supported format.
XPATH_EXPRESSION - Static variable in interface org.apache.solr.handler.extraction.ExtractingParams
Restrict the extracted parts of a document to be indexed by passing in an XPath expression.

A B C D E F G I L M N O R S T U X

Copyright © 2010 Apache Software Foundation. All Rights Reserved.