11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

These processors are included in <strong>Solr</strong> releases as "contribs", and require additional jars loaded at runtime. See<br />

the README files associated with each contrib for details:<br />

The langid contrib provides :<br />

LangDetectLanguageIdentifierUpdateProcessorFactory : Identifies the language of a set of<br />

input fields using http://code.google.com/p/language-detection<br />

TikaLanguageIdentifierUpdateProcessorFactory : Identifies the language of a set of input fields<br />

using Tika's LanguageIdentifier.<br />

The uima contrib provides:<br />

UIMAUpdateRequestProcessorFactory : Update document(s) to be indexed with UIMA extracted<br />

information.<br />

Update Processor factories you should not modify or remove.<br />

These are listed for completeness, but are part of the <strong>Solr</strong> infrastructure, particularly <strong>Solr</strong>Cloud. Other than<br />

insuring you do not remove them when modifying the update request handlers (or any copies you make), you will<br />

rarely, if ever, need to change these.<br />

DistributedUpdateProcessorFactory:<br />

Used to distribute updates to all necessary nodes.<br />

NoOpDistributingUpdateProcessorFactory:<br />

An alternative No-Op implementation of<br />

DistributingUpdateProcessorFactory that always returns null. Designed for experts who want to<br />

bypass distributed updates and use their own custom update logic.<br />

LogUpdateProcessorFactory:<br />

A logging processor. This keeps track of all commands that have passed<br />

through the chain and prints them on finish().<br />

RunUpdateProcessorFactory : Executes the update commands using the underlying UpdateHandler.<br />

Almost all processor chains should end with an instance of RunUpdateProcessorFactory unless the user<br />

is explicitly executing the update commands in an alternative custom UpdateRequestProcessorFactory.<br />

Codec Factory<br />

A codecFactory can be specified in solrconfig.xml to determine which Lucene Codec is used when<br />

writing the index to disk.<br />

If not specified, Lucene's default codec is implicitly used, but a solr.SchemaCodecFactory is also available<br />

which supports 2 key features:<br />

Example:<br />

Schema based per-fieldtype configuration for docValuesFormat and postingsFormat - see the Field<br />

Type Definitions and Properties section for more details.<br />

A compressionMode option:<br />

BEST_SPEED (default) is optimized for search speed performance<br />

BEST_COMPRESSION is optimized for disk space usage<br />

<br />

BEST_COMPRESSION<br />

<br />

<strong>Solr</strong> Cores and solr.xml<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

466

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!