11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

In: "four score"<br />

Tokenizer to Filter: "four", "score"<br />

Out: "f", "fo", "fou", "four", "s", "sc", "sco", "scor"<br />

Example:<br />

A range of 4 to 6.<br />

<br />

<br />

<br />

<br />

In: "four score and twenty"<br />

Tokenizer to Filter: "four", "score", "and", "twenty"<br />

Out: "four", "scor", "score", "twen", "twent", "twenty"<br />

English Minimal Stem Filter<br />

This filter stems plural English words to their singular form.<br />

Factory class: solr.EnglishMinimalStemFilterFactory<br />

Arguments: None<br />

Example:<br />

<br />

<br />

<br />

<br />

In: "dogs cats"<br />

Tokenizer to Filter: "dogs", "cats"<br />

Out: "dog", "cat"<br />

Fingerprint Filter<br />

This filter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens.<br />

This can be useful for clustering/linking use cases.<br />

Factory class: solr.FingerprintFilterFactory<br />

Arguments:<br />

separator : The character used to separate tokens combined into the single output token. Defaults to " " (a<br />

space character).<br />

maxOutputTokenSize : The maximum length of the summarized output token. If exceeded, no output token is<br />

emitted. Defaults to 1024.<br />

Example:<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

122

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!