11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

In: "four score and twenty"<br />

Tokenizer to Filter: "four"(1), "score"(2), "and"(3), "twenty"(4)<br />

Out: "four"(1), "FR"(1), "score"(2), "SKR"(2), "and"(3), "ANT"(3), "twenty"(4), "TNT"(4)<br />

The phonetic tokens have a position increment of 0, which indicates that they are at the same position as the<br />

token they were derived from (immediately preceding).<br />

Example:<br />

Discard original token.<br />

<br />

<br />

<br />

<br />

In: "four score and twenty"<br />

Tokenizer to Filter: "four"(1), "score"(2), "and"(3), "twenty"(4)<br />

Out: "FR"(1), "SKR"(2), "ANT"(3), "TWNT"(4)<br />

Example:<br />

Default Soundex encoder.<br />

<br />

<br />

<br />

<br />

In: "four score and twenty"<br />

Tokenizer to Filter: "four"(1), "score"(2), "and"(3), "twenty"(4)<br />

Out: "four"(1), "F600"(1), "score"(2), "S600"(2), "and"(3), "A530"(3), "twenty"(4), "T530"(4)<br />

Porter Stem Filter<br />

This filter applies the Porter Stemming Algorithm for English. The results are similar to using the Snowball Porter<br />

Stemmer with the language="English" argument. But this stemmer is coded directly in Java and is not based<br />

on Snowball. It does not accept a list of protected words and is only appropriate for English language text.<br />

However, it has been benchmarked as four times faster than the English Snowball stemmer, so can provide a<br />

performance enhancement.<br />

Factory class: solr.PorterStemFilterFactory<br />

Arguments: None<br />

Example:<br />

<br />

<br />

<br />

<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

132

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!