11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Factory class: solr.HyphenatedWordsFilterFactory<br />

Arguments: None<br />

Example:<br />

<br />

<br />

<br />

<br />

In: "A hyphen- ated word"<br />

Tokenizer to Filter: "A", "hyphen-", "ated", "word"<br />

Out: "A", "hyphenated", "word"<br />

ICU Folding Filter<br />

This filter is a custom Unicode normalization form that applies the foldings specified in Unicode Technical Report<br />

30 in addition to the NFKC_Casefold normalization form as described in ICU Normalizer 2 Filter. This filter is a<br />

better substitute for the combined behavior of the ASCII Folding Filter, Lower Case Filter, and ICU Normalizer 2<br />

Filter.<br />

To use this filter, see solr/contrib/analysis-extras/README.txt for instructions on which jars you<br />

need to add to your solr_home/lib.<br />

Factory class: solr.ICUFoldingFilterFactory<br />

Arguments: None<br />

Example:<br />

<br />

<br />

<br />

<br />

For detailed information on this normalization form, see http://www.unicode.org/reports/tr30/tr30-4.html.<br />

ICU Normalizer 2 Filter<br />

This filter factory normalizes text according to one of five Unicode Normalization Forms as described in Unicode<br />

Standard Annex #15:<br />

NFC: (name="nfc" mode="compose") Normalization Form C, canonical decomposition<br />

NFD: (name="nfc" mode="decompose") Normalization Form D, canonical decomposition, followed by<br />

canonical composition<br />

NFKC: (name="nfkc" mode="compose") Normalization Form KC, compatibility decomposition<br />

NFKD: (name="nfkc" mode="decompose") Normalization Form KD, compatibility decomposition, followed<br />

by canonical composition<br />

NFKC_Casefold: (name="nfkc_cf" mode="compose") Normalization Form KC, with additional Unicode<br />

case folding. Using the ICU Normalizer 2 Filter is a better-performing substitution for the Lower Case Filter<br />

and NFKC normalization.<br />

Factory class: solr.ICUNormalizer2FilterFactory<br />

Arguments:<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

124

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!