11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

name: (string) The name of the normalization form; nfc, nfd, nfkc, nfkd,<br />

nfkc_cf<br />

mode: (string) The mode of Unicode character composition and decomposition; compose or decompose<br />

Example:<br />

<br />

<br />

<br />

<br />

For detailed information about these Unicode Normalization Forms, see http://unicode.org/reports/tr15/ .<br />

To use this filter, see solr/contrib/analysis-extras/README.txt for instructions on which jars you<br />

need to add to your solr_home/lib.<br />

ICU Transform Filter<br />

This filter applies ICU Tranforms to text. This filter supports only ICU System Transforms. Custom rule sets are<br />

not supported.<br />

Factory class: solr.ICUTransformFilterFactory<br />

Arguments:<br />

id: (string) The identifier for the ICU System Transform you wish to apply with this filter. For a full list of ICU<br />

System Transforms, see http://demo.icu-project.org/icu-bin/translit?TEMPLATE_FILE=data/translit_rule_main.ht<br />

ml.<br />

Example:<br />

<br />

<br />

<br />

<br />

For detailed information about ICU Transforms, see http://userguide.icu-project.org/transforms/general.<br />

To use this filter, see solr/contrib/analysis-extras/README.txt for instructions on which jars you<br />

need to add to your solr_home/lib.<br />

Keep Word Filter<br />

This filter discards all tokens except those that are listed in the given word list. This is the inverse of the Stop<br />

Words Filter. This filter can be useful for building specialized indices for a constrained set of terms.<br />

Factory class: solr.KeepWordFilterFactory<br />

Arguments:<br />

words: (required) Path of a text file containing the list of keep words, one per line. Blank lines and lines that<br />

begin with "#" are ignored. This may be an absolute path, or a simple filename in the <strong>Solr</strong> config directory.<br />

ignoreCase: (true/false) If true then comparisons are done case-insensitively. If this argument is true, then the<br />

words file is assumed to contain only lowercase words. The default is false.<br />

enablePositionIncrements: if luceneMatchVersion is 4.3 or earlier and enablePositionIncrement<br />

s="false" , no position holes will be left by this filter when it removes tokens. This argument is invalid if luc<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

125

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!