11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Tokenizer to Filter: "XL-4000/ES"(1)<br />

Out: "XL"(1), "4000"(2), "ES"(3), "XL4000ES"(3)<br />

Example:<br />

Using a protected words list that contains "AstroBlaster" and "XL-5000" (among others).<br />

<br />

<br />

<br />

<br />

In: "FooBar AstroBlaster XL-5000 ==ES-34-"<br />

Tokenizer to Filter: "FooBar", "AstroBlaster", "XL-5000", "==ES-34-"<br />

Out: "FooBar", "FooBar", "AstroBlaster", "XL-5000", "ES", "34"<br />

Related Topics<br />

TokenFilterFactories<br />

CharFilterFactories<br />

Char Filter is a component that pre-processes input characters. Char Filters can be chained like Token Filters<br />

and placed in front of a Tokenizer. Char Filters can add, change, or remove characters while preserving the<br />

original character offsets to support features like highlighting.<br />

Topics discussed in this section:<br />

solr.MappingCharFilterFactory<br />

solr.HTMLStripCharFilterFactory<br />

solr.ICUNormalizer2CharFilterFactory<br />

solr.PatternReplaceCharFilterFactory<br />

Related Topics<br />

solr.MappingCharFilterFactory<br />

This filter creates org.apache.lucene.analysis.MappingCharFilter, which can be used for changing<br />

one string to another (for example, for normalizing é to e.).<br />

This filter requires specifying a mapping argument, which is the path and name of a file containing the mappings<br />

to perform.<br />

Example:<br />

<br />

<br />

<br />

[...]<br />

<br />

Mapping file syntax:<br />

Comment lines beginning with a hash mark (#),<br />

as well as blank lines, are ignored.<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

143

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!