11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Caverphone<br />

To use this encoding in your analyzer, specify encoding="Caverphone" with the Phonetic Filter.<br />

Caverphone is an algorithm created by the Caversham Project at the University of Otago. The algorithm is<br />

optimised for accents present in the southern part of the city of Dunedin, New Zealand.<br />

See http://en.wikipedia.org/wiki/Caverphone and the Caverphone 2.0 specification at http://caversham.otago.ac.<br />

nz/files/working/ctp150804.pdf<br />

Kölner Phonetik a.k.a. Cologne Phonetic<br />

To use this encoding in your analyzer, specify encoding="ColognePhonetic" with the Phonetic Filter.<br />

The Kölner Phonetik, an algorithm published by Hans Joachim Postel in 1969, is optimized for the German<br />

language.<br />

See http://de.wikipedia.org/wiki/K%C3%B6lner_Phonetik<br />

NYSIIS<br />

To use this encoding in your analyzer, specify encoding="Nysiis" with the Phonetic Filter.<br />

NYSIIS is an encoding used to relate similar names, but can also be used as a general purpose scheme to find<br />

words with similar phonemes.<br />

See http://en.wikipedia.org/wiki/NYSIIS and http://www.dropby.com/NYSIIS.html<br />

Running Your Analyzer<br />

Once you've defined a field type in your Schema, and specified the analysis steps that you want applied to it, you<br />

should test it out to make sure that it behaves the way you expect it to. Luckily, there is a very handy page in the<br />

<strong>Solr</strong> admin interface that lets you do just that. You can invoke the analyzer for any text field, provide sample<br />

input, and display the resulting token stream.<br />

For example, let's look at some of the "Text" field types available in the " bin/solr -e techproducts"<br />

example configuration, and use the Analysis Screen ( http://localhost:8983/solr/#/techproducts/analysis) to<br />

compare how the tokens produced at index time for the sentence " Running an Analyzer" match up with a<br />

slightly different query text of " run my analyzers"<br />

We can begin with " text_ws"<br />

- one of the most simplified Text field types available:<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

175

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!