11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Arguments:<br />

maxTokenLength: (integer, default 255) <strong>Solr</strong> ignores tokens that exceed the number of characters specified by<br />

maxTokenLength.<br />

Example:<br />

<br />

<br />

<br />

In: "Please, email john.doe@foo.com by 03-09, re: m37-xq."<br />

Out: "Please", "email", "john.doe", "foo.com", "by", "03", "09", "re", "m37", "xq"<br />

Classic Tokenizer<br />

The Classic Tokenizer preserves the same behavior as the Standard Tokenizer of <strong>Solr</strong> versions 3.1 and<br />

previous. It does not use the Unicode standard annex UAX#29 word boundary rules that the Standard Tokenizer<br />

uses. This tokenizer splits the text field into tokens, treating whitespace and punctuation as delimiters. Delimiter<br />

characters are discarded, with the following exceptions:<br />

Periods (dots) that are not followed by whitespace are kept as part of the token.<br />

Words are split at hyphens, unless there is a number in the word, in which case the token is not split and<br />

the numbers and hyphen(s) are preserved.<br />

Recognizes Internet domain names and email addresses and preserves them as a single token.<br />

Factory class: solr.ClassicTokenizerFactory<br />

Arguments:<br />

maxTokenLength: (integer, default 255) <strong>Solr</strong> ignores tokens that exceed the number of characters specified by<br />

maxTokenLength.<br />

Example:<br />

<br />

<br />

<br />

In: "Please, email john.doe@foo.com by 03-09, re: m37-xq."<br />

Out: "Please", "email", "john.doe@foo.com", "by", "03-09", "re", "m37-xq"<br />

Keyword Tokenizer<br />

This tokenizer treats the entire text field as a single token.<br />

Factory class: solr.KeywordTokenizerFactory<br />

Arguments: None<br />

Example:<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

110

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!