11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

for the simplest keyword searching through to complex queries on multiple fields and faceted search results. Sea<br />

rching has more information about searching and queries.<br />

If <strong>Solr</strong>'s capabilities are not impressive enough, its ability to handle very high-volume applications should do the<br />

trick.<br />

A relatively common scenario is that you have so much data, or so many queries, that a single <strong>Solr</strong> server is<br />

unable to handle your entire workload. In this case, you can scale up the capabilities of your application using So<br />

lrCloud to better distribute the data, and the processing of requests, across many servers. Multiple options can<br />

be mixed and matched depending on the type of scalability you need.<br />

For example: "Sharding" is a scaling technique in which a collection is split into multiple logical pieces called<br />

"shards" in order to scale up the number of documents in a collection beyond what could physically fit on a single<br />

server. Incoming queries are distributed to every shard in the collection, which respond with merged results.<br />

Another technique available is to increase the "Replication Factor" of your collection, which allows you to add<br />

servers with additional copies of your collection to handle higher concurrent query load by spreading the<br />

requests around to multiple machines. Sharding and Replication are not mutually exclusive, and together make<br />

<strong>Solr</strong> an extremely powerful and scalable platform.<br />

Best of all, this talk about high-volume applications is not just hypothetical: some of the famous Internet sites that<br />

use <strong>Solr</strong> today are Macy's, EBay, and Zappo's.<br />

For more information, take a look at https://wiki.apache.org/solr/PublicServers.<br />

A Step Closer<br />

You already have some idea of <strong>Solr</strong>'s schema. This section describes <strong>Solr</strong>'s home directory and other<br />

configuration options.<br />

When <strong>Solr</strong> runs in an application server, it needs access to a home directory. The home directory contains<br />

important configuration information and is the place where <strong>Solr</strong> will store its index. The layout of the home<br />

directory will look a little different when you are running <strong>Solr</strong> in standalone mode vs when you are running in<br />

<strong>Solr</strong>Cloud mode.<br />

The crucial parts of the <strong>Solr</strong> home directory are shown in these examples:<br />

Standalone Mode<br />

/<br />

solr.xml<br />

core_name1/<br />

core.properties<br />

conf/<br />

solrconfig.xml<br />

managed-schema<br />

data/<br />

core_name2/<br />

core.properties<br />

conf/<br />

solrconfig.xml<br />

managed-schema<br />

data/<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

13

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!