11.05.2016 Views

Apache Solr Reference Guide Covering Apache Solr 6.0

21SiXmO

21SiXmO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Updates and deletes are first written to the Source cluster, then forwarded to the Target cluster. The data flow<br />

sequence is:<br />

1.<br />

2.<br />

3.<br />

4.<br />

5.<br />

6.<br />

A shard leader receives a new data update that is processed by its Update Processor.<br />

The data update is first applied to the local index.<br />

Upon successful application of the data update on the local index, the data update is added to the<br />

Updates Log queue.<br />

After the data update is persisted to disk, the data update is sent to the replicas within the Data Center.<br />

After Step 4 is successful CDCR reads the data update from the Updates Log and pushes it to the<br />

corresponding collection in the Target Data Center. This is necessary in order to ensure consistency<br />

between the Source and Target Data Centers.<br />

The leader on the Target data center writes the data locally and forwards it to all its followers.<br />

Steps 1, 2, 3 and 4 are performed synchronously by <strong>Solr</strong>Cloud; Step 5 is performed asynchronously by a<br />

background thread. Given that CDCR replication is performed asynchronously, it becomes possible to push<br />

batch updates in order to minimize network communication overhead. Also, if CDCR is unable to push the<br />

update at a given time -- for example, due to a degradation in connectivity -- it can retry later without any impact<br />

on the Source Data Center.<br />

One implication of the architecture is that the leaders in the Source cluster must be able to "see" the leaders in<br />

the Target cluster. Since leaders may change, this effectively means that all nodes in the Source cluster must be<br />

able to "see" all <strong>Solr</strong> nodes in the Target cluster so firewalls, ACL rules, etc. must be configured with care.<br />

Major Components<br />

There are a number of key features and components in CDCR’s architecture:<br />

<strong>Apache</strong> <strong>Solr</strong> <strong>Reference</strong> <strong>Guide</strong> <strong>6.0</strong><br />

611

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!