15.07.2016 Views

MARKLOGIC SERVER

Inside-MarkLogic-Server

Inside-MarkLogic-Server

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

LOW-LEVEL SYSTEM CONTROL<br />

When scaling a system, there are a few administrative settings you can adjust that<br />

control how MarkLogic operates and can, if used judiciously, improve your loading and<br />

query performance.<br />

1. "Last Modified" and "Directory Last Modified" options: These options<br />

track the last modified time of each document and each directory by updating<br />

its property sheet after each change. This is convenient information to have, but<br />

it incurs an extra fragment write for every document update. Turning off these<br />

options saves that bit of work. Starting with MarkLogic 7, these options are disabled<br />

by default, but in earlier versions and in systems upgraded from earlier versions they<br />

will likely be enabled.<br />

2. "Directory Creation" option: When set to "automatic," MarkLogic creates<br />

directory entries automatically, so if you create a document with the path /x/y/z.<br />

xml, MarkLogic will create the directory entries for /x/ and /x/y/ if they don't<br />

already exist. When it's set to "manual," you have to create those directories<br />

yourself if you want them to exist. Directories are important when accessing a<br />

database via WebDAV, where the database has to look and behave like a filesystem,<br />

and they're important if you're doing directory-based queries, but they otherwise<br />

aren't necessary. Setting directory creation to "manual" saves the ingestion overhead<br />

of checking if the directories mentioned in the path already exist and the work<br />

of creating them if they don't. This option is by default disabled starting with<br />

MarkLogic 7.<br />

3. "Locking" options: MarkLogic's locking strategy ensures that documents in a<br />

database have distinct URIs as they are ingested. What setting you need depends<br />

on your database's assignment policy, described in the "Rebalancing" section. The<br />

default bucket and legacy policies require "fast" locking, which means only the<br />

URIs for existing documents are checked for uniqueness. The statistical and range<br />

policies require "strict" locking, which checks the URIs for both new and existing<br />

documents. If you're absolutely certain that all of the documents you're loading<br />

have distinct URIs, as often happens with a bulk load, you can relax this restriction<br />

and set locking to "off."<br />

4. "Journaling" option: MarkLogic supports three journaling options. The "strict"<br />

option considers a transaction committed only after it's been journaled and the<br />

journal has been fully flushed to disk. This protects against MarkLogic Server<br />

process failures, host operating system kernel failures, and host hardware failures.<br />

This option is always used with forests configured for Shared-Disk Failover. The<br />

"fast" option considers a transaction committed after it's been journaled, but the<br />

119

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!