15.07.2016 Views

MARKLOGIC SERVER

Inside-MarkLogic-Server

Inside-MarkLogic-Server

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

CONNECTOR FOR SHAREPOINT<br />

Microsoft SharePoint is a popular system for document management. MarkLogic offers<br />

(and supports) a Connector for SharePoint that integrates with SharePoint, providing<br />

more advanced access to the documents held within the system. The connector lets you<br />

mirror the SharePoint documents in MarkLogic for search, assembly, and reuse, or it<br />

lets MarkLogic act as a node in a SharePoint workflow.<br />

DOCUMENT FILTERS<br />

Built into MarkLogic behind the unassuming xdmp:document-filter() function is a<br />

robust system for extracting metadata and text from binary documents that handles<br />

hundreds of document formats. You can filter office documents, emails, database<br />

dumps, movies, images, and other multimedia formats, and even archive files. The filter<br />

process doesn't attempt to convert these documents to a rich XML format, but instead<br />

extracts the standard metadata and whatever text is within the files. It's great for search,<br />

classification, or other text-processing needs. For richer extraction (such as feature<br />

identification in an image or transcribing a movie) there are third-party tools.<br />

LIBRARY SERVICES API<br />

Library Services offers an interface for managing documents, letting you do check-in/<br />

check-out and versioning. You can combine the Library Services features with rolebased<br />

security and the Search API to build a content management system on top<br />

of MarkLogic.<br />

COMMUNITY-SUPPORTED TOOLS, LIBRARIES,<br />

AND PLUG-INS<br />

The MarkLogic Developer Site (http://developer.marklogic.com) also hosts or<br />

references a number of highly useful projects. Many are built collaboratively on GitHub<br />

(https://github.com/marklogic), where you can contribute to their development if you<br />

are so inclined.<br />

Converter for MongoDB<br />

A Java-based tool for importing data from MongoDB into MarkLogic. It reads JSON<br />

data from MongoDB's mongodump tool and loads data into MarkLogic using an<br />

XDBC Server.<br />

Corb2<br />

A Java-based tool designed for bulk content reprocessing of documents stored in<br />

MarkLogic. It works off of a list of database documents and performs operations<br />

against them. Operations can include generating a report across all documents,<br />

manipulating individual documents, or a combination thereof.<br />

123

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!