You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
CONNECTOR FOR SHAREPOINT<br />
Microsoft SharePoint is a popular system for document management. MarkLogic offers<br />
(and supports) a Connector for SharePoint that integrates with SharePoint, providing<br />
more advanced access to the documents held within the system. The connector lets you<br />
mirror the SharePoint documents in MarkLogic for search, assembly, and reuse, or it<br />
lets MarkLogic act as a node in a SharePoint workflow.<br />
DOCUMENT FILTERS<br />
Built into MarkLogic behind the unassuming xdmp:document-filter() function is a<br />
robust system for extracting metadata and text from binary documents that handles<br />
hundreds of document formats. You can filter office documents, emails, database<br />
dumps, movies, images, and other multimedia formats, and even archive files. The filter<br />
process doesn't attempt to convert these documents to a rich XML format, but instead<br />
extracts the standard metadata and whatever text is within the files. It's great for search,<br />
classification, or other text-processing needs. For richer extraction (such as feature<br />
identification in an image or transcribing a movie) there are third-party tools.<br />
LIBRARY SERVICES API<br />
Library Services offers an interface for managing documents, letting you do check-in/<br />
check-out and versioning. You can combine the Library Services features with rolebased<br />
security and the Search API to build a content management system on top<br />
of MarkLogic.<br />
COMMUNITY-SUPPORTED TOOLS, LIBRARIES,<br />
AND PLUG-INS<br />
The MarkLogic Developer Site (http://developer.marklogic.com) also hosts or<br />
references a number of highly useful projects. Many are built collaboratively on GitHub<br />
(https://github.com/marklogic), where you can contribute to their development if you<br />
are so inclined.<br />
Converter for MongoDB<br />
A Java-based tool for importing data from MongoDB into MarkLogic. It reads JSON<br />
data from MongoDB's mongodump tool and loads data into MarkLogic using an<br />
XDBC Server.<br />
Corb2<br />
A Java-based tool designed for bulk content reprocessing of documents stored in<br />
MarkLogic. It works off of a list of database documents and performs operations<br />
against them. Operations can include generating a report across all documents,<br />
manipulating individual documents, or a combination thereof.<br />
123