12.07.2015 Views

Matching Schemas From Disparate Data Sources

Matching Schemas From Disparate Data Sources

Matching Schemas From Disparate Data Sources

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

• Name Matcher (Exact, Levenshtein distance )• Histogram Matcher (Kullback-Leibler distance)• Bayesian Classifier (NN to be integrated)[http://classifier4j.sourceforge.net/usage.htm]• Value Set Matcher• Different Tokenizers (Qgrams, non-word break,white-space break)• Type Matcher (calendar, phone #, zip code,general strings, numbers, ..)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!