12.07.2015 Views

Matching Schemas From Disparate Data Sources

Matching Schemas From Disparate Data Sources

Matching Schemas From Disparate Data Sources

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

• Information theory-based:– Column names can be opaque!– Construct dependency graph using mutualinformation between columns– Reduces to a graph matching problem[Kang and Naughton SIGMOD03]• Rule-based (iMAP):– Search space of all possible matches“efficiently”– Apply rules that reflect domain constraints[Dhamanker, Lee, Doan et al SIGMOD04]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!