09.07.2015 Views

Breaking down Data Silos - SemTech 2011

Breaking down Data Silos - SemTech 2011

Breaking down Data Silos - SemTech 2011

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Why is O<strong>Data</strong> Needed?


Today, the tools for capturing databoth at the mega-scale and at themilli-scale are just dreadful. After youhave captured the data, you need tocurate it before you can start doingany kind of data analysis, and we lackgood tools for both data curation anddata analysis.Quote – Jim Gray


What is O<strong>Data</strong>?


Web Friendly, just plain HTTP Uniform Patterns Focus on <strong>Data</strong>, not on Formats Stay High-Level, Abstract the StoreRequirements


“Self-service business intelligence”Let people find and use their own data…but where’s the data? Broad direct database access is impracticalO<strong>Data</strong> feeds expose data over application logic Reports often represent the best data sourcesAll reports in Reporting Services are now O<strong>Data</strong> feedsInsight from data


Government data, e.g. OGDIScientific data, e.g. OSDISocial dataCommercial public data, e.g. NetflixPremium data, <strong>Data</strong>Markets<strong>Data</strong> Sharing at Large


Consistent, Up to Date


Scale to Large <strong>Data</strong>sets


Create Web API to be Consumedwith Ease


maximize reach, focus investmentsshare libraries, browsers, applications


Monetize <strong>Data</strong>


data in modern webapplications


Where did O<strong>Data</strong> come from?


Is O<strong>Data</strong> RESTful?


Uniform way of representing structureddataAtom, JSON formats Uniform URL conventionsNavigation, filtering, sorting, paging, etc. Uniform operationsAddressabilityGET, POST, PUT, DELETE always mean the sameO<strong>Data</strong>


AtomPubThe name Atom applies to a pair ofrelated standards. The AtomSyndication Format is an XMLlanguage used for web feeds, whilethe Atom Publishing Protocol(AtomPub) is a simple HTTP-basedprotocol for creating and updatingweb resources.


JSON


The Metaof O<strong>Data</strong>


Lets dig into O<strong>Data</strong> Protocol


Feeds, which are Collections of typedEntries O<strong>Data</strong> services can expose ServiceOperations O<strong>Data</strong> services expose all these constructs viaURIs O<strong>Data</strong> service may also expose a ServiceMetadata Document O<strong>Data</strong> has the ability for validation andserver side securityThe Basics


Binary BooleanByteDateTimeDateTimeOffsetTimeDecimal DoubleSingleGuidInt16Int32Int64String<strong>Data</strong>types


HTTP VerbGETPUTPOSTDELETESQL CommandSELECTUPDATEINSERTDELETEMERGEFull SQL like Query “Language”


Query Navigation


http://services.odata.org/O<strong>Data</strong>.svc/GetProductsByRating?rating=5Identifies the "GetProductsByRating" Service Operation and specifiesa value of 5 for the "rating" input parameter.Service Operations (Example)


Where is O<strong>Data</strong> Used?


Who is using O<strong>Data</strong>?


Where can you consume it?


O<strong>Data</strong>.orgO<strong>Data</strong>Primer.comWrap Up


The FourthParadigm:<strong>Data</strong>-IntensiveScientificDiscovery


http://deepfriedbytes.com

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!