NUI Galway – UL Alliance First Annual ENGINEERING AND - ARAN ...

More documents

Recommendations

Info

Provenance in the Web of Data: a building block for user profiling and trust in online communities Fabrizio Orlandi, Alexandre Passant Digital Enterprise Research Institute National University of Ireland, Galway fabrizio.orlandi@deri.org - alexandre.passant@deri.org Abstract Online collaborative knowledge bases such as Wikipedia provide an extensive source of information, not only to their readers, but also to a wide range of applications and Web services. For example, DBpedia, one of the largest datasets on the Web of Data, is widely used as a reference for data interlinking and as a basis for applications employing Semantic Web technologies. Yet its dataset, directly derived from Wikipedia articles, could contain errors due to inexperience or anonymity of the contributors. By analysing the Wikipedia edit history and the users' contributions we provide detailed provenance information for DBpedia statements and we make this information publicly available on the Web of Data. The dataset we provide is then fundamental for analysing users' activities/interests and computing trust measures. Collaborative websites such as Wikipedia have recently shown the benefit of being able to create and manage very large public knowledge bases. However, one of the most common concerns about these types of information sources is the trustworthiness of their content which can be arbitrarily edited by everyone. The DBpedia project 1 , which aims at converting Wikipedia content into structured knowledge, is then not exempt from this concern. Especially considering that one of the main objectives of DBpedia is to build a dataset such that Semantic Web technologies can be employed against it. Hence this allows not only to formulate sophisticated queries against Wikipedia, but also to link it to other datasets on the Web, or create new applications or mashups. Thanks to its large dataset and its cross-domain nature DBpedia has become one of the most important and interlinked datasets on the Web of Data. Therefore providing information about where DBpedia data comes from and how it was extracted and processed is crucial. This type of information is called provenance and it describes the entire data life cycle, from its origin to its subsequent processing history. Having provenance information about Wikipedia data allows us to identify quality measures for Wikipedia articles and estimate the trustworthiness of their content. Then, since the DBpedia content is directly extracted from Wikipedia, the same trust and quality values can be propagated to the DBpedia dataset. We apply this process to DBpedia, but this is just one particular use-case, the same considerations about provenance are suitable for every dataset on the Web of Data. The benefits of using data provenance to develop trust on the Web, and the Semantic Web in particular, have been already widely described in the state of the art. Provenance data provides useful information such as timeliness and authorship of data. It can be used as a ground basis for various applications and use cases such as identifying trust values for pages or pages fragments, or measuring users' expertise by analysing their contributions and then personalize trust metrics based on the user profile of a person on a particular topic. Moreover, providing also provenance meta-data as RDF and making it available on the Web of Data offers more interchange possibilities and transparency. This would let people link to provenance information from other sources. It provides them the opportunity to compare these sources and choose the most appropriate one or the one with higher quality. In our specific context of DBpedia for example, by indicating by whom and when a RDF triple was created (or contributed by), it could let any application flag, reject or approve this statement based on particular criteria. In our work [1][2] we propose a modelling solution to semantically represent information about provenance of data in DBpedia and an extraction framework capable of computing provenance for DBpedia statements using Wikipedia edits. The framework consists of: (i) a lightweight modelling solution to semantically represent provenance of both DBpedia resources and Wikipedia content, (ii) an information extraction process and a provenance-computation system combining Wikipedia articles' history with DBpedia information, (iii) a set of scripts to make provenance information about DBpedia statements directly available when browsing this source, (iv) a publicly available web service that exposes in RDF as Linked Open Data our provenance dataset letting software agents and developers consume it. References [1] Orlandi F., Champin P-A., Passant A., “Semantic Representation of Provenance in Wikipedia,”, Semantic Web Provenance Management workshop at ISWC2010, CEUR- WS, Shanghai, 2010. [2] Orlandi F., Passant A., “Modelling Provenance of DBpedia Resources Using Wikipedia Contributions”, Journal of Web Semantics, (to be published), 2011. * This work has been funded in part by Science Foundation Ireland under Grant No. SFI/08/CE/I1380 (Lion-2) and by an IRCSET Scholarship. 1 http://dbpedia.org/ 97
Supporting Online Shopping with Opinion Mining Abstract Consumers often find product reviews very valuable. In online shopping, opinions that are expressed in product reviews are available in the form of unstructured text. Existing shopping websites offer search tools suited to structured product information, thus customers looking for product opinions are forced to perform time-consuming analyses manually. This work proposes a method for seamless integration of unstructured information available in product reviews with structured product descriptions using opinion mining. We demonstrate applicability of our approach with a used car product search tool using real data. 1. Introduction Many online shopping decisions are made after consulting other customers’ opinions. This effect is especially visible in travel bookings (97,7%) where 77.9% decision involve the use of customer reviews as a source of information [1]. Consulting reviews requires significant amount of additional effort from customers. This work proposes new method for extraction of valuable product information from customer reviews and its integration with structured product descriptions. 2. The Method An opinion mining system needs to fulfill three generic tasks [2]: identification of the product features, discovery of opinion phrases, and sentiment analysis. In our method (see [3] for details), the first of the tasks is performed using domain knowledge and data from popular websites offering semi-structured car reviews. We use a rule-based shallow-parsing method for extraction of potential opinion statements. The rules are constructed to extract a consistent fragment of the sentence that contains a feature and the sentiment about the feature. Opinion statements are further matched with lists of opinion words. In comparison to other approaches our method considers not only nouns as features and not only adjectives as opinions. Our approach deals with sentiment analysis on three levels: word level, chunk level, and context dependant chunk level. To assess the sentiment we use an approach similar to [4], where lists of adjectives, nouns, verbs and adverbs with positive and negative sentiment were created, combining to the total word sentiment. Opinion context is modeled with utility theory [5] as the features were divided in three classes: cost-type - with preference toward lower values (e.g. price); benefit-type - higher values are preferred (e.g. reliability); neutral – the character of a feature is context dependant. Maciej Dabrowski Digital Enterprise Research Institute National University of Ireland Galway, Ireland maciej.dabrowski@deri.org 98 Figure 1 An example of a used car shopping website presenting product offers extended with structured attributes extracted from free-text customer reviews. The discussed method is implemented in a shopping website (see Fig. 1) that demonstrates seamless integration of structured product information (e.g. price) with unstructured customer opinions. 3. Conclusions We presented an opinion mining system that extracts and integrates opinions about products and features from very informal, noisy text data (product reviews) using a hierarchy of features from a number of websites and domain knowledge. Our method is of value not only to shopping service providers and potential customers but also to product manufacturers. 4. References [1] U. Gretzel and K. H. Yoo, "Use and Impact of Online Travel Reviews " in Information and Communication Technologies in Tourism Innsbruck, Austria, 2008, pp. 35-46. [2] A. M. Popescu, B. Nguyen, and O. Etzioni, "OPINE: extracting product features and opinions from reviews," in Proceedings of HLT/EMNLP on Interactive Demonstrations, 2005, pp. Association for Computational Linguistics--33. [3] M. Dabrowski, P. Jarzebowski, T. Acton, and S. O'Riain, "Improving Customer Decisions Using Product Reviews: CROM - Car Review Opinion Miner," in 6th International Conference on Web Information Systems and Technologies Valencia, Spain: Springer, 2010. [4] X. Ding, B. Liu, and P. S. Yu, "A holistic lexicon-based approach to opinion mining," in WSDM '08: Proceedings of the international conference on Web search and web data mining, 2008, pp. ACM--240. [5] J. Butler, D. J. Morrice, and P. W. Mullarkey, "A multiple attribute utility theory approach to ranking and selection," Management Science, vol. 47, pp. 800-816, Jun 2001
Page 1 and 2:
NUI Galway - UL Alliance First Annu
Page 4 and 5:
FULL TABLE OF CONTENTS 1 GAMES, VIS
Page 6 and 7:
4 MECHANICAL AND BIOMEDICAL ENGINEE
Page 8 and 9:
5.21 Detecting Topics and Events in
Page 10 and 11:
8.7 Modelling Extreme Flood Events
Page 12 and 13:
GAMES, VISUALISATION & EDUCATION 1.
Page 14 and 15:
Generation and Analysis of Graph St
Page 16 and 17:
Evolution and Analysis of Strategie
Page 18 and 19:
Abstract The delivery of multimedia
Page 20 and 21:
Applications of Reinforcement Learn
Page 22 and 23:
Assessing the effects of interactiv
Page 24 and 25:
Real-time depth map generation usin
Page 26 and 27:
An analysis of the capability of pr
Page 28 and 29:
Building Information Modelling duri
Page 30 and 31:
Dwelling Energy Measurement Procedu
Page 32 and 33:
Numerical Modelling of Tidal Turbin
Page 34 and 35:
Energy Storage using Microencapsula
Page 36 and 37:
Data Centre Energy Efficiency Mark
Page 38 and 39:
An embodied energy and carbon asses
Page 40 and 41:
SmartOp - Smart Buildings Operation
Page 42 and 43:
Ocean Wave Energy Exploitation in D
Page 44 and 45:
Future Smart Grid Synchronization C
Page 46 and 47:
Web-Based Building Energy Usage Vis
Page 48 and 49:
Image Recognition and Classificatio
Page 50 and 51:
Android Based Multi-Feature Elderly
Page 52 and 53:
Determining Subjects’ Activities
Page 54 and 55:
New Analysis Techniques for ICU Dat
Page 56 and 57:
National E-Prescribing Systems in I
Page 58 and 59: Using Mashups to Satisfy Personalis
Page 60 and 61: 3D Computational Modeling of Blood
Page 62 and 63: Experimental and Computational Inve
Page 64 and 65: Experimental Analysis of the Therma
Page 66 and 67: Simulating Actin Cytoskeleton Remod
Page 68 and 69: Computational Analysis of Transcath
Page 70 and 71: An In vitro Shear Stress System for
Page 72 and 73: Development of a Micropipette Aspir
Page 74 and 75: A Computational Test-Bed to Examine
Page 76 and 77: Computational Modeling of Ceramic-b
Page 78 and 79: Multi-Scale Computational Modelling
Page 80 and 81: Development of a mixed-mode cohesiv
Page 82 and 83: Active Computational Modelling of C
Page 84 and 85: Modelling the Management of Medical
Page 86 and 87: SOCIAL MEDIA, SEARCH & RECOMMENDATI
Page 88 and 89: Improving Twitter Search by Removin
Page 90 and 91: Abstract The goal of this research
Page 92 and 93: Generalized Blockmodeling Samantha
Page 94 and 95: Life-Cycles and Mutual Effects of S
Page 96 and 97: dcat: Searching Public Sector Infor
Page 98 and 99: The Effect of User Features on Chur
Page 100 and 101: User Similarity and Interaction in
Page 102 and 103: Improving Categorisation in Social
Page 104 and 105: Natural Language Queries on Enterpr
Page 106 and 107: Studying Forum Dynamics from a User
Page 110 and 111: Towards Social Descriptions of Serv
Page 112 and 113: ENVIRONMENTAL ENGINEERING 6.1 Asses
Page 114 and 115: Novel Agri-engineering solutions fo
Page 116 and 117: Evaluation of amendments to control
Page 118 and 119: Determination of optimal applicatio
Page 120 and 121: Treatment of Piggery Wastewaters us
Page 122 and 123: NEXT GENERATION INTERNET 7.1 Extens
Page 124 and 125: Enabling Federation of Government M
Page 126 and 127: Curated Entities for Enterprise Uma
Page 128 and 129: Mobile Web + Social Web + Semantic
Page 130 and 131: Engaging Citizens in the Policy-Mak
Page 132 and 133: Preference-based Discovery of Dynam
Page 134 and 135: RDF On the Go: An RDF Storage and Q
Page 136 and 137: Policy Modeling meets Linked Open D
Page 138 and 139: A Contextualized Perspective for Li
Page 140 and 141: Improving discovery in Life Science
Page 142 and 143: The Semantic Public Service Portal
Page 144 and 145: Personalized Content Delivery on Mo
Page 146 and 147: A Framework to Describe Localisatio
Page 148 and 149: The influence of secondary settleme
Page 150 and 151: Analysis of Shear Transfer in Void-
Page 152 and 153: Cost-Effective Sustainable Construc
Page 154 and 155: Modelling Extreme Flood Events due
Page 156 and 157: Axial Load Capacity of a Driven Cas
Page 158 and 159:
Chemical amendment of dairy cattle
Page 160 and 161:
Seismic Design of Concentrically Br
Page 162 and 163:
MODELLING, ALGORITHMS & CONTROL 9.1
Page 164 and 165:
Eigen-based Approach for Leverage P
Page 166 and 167:
Evolutionary Modelling of Industria
Page 168 and 169:
Abstract: Graphical Semantic Wiki f
Page 170 and 171:
Low Coverage Genome Assembly Using
Page 172 and 173:
Evolving a Robust Open-Ended Langua
Page 174 and 175:
Context Stamp - A Topic-based Conte
Page 176 and 177:
DSP-Based Control of Multi-Rail DC-
Page 178 and 179:
Topographical Cues - Controlling Ce
Page 180 and 181:
Creep Relaxation and Crack Growth P
Page 182 and 183:
Finite Element Modelling of Failure
Page 184 and 185:
Influence of Fluorine and Nitrogen
Page 186 and 187:
Phase Decompositions of Bioceramic
Page 188 and 189:
High Resolution Microscopical Analy
Page 190 and 191:
An Experimental and Numerical Analy
Page 192 and 193:
Thermomechanical characterisation o
Page 194 and 195:
A multiaxial damage mechanics metho
Page 196:
The effect of citrate ester plastic
show all

NUI Galway – UL Alliance First Annual ENGINEERING AND - ARAN ...

Create successful ePaper yourself

Delete template?

Save as template?