NUI Galway – UL Alliance First Annual ENGINEERING AND - ARAN ...

More documents

Recommendations

Info

Life-Cycles and Mutual Effects of Scientific Communities Vaclav Belák, Marcel Karnstedt, Conor Hayes Digital Enterprise Research Institute, NUI Galway {vaclav.belak, marcel.karnstedt, conor.hayes}@deri.org 1. Introduction Claims for progress in a scientific community are generally assessed using cumulative citation measures. However, the analysis of the life-cycle of a community provides much greater explanatory power for the progress and potential of a scientific field. While previous work has examined scientific networks through co-citation and textual analysis, there is relatively little work on analysing the dynamics of cross-community behaviours, particularly where closely related communities are competing for scientific, funding and industrial capital. Figure 1 Community shift (left) and merge (right) Inspired by Thomas Kuhn's work [1], we identified several interesting cross-community phenomena, which we then mined in an automated manner. For example, a new community with a distinct topic can emerge from an established research community, where the emerging topic can be based on a novel approach or method. We call this phenomenon a community shift (see Fig 1). Similarly, a community can merge with another one. A community can also move in time from broader topics to more specific ones, which we call community specialization. 2. Methodology We extracted co-citation network of 5772 scientists from papers published between 2000-2009 in two related disciplines in computer science: Semantic Web (SW) and Information Retrieval (IR). We then divided the network into ten overlapping time-slices and identified communities in each slice using Infomap [2] and Louvain [3] methods. The communities were matched across the slices according to the highest Jaccard coefficient, and important ancestors and descendants were identified for each community using measures derived from Jaccard coefficient. Additionally, keywords were extracted from the papers, for which the text version was available, resulting in nearly 70% coverage of the network by the content. Finally, we applied several specifically tailored measures combining both structural and content features in order to detect the interesting phenomena. 83 3. Results An emergence of a trans-disciplinary community (community 15) that bridged the Semantic Web and Information Retrieval fields was detected between 2004-2007. This community was formed mainly by former members of Semantic Web community 0 depicted with red colour in the left part of the snapshots in Fig 2. We identified that the main research topic of community 15 had been Semantic Web until 2006– 2008, during which time information retrieval became one of its core topics. At the same time this topic disappeared for its ancestor community. In 2007 the whole community moved between the SW and IR communities (see Fig 2), which is supported by investigated rise of its normalized group betweenness from 0.09 in 2004 to 0.27 in 2007. Therefore, whereas community 0 kept its focus on the core SW-related topics, it also formed a new interdisciplinary community, which has functioned since then as a mutual intermediary between SW and IR communities. Analysis of different overlap measures revealed that an effort to establish this interdisciplinary collaboration came mainly from the SW community. Our approach uses community-finding techniques in combination with different overlap measures, special visualisations and automated metadata extraction and has enabled us to identify several other similar cases to support the hypotheses introduced above [4]. Figure 2 Network snapshots in 2004 (left) and 2007 (right). Note the central position of the transdisciplinary community 15 (violet). 4. References [1] Kuhn, T., Structure of Scientific Revolutions, University of Chicago Press, Chicago, USA, 1962 [2] Rosvall M., Bergstrom C., Maps of information flow reveal community structure in complex networks, PNAS, 2008 [3] Blondel, V. et al., Fast unfolding of communities in large networks, J. of Stat. Mech.: Theory and Experiment, 2008 [4] Belák,V et al., Cross-Community Dynamics in Science, arXiv:1010.4327, 2010 Acknowledgements The material presented in this work is based upon works jointly supported by the Science Foundation Ireland under Grant No. SFI/08/CE/I1380 (Lion-2) and under Grant No. 08/SRC/I1407 (Clique: Graph & Network Analysis Cluster).
1. Introduction A Corpus Framework For Cross-Lingual Search Nitish Aggarwal, Tobias Wunner, Paul Buitelaar Unit for Natural Language Processing, Digital Enterprise Research Institute, National University of Ireland, Galway Email: firstname.lastname@deri.org Cross-lingual queries on text documents based on specialized domain vocabularies are complex and dependent on the semantic, terminological and linguistic (STL) features of the vocabulary and language. The challenge for a cross-lingual search is to retrieve corpus objects which best match the user query based on these features. Therefore both the corpus and the query need to be STL enriched [1]. 2. Method In this work we present a framework to carry out a STL enrichment process for document, sentence and token corpus objects. The implementation is based on the blackboard architecture pattern with the corpus as a blackboard and S, T and L annotators acting on the corpus to perform the STL enrichment process (fig. 1). Figure. 1 STL corpus framework We implemented L and T annotators for NLP processing such as tokenization or part-of speech tagging for English, German, Spanish and Dutch processing and a S annotator enriching the corpus objects with vocabulary annotation. On the query side we manually implemented a set of queries with different STL features as shown in table 1. type Examples S query val=IFRS_FinancialAssets, lang=DE T query val= activos financieros, lang=ES L query part-of speech=Verb L query lemma=finance, part-of-speech=Noun Table. 1 implemented STL queries 84 3. Data-Set We have constructed a multi-lingual finance data set consisting of financial reports from Wind Energy companies (UNLP Wind Energy Corpus) and vocabularies for English, German, and Spanish. The corpus comprises 96 financial reports and 1421 news texts from 9 different wind energy companies. We also used two financial vocabularies with STL enriched terms. The first vocabulary is the International Finance Reporting Standard (IFRS), which used worldwide to create financial reports in XBRL (eXtended Business Reporting Language) format. The second vocabulary is developed by the xEBR (XBRL European Business Registers) group to describe legal enterprise entities within Europe. Terms examples IFRS 2487 Financial assets Amortization Computer Software xEBR 147 Financial fixed assets Company address, Country Table. 2 Financial vocabularies 4. Future work For the future we plan to develop a broader set of annotators and evaluate our approach on the UNLP Wind Energy corpus with queries constructed from the IFRS and the xEBR vocabulary. In particular we want to explore different combinations of S, T and L features of the queries using the framework. We also plan to extend the framework STL enrichment on the vocabulary side based on the lemon (lexicon model for ontologies) Generator 1 , as developed by the MONNET project 2 to facilitate richer STL searches. 5. References [1] Wunner, T., Buitelaar, P., O’Riain, S., Semantic, Terminological and Linguistic Interpretation of XBRL. In Proceedings of the Workshop on Reuse and Adaptation of Ontologies and Terminologies at the 17th International Conference on Knowledge Engineering and Knowledge Management (EKAW), Lisbon [2] Cimiano et al. (2010). D2.1 Ontology-Lexicon Model. Monnet Project Deliverable. 1 http://monnetproject.deri.ie/Lemon-Editor 2 http://www.monnet-project.eu/
Page 1 and 2:
NUI Galway - UL Alliance First Annu
Page 4 and 5:
FULL TABLE OF CONTENTS 1 GAMES, VIS
Page 6 and 7:
4 MECHANICAL AND BIOMEDICAL ENGINEE
Page 8 and 9:
5.21 Detecting Topics and Events in
Page 10 and 11:
8.7 Modelling Extreme Flood Events
Page 12 and 13:
GAMES, VISUALISATION & EDUCATION 1.
Page 14 and 15:
Generation and Analysis of Graph St
Page 16 and 17:
Evolution and Analysis of Strategie
Page 18 and 19:
Abstract The delivery of multimedia
Page 20 and 21:
Applications of Reinforcement Learn
Page 22 and 23:
Assessing the effects of interactiv
Page 24 and 25:
Real-time depth map generation usin
Page 26 and 27:
An analysis of the capability of pr
Page 28 and 29:
Building Information Modelling duri
Page 30 and 31:
Dwelling Energy Measurement Procedu
Page 32 and 33:
Numerical Modelling of Tidal Turbin
Page 34 and 35:
Energy Storage using Microencapsula
Page 36 and 37:
Data Centre Energy Efficiency Mark
Page 38 and 39:
An embodied energy and carbon asses
Page 40 and 41:
SmartOp - Smart Buildings Operation
Page 42 and 43:
Ocean Wave Energy Exploitation in D
Page 44 and 45: Future Smart Grid Synchronization C
Page 46 and 47: Web-Based Building Energy Usage Vis
Page 48 and 49: Image Recognition and Classificatio
Page 50 and 51: Android Based Multi-Feature Elderly
Page 52 and 53: Determining Subjects’ Activities
Page 54 and 55: New Analysis Techniques for ICU Dat
Page 56 and 57: National E-Prescribing Systems in I
Page 58 and 59: Using Mashups to Satisfy Personalis
Page 60 and 61: 3D Computational Modeling of Blood
Page 62 and 63: Experimental and Computational Inve
Page 64 and 65: Experimental Analysis of the Therma
Page 66 and 67: Simulating Actin Cytoskeleton Remod
Page 68 and 69: Computational Analysis of Transcath
Page 70 and 71: An In vitro Shear Stress System for
Page 72 and 73: Development of a Micropipette Aspir
Page 74 and 75: A Computational Test-Bed to Examine
Page 76 and 77: Computational Modeling of Ceramic-b
Page 78 and 79: Multi-Scale Computational Modelling
Page 80 and 81: Development of a mixed-mode cohesiv
Page 82 and 83: Active Computational Modelling of C
Page 84 and 85: Modelling the Management of Medical
Page 86 and 87: SOCIAL MEDIA, SEARCH & RECOMMENDATI
Page 88 and 89: Improving Twitter Search by Removin
Page 90 and 91: Abstract The goal of this research
Page 92 and 93: Generalized Blockmodeling Samantha
Page 96 and 97: dcat: Searching Public Sector Infor
Page 98 and 99: The Effect of User Features on Chur
Page 100 and 101: User Similarity and Interaction in
Page 102 and 103: Improving Categorisation in Social
Page 104 and 105: Natural Language Queries on Enterpr
Page 106 and 107: Studying Forum Dynamics from a User
Page 108 and 109: Provenance in the Web of Data: a bu
Page 110 and 111: Towards Social Descriptions of Serv
Page 112 and 113: ENVIRONMENTAL ENGINEERING 6.1 Asses
Page 114 and 115: Novel Agri-engineering solutions fo
Page 116 and 117: Evaluation of amendments to control
Page 118 and 119: Determination of optimal applicatio
Page 120 and 121: Treatment of Piggery Wastewaters us
Page 122 and 123: NEXT GENERATION INTERNET 7.1 Extens
Page 124 and 125: Enabling Federation of Government M
Page 126 and 127: Curated Entities for Enterprise Uma
Page 128 and 129: Mobile Web + Social Web + Semantic
Page 130 and 131: Engaging Citizens in the Policy-Mak
Page 132 and 133: Preference-based Discovery of Dynam
Page 134 and 135: RDF On the Go: An RDF Storage and Q
Page 136 and 137: Policy Modeling meets Linked Open D
Page 138 and 139: A Contextualized Perspective for Li
Page 140 and 141: Improving discovery in Life Science
Page 142 and 143: The Semantic Public Service Portal
Page 144 and 145:
Personalized Content Delivery on Mo
Page 146 and 147:
A Framework to Describe Localisatio
Page 148 and 149:
The influence of secondary settleme
Page 150 and 151:
Analysis of Shear Transfer in Void-
Page 152 and 153:
Cost-Effective Sustainable Construc
Page 154 and 155:
Modelling Extreme Flood Events due
Page 156 and 157:
Axial Load Capacity of a Driven Cas
Page 158 and 159:
Chemical amendment of dairy cattle
Page 160 and 161:
Seismic Design of Concentrically Br
Page 162 and 163:
MODELLING, ALGORITHMS & CONTROL 9.1
Page 164 and 165:
Eigen-based Approach for Leverage P
Page 166 and 167:
Evolutionary Modelling of Industria
Page 168 and 169:
Abstract: Graphical Semantic Wiki f
Page 170 and 171:
Low Coverage Genome Assembly Using
Page 172 and 173:
Evolving a Robust Open-Ended Langua
Page 174 and 175:
Context Stamp - A Topic-based Conte
Page 176 and 177:
DSP-Based Control of Multi-Rail DC-
Page 178 and 179:
Topographical Cues - Controlling Ce
Page 180 and 181:
Creep Relaxation and Crack Growth P
Page 182 and 183:
Finite Element Modelling of Failure
Page 184 and 185:
Influence of Fluorine and Nitrogen
Page 186 and 187:
Phase Decompositions of Bioceramic
Page 188 and 189:
High Resolution Microscopical Analy
Page 190 and 191:
An Experimental and Numerical Analy
Page 192 and 193:
Thermomechanical characterisation o
Page 194 and 195:
A multiaxial damage mechanics metho
Page 196:
The effect of citrate ester plastic
show all

NUI Galway – UL Alliance First Annual ENGINEERING AND - ARAN ...

Create successful ePaper yourself

Delete template?

Save as template?