NUI Galway – UL Alliance First Annual ENGINEERING AND - ARAN ...

More documents

Recommendations

Info

Generalized Blockmodeling Samantha Lam, Jeffrey Chan, Conor Hayes Digital Enterprise Research Institute, National University of Ireland, Galway samantha.lam@deri.org, jkc.chan@deri.org, conor.hayes@deri.org 1. Introduction As online social network data become increasingly available and popular, there is a ongoing need to analyze and model them in a scalable manner. To understand these large networks, it is useful to be able to reduce and summarize them in terms of their underlying structure. Popular approaches include community finding [1] and blockmodeling [2], both of which aim to group the strongly associated vertices together. Our research is focused on the latter approach. 2. Generalized blockmodeling Generalized blockmodeling [3] decomposes a network into partitions and assigns a relation type to each pair of partition (called a block), which describes the relationship between the partitions. A major component to the generality of this method comes from the use of regular equivalence as a defining feature of the blocks. The previously well-studied structural equivalence proved to be somewhat too restrictive to describe real-world networks which led to the proposition of regular equivalence [3]. Up to now, the generalized blockmodel analysis of social networks has not received much attention, partly due to the computational demands of the existing algorithms. Therefore, we have designed approaches based on genetic algorithms (GA) and simulated annealing (SA) to fit generalized blockmodels. We have found both approaches are at least two orders of magnitude faster than the existing method. 3. Improving algorithms In [3], they proposed the greedy KL-based approach to fit blockmodels. This algorithm considers the solution neighborhood of each vertex, and then greedily makes a move that minimizes the objective cost. A neighborhood move as a vertex moves from one partition to another, and the swapping of two vertices in different partitions were also considered. However, there was no description as to how to optimize the blocks types themselves. Therefore, we also introduce an additional step, where the blocks types are optimized after the partitions are optimized. 3.1. Results We evaluate the efficiency and optimization performance of three algorithms – the proposed KLbased, and two aforementioned ones, SA and GA. To measure the scalability and optimizing ability of the algorithms, we generated synthetic datasets using a community generating algorithm and he results indicate that for larger networks, if speed is important, than the 81 SA algorithm should be used, but if accuracy is more important, than the GA should be used. To demonstrate the importance of increasing the scalability of fitting generalized blockmodels, we fitted blockmodels to the Enron and flight route datasets that could be not be fitted before because of the limitations of the KL algorithm. We used the GA algorithm to explore the Enron dataset over three time periods - prior, during and after the crisis. As a guide to the communications between the employees we used results found by [4] to help us construct our blocks. The best-fitted blockmodels summarized the roles, the key relationships (block types) between the different roles. For the flight route dataset, we decided to use European airlines as it was shown that a hub-andspoke/hierarchy model existed . For this data, we found that an addition of a 'density' block to be a fruitful addition to the nine types specified by [3]. 4. Improving the objective function The current definition of the objective function proposed by [3] is somewhat naïve. It is essentially a simple count/percentage of the number of deviations of an element from its ideal block. We are currently investigating methods to improve upon this definition. 5. Future work We aim to investigate additional ways to measure and rank discovered blockmodels, such as improving the objective function and defining other block types. 6. References [1] A. Clauset, M.E.J. Newman, and C. Moore, “Finding community structure in very large networks”, Phys. Rev. E, American Physical Society, Vol. 7 Issue 6, 2004, 066111. [2] S. Wasserman, and K. Faust, “Social network analysis: Methods and applications”, Cambridge university press, 1994 [3] P. Doreian, V. Batagelj, and A. Ferligoj, “Generalized Blockmodeling”, Cambridge university press, 2005 [4] J. Diesner. and T.L. Frantz, and K.M. Carley, “Communication networks from the Enron email corpus “It's always about the people. Enron is no different”, Computational & Mathematical Organization Theory, Springer, Vol. 1 No. 3, 2005, pp. 201-228
Finding and Representing Interesting Tweets Jodi Schneider, Fabrizio Orlandi, Julia Anaya, Alexandre Passant Digital Enterprise Research Institute National University of Ireland Galway, Ireland firstname.lastname@deri.org Microblogging has become a popular way to communicate, and Twitter’s popularity has grown dramatically. Filtering is important [1]: users are motivated to seek information [2]. 1 While Twitter’s collective value is being mined in various ways, we lack an overall sense of the discussion topics, concerns, and personalities of everyday users, whose messages can be aggregated only by topic or location. Instead of a public timeline with messages from all users, the Twitter homepage promotes celebrities, “Top Tweets”, and trending topics. Interacting with people from diverse backgrounds should be easy [3], yet Twitter users and followers tend to be similar [4]. One of Twitter’s early successes was in bringing users a “social sixth sense” 2 . To extend this sense of peripheral awareness beyond a user’s own social circle, we seek to redevelop a public stream which is diverse yet interesting to read. We investigate four related research questions, as follows: 1) Which tweets are consistently rated as interesting? 2) What are the characteristics of these interesting tweets? 3) How can we represent tweets using existing SemanticWeb standards and activitystrea.ms? 4) How should we display interesting tweets to users? We collected data on 210 tweets from 42 users who rated 10 tweets each on one of 21 questionnaires. Tweets originated from Twitter’s public timeline API 3 , translated by the Google Translate API 4 . Questionnaires, answered by 2 users, asked: “Did you enjoy reading this tweet?” (“Yes”, “No, because I don’t like the topic”, “No, because I don’t know the people”, “No, because I couldn’t understand the words”) “What category (from [5]) best fits this tweet?” In a formative evaluation, 10 users answered a brief questionnaire about interfaces for displaying interesting tweets. 36 tweets (17%), including 10 translated tweets, were considered interesting (e.g. Table I) by both raters, while 44 tweets (21%) were considered uninteresting by both raters. Although we expected a correlation between the category and the interestingness of a tweet, 1 The work presented in this paper has been funded by Science Foundation Ireland under Grant No. SFI/08/CE/I1380 (Líon-2). 2 http://www.wired.com/techbiz/media/magazine/15-07/st_thompson 3 http://dev.twitter.com/doc/get/statuses/public timeline 4 http://translate.google.com/ 82 no such correlation was found. Category assignment varied between raters, and depended on the category as shown in Table 2. Interesting Tweets (followed by English translation as needed) espere pelo mais sábio dos conselhos: o tempo ! wait for the wisest of advice: the weather! Laat me iets gaan eten Leave me something to eat so it’s settled! May 8 is the day for big changes. Leaving Tokyo! :(( but so worried for my baby2!! :(( Mom said alcohol is ur enemy, but the bible said, love ur enemy. - @mayshaaurelia What honors english teacher does work ONLY from out of the text book? oh ya... mine! Love my doctor. He hugged me at my 20lbs weight loss. http://t.co/YEs1HuK - Al-Gaddafi International Prize for Human Rights - errr, yeah.... Zac Brown Band....i love you follow @Rebecca Jade x she is an amazing person Table 1 A Selection of Interesting Tweets Agreed Disagreed Category Tweets Tweets 1 12 Anecdote (AN) 23 51 Information Sharing (IS) 12 38 Me now (ME) 8 36 Opinions/Complaints (OC) 3 16 Presence Maintenance 5 10 Question to followers (QF) 4 18 Self Promotion (SP) 20 68 Statements and Random Thoughts (RT) Table 2 Tweet Categories References [1] D. Ramage, S. Dumais, and D. Liebling, “Characterizing microblogs with topic models,” in ICWSM, 2010. [2] P. Johnson and S. Yang, “Uses and gratifications of Twitter,” in AEJMC, 2009. [3] S. Yardi and d. boyd, “Dynamic Debates,” Bulletin of Science, Technology & Society, vol. 30, no. 5, p. 316, 2010. [4] M. Smith and C. Giraud-Carrier, “Bonding vs. Bridging Social Capital,” in ICWSM, 2010. [5] M. Naaman, J. Boase, and C. Lai, “Is it really about me?: Message content in social awareness streams,” in CSCW ’10, 2010.
Page 1 and 2:
NUI Galway - UL Alliance First Annu
Page 4 and 5:
FULL TABLE OF CONTENTS 1 GAMES, VIS
Page 6 and 7:
4 MECHANICAL AND BIOMEDICAL ENGINEE
Page 8 and 9:
5.21 Detecting Topics and Events in
Page 10 and 11:
8.7 Modelling Extreme Flood Events
Page 12 and 13:
GAMES, VISUALISATION & EDUCATION 1.
Page 14 and 15:
Generation and Analysis of Graph St
Page 16 and 17:
Evolution and Analysis of Strategie
Page 18 and 19:
Abstract The delivery of multimedia
Page 20 and 21:
Applications of Reinforcement Learn
Page 22 and 23:
Assessing the effects of interactiv
Page 24 and 25:
Real-time depth map generation usin
Page 26 and 27:
An analysis of the capability of pr
Page 28 and 29:
Building Information Modelling duri
Page 30 and 31:
Dwelling Energy Measurement Procedu
Page 32 and 33:
Numerical Modelling of Tidal Turbin
Page 34 and 35:
Energy Storage using Microencapsula
Page 36 and 37:
Data Centre Energy Efficiency Mark
Page 38 and 39:
An embodied energy and carbon asses
Page 40 and 41:
SmartOp - Smart Buildings Operation
Page 42 and 43: Ocean Wave Energy Exploitation in D
Page 44 and 45: Future Smart Grid Synchronization C
Page 46 and 47: Web-Based Building Energy Usage Vis
Page 48 and 49: Image Recognition and Classificatio
Page 50 and 51: Android Based Multi-Feature Elderly
Page 52 and 53: Determining Subjects’ Activities
Page 54 and 55: New Analysis Techniques for ICU Dat
Page 56 and 57: National E-Prescribing Systems in I
Page 58 and 59: Using Mashups to Satisfy Personalis
Page 60 and 61: 3D Computational Modeling of Blood
Page 62 and 63: Experimental and Computational Inve
Page 64 and 65: Experimental Analysis of the Therma
Page 66 and 67: Simulating Actin Cytoskeleton Remod
Page 68 and 69: Computational Analysis of Transcath
Page 70 and 71: An In vitro Shear Stress System for
Page 72 and 73: Development of a Micropipette Aspir
Page 74 and 75: A Computational Test-Bed to Examine
Page 76 and 77: Computational Modeling of Ceramic-b
Page 78 and 79: Multi-Scale Computational Modelling
Page 80 and 81: Development of a mixed-mode cohesiv
Page 82 and 83: Active Computational Modelling of C
Page 84 and 85: Modelling the Management of Medical
Page 86 and 87: SOCIAL MEDIA, SEARCH & RECOMMENDATI
Page 88 and 89: Improving Twitter Search by Removin
Page 90 and 91: Abstract The goal of this research
Page 94 and 95: Life-Cycles and Mutual Effects of S
Page 96 and 97: dcat: Searching Public Sector Infor
Page 98 and 99: The Effect of User Features on Chur
Page 100 and 101: User Similarity and Interaction in
Page 102 and 103: Improving Categorisation in Social
Page 104 and 105: Natural Language Queries on Enterpr
Page 106 and 107: Studying Forum Dynamics from a User
Page 108 and 109: Provenance in the Web of Data: a bu
Page 110 and 111: Towards Social Descriptions of Serv
Page 112 and 113: ENVIRONMENTAL ENGINEERING 6.1 Asses
Page 114 and 115: Novel Agri-engineering solutions fo
Page 116 and 117: Evaluation of amendments to control
Page 118 and 119: Determination of optimal applicatio
Page 120 and 121: Treatment of Piggery Wastewaters us
Page 122 and 123: NEXT GENERATION INTERNET 7.1 Extens
Page 124 and 125: Enabling Federation of Government M
Page 126 and 127: Curated Entities for Enterprise Uma
Page 128 and 129: Mobile Web + Social Web + Semantic
Page 130 and 131: Engaging Citizens in the Policy-Mak
Page 132 and 133: Preference-based Discovery of Dynam
Page 134 and 135: RDF On the Go: An RDF Storage and Q
Page 136 and 137: Policy Modeling meets Linked Open D
Page 138 and 139: A Contextualized Perspective for Li
Page 140 and 141: Improving discovery in Life Science
Page 142 and 143:
The Semantic Public Service Portal
Page 144 and 145:
Personalized Content Delivery on Mo
Page 146 and 147:
A Framework to Describe Localisatio
Page 148 and 149:
The influence of secondary settleme
Page 150 and 151:
Analysis of Shear Transfer in Void-
Page 152 and 153:
Cost-Effective Sustainable Construc
Page 154 and 155:
Modelling Extreme Flood Events due
Page 156 and 157:
Axial Load Capacity of a Driven Cas
Page 158 and 159:
Chemical amendment of dairy cattle
Page 160 and 161:
Seismic Design of Concentrically Br
Page 162 and 163:
MODELLING, ALGORITHMS & CONTROL 9.1
Page 164 and 165:
Eigen-based Approach for Leverage P
Page 166 and 167:
Evolutionary Modelling of Industria
Page 168 and 169:
Abstract: Graphical Semantic Wiki f
Page 170 and 171:
Low Coverage Genome Assembly Using
Page 172 and 173:
Evolving a Robust Open-Ended Langua
Page 174 and 175:
Context Stamp - A Topic-based Conte
Page 176 and 177:
DSP-Based Control of Multi-Rail DC-
Page 178 and 179:
Topographical Cues - Controlling Ce
Page 180 and 181:
Creep Relaxation and Crack Growth P
Page 182 and 183:
Finite Element Modelling of Failure
Page 184 and 185:
Influence of Fluorine and Nitrogen
Page 186 and 187:
Phase Decompositions of Bioceramic
Page 188 and 189:
High Resolution Microscopical Analy
Page 190 and 191:
An Experimental and Numerical Analy
Page 192 and 193:
Thermomechanical characterisation o
Page 194 and 195:
A multiaxial damage mechanics metho
Page 196:
The effect of citrate ester plastic
show all

NUI Galway – UL Alliance First Annual ENGINEERING AND - ARAN ...

Create successful ePaper yourself

Delete template?

Save as template?