NUI Galway – UL Alliance First Annual ENGINEERING AND - ARAN ...

More documents

Recommendations

Info

Low Coverage Genome Assembly Using Fuzzy Hash Maps John Healy Department of Computing & Mathematics <strong>Galway</strong>-Mayo Institute of Technology Ireland john.healy@gmit.ie Abstract Despite the high-throughput of sequence reads that characterise second generation sequencing technologies, genome assemblers require a large degree of over-sampling to produce a complete genomic sequence. Using a novel approach to comparative genome assembly, based on the application of fuzzy hash maps, low coverage sequence reads can be rapidly ordered and orientated in an assembly scaffold with a low error rate and a vastly increased N50 length. 1. Introduction The advent of second generation sequencing (SGS) technology, capable of rapidly sequencing a massive number of short-length reads, has resulted in a reappraisal of existing approaches to sequence alignment and genome assembly. The twin characteristics of large read number and short read length has resulted in a move away from assembly strategies based on the traditional overlap graph to more k-mer centric approaches such as de Bruijn graphs and sequence graphs. While the newer k-mer centric genome assemblers are ideal for use with short-length reads, the requirement for a large degree of oversampling, or coverage, renders these approaches unsuitable for assembling genomes of draft coverage or lower. Although comparative assemblers have been developed for assembling draft genomes, the underlying assembly model is invariably based on the traditional overlap graph. We describe how a fuzzy hash-map can be applied to rapidly and accurately assemble a prokaryotic genome, sampled at varying levels of low coverage, against the reference genome of a closely related species. 2. Assembly with Fuzzy Hash Maps Hash-tables or maps are dictionary data structures that use a key and a hashing function to provide constant time, O(1), insertion, deletion and retrieval operations. By generating a unique hash code from a given key, hash tables provide a rapid mapping from a domain of unique keys to a range of possible values. Fuzzy Hash Maps (FHM) leverage the power of objectoriented languages to allow a degree of variability in the composition of a hash key. In the Java programming language, a degree of fuzziness can be applied to a hash key by manipulating the contract between the hashCode() and equals() methods in the object used as 159 Desmond Chambers Department of Information Technology National University of Ireland <strong>Galway</strong> Ireland des.chambers@nuigalway.ie the hash key. In contrast to traditional hash maps, which seek to avoid collisions, FHMs encourage initial collisions in the map by reducing the size of the key used to compute a hash code. Dynamic programming algorithms can then be implemented in the equals() method to establish whether a full collision is permitted. Using a FHM as the underlying data structure, a de Bruijn graph approach can be used to anchor a set of draft sequence against a reference genome and assemble the draft reads into contiguous sequences [1- 2]. 3. Results The results of assembling the 0.58Mb genome of M.genitalium at varying levels of coverage are shown in Table 1. The 0.81Mb genome of M.pneumoniae was used as a reference sequence and anchored 65.56% of the M.genitalium reads. Table 1. Summary of Assembly Results at Varying Levels of Coverage Coverage N50 Contig N50 Scaffold % Ordering Errors % Orientation Errors Time (s) 2.0 2141 51215 1.21 0.12 19.2 1.8 1918 14787 4.60 1.00 17.3 1.6 1798 14853 1.12 0.17 16.2 1.4 1739 9734 0.39 0.59 14.6 1.2 1456 10322 2.18 0.46 12.8 1.0 1269 6001 1.24 0.97 11.1 0.8 1228 8240 1.55 0.69 7.5 0.6 992 4743 0.92 0.46 7.8 0.4 -‐ 2450 2.07 0.00 6.0 0.2 -‐ 2539 1.38 2.07 4.1 4. Conclusions The FHM approach is capable of rapidly and accurately ordering and orientating low coverage sequence reads without sacrificing the execution speed inherent in hash maps. 5. References [1] J. Healy and D. Chambers, "Fast and Accurate Genome Anchoring Using Fuzzy Hash Maps", Proceedings 5th International Conference on Practical Applications of Computational Biology & Bioinformatics, 2011, pp. 149- 156. [2] J. Healy and D. Chambers, "De Novo Draft Genome Assembly Using Fuzzy K-mers", Proceedings 3rd International Conference on Bioinformatics, Biocomputational Systems and Biotechnologies, 2011.
Traffic-Aware Intelligent Access Point (TAIAP) Padraig O Flaithearta, Hugh Melvin College of Engineering and Informatics p.oflaithearta1@nuigalway.ie, hugh.melvin@nuigalway.ie Abstract The overall goal of our research is to investigate the benefits of applying synchronized time to wireless networks. For VoIP, Mouth-to-Ear (M2E) delays over wireless networks can vary significantly due to contention and asymmetries. Real-time applications such as VoIP are very sensitive to delay and Quality of Service (QoS) can be much improved if the M2E delay in a VoIP session is known and can be controlled. We are developing a Traffic-Aware Intelligent Access Point (TAIAP) that will calculate and profile M2E delays in both directions for each active VoIP session in a BSS, in order to prioritize VoIP sessions that are suffering from lower QoS based on M2E delays. 1. Background 802.11e is an extension of the 802.11 WiFi standard that was developed to provide Quality of Service (QoS). It prioritizes traffic into four categories with Voice traffic being the highest priority followed by Video, Best Effort and Background. Our Access Point (AP) will further prioritize traffic within the voice category, by tuning 802.11e EDCA parameters for individual clients and sessions. For our experiments, we will implement the Network Time Protocol (NTP) on nodes at both ends of a VoIP session and on the intermediate AP in order to calculate the M2E delay for that session. 2. Experimental setup Our experiments will involve running multiple VoIP sessions over a wireless network, each connecting to a wired client Fig. 1. Fig. 1: Scenario topology Once the sessions have been initiated, there are a number of different delay values that must be calculated 160 for each session. We must calculate M2E delays in each direction for each individual session. This information can be computed by analyzing RTCP Sender Reports (SR) and Receiver Reports (RR) at the Access Point. These RTCP reports contain system timestamps (TS) in NTP format, which allows accurate computation of delays when all nodes including the AP are synchronized with NTP. A script file running on the AP will use these delay values along with log information provided by TCPdump on the AP, to generate a real-time picture of delays for all active sessions. This information will then be plugged into an E-Model algorithm to generate R- Values for each session. A further algorithm will prioritize certain VoIP sessions based on their R- Values. This prioritization will be implemented by the issuance of new 802.11e EDCA parameters to all clients via broadcast beacon frames or by sending unicast beacon frames to relevant clients. The life cycle of this process can be seen in Fig.2: Fig.2: Life cycle of role of TAIAP in BSS 3. Conclusion This project is being carried out in parallel with the development of an extension module for the NS-2 Network Simulator, and is intended to validate simulation results found in NS-2[2]. We believe that a Traffic Aware intelligent access point would greatly enhance the QoS of multimedia, particularly VoIP, in an office environment, where network administrators could implement this as a dedicated wireless voice communications framework. 4. References [1] IEEE Standard 802.11e-2005 [2] P.O Flaithearta, H. Melvin, E-Model based Prioritization of Multiple VoIP sessions over 802.11e, Digital Technologies, Zilina, SK, Nov. 2011
Page 1 and 2:
NUI Galway - UL Alliance First Annu
Page 4 and 5:
FULL TABLE OF CONTENTS 1 GAMES, VIS
Page 6 and 7:
4 MECHANICAL AND BIOMEDICAL ENGINEE
Page 8 and 9:
5.21 Detecting Topics and Events in
Page 10 and 11:
8.7 Modelling Extreme Flood Events
Page 12 and 13:
GAMES, VISUALISATION & EDUCATION 1.
Page 14 and 15:
Generation and Analysis of Graph St
Page 16 and 17:
Evolution and Analysis of Strategie
Page 18 and 19:
Abstract The delivery of multimedia
Page 20 and 21:
Applications of Reinforcement Learn
Page 22 and 23:
Assessing the effects of interactiv
Page 24 and 25:
Real-time depth map generation usin
Page 26 and 27:
An analysis of the capability of pr
Page 28 and 29:
Building Information Modelling duri
Page 30 and 31:
Dwelling Energy Measurement Procedu
Page 32 and 33:
Numerical Modelling of Tidal Turbin
Page 34 and 35:
Energy Storage using Microencapsula
Page 36 and 37:
Data Centre Energy Efficiency Mark
Page 38 and 39:
An embodied energy and carbon asses
Page 40 and 41:
SmartOp - Smart Buildings Operation
Page 42 and 43:
Ocean Wave Energy Exploitation in D
Page 44 and 45:
Future Smart Grid Synchronization C
Page 46 and 47:
Web-Based Building Energy Usage Vis
Page 48 and 49:
Image Recognition and Classificatio
Page 50 and 51:
Android Based Multi-Feature Elderly
Page 52 and 53:
Determining Subjects’ Activities
Page 54 and 55:
New Analysis Techniques for ICU Dat
Page 56 and 57:
National E-Prescribing Systems in I
Page 58 and 59:
Using Mashups to Satisfy Personalis
Page 60 and 61:
3D Computational Modeling of Blood
Page 62 and 63:
Experimental and Computational Inve
Page 64 and 65:
Experimental Analysis of the Therma
Page 66 and 67:
Simulating Actin Cytoskeleton Remod
Page 68 and 69:
Computational Analysis of Transcath
Page 70 and 71:
An In vitro Shear Stress System for
Page 72 and 73:
Development of a Micropipette Aspir
Page 74 and 75:
A Computational Test-Bed to Examine
Page 76 and 77:
Computational Modeling of Ceramic-b
Page 78 and 79:
Multi-Scale Computational Modelling
Page 80 and 81:
Development of a mixed-mode cohesiv
Page 82 and 83:
Active Computational Modelling of C
Page 84 and 85:
Modelling the Management of Medical
Page 86 and 87:
SOCIAL MEDIA, SEARCH & RECOMMENDATI
Page 88 and 89:
Improving Twitter Search by Removin
Page 90 and 91:
Abstract The goal of this research
Page 92 and 93:
Generalized Blockmodeling Samantha
Page 94 and 95:
Life-Cycles and Mutual Effects of S
Page 96 and 97:
dcat: Searching Public Sector Infor
Page 98 and 99:
The Effect of User Features on Chur
Page 100 and 101:
User Similarity and Interaction in
Page 102 and 103:
Improving Categorisation in Social
Page 104 and 105:
Natural Language Queries on Enterpr
Page 106 and 107:
Studying Forum Dynamics from a User
Page 108 and 109:
Provenance in the Web of Data: a bu
Page 110 and 111:
Towards Social Descriptions of Serv
Page 112 and 113:
ENVIRONMENTAL ENGINEERING 6.1 Asses
Page 114 and 115:
Novel Agri-engineering solutions fo
Page 116 and 117:
Evaluation of amendments to control
Page 118 and 119:
Determination of optimal applicatio
Page 120 and 121: Treatment of Piggery Wastewaters us
Page 122 and 123: NEXT GENERATION INTERNET 7.1 Extens
Page 124 and 125: Enabling Federation of Government M
Page 126 and 127: Curated Entities for Enterprise Uma
Page 128 and 129: Mobile Web + Social Web + Semantic
Page 130 and 131: Engaging Citizens in the Policy-Mak
Page 132 and 133: Preference-based Discovery of Dynam
Page 134 and 135: RDF On the Go: An RDF Storage and Q
Page 136 and 137: Policy Modeling meets Linked Open D
Page 138 and 139: A Contextualized Perspective for Li
Page 140 and 141: Improving discovery in Life Science
Page 142 and 143: The Semantic Public Service Portal
Page 144 and 145: Personalized Content Delivery on Mo
Page 146 and 147: A Framework to Describe Localisatio
Page 148 and 149: The influence of secondary settleme
Page 150 and 151: Analysis of Shear Transfer in Void-
Page 152 and 153: Cost-Effective Sustainable Construc
Page 154 and 155: Modelling Extreme Flood Events due
Page 156 and 157: Axial Load Capacity of a Driven Cas
Page 158 and 159: Chemical amendment of dairy cattle
Page 160 and 161: Seismic Design of Concentrically Br
Page 162 and 163: MODELLING, ALGORITHMS & CONTROL 9.1
Page 164 and 165: Eigen-based Approach for Leverage P
Page 166 and 167: Evolutionary Modelling of Industria
Page 168 and 169: Abstract: Graphical Semantic Wiki f
Page 172 and 173: Evolving a Robust Open-Ended Langua
Page 174 and 175: Context Stamp - A Topic-based Conte
Page 176 and 177: DSP-Based Control of Multi-Rail DC-
Page 178 and 179: Topographical Cues - Controlling Ce
Page 180 and 181: Creep Relaxation and Crack Growth P
Page 182 and 183: Finite Element Modelling of Failure
Page 184 and 185: Influence of Fluorine and Nitrogen
Page 186 and 187: Phase Decompositions of Bioceramic
Page 188 and 189: High Resolution Microscopical Analy
Page 190 and 191: An Experimental and Numerical Analy
Page 192 and 193: Thermomechanical characterisation o
Page 194 and 195: A multiaxial damage mechanics metho
Page 196: The effect of citrate ester plastic
show all

NUI Galway – UL Alliance First Annual ENGINEERING AND - ARAN ...

Create successful ePaper yourself

Delete template?

Save as template?