William Acosta locking of existing data is necessary. It also allows for the analysis framework to make use novel summary data structures and algorithms that can incorporate the changes made to the data without requiring analysis of the full dataset. 2.1 Storage and data management The large quantity of data makes a centralized storage solution unfeasible; instead, a distributed storage solution Is favored. The parallel nature of many of the algorithms makes a distributed solution not only more feasible, but also desirable. Distributed storage systems such as Google’s BigTable (Chang et al. 2006), Yahoo’s PNUTS (Cooper et al. 2008), and Amazon’s Dynamo (DeCandia et al. 2007) provide the low-level mechanisms for storing, and managing large quantities of data. These systems were designed to support coordinated reads and updates of data in a distributed environment. To support the needs of applications like cyber-warfare threat detection, a distributed storage system should provide efficient, low-level support for append-only writes of raw data, as well as efficient tracking of incremental additions and updates of the dataset. 2.2 Distributed processing of data Recently, there has been a great deal of research in Google’s MapReduce (Dean & Ghemawat 2004) distributed computing software framework for processing large datasets. However, its batch-oriented nature was not designed to deal with incremental or continuous data updates. This makes it unsuitable for a variety of applications including cyber-warfare threat analysis and detection. Systems like Haloop (Bu et al. 2010) and MapReduce Online (Condie et al. 2010) have sought to add continuous query support to MapReduce. To achieve this, these systems had to make fundamental changes to the API and underlying architecture of MapReduce. This paper argues that what is needed instead is a system designed from the ground-up to support the demands of analysis and mining algorithms on large sets of continuously generated data. 2.3 Data management and analysis The problem of analyzing continuous data has been explored by stream databases (Abadi et al. 2005, Shah et al. 2004). Similarly, continuous queries in databases have been proposed with systems such as TelegraphCQ (Chandrasekaran et al. 2003) and CQL (Arasu et al. 2006). These systems can handle processing queries on streams of data with long-running/continuous queries. However, they lack the ability to support analytic algorithms over a large and diverse dataset. In contrast, verticallypartitioned databases such as C-Store (Stonebraker et al. 2005) excel at fast and efficient support of complex analytics. Unfortunately, vertically-partitioned databases suffer from poor performance on writes. In essence, insertions and updates require that the index be rebuilt. Although performance of reads is very fast once the index is built, building the index is very expensive. What is needed is a system that can perform complex analytics on continuous data without requiring a complex index to be completely rebuilt as a result of data updates. This paper proposes a new, incremental indexing system that keeps track of summarized historical data while allowing for many small [incremental] updates to be incorporated. The key difference is that, unlike traditional database indexes, the new incremental index would not be build off-line (batch-process). Instead, the index would incorporate the many incremental updates on-line so that the index of past data is always active and valid. In addition to the storage and distributed computing framework, it is also important to consider the needs of the algorithms that will be used in the system. Applications with such diverse data require equally diverse analysis. For example, detecting hidden correlations and associations between events seen in server logs requires mining association rules (Agrawal & Srikant 1994) whereas detecting interaction of attackers in a network may involve graph theoretic algorithms. 3. Conclusion This paper presents a case for a new distributed computing system that is explicitly designed to meet the unique needs of applications such as cyber-warfare threat detection. The system should support large quantities of diverse data such as server logs, emails, social-network data, etc. It should allow for a variety of mining and analysis algorithms and support for those algorithms to be processed in a parallel and distributed manner. The system must not only meet these needs, but also do so in a way that can efficiently support continuous analysis of data that is continuously generated. 318
References William Acosta Abadi, D. J., Ahmad, Y., Balazinska, M., Cherniack, M., hyon Hwang, J., Lindner, W., Maskey, A. S., Rasin, E., Ryvkina, E., Tatbul, N., Xing, Y. & Zdonik, S. (2005), The design of the borealis stream processing engine, in ‘CIDR ’05: Proceedings of the second biennial <strong>Conference</strong> on Innovative Data Systems Research’, pp. 277–289. Agrawal, R. & Srikant, R. (1994), Fast algorithms for mining association rules, in J. B. Bocca, M. Jarke & C. Zaniolo, eds, ‘Proc. 20th Int. Conf. Very Large Data Bases, VLDB’, Morgan Kaufmann, pp. 487–499. Arasu, A., Babu, S. & Widom, J. (2006), ‘The cql continuous query language: semantic foundations and query execution’, The VLDB Journal 15(2), 121–142. Bu, Y., Howe, B., Balazinska, M. & Ernst, M. D. (2010), Haloop: Efficient iterative data processing on large clusters, in ‘Proceedings of the VLDB Endowment’, Vol. 3. Chandrasekaran, S., Cooper, O., Deshpande, A., Franklin, M. J., Hellerstein, J. M., Hong, W., Krishnamurthy, S., Madden, S., Raman, V., Reiss, F. & Shah, M. (2003), Telegraphcq: Continuous dataflow processing for an uncertain world, in ‘CIDR ’03: Proceedings of the first biennial <strong>Conference</strong> on Innovative Data Systems Research’. Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., Chandra, T., Fikes, A. & Gruber, R. E. (2006), Bigtable: A distributed storage system for structured data, in ‘Proceedings of the 7th symposium on Operating systems design and implementation (OSDI ’06)’, Seattle, WA. Condie, T., Conway, N., Alvaro, P., Elmeleegy, J. M. H. K. & Sears, R. (2010), Mapreduce online, in ‘Proceedings of the Seventh USENIX Symposium on Networked System Design and Implementation (NSDI 2010)’, San Jose, CA. Cooper, B. F., Ramakrishnan, R., Srivastava, U., Silberstein, A., Bohannon, P., arno Jacobsen, H., Puz, N., Weaver, D. & Yerneni, R. (2008), Pnuts: Yahoo!s hosted data serving platform, in ‘Proceedings of the 34th International <strong>Conference</strong> on Very Large Data Bases (VLDB ’08)’, Auckland, New Zealand. Dean, J. & Ghemawat, S. (2004), Mapreduce: simplified data processing on large clusters, in ‘OSDI’04: Proceedings of the <strong>6th</strong> conference on Symposium on Operating Systems Design & Implementation’, USENIX Association, Berkeley, CA, USA, pp. 10–10. DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P. & Vogels, W. (2007), Dynamo: amazon’s highly available key-value store, in ‘SOSP ’07: Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles’, ACM, New York, NY, USA, pp. 205–220. Metz, C. (2010), ‘Google search index splits with mapreduce’. URL: http: // www. theregister. co. uk/ 2010/ 09/ 09/ google_ caffeine_ explained/ Myers, J., Grimaila, M. & Mills, R. (2010), Insider threat detection using distributed event correlation of web server logs, in ‘ICIW ’10: Proceedings of the 5th International <strong>Conference</strong> on Information-Warfare and Security’. Peng, D. & Dabek, F. (2010), Large-scale incremental processing using distributed transactions and notifications, in ‘OSDI ’10: Proceedings of the Ninth USENIX Symposium on Operating Systems Design and Implementation’. Shah, M. A., Hellerstein, J. M. & Brewer, E. (2004), Highly available, fault-tolerant, parallel dataflows, in ‘SIGMOD ’04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data’, ACM, New York, NY, USA, pp. 827–838. Stonebraker, M., Abadi, D. J., Batkin, A., Chen, X., Cherniack, M., Ferreira, M., Lau, E., Lin, A., Madden, S., O’Neil, E., O’Neil, P., Rasin, A., Tran, N. & Zdonik, S. (2005), C-store: a column-oriented dbms, in ‘VLDB ’05: Proceedings of the 31st international conference on Very large data bases’, VLDB Endowment, pp. 553–564. 319
- Page 1 and 2:
The Proceedings of the 6th Internat
- Page 3 and 4:
Contents Paper Title Author(s) Page
- Page 5 and 6:
Preface These Proceedings are the w
- Page 7 and 8:
Biographies of contributing authors
- Page 9 and 10:
Department of Computer Science, IQR
- Page 11 and 12:
Using the Longest Common Substring
- Page 13 and 14:
Jaime Acosta Some techniques that u
- Page 15 and 16:
Jaime Acosta assigned by an anti-vi
- Page 17 and 18:
Jaime Acosta Cormen, T.H., Leiserso
- Page 19 and 20:
Hind Al Falasi and Liren Zhang tran
- Page 21 and 22:
Hind Al Falasi and Liren Zhang ther
- Page 23 and 24:
Hind Al Falasi and Liren Zhang the
- Page 25 and 26:
Edwin Leigh Armistead and Thomas Mu
- Page 27 and 28:
Deception Operations Security (OPS
- Page 29 and 30:
Edwin Leigh Armistead and Thomas Mu
- Page 31 and 32:
Edwin Leigh Armistead and Thomas Mu
- Page 33 and 34:
The Uses and Limits of Game Theory
- Page 35 and 36:
3. Limits to using game theory 3.1
- Page 37 and 38:
Merritt Baer Effective cyberintrusi
- Page 39 and 40:
Merritt Baer 1.5), there seems to b
- Page 41 and 42:
Merritt Baer Report of the Defense
- Page 43 and 44:
Ivan Burke and Renier van Heerden F
- Page 45 and 46:
Ivan Burke and Renier van Heerden a
- Page 47 and 48:
Ivan Burke and Renier van Heerden F
- Page 49 and 50:
Ivan Burke and Renier van Heerden F
- Page 51 and 52:
Ivan Burke and Renier van Heerden F
- Page 53 and 54:
Marco Carvalho et al. systems start
- Page 55 and 56:
Marco Carvalho et al. Benatallah, 2
- Page 57 and 58:
Marco Carvalho et al. management la
- Page 59 and 60:
Marco Carvalho et al. Figure 4: Sta
- Page 61 and 62:
Marco Carvalho et al. Sorrels, D.,
- Page 63 and 64:
Manoj Cherukuri and Srinivas Mukkam
- Page 65 and 66:
Manoj Cherukuri and Srinivas Mukkam
- Page 67 and 68:
Manoj Cherukuri and Srinivas Mukkam
- Page 69 and 70:
Manoj Cherukuri and Srinivas Mukkam
- Page 71 and 72:
Manoj Cherukuri and Srinivas Mukkam
- Page 73 and 74:
Manoj Cherukuri and Srinivas Mukkam
- Page 75 and 76:
Manoj Cherukuri and Srinivas Mukkam
- Page 77 and 78:
Manoj Cherukuri and Srinivas Mukkam
- Page 79 and 80:
Mecealus Cronkrite et al. to see bo
- Page 81 and 82:
Mecealus Cronkrite et al. trivial.
- Page 83 and 84:
Mecealus Cronkrite et al. socially
- Page 85 and 86:
Mecealus Cronkrite et al. The views
- Page 87 and 88:
Vincent Garramone and Daniel Likari
- Page 89 and 90:
Vincent Garramone and Daniel Likari
- Page 91 and 92:
Vincent Garramone and Daniel Likari
- Page 93 and 94:
Vincent Garramone and Daniel Likari
- Page 95 and 96:
Stephen Groat et al. Sections 4 and
- Page 97 and 98:
Stephen Groat et al. probability fo
- Page 99 and 100:
Stephen Groat et al. host. In this
- Page 101 and 102:
Stephen Groat et al. changing addre
- Page 103 and 104:
Marthie Grobler et al. leadership,
- Page 105 and 106:
Marthie Grobler et al. apply to sta
- Page 107 and 108:
Marthie Grobler et al. 6. Working t
- Page 109 and 110:
Cyber Strategy and the Law of Armed
- Page 111 and 112:
Ulf Haeussler Alliance and Allies r
- Page 113 and 114:
Ulf Haeussler following the invocat
- Page 115 and 116:
Ulf Haeussler NCSA (2009) NCSA Supp
- Page 117 and 118:
Karim Hamza and Van Dalen of respon
- Page 119 and 120:
Karim Hamza and Van Dalen From a mi
- Page 121 and 122:
Karim Hamza and Van Dalen productiv
- Page 123 and 124:
Intelligence-Driven Computer Networ
- Page 125 and 126:
Eric Hutchins et al. of defensive a
- Page 127 and 128:
Eric Hutchins et al. Defenders can
- Page 129 and 130:
Eric Hutchins et al. Equally as imp
- Page 131 and 132:
Eric Hutchins et al. X-Mailer: Yaho
- Page 133 and 134:
Eric Hutchins et al. Received: (qma
- Page 135 and 136:
Eric Hutchins et al. U.S.-China Eco
- Page 137 and 138:
Saara Jantunen and Aki-Mauri Huhtin
- Page 139 and 140:
Saara Jantunen and Aki-Mauri Huhtin
- Page 141 and 142:
Saara Jantunen and Aki-Mauri Huhtin
- Page 143 and 144:
Saara Jantunen and Aki-Mauri Huhtin
- Page 145 and 146:
Brian Jewell and Justin Beaver In t
- Page 147 and 148:
Brian Jewell and Justin Beaver Figu
- Page 149 and 150:
4. Evaluation Brian Jewell and Just
- Page 151 and 152:
Brian Jewell and Justin Beaver othe
- Page 153 and 154:
Detection of YASS Using Calibration
- Page 155 and 156:
Kesav Kancherla and Srinivas Mukkam
- Page 157 and 158:
M M M Su−2 Sv ∑∑ u= 1 v= 1 h(
- Page 159 and 160:
5.2 ROC curves Kesav Kancherla and
- Page 161 and 162:
Developing a Knowledge System for I
- Page 163 and 164:
Louise Leenen et al. We distinction
- Page 165 and 166:
Louise Leenen et al. There is growi
- Page 167 and 168:
3.1 Needs analysis Louise Leenen et
- Page 169 and 170:
Louise Leenen et al. Kroenke, D.M.
- Page 171 and 172:
Jose Mas y Rubi et al. As we can se
- Page 173 and 174:
Figure 2: CALEA forensic model (Pel
- Page 175 and 176:
Jose Mas y Rubi et al. Table 2: Com
- Page 177 and 178:
Jose Mas y Rubi et al. Another pend
- Page 179 and 180:
Tree of Objectives Acknowledgements
- Page 181 and 182:
Secure Proactive Recovery - a Hardw
- Page 183 and 184:
Ruchika Mehresh et al. implementing
- Page 185 and 186:
Ruchika Mehresh et al. The coordina
- Page 187 and 188:
Ruchika Mehresh et al. multiplicati
- Page 189 and 190:
Ruchika Mehresh et al. Table 2: App
- Page 191 and 192:
2. Network infiltration detection D
- Page 193 and 194:
David Merritt and Barry Mullins on
- Page 195 and 196:
David Merritt and Barry Mullins Ess
- Page 197 and 198:
David Merritt and Barry Mullins Dev
- Page 199 and 200:
Muhammad Naveed Pakistan Computer E
- Page 201 and 202:
Muhammad Naveed response could also
- Page 203 and 204:
Muhammad Naveed Table 10: Aggressiv
- Page 205 and 206:
Muhammad Naveed 2006 Tcp Open Mysql
- Page 207 and 208:
Muhammad Naveed 8009 Tcp Open Ajp13
- Page 209 and 210:
Muhammad Naveed Austalian Taxation
- Page 211 and 212:
Alexandru Nitu world and bring it i
- Page 213 and 214:
Alexandru Nitu Article 51 restricts
- Page 215 and 216:
Alexandru Nitu As IW strategy and t
- Page 217 and 218:
Cyberwarfare and Anonymity Christop
- Page 219 and 220:
Christopher Perr attacks again help
- Page 221 and 222:
Christopher Perr about the current
- Page 223 and 224:
Catch me if you can: Cyber Anonymit
- Page 225 and 226:
David Rohret and Michael Kraft reve
- Page 227 and 228:
David Rohret and Michael Kraft sary
- Page 229 and 230:
Data (Evidence) Removal Shield Davi
- Page 231 and 232:
Neutrality in the Context of Cyberw
- Page 233 and 234:
Julie Ryan and Daniel Ryan 18th cen
- Page 235 and 236:
Julie Ryan and Daniel Ryan “Decla
- Page 237 and 238:
Julie Ryan and Daniel Ryan von Glah
- Page 239 and 240:
Harm Schotanus et al. In the remain
- Page 241 and 242:
Harm Schotanus et al. 2.3.1 Secure
- Page 243 and 244:
Harm Schotanus et al. In this setup
- Page 245 and 246:
Harm Schotanus et al. the label (by
- Page 247 and 248:
Harm Schotanus et al. these aspects
- Page 249 and 250:
Maria Semmelrock-Picej et al. they
- Page 251 and 252:
Maria Semmelrock-Picej et al. User
- Page 253 and 254:
Maria Semmelrock-Picej et al. SPIKE
- Page 255 and 256:
Maria Semmelrock-Picej et al. A cl
- Page 257 and 258:
Maria Semmelrock-Picej et al. in co
- Page 259 and 260:
Maria Semmelrock-Picej et al. In th
- Page 261 and 262:
Maria Semmelrock-Picej et al. Fuchs
- Page 263 and 264:
Madhu Shankarapani and Srinivas Muk
- Page 265 and 266:
Figure 1: UPX packed Trojan Figure
- Page 267 and 268:
Trojan.Zb ot- 1342.mal Trojan.Sp y.
- Page 269 and 270:
Madhu Shankarapani and Srinivas Muk
- Page 271 and 272:
Namosha Veerasamy and Marthie Grobl
- Page 273 and 274:
Namosha Veerasamy and Marthie Grobl
- Page 275 and 276:
Namosha Veerasamy and Marthie Grobl
- Page 277 and 278: 4. Conclusion Namosha Veerasamy and
- Page 279 and 280: Tanya Zlateva et al. Computer Infor
- Page 281 and 282: Tanya Zlateva et al. security and v
- Page 283 and 284: Tanya Zlateva et al. court opinions
- Page 285 and 286: Tanya Zlateva et al. 5. Pedagogy, e
- Page 287 and 288: PhD Research Papers 277
- Page 289 and 290: Shada Alsalamah et al. Level 3 all
- Page 291 and 292: Shada Alsalamah et al. assure the a
- Page 293 and 294: Shada Alsalamah et al. for Health I
- Page 295 and 296: 3. Hematology Laboratory System Who
- Page 297 and 298: Shada Alsalamah et al. Pirnejad, H.
- Page 299 and 300: Michael Bilzor a diverse base of U.
- Page 301 and 302: Michael Bilzor In our current exper
- Page 303 and 304: 5. Execution monitor theory Michael
- Page 305 and 306: Michael Bilzor design was run in si
- Page 307 and 308: Michael Bilzor over testbench metho
- Page 309 and 310: Evan Dembskey and Elmarie Biermann
- Page 311 and 312: Evan Dembskey and Elmarie Biermann
- Page 313 and 314: Evan Dembskey and Elmarie Biermann
- Page 315 and 316: Evan Dembskey and Elmarie Biermann
- Page 317 and 318: Theoretical Offensive Cyber Militia
- Page 319 and 320: Rain Ottis Last, but not least, it
- Page 321 and 322: Rain Ottis an infantry battalion, w
- Page 323 and 324: Rain Ottis Ottis, R. (2008) “Anal
- Page 325 and 326: Work in Progress Papers 315
- Page 327: Large-Scale Analysis of Continuous
- Page 331 and 332: Natarajan Vijayarangan top box unit
- Page 333 and 334: Natarajan Vijayarangan The proposed