6th European Conference - Academic Conferences

More documents

Recommendations

Info

Host-Based Data Exfiltration Detection via System Call Sequences Brian Jewell 1 and Justin Beaver 2 1 Tennessee Technological University, Cookeville, USA 2 Oak Ridge National Laboratory, Oak Ridge, USA bcjewell21@tntech.edu beaverjm@ornl.gov Abstract: The host-based detection of malicious data exfiltration activities is currently a sparse area of research and mostly limited to methods that analyze network traffic or signature based detection methods that target specific processes. In this paper we explore an alternative method to host-based detection that exploits sequences of system calls and new collection methods that allow us to catch these activities in real time. We show that system call sequences can be found to reach a steady state across processes and users, and explore the viability of new methods as heuristics for profiling user behaviors. Keywords: data exfiltration, data security, intrusion detection 1. Introduction A successful attack on an organization involving the theft of sensitive data can be devastating. Data exfiltration is the term used to describe this type of theft and can be defined as the unauthorized transfer of information from a computer system. Data exfiltration attacks represent a tremendous threat to both government entities and commercial enterprises. Government organizations maintain repositories for sensitive and classified information, and breaches into protected systems or leaks into the public domain can have implications that threaten national security. Commercial enterprises manage complex levels of proprietary tools and data that, if compromised, could endanger the financial security of their institutions and/or their customers. Recent studies find that information leaks are the most prevalent security threat for organizations 0 and that in recent years attackers have exfiltrated more than 20 terabytes of data, much of which is sensitive, from the U.S. Department of Defense and Defense Industrial Base organizations, as well as civilian government organizations 0. Despite the threat, the approach to defending against these attacks is surprisingly unsophisticated. Off-the-shelf intrusion detection systems (IDS) monitor for known malicious network signatures at the system boundary. These systems are relied upon to flag potential network breaches, which are then typically investigated manually (often these are guided analyses that leverage custom-built scripts) in order to trace potential unauthorized activities. Unfortunately, the model of perimeter defense leaves attackers free to navigate, investigate, and extrude information if the perimeter is breached undetected. Host intrusion detection systems (HIDS) are software programs that run on each computer host in a network and attempt to detect malicious events in the operation of the host. Commercial virus protection packages (McAfee, 2003) are examples of HIDS and monitor system services, registry changes, and check individual files for signatures of known malicious programs. We approach the detection of data exfiltration attacks as a HIDS. Once the boundary defense is breached, it is from the individual hosts that a malicious user will explore file systems, package data, and export it to an outside network. We postulate that, given insight into the activities of individual users and processes on a given host, acts of unauthorized data exfiltration can be discriminated from normal user/process behaviors. Our hypothesis hinges on the availability of low-level data that reflects the operation of processes on a computer host. We propose to achieve this insight into the computer’s operation through the monitoring of system calls, which are low-level process interactions with the host computer’s operating system. System calls provide a window into what all processes and users on a host machine are executing, regardless of how they are interacting with the machine. In addition, they provide more fidelity in identifying individual actions than a process monitor. 134
Brian Jewell and Justin Beaver In this paper, we propose the use of a method by which unique sequences of system calls, managed at the process/user level, are the basis for discriminating normal and anomalous behaviors by users for use as an exfiltration detection agent. We then evaluate this model to fit our 3 criteria for a viable detection agent. Tractable- The chosen method must be able to run in real time while having negligible effect on a system as experienced by the end user. Environmentally Neutral- Our method must also be portable and adapt to any environment. Responsive- Lastly, our ideal method should reliably report on data exfiltration behaviors. The following sections are organized as follows: Section 2 provides a review of similar work. Section 3 formalizes the methodology we used to categorize normal behavior and collect a profile from the system call data traces. Section 4 evaluates our method according to the 3 criteria we set, and Section 5 gives a detailed account of our results, conclusions, and ideas for future work. 2. Related work The detection of data exfiltrations has been a recent focus of cyber security research. Exfiltration detection is a difficult problem due to the wide range of methods available, and the subtlety with which it can be performed 0. Current IDS systems are mostly concerned with intrusion attempts, although there are extrusion detection systems that are commercially available (e.g., 0. Like network-based IDSs, these are primarily signature-based solutions that perform network traffic analysis through custom hardware. Many more advanced data analysis approaches have been proposed, including clustering of network traffic for anomaly detection 0, the application of statistical and signal processing methods to outbound traffic for signature identification 0, and the application of data mining techniques 0 to network data. These approaches yielded varying degrees of success, but inevitably were plagued with base-rate fallacy 0 issues or a narrow problem focus. However, when we look at previous work on host-based IDSs there is some inspiration for host-based data exfiltration detection. In 1996, Forrest, et al, proposed a host-based intrusion detection method based on the monitoring of system calls (Forrest, 1996). This early work was inspired by the human immune system's ability to recognize what cells are part of the host organism (it's self) or foreign (nonself). They used this principle in developing their own methodology for constructing a "sense of self" for Unix based systems using available system trace data. Forrest’s methodology used lookahead pairs, or sets containing pairs of system calls formed by the originating system call and the one that follows it with spacing 1, 2, 3, .. k. These pairs were used to form a database of normal process behavior (or self), and used to monitor for previously unfound patterns, that were then tagged as anomalous (or non-self). While their results were only preliminary, they did show that a stable signature of normal process behavior could be constructed using very simple methods. Many other approaches have been taken since to model the behavior of processes using system calls, including the use of Hidden Markov Models (HMM) (Gao, 2006), neural networks (Endler, 1998), k-nearest neighbors (Liao, 2002), and Bayes models (Kosoresow, 1997). These models were all developed in hopes of producing more accurate models while reducing false positives which comes at a high computational cost. The most notable advantage of Forrest's model is the ability to track processes for anomalous behavior at the application layer of each individual host in real time at a very low computational cost. Forrest et al., later improves upon their work in (Forrest, 2008) by introducing another simple model that is suitable for real-time detection dubbed sequence time-delay embedding (stide), and again involves the enumeration of system call sequences. However, this time their method uses contiguous sequences of fixed length to form a database of normal behavior. They also introduce a new modification to their method called sequence time-delay embedding with frequency threshold (t-stide). This method explores the hypothesis that sequences with very low occurrence rates in training data are suspicious. 135
Page 1 and 2:
The Proceedings of the 6th Internat
Page 3 and 4:
Contents Paper Title Author(s) Page
Page 5 and 6:
Preface These Proceedings are the w
Page 7 and 8:
Biographies of contributing authors
Page 9 and 10:
Department of Computer Science, IQR
Page 11 and 12:
Using the Longest Common Substring
Page 13 and 14:
Jaime Acosta Some techniques that u
Page 15 and 16:
Jaime Acosta assigned by an anti-vi
Page 17 and 18:
Jaime Acosta Cormen, T.H., Leiserso
Page 19 and 20:
Hind Al Falasi and Liren Zhang tran
Page 21 and 22:
Hind Al Falasi and Liren Zhang ther
Page 23 and 24:
Hind Al Falasi and Liren Zhang the
Page 25 and 26:
Edwin Leigh Armistead and Thomas Mu
Page 27 and 28:
Deception Operations Security (OPS
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
The Uses and Limits of Game Theory
Page 35 and 36:
3. Limits to using game theory 3.1
Page 37 and 38:
Merritt Baer Effective cyberintrusi
Page 39 and 40:
Merritt Baer 1.5), there seems to b
Page 41 and 42:
Merritt Baer Report of the Defense
Page 43 and 44:
Ivan Burke and Renier van Heerden F
Page 45 and 46:
Ivan Burke and Renier van Heerden a
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Marco Carvalho et al. systems start
Page 55 and 56:
Marco Carvalho et al. Benatallah, 2
Page 57 and 58:
Marco Carvalho et al. management la
Page 59 and 60:
Marco Carvalho et al. Figure 4: Sta
Page 61 and 62:
Marco Carvalho et al. Sorrels, D.,
Page 63 and 64:
Manoj Cherukuri and Srinivas Mukkam
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Page 77 and 78:
Page 79 and 80:
Mecealus Cronkrite et al. to see bo
Page 81 and 82:
Mecealus Cronkrite et al. trivial.
Page 83 and 84:
Mecealus Cronkrite et al. socially
Page 85 and 86:
Mecealus Cronkrite et al. The views
Page 87 and 88:
Vincent Garramone and Daniel Likari
Page 89 and 90:
Page 91 and 92:
Page 93 and 94: Vincent Garramone and Daniel Likari
Page 95 and 96: Stephen Groat et al. Sections 4 and
Page 97 and 98: Stephen Groat et al. probability fo
Page 99 and 100: Stephen Groat et al. host. In this
Page 101 and 102: Stephen Groat et al. changing addre
Page 103 and 104: Marthie Grobler et al. leadership,
Page 105 and 106: Marthie Grobler et al. apply to sta
Page 107 and 108: Marthie Grobler et al. 6. Working t
Page 109 and 110: Cyber Strategy and the Law of Armed
Page 111 and 112: Ulf Haeussler Alliance and Allies r
Page 113 and 114: Ulf Haeussler following the invocat
Page 115 and 116: Ulf Haeussler NCSA (2009) NCSA Supp
Page 117 and 118: Karim Hamza and Van Dalen of respon
Page 119 and 120: Karim Hamza and Van Dalen From a mi
Page 121 and 122: Karim Hamza and Van Dalen productiv
Page 123 and 124: Intelligence-Driven Computer Networ
Page 125 and 126: Eric Hutchins et al. of defensive a
Page 127 and 128: Eric Hutchins et al. Defenders can
Page 129 and 130: Eric Hutchins et al. Equally as imp
Page 131 and 132: Eric Hutchins et al. X-Mailer: Yaho
Page 133 and 134: Eric Hutchins et al. Received: (qma
Page 135 and 136: Eric Hutchins et al. U.S.-China Eco
Page 137 and 138: Saara Jantunen and Aki-Mauri Huhtin
Page 143: Saara Jantunen and Aki-Mauri Huhtin
Page 147 and 148: Brian Jewell and Justin Beaver Figu
Page 149 and 150: 4. Evaluation Brian Jewell and Just
Page 151 and 152: Brian Jewell and Justin Beaver othe
Page 153 and 154: Detection of YASS Using Calibration
Page 155 and 156: Kesav Kancherla and Srinivas Mukkam
Page 157 and 158: M M M Su−2 Sv ∑∑ u= 1 v= 1 h(
Page 159 and 160: 5.2 ROC curves Kesav Kancherla and
Page 161 and 162: Developing a Knowledge System for I
Page 163 and 164: Louise Leenen et al. We distinction
Page 165 and 166: Louise Leenen et al. There is growi
Page 167 and 168: 3.1 Needs analysis Louise Leenen et
Page 169 and 170: Louise Leenen et al. Kroenke, D.M.
Page 171 and 172: Jose Mas y Rubi et al. As we can se
Page 173 and 174: Figure 2: CALEA forensic model (Pel
Page 175 and 176: Jose Mas y Rubi et al. Table 2: Com
Page 177 and 178: Jose Mas y Rubi et al. Another pend
Page 179 and 180: Tree of Objectives Acknowledgements
Page 181 and 182: Secure Proactive Recovery - a Hardw
Page 183 and 184: Ruchika Mehresh et al. implementing
Page 185 and 186: Ruchika Mehresh et al. The coordina
Page 187 and 188: Ruchika Mehresh et al. multiplicati
Page 189 and 190: Ruchika Mehresh et al. Table 2: App
Page 191 and 192: 2. Network infiltration detection D
Page 193 and 194: David Merritt and Barry Mullins on
Page 195 and 196:
David Merritt and Barry Mullins Ess
Page 197 and 198:
David Merritt and Barry Mullins Dev
Page 199 and 200:
Muhammad Naveed Pakistan Computer E
Page 201 and 202:
Muhammad Naveed response could also
Page 203 and 204:
Muhammad Naveed Table 10: Aggressiv
Page 205 and 206:
Muhammad Naveed 2006 Tcp Open Mysql
Page 207 and 208:
Muhammad Naveed 8009 Tcp Open Ajp13
Page 209 and 210:
Muhammad Naveed Austalian Taxation
Page 211 and 212:
Alexandru Nitu world and bring it i
Page 213 and 214:
Alexandru Nitu Article 51 restricts
Page 215 and 216:
Alexandru Nitu As IW strategy and t
Page 217 and 218:
Cyberwarfare and Anonymity Christop
Page 219 and 220:
Christopher Perr attacks again help
Page 221 and 222:
Christopher Perr about the current
Page 223 and 224:
Catch me if you can: Cyber Anonymit
Page 225 and 226:
David Rohret and Michael Kraft reve
Page 227 and 228:
David Rohret and Michael Kraft sary
Page 229 and 230:
Data (Evidence) Removal Shield Davi
Page 231 and 232:
Neutrality in the Context of Cyberw
Page 233 and 234:
Julie Ryan and Daniel Ryan 18th cen
Page 235 and 236:
Julie Ryan and Daniel Ryan “Decla
Page 237 and 238:
Julie Ryan and Daniel Ryan von Glah
Page 239 and 240:
Harm Schotanus et al. In the remain
Page 241 and 242:
Harm Schotanus et al. 2.3.1 Secure
Page 243 and 244:
Harm Schotanus et al. In this setup
Page 245 and 246:
Harm Schotanus et al. the label (by
Page 247 and 248:
Harm Schotanus et al. these aspects
Page 249 and 250:
Maria Semmelrock-Picej et al. they
Page 251 and 252:
Maria Semmelrock-Picej et al. User
Page 253 and 254:
Maria Semmelrock-Picej et al. SPIKE
Page 255 and 256:
Maria Semmelrock-Picej et al. A cl
Page 257 and 258:
Maria Semmelrock-Picej et al. in co
Page 259 and 260:
Maria Semmelrock-Picej et al. In th
Page 261 and 262:
Maria Semmelrock-Picej et al. Fuchs
Page 263 and 264:
Madhu Shankarapani and Srinivas Muk
Page 265 and 266:
Figure 1: UPX packed Trojan Figure
Page 267 and 268:
Trojan.Zb ot- 1342.mal Trojan.Sp y.
Page 269 and 270:
Madhu Shankarapani and Srinivas Muk
Page 271 and 272:
Namosha Veerasamy and Marthie Grobl
Page 273 and 274:
Page 275 and 276:
Page 277 and 278:
4. Conclusion Namosha Veerasamy and
Page 279 and 280:
Tanya Zlateva et al. Computer Infor
Page 281 and 282:
Tanya Zlateva et al. security and v
Page 283 and 284:
Tanya Zlateva et al. court opinions
Page 285 and 286:
Tanya Zlateva et al. 5. Pedagogy, e
Page 287 and 288:
PhD Research Papers 277
Page 289 and 290:
Shada Alsalamah et al. Level 3 all
Page 291 and 292:
Shada Alsalamah et al. assure the a
Page 293 and 294:
Shada Alsalamah et al. for Health I
Page 295 and 296:
3. Hematology Laboratory System Who
Page 297 and 298:
Shada Alsalamah et al. Pirnejad, H.
Page 299 and 300:
Michael Bilzor a diverse base of U.
Page 301 and 302:
Michael Bilzor In our current exper
Page 303 and 304:
5. Execution monitor theory Michael
Page 305 and 306:
Michael Bilzor design was run in si
Page 307 and 308:
Michael Bilzor over testbench metho
Page 309 and 310:
Evan Dembskey and Elmarie Biermann
Page 311 and 312:
Page 313 and 314:
Page 315 and 316:
Page 317 and 318:
Theoretical Offensive Cyber Militia
Page 319 and 320:
Rain Ottis Last, but not least, it
Page 321 and 322:
Rain Ottis an infantry battalion, w
Page 323 and 324:
Rain Ottis Ottis, R. (2008) “Anal
Page 325 and 326:
Work in Progress Papers 315
Page 327 and 328:
Large-Scale Analysis of Continuous
Page 329 and 330:
References William Acosta Abadi, D.
Page 331 and 332:
Natarajan Vijayarangan top box unit
Page 333 and 334:
Natarajan Vijayarangan The proposed
show all

6th European Conference - Academic Conferences

Create successful ePaper yourself

Delete template?

Save as template?