Download - Academy Publisher

More documents

Recommendations

Info

topic that because it is flexible division of sample data sets and can detect invasion more objectively. A. The Theory of FCM Fuzzy clustering is a multi-technology for classification of objective things, which construct fuzzy resemblance according to the characteristics of the objective things, the relatedness and the similarity. Ref. [1] referred the method of fuzzy clustering analysis that it can be divided into three categories: 1) The number of categories is indeterminate; it means to cluster dynamically according to the different requirements. 2) The number of categories is given; the target is to find out the best way to classify the data. This method cluster based on objective function and called fuzzy C means (FCM) algorithm or fuzzy ISODATA clustering. 3) In the case of significant perturbation, it clusters according to the fuzzy similarity matrix. This method is called fuzzy clustering based on perturbation. The theory of fuzzy C-means clustering (FCM) [2,3]: Fuzzy C-means clustering is an algorithm based on the division, and it is an improved algorithm based on C-means, the C-means algorithm is rigid for data partition, but FCM is flexible and fuzzy for partition. According to the quadratic sum in minimum of the specified grouping, FCM uses the membership to determine each data instance; it divides a data instance X = { X i X i ∈ R( i =1,2, L, n) } with n into k categories ( 1 < K < N ), and calculates the cluster center of each category, in order to make the non-similarity value function minimum. The matrix of classification, U = ( uij i = 1,2, L n; j = 1,2, L, k) , where u ij indicated the membership of the data instance belong to , and satisfied the following conditions: k ∑ j= 1 u ij = 1, ∀i = 1, L, n. Use the FCM for fuzzy partition, so that each given data instance can determine which categories belong to, according to the membership between 0 and 1. The elements of the matrix U get values between 0 and 1. The value function defined as follows: m N k m 2 ( U, C) ∑∑uij d ij ( X i C j ) i= 1 j = 1 ⑴ J = , ⑵ J m can be seen as the quadratic sum of the distance between the each data instance and the cluster center. In (2), C = { C j C j ∈ I, j = 1,2, L, k} , and C j ∈ I indicate the cluster centers; X i ∈ I indicate the data instance sets; u ij mean the membership of the data instance belong to the cluster center, their values are U = is a matrix of n × k , u ij between 0 and 1, { } [ C1, C2 , L C k ] { c , c2 , } C = , is a matrix of s × k ; C = 1 L, c k , c i indicate the cluster center of p the fuzzy group; X ∈ R are the data instances; ( X C ) i d ij i , j indicate the distance between the data instance and the cluster center; m means the fuzzy coefficient ( 1 ≤ m < ∞) ; k means the number of the pre-categories, it determined by the initial clustering. We can use the Lagrange multiplier method to obtain the necessary condition of minimum for J : c u ij ij = 1 k 2 ( ) ( m−1 ∑ ) dij di , ∀i ⑶ 1 i= 1 m m ⎛ m ⎞ ⎛ = ⎜∑uij x j ⎟ ⎜∑u ⎝ i= 1 ⎠ ⎝ i= 1 ij m ⎞ ⎟, ∀j ⎠ The parameter m in the above formulas is a scalar to control the blur length of the classification matrixU , the bigger m is, the more blurred it is. If m = 1, the algorithm of FCM degenerates into hard C-means clustering (HCM) algorithm. FCM clustering needs many times to iterate so that the value function obtains the minimum. B. FCM Used in Anomaly Detection The intrusion detection algorithm based on FCM [3]: From the above discussion we can see, the FCM algorithm requires two parameters: the number of clusters C and the parameter m. The number of clusters can use the clustering number of initial clustering as C, and C is less than the total number of cluster samples. The detection optimization can follow these steps: Step1: initialize the membership matrix U with random number between 0 and 1, and satisfy the n formula∑uij = 1, ∀j = 1, L , n . i= 1 n ⎛ m ⎞ Step2: use ci = ⎜∑uij x j ⎟ ⎝ i= 1 ⎠ cluster centersC i , i = 1, L, k . n ∑ j= 1 u 2 ( m− j ) m ij ⑷ to calculate the c ⎛ d ij ⎞ Step3: use uij = 1 ∑⎜ ⎟ to calculate the k = 1 d ⎝ kj ⎠ new membership matrix U. Step4: calculate the value function according to J m N k m 2 ( U C) = ∑∑uij dij ( X i , C j ) , . If it is smaller i= 1 j= 1 than a determined threshold or is smaller than the change with the last value function, then it will stop and output the clustering results. Otherwise, return to Step2 to continue iterating. The output of the algorithm is a fuzzy partition matrix with N × K , the matrix indicate the membership of the 90
each sample belongs to each category. It can identify each sample belongs to a category in accordance with the principle of maximum membership according to the matrix. We can see from the algorithm that it is better to cluster for the data of normal distribution, and is more sensitive to the isolated point. C. Problems about the Application of FCM In recent years, cluster analysis has become an important technique of data mining. Although many clustering methods obtain a wide range of applications, there are problems of limitations and adaptability for every method. To the fuzzy clustering algorithm, FCM algorithm is the most widely used, but it also has many issues to exist, such as the number of the cluster must be preset by users, and select the appropriate number of clusters is the precondition for a precise clustering, but it is difficult to determine the number of clustering; as FCM algorithm is essential a local optimization technology, which uses the Hill Climbing to find out the optimal solution for iterating, it is particularly sensitive to initialize, and easily to fall into the local minimum so that can’t obtain the global optimal solution. IV. OPTIMIZATION OF FUZZY CLUSTERING ALGORITHM The method of intrusion detection based on fuzzy clustering, FCM often combine with other methods to detect the intrusion, of course, there are many hybrid methods, such as the combination of FCM and adaptive immune system, the application of the average information entropy, the combination of FCM and support vector machine, the fuzzy genetic algorithm and etc.. The paper respectively introduced the research in recent years on how to obtain the number of clusters and the optimal solution in the following. A. Determination of The Number of Clusters Many studies aimed at the number of FCM clustering algorithm and the selection of the initial cluster centers, but the relevant research, only consider the determination of the number of clusters, or only to select the initial cluster centers. Ref. [4] introduced a method based on the average information entropy for the problem of the determination of the cluster number in FCM algorithm, and used the density function to obtain the initial cluster center. When the division of the cluster is more reasonable, the data on the attribution of a cluster is more established, and the information entropy is smaller. Ref. [4] improved the clustering algorithm based on the information entropy, it used the average information entropy as the standard for determining the number of cluster. The concept of the average information entropy is defined as follows: C N ( k ) = −∑∑{ [ uij × log 2( uij ) + ( − uij )× H 1 i= 1 j= 1 ( )] N} log 21−u ij ⑸ First, to define the range of the number of cluster, C . In (5), uij indicated the extent of the u ij ∈ 0,1 , ∀i, . When [ ] min ,C max sample j belongs to clusteri , [ ] j k increase from C min to C max , it can create Cmax − Cmin + 1 of H K ( x) . According to the regulation, the smaller information entropy value is, the more established the data belongs to a cluster, and then select the smallest H K ( x) , which corresponds to the cluster number k as the final cluster numberC . Ref. [5] proposed a fuzzy C-means and support vector machine algorithm (F-CMSVM) for automatic clustering number determination; it can solve the issue in fuzzy C-means algorithm (FCM) that clustering number has to be pre-defined. It used the algorithm of support vector machine with a fuzzy membership function and used the affiliating matrix which obtained by the introduction of support vector machine into fuzzy C-means algorithm as the fuzzy membership function, so that each different input data sample can have different penalty value, then it can obtain the optimized separating hyper-plane. For the fuzzy C-means and support vector machine algorithm, first of all, it suppose that the given data set can be divided into 2 categories ( k = 2) , and use the fuzzy C-means to cluster, then use the affiliating matrix which obtained by using the algorithm as the fuzzy membership of the fuzzy support vector machine algorithm, and train the data sets so that it can access to the support vector machine and the separating hyper-plane. To test the assumption, it had proposed a new standard in the paper: use the d SV as the distance between the two categories. And d S1 , d S 2 mean the average distance between the respective support vector of S1, S2 and the nearest neighbor points. If d SV ≤ min( d S1, d S 2 ) , the original dataset can not be classified, and the assumption is not true; otherwise, it establish and the original dataset can be divided into two categories at least. B. Obtain The Global Optimal Solution For the problem that it is easy to fall into local minimum value caused by the sensitive initialization to FCM, people have been proposed the clustering method based on genetic algorithm, the method can converge to the global optimal value at a higher probability, but its’ convergence rate is slower and it will appear the phenomenon of precocity. To solve the problem, Ref. [6] used the clonal selection algorithm (CSA) to optimize the objective function of the unsupervised FCM clustering algorithm; the clonal selection algorithm used the mechanism of antibody clone to construct a clone operator which combines characters of the evolutionary search, the global search, the stochastic search and the local search. As CSA based on the clone operator is a Groupization strategy, it has parallelism and randomness, so that it can obtain the global optimal solution with a 91
Page 1 and 2:
Proceedings The Second Internationa
Page 3 and 4:
Table of Contents Message from the
Page 5 and 6:
Zuming Xiao, Zhan Guo, Bin Tan, and
Page 7 and 8:
Message from the Symposium Chairs T
Page 9 and 10:
Second International Symposium on N
Page 11 and 12:
ISBN 978-952-5726-09-1 (Print) Proc
Page 13 and 14:
model that can deal with time serie
Page 15 and 16:
ISBN 978-952-5726-09-1 (Print) Proc
Page 17 and 18:
training for the Wushu competition
Page 19 and 20:
ISBN 978-952-5726-09-1 (Print) Proc
Page 21 and 22:
information, called weak uncertain
Page 23 and 24:
Student side Student side Student s
Page 25 and 26:
ISBN 978-952-5726-09-1 (Print) Proc
Page 27 and 28:
If we define element of student as
Page 29 and 30:
ISBN 978-952-5726-09-1 (Print) Proc
Page 31 and 32:
QoS, each frame data is divided int
Page 33 and 34:
ISBN 978-952-5726-09-1 (Print) Proc
Page 35 and 36:
for Imaging Two and Three Phase Flo
Page 37 and 38:
ISBN 978-952-5726-09-1 (Print) Proc
Page 39 and 40:
A. Profiling&following control algo
Page 41 and 42:
Research and Realization about Conv
Page 43 and 44:
PDF document structure is a tree st
Page 45 and 46:
ISBN 978-952-5726-09-1 (Print) Proc
Page 47 and 48:
support 10 Mb / s. But ENC28J60 onl
Page 49 and 50: ISBN 978-952-5726-09-1 (Print) Proc
Page 51 and 52: condition the first byte output of
Page 55 and 56: According to maximum membership deg
Page 59 and 60: ubber according to the mass ratio o
Page 61 and 62: Ⅲ. AN IMPROVED DNA ALGORITHM FOR
Page 63 and 64: Ⅴ.CONCLUSION REMARKS DNA computer
Page 65 and 66: its first child q 1 on the left, th
Page 67 and 68: chains can greatly improve efficien
Page 69 and 70: II. RELATED WORK A. Mobile Service
Page 71 and 72: special services. SOAP is used to b
Page 73 and 74: detection methods of DDoS attacks m
Page 75 and 76: Step4 calculate the new subordinate
Page 77 and 78: Figure 1. An analysis of the partit
Page 81 and 82: The preceding three formulas can be
Page 85 and 86: And the decay speed of buffer seque
Page 89 and 90: distance of view point. Given that
Page 93 and 94: Ⅳ. EVALUATION OF BLENDED LEARNING
Page 97 and 98: symmetric with respect to the origi
Page 99: ISBN 978-952-5726-09-1 (Print) Proc
Page 105 and 106: A. Data Preparing To generate our t
Page 109 and 110: oth α and β . determination of Qu
Page 113 and 114: Strong earthquake 0.1< M L
Page 117 and 118: indirect causes, and the logical re
Page 119 and 120: egression. Granger causality test i
Page 121 and 122: B. Evaluation We have evaluated thi
Page 123 and 124: used as a source and neighboring ce
Page 125 and 126: Figure 3. Drainage networks generat
Page 127 and 128: Combinational logic unit failures i
Page 129 and 130: Tabal.1 Combinational fault logic t
Page 131 and 132: under the endorsement of both the m
Page 133 and 134: TABLE I. DESCRIPTION OF PROPOSITION
Page 135 and 136: Figure 2. The process of the invers
Page 137 and 138: Figure 9. (a)the original image.(b)
Page 141 and 142: In order to reduce to the number of
Page 145 and 146: that studying being going to be to
Page 147 and 148: Reference[10] analyzed the evolutio
Page 149 and 150: module, communication module, apper
Page 151 and 152:
After the comprehensive performance
Page 153 and 154:
system testing can be seen that the
Page 155 and 156:
III. AUTONOMIC RESOURCE ALLOCATION
Page 157 and 158:
The QoE i (T i ) is the ith user’
Page 159 and 160:
ISBN 978-952-5726-09-1 (Print) Proc
Page 161 and 162:
nodes will be formed one cluster, t
Page 163 and 164:
ISBN 978-952-5726-09-1 (Print) Proc
Page 165 and 166:
Where n is the number of data point
Page 167 and 168:
ISBN 978-952-5726-09-1 (Print) Proc
Page 169 and 170:
Keyboard event Application User mod
Page 171 and 172:
ISBN 978-952-5726-09-1 (Print) Proc
Page 173 and 174:
SQL Azure will eventually include a
Page 175 and 176:
ISBN 978-952-5726-09-1 (Print) Proc
Page 177 and 178:
The nodes in the suffix tree are dr
Page 179 and 180:
[3] Y. Li, S. M. Chung, and J. D. H
Page 181 and 182:
some drilling fluid produces hydrog
Page 183 and 184:
"normalization", whose membership b
Page 185 and 186:
and minimum structural elements in
Page 187 and 188:
esults of the spatial transform par
Page 189 and 190:
B. Research on protocol actions Com
Page 191 and 192:
ACKNOWLEDGMENT This work is funded
Page 193 and 194:
A. Problem Description In a small b
Page 195 and 196:
[4] Huang, Hung, and J. Y. jen Hsu.
Page 197 and 198:
Further by calculating, the followi
Page 199 and 200:
TABLE II. THE CONCENTRATION BETWEEN
Page 201 and 202:
private key that obtained by using
Page 203 and 204:
ISBN 978-952-5726-09-1 (Print) Proc
Page 205 and 206:
ontology, and the domain dictionary
Page 207 and 208:
ISBN 978-952-5726-09-1 (Print) Proc
Page 209 and 210:
Eq.4 is NP-hard and can be solved b
Page 211 and 212:
ISBN 978-952-5726-09-1 (Print) Proc
Page 213 and 214:
Where: G i is the selection field o
Page 215 and 216:
If there is X j in a generation, wh
Page 217 and 218:
Theorem 2.4([10]). Let L 1 and L 2
Page 219 and 220:
The relation between the lattice im
Page 221 and 222:
Figure 2. Example of single-step de
Page 223 and 224:
evocation and the fourth group stor
Page 225 and 226:
focused mainly on rule-based forms
Page 227 and 228:
Ⅵ. CONCLUSION Figure 6. The Flask
Page 229 and 230:
EDCF in comparison with DCF, has so
Page 231 and 232:
B. Simulation results and analysis
Page 233 and 234:
ISBN 978-952-5726-09-1 (Print) Proc
Page 235 and 236:
in routing table. When search resou
Page 237 and 238:
ISBN 978-952-5726-09-1 (Print) Proc
Page 239 and 240:
others through the selective emotio
Page 241 and 242:
such as weather information, techno
Page 243 and 244:
the opening window, a related page
Page 245 and 246:
1) Process context storage areas. I
Page 247 and 248:
V. CONCLUSION In this thesis, sCPU-
Page 249 and 250:
main types of horizontal search env
Page 251 and 252:
Enterprise Portal security provides
Page 253 and 254:
() t = [ S () t S () t ] T N S ,...
Page 255 and 256:
[2] Kumaravel, N., and Kavitha, V.,
Page 257 and 258:
output, so BP network has been wide
Page 259 and 260:
input to the artificial neural netw
Page 261 and 262:
(2) In a process (tokens from outsi
Page 263 and 264:
The reduction process consists of t
Page 265 and 266:
all eight normal vectors are identi
Page 267 and 268:
is B object , and the number of emp
Page 269 and 270:
xi, j y' = εα ( (1 + tanh( )) −
Page 271 and 272:
Error 8000 7000 6000 5000 4000 3000
Page 273 and 274:
Architecture (AMBA) a new bus archi
Page 275 and 276:
In order to ensure the smooth proce
Page 277 and 278:
ISBN 978-952-5726-09-1 (Print) Proc
Page 279 and 280:
In this paper, we adopt two Sobel o
Page 281 and 282:
ISBN 978-952-5726-09-1 (Print) Proc
Page 283 and 284:
four layers.The far right of the gr
Page 285 and 286:
ISBN 978-952-5726-09-1 (Print) Proc
Page 287 and 288:
TABLE II. OPERATIONAL EMPLOYEE TABL
Page 289 and 290:
A. Object-relational Type To audit
Page 291 and 292:
to execution time. Also, DBA would
Page 293 and 294:
Liping Chen .......................
show all

Download - Academy Publisher

Create successful ePaper yourself

Delete template?

Save as template?