Download Full Issue in PDF - Academy Publisher

More documents

Recommendations

Info

1604 JOURNAL OF COMPUTERS, VOL. 8, NO. 6, JUNE 2013 belongs to is that of the component with the maximum value. Kernel function of variables and Kernel matrix . Its columns are the coefficients of the kernel function to represent the discriminative function . Norm in the Hilbert space Inner product in the Hilbert space tr(M) The trace of the matrix M , that is, the sum of the diagonal elements of the matrix M. span{ } subspace expanded by In the implementation of this kernel-based learning approach, we often use the Radial Basis Function or Gaussian (RBF) as kernel, and the kernel k defines a unique RKHS. Since the Gaussian kernel is isotropic, it has a spherical symmetry. That is, it generally does not conform to the particular geometry of the underlying classes. In other words, the underlying data structure is obviated. Finally, it is unable to provide an accurate decision surface. To address these limitations, it is crucial to define a new kernel that is adapted to the geometry of the data distribution well. Instead of solving (8) like a traditional kernel-based learning approach, we modify (or deform) the original kernel in order to adapt it to the underlying distribution geometry. Defining a new deformed kernel , the new problem to be solved becomes , (9) (8) and (9) solved with different kernels, and thus in different . The solution of (9) is , (10) that should be appropriate for real setting. To “deform” the original kernel and adapt it to the geometry of the underlying distribution, let be a linear space with positive semi-definite inner product, and let be a bounded linear operator. Defining to be the space with the same functions as and its inner product defines , (11) It is proved in [27] that is a valid . In this specific problem, it is required that and should depend on the data. Therefore, let be , and define as the evaluation map .Using a symmetric positive semidefinite matrix , the semi-norm on can be written as . With such a norm, the regularization problem in (9) becomes (12) where includes both labeled and unlabeled data and the matrix encodes smoothness w.r.t. the graph or manifold. Let and substitute it into (12), we have (13) where is a free parameter that controls the “deformation” of the original kernel. Thus, Equation (13), in fact, is a graph-based semi-supervised learning problem based on the manifold assumption; it can be indirectly set out using (12) and solved using (10). To utilize the geometry information of the data distribution, a graph can be constructed using labeled and unlabeled pixels. The graph Laplacian of is a matrix defined as , where is the adjacency matrix. The elements are measures of the similarity between pixels and , and the diagonal matrix D is given by . The graph Laplacian L measures the variation of the function along the graph built upon all labeled and unlabeled samples. By fixing , the original (undeformed) kernel is obtained. Next, we discuss the computation of the deformed kernel . In [27], the resulting new kernel was computed explicitly in terms of labeled and unlabeled data. It is proved that and (14) Thus, the two spans are same and we have where the coefficients depend on x, let . We can compute at x: (17) (15) (16) Where and g is the vector given by components . Therefore, it can be derived from (17) that (I+MK) (18) where K is the kernel matrix , = and is the identity matrix. Finally, we obtain the following explicit form for (19) where . It satisfies the Mercer’s conditions, being a valid kernel. If , the deformed kernel is degenerated into the original © 2013 ACADEMY PUBLISHER
JOURNAL OF COMPUTERS, VOL. 8, NO. 6, JUNE 2013 1605 (undeformed) kernel. When is singular, one adds a small ridge term to and uses a continuity argument. IV. ELM BASED ON DEFORMED KERNEL In regularization problem (8), if yi is an m-dimensional label vector with the elements 0 or 1, where m is the number of classes, and xi belongs to the k-th class, then the k-th component of yi takes the value 1 and the rest components take the value 0. The corresponding vector function is defined as . Then the extended regularization problem estimates an unknown vector function by minimizing where. (20) Next, we discuss the computation of the deformed kernel , according to , where is the kernel matrix over labeled and unlabeled samples, we have to compute a matrix inversion of size . Note that this inversion scales exponentially with the number of samples. If the number of labeled and unlabeled samples is huge, it is difficult to compute. So we further approximate the kernel matrix K, letting and , then , and , so we achieve In (20), if we introduce a deformed kernel problem to be solved becomes , the (21) where , . The solution of (21) is , where Based on what is introduced above, the regularization problem for DKELM with multioutput nodes can be formulated as (22) The solution of the optimization problem (22) is given by , where is the identity matrix. . Let and and , so ,where is the deformed kernel matrix over labeled points. If the number of labeled samples is not huge, the output function is , (23) if the number of labeled samples is huge, according to the Sherman-Morrison-Woodbury(SMW) formula for matrix inversion, we have , (24) where is the label matrix with elements or , and ( ) is an m- dimensional label vector with the elements 0 or 1. In a semi-supervised case, the number of labeled samples is small, so (40) should be used to compute the output function. , (25) where , . Correspondingly, in a semi-supervised case, the output function of DKELM with a single output is (26) where is an l dimensional label vector given by: . The formula (25) and (26) all involve the inversion of a matrix of order , as long as L is large enough, the generalization performance of DKELM is not sensitive to the dimensionality of the feature space (L) and good performance can be reached, which will be verified later in Section 5. The DKELM algorithm is summarized in the Table II. TABLE II. THE DESCRIPTION OF DKELM ALGORITHM BASED ON DEFORMED KERNEL DKELM Algorithm based on deformed kernel Input: l labeled examples , u unlabeled examples . Output: Estimated function . Step 1: Construct data adjacency graph with (l+u) nodes using k nearest neighbors or a graph kernel. Choose edge weights Wij, for example, for binary weights or heat kernel weights . Step 2: Compute graph Laplacian matrix: , where is a diagonal matrix given by . Step 3: Choose a kernel function . Choose , C and L (the number of sample points), randomly generate . Step 4: if the number of the training data sets is very large , compute , , select (25) for computing the deformed kernel; Otherwise, use (19) . Step 5: Select (23) for computing the output function of DKELM © 2013 ACADEMY PUBLISHER
Page 1 and 2:
Journal of Computers ISSN 1796-203X
Page 3:
Corn Moisture Measurement using a C
Page 6 and 7:
1378 JOURNAL OF COMPUTERS, VOL. 8,
Page 8 and 9:
Page 10 and 11:
Page 12 and 13:
Page 14 and 15:
Page 16 and 17:
Page 18 and 19:
Page 20 and 21:
Page 22 and 23:
Page 24 and 25:
Page 26 and 27:
Page 28 and 29:
Page 30 and 31:
Page 32 and 33:
Page 34 and 35:
Page 36 and 37:
Page 38 and 39:
Page 40 and 41:
Page 42 and 43:
Page 44 and 45:
Page 46 and 47:
Page 48 and 49:
Page 50 and 51:
Page 52 and 53:
Page 54 and 55:
Page 56 and 57:
Page 58 and 59:
Page 60 and 61:
Page 62 and 63:
Page 64 and 65:
Page 66 and 67:
Page 68 and 69:
Page 70 and 71:
Page 72 and 73:
Page 74 and 75:
Page 76 and 77:
Page 78 and 79:
Page 80 and 81:
Page 82 and 83:
Page 84 and 85:
Page 86 and 87:
Page 88 and 89:
Page 90 and 91:
Page 92 and 93:
Page 94 and 95:
Page 96 and 97:
Page 98 and 99:
Page 100 and 101:
Page 102 and 103:
Page 104 and 105:
Page 106 and 107:
Page 108 and 109:
Page 110 and 111:
Page 112 and 113:
Page 114 and 115:
Page 116 and 117:
Page 118 and 119:
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
Page 176 and 177:
Page 178 and 179:
Page 180 and 181:
Page 182 and 183: 1554 JOURNAL OF COMPUTERS, VOL. 8,
Page 261: Call for Papers and Special Issues
Page 264: (Contents Continued from Back Cover
show all

Download Full Issue in PDF - Academy Publisher

Create successful ePaper yourself

Delete template?

Save as template?