Download Full Issue in PDF - Academy Publisher

More documents

Recommendations

Info

1522 JOURNAL OF COMPUTERS, VOL. 8, NO. 6, JUNE 2013 category. But it just makes a rough analysis on leaves shapes, so the next step is refined analysis. C. Classification Model of Leaves 2 Then factor analysis is made on tree leaf shapes within one category to calculate factor score, which is used for clustering. This kind of clustering analysis method is refined. We know that there are several dozens of factors describing leaf shapes, such as leaf shape, leaf width, leaf length, leaf vein, etc., but we know that the length of veins in a certain extent determines leaf length and leaf width. And some factors could be completely described by other factors, so we use the method of reducing dimension firstly and then clustering. We use factor analysis to reduce the dimension of influential factors to get factor score for clustering. This method not only can distinguish well leaf shapes, but also can reduce the complexity of the analyzed problem. The mathematical model for factor analysis is as follows: ⎧X1 = a11F1 + a12F2 + + a1 mFm + ε1 ⎪X2 = a21F1+ a22F2 + + a2 mFm + ε 2 ⎨ , (1) ⎪ ⎪ ⎩XP = aP 1F1 + aP2F2 + + aPmFm + ε P represented with matrix: ⎡X1 ⎤ ⎡a11 a12 a1 m ⎤ ⎡F1 ⎤ ⎡ε1 ⎤ ⎢ X ⎥ ⎢ ⎥ 2 a21 a22 a ⎢ 2m F ⎥ ⎢ 2 ε ⎥ ⎢ ⎥ ⎢ 2 = ⎥ ⎢ ⎥+ ⎢ ⎥ . ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎣X P⎦ ⎣aP1 aP2 aPm⎦ ⎣Fm ⎦ ⎣ε P⎦ Simply recorded as: And meet: 1) m≤ P; 2) cov ( F, ε ) = 0 ; X = AF + ε . (2) ⎡1 0 ⎤ 3) D( F) = ⎢ ⎥ ⎢ ⎥ = Im ; ⎢⎣0 1 ⎥⎦ F , , F is unrelated and variance are 1. 1 m 2 ⎡σ 1 0 ⎤ ⎢ ⎥ 4) D ( ε ) = ⎢ ⎥ . ⎢ 2 0 σ ⎥ ⎣ P ⎦ ε 1, ,ε P denote unrelated and different variance. Among them is the P dimensional random vector as unobservable volume, comprised by P indexes got in F = F , F ′ is called common actual observation. ( ) factor of ( ) 1 , m X = X , , 1 X ′ P the above-mentioned integrated variable. A is factor loading matrix, on which maximum variance rotation is made with variance, so that the structure of A simplified. In other words, the square value of every column elements of loading matrix is made to polarization 0 or 1 or the more dispersed the contribution rate of public factor is the better is the result. Variables got from factor analysis are represented as linear combination of public factors: Xi = ai1F1+ ai2F2 + + aimFm + εi i = 1,2, ,P (3) But usually when public factors are used to represent the original variables, it is more convenient to describe the characteristics of research object. Therefore, public factors are represented as linear combination of variables, i.e., the factor score function, namely F′= β X + β X + + β X j j1 1 j2 2 jP P j= 1,2, ,m (4) We calculated m factor score for each left samples. Use the score of these m factors as a variable value to cluster different leaves with the method of K-means Cluster. D. Clustering Error Estimation We have given the evaluation method for judging clustering effect. Usually we use back substitution misjudgment probability and cross misjudgment probability. If the number of misjudging samples belong to G 1 as belong to G2 is N 1 , and the number of misjudging samples belong to G 2 as belong to G 1 is N 2 , the total number of samples of the two general classifications is n ,Then misjudgment probability is: N1+ N2 p = (5) n Back substitution misjudgment probability Set G 1 , G 2 as two general classifications, X , , 1 X m and Y , , 1 Yn are training samples from G 1 , G 2 respectively, with all the training samples used as m+ n new samples, which is substituted gradually into established criterion for judging the ownership of the new samples. The process is called back substitution. If the number of misjudging samples belong to G 1 as belong to G 2 is N 1 , and the number of misjudging samples belong to G 2 as belong to G 1 is N 2 , then misjudgment probability is: N1+ N2 pˆ = m+ n Cross judgment probability Back to generation misjudgment probability is to eliminate a sample every time, and use the rest of m+ n− 1 training samples to establish a criterion for judgment, then use established criterion to make judgment on deleted samples. The above-mentioned analysis is made on each sample of those training samples, and uses its misjudgment proportion as the misjudgment probability. The specific procedure is as follows: 1) From training samples in general classification G 1 , eliminate one of the samples, and use the rest of the © 2013 ACADEMY PUBLISHER
JOURNAL OF COMPUTERS, VOL. 8, NO. 6, JUNE 2013 1523 samples m − 1 plus all samples in G2 to establish discriminate function; 2) Use the established discriminate function to make judgment on eliminated samples; 3) Repeat steps 1), 2) until the samples in G 1 in turn be deleted and judged. The number of misjudged samples is recorded as m 12 ; 4) Repeat steps 1), 2), 3) for samples in G 2 , until all of the samples in G 2 in turn be deleted and discriminated. The number of misjudged samples is recorded as n 21 . So cross misjudgment probability is estimated: m12 + n21 pˆ = (6) m+ n If clustering result is bad, the following several aspects of optimization could be carried out. 1) Increase sample capacity; 2) Increase new index variables; 3) If statistical data is wrong, rediscover data. III. PROPOSED MODEL BASED ON SNOWFLAKE THEORY A. Snowflake Theory Each snowflake on the whole is a hexagonal star, in which there are six trunks, and then each trunk has small branches, and smaller branches growing on small branches, and so on, as shown in figure 1 below. The process of shaping snowflake is copying part and the whole sections of it constantly. The process with the above mentioned of growth characteristics is called snowflake theory. Figure 1. Snowflake We already know in the above that each tree species has its own particular branching angle. We think of tree trunk as straight, and from another perspective, we could see it as a lateral branch. We all know that each lateral branch has the function of branching, and all of the lateral branches have the same status. Each layer of the branches will branch in accordance with certain similar rule. According to this growth rule, we simulate the outline of a tree, as shown in Fig. 2. Figure 2. The Tree of Computer Simulation According to the ideas of snowflake theory, the growth process of trees is established until it reaches the state of the tree for observation. The laws of changing between the state of a certain level of branching and the state of its sub-level of branching should be found out to for the recursion relationship of programming. Among a certain level of branch the main parameters are the quantity of branches, number of sections, interval of sections, azimuth, included angle of branching, curvature, length of branches, and stem. In [1] three ways of branching have been mentioned, i.e., single axis branching, false binary branching and merging axis branching. To simulate the growth of a tree, which way of branching it belongs to should be found .Then after finding out the law of its branching, computer could be used to simulate out its growth process. B. Ways for Branching We have known that the growth process of trees has the characteristics of self-adaptive, uncertainty, emergency, finality and opening. Different kinds of trees have different ways of branching, and the law of copying is different. So we will analyze ways of branching. Roughly there are three ways of branching for trees: • Single axis branching: The apical bud of the tree constantly grows up vigorously, shaping the stout trunk. And lateral buds also grow into the lateral branch, on which sub-branches grow again, as shown in figure 3 below. The trunk of single axis branching is comparatively straight, and the growth of other branches at all levels is not so vigorous as it. Poplar, metasequoia, etc., are all within the group of single axis branching. False binary branching: The apical bud of the tree stops growing after shaping a branch. Close to the branch two opposite auxiliary buds simultaneously grow into a pair of opposite lateral branches. Then the apical bud and auxiliary buds on the two opposite lateral branches repeat the same growing process, as shown in the figure below. Clove, carnation and horse chestnut, etc., are all within the group of false binary branching. © 2013 ACADEMY PUBLISHER
Page 1 and 2:
Journal of Computers ISSN 1796-203X
Page 3:
Corn Moisture Measurement using a C
Page 6 and 7:
1378 JOURNAL OF COMPUTERS, VOL. 8,
Page 8 and 9:
Page 10 and 11:
Page 12 and 13:
Page 14 and 15:
Page 16 and 17:
Page 18 and 19:
Page 20 and 21:
Page 22 and 23:
Page 24 and 25:
Page 26 and 27:
Page 28 and 29:
Page 30 and 31:
Page 32 and 33:
Page 34 and 35:
Page 36 and 37:
Page 38 and 39:
Page 40 and 41:
Page 42 and 43:
Page 44 and 45:
Page 46 and 47:
Page 48 and 49:
Page 50 and 51:
Page 52 and 53:
Page 54 and 55:
Page 56 and 57:
Page 58 and 59:
Page 60 and 61:
Page 62 and 63:
Page 64 and 65:
Page 66 and 67:
Page 68 and 69:
Page 70 and 71:
Page 72 and 73:
Page 74 and 75:
Page 76 and 77:
Page 78 and 79:
Page 80 and 81:
Page 82 and 83:
Page 84 and 85:
Page 86 and 87:
Page 88 and 89:
Page 90 and 91:
Page 92 and 93:
Page 94 and 95:
Page 96 and 97:
Page 98 and 99:
Page 100 and 101: 1472 JOURNAL OF COMPUTERS, VOL. 8,
Page 200 and 201:
Page 202 and 203:
Page 204 and 205:
Page 206 and 207:
Page 208 and 209:
Page 210 and 211:
Page 212 and 213:
Page 214 and 215:
Page 216 and 217:
Page 218 and 219:
Page 220 and 221:
Page 222 and 223:
Page 224 and 225:
Page 226 and 227:
Page 228 and 229:
Page 230 and 231:
Page 232 and 233:
Page 234 and 235:
Page 236 and 237:
Page 238 and 239:
Page 240 and 241:
Page 242 and 243:
Page 244 and 245:
Page 246 and 247:
Page 248 and 249:
Page 250 and 251:
Page 252 and 253:
Page 254 and 255:
Page 256 and 257:
Page 258 and 259:
Page 261:
Call for Papers and Special Issues
Page 264:
(Contents Continued from Back Cover
show all

Download Full Issue in PDF - Academy Publisher

Create successful ePaper yourself

Delete template?

Save as template?