Paysages virtuels et analyse de scénarios pour évaluer les impacts ...

More documents

Recommendations

Info

eached. The graphical structure of DTs makes them easy to read, to interpret or to design, in comparison to black-box classifiers like neural networks, generalized linear models or maximal likelihood classifiers (Pal and Mather, 2003). Such DTs are deterministic, so that two instances carrying the same values for their attributes will be routed by the same test nodes to identical decision leaf. DT do not require to be built with expert knowledge and they can be extracted from a learning dataset. We used the C4.5 algorithm (Quinlan, 1993) because it can handle both discrete (the 10 crop classes, the three soil waterlogging classes) and continuous attributes (distance-to-farmstead and field area are real numbers) without making any assumption on the distribution of the input attributes (Friedl and Brodley, 1997). The learning dataset supplied to C4.5 must contain instances characterized by explanatory attributes (the observed current crop, field area, field distance-to-farmstead, soil waterlogging class) and one target attribute (the observed next crop). The learning process starts with the “tree growing” phase, which recursively subdivides the learning dataset into smaller partitions, by testing the values of explanatory attributes and maximizing the information gain ratio. The information carried by a dataset partition is evaluated by an indicator derived from the Shannon entropy index (Shannon, 1948): m infoT =−∑ p log j/ T 2 p j/T , withp = j/T j=1 n j/T m where info(T) is the information entropy of data partition T, log2 is the logarithm function to base 2, m is the number of crop classes, pj/T is the proportion of instances in T carrying the crop class j. Pj/T is therefore the ratio between nj/T, the number of instances carrying class j in T, and the total number of instances of T. Given a test X that partitions T into n outcomes, the total information content after applying X is the sum of the information of the sub-partitions weighted by the number of instances in each sub-partition: m n ∑ n j/T i j=1 infoX T =∑ infoT i i=1 m ∑ n j/T j=1 The information gained by splitting T using X is: gain X =infoT −info X T (10) ∑ j=1 The gain criteria selects the test for which gain(X) is maximum. III. Stochastree, un modèle de successions de cultures basé sur des arbres de décision stochastique – p. 80 n j/ T (8) (9)
To compensate for favoring tests with large number of splits, gain(X) is normalized by: 2 m m n ∑ n j/T ∑ n j/ Ti i j=1 split info X =−∑ log j=1 m m i=1 ∑ n j/T ∑ n j/ T j=1 j=1 Therefore the final splitting metric is: gain X gainratio X = split info X A better information gain ratio thus indicates that the instances in the new partitions are more homogeneous concerning the targeted “next crop” value. However, a noisy learning dataset artificially generates unnecessary test nodes, causes overfitting, and alters the robustness of the DT. Therefore, most DT algorithms use a stop rule (like “stop subdividing a partition if it contains less than n instances”) and even a “pruning phase” during which subtrees not contributing significantly towards generalization accuracy are replaced by a final leaf (Murthy et al., 1998). Such pruned DT are usually more robust although their final leaves are not perfect (they contain different values for the target attribute). At imperfect leaves, the algorithm chooses the most represented value of the target attribute and sets it as the leaf decision. The originality of our approach was (i) to expand the imperfect leaves of the deterministic trees into stochastic nodes by converting the frequencies of the target values into probabilities estimates (Quinlan, 1990; Provost and Domingos, 2003; Liang et al., 2006), (ii) and to use them as transition probabilities to simulate the next crop. The deterministic trees were built using the C4.5 algorithm implemented in the Weka data-mining software (Witten and Frank, 2005) with the following parameters: - tree growth parameters: 10-fold cross-validation, growth should stops at a node containing less than five instances (parameter set by default to two, but raised to five to avoid noise artifacts); - tree pruning parameters: three of the 10 folds were randomly selected and used for pruning; pruning confidence rate was 25% (recommended default value); sub-tree raising and reduced error mode were activated. The deterministic trees were then expanded into stochastic decision trees and saved as XML files, whose the tree-structured syntax is particularly well adapted to DT representation and parsing. A program, Stochastree, was designed on purpose. It handles landcover dataset and parses the XML decision trees. When a stochastic node is met, a random number is drawn in [0, 1[ (uniform law) and is compared to the cumulated probabilities of the possible outcomes of the node (which sum up to 1). The outcome corresponding to the interval in which the random number falls is chosen. III. Stochastree, un modèle de successions de cultures basé sur des arbres de décision stochastique – p. 81 (11) (12)
Page 1:
Thèse de Doctorat présentée deva
Page 5 and 6:
Abstract and key words The environm
Page 7 and 8:
Remerciements « [Il] émergeait de
Page 9 and 10:
de connaître et de collaborer avec
Page 11 and 12:
Table des matières Abstract and ke
Page 13 and 14:
Annexes I : Application d’une mod
Page 15 and 16:
Introduction générale Les paysage
Page 17 and 18:
- dans un second temps : (i) constr
Page 19:
Chapitre I Modélisation intégrée
Page 22 and 23:
Fig. I.1: Trois regards disciplinai
Page 24 and 25:
surface et le sous-sol est limitée
Page 26 and 27:
Fig. I.2: (a) Variogramme expérime
Page 28 and 29:
des interactions entre les exploita
Page 30 and 31: Fig. I.4: Schémas du cycle hydrolo
Page 32 and 33: Fig. I.5: Flux d’azote (a) et de
Page 34 and 35: influencent les phénomènes de tra
Page 36 and 37: Fig. I.6: Courbes d’absorption au
Page 38 and 39: Fig. I.7 : Diversité d’indicateu
Page 40 and 41: ecyclage et de transformation (min
Page 42 and 43: Modélisation et évaluation intég
Page 44 and 45: conceptuelles et techniques : - un
Page 46 and 47: La démarche adoptée pour cette é
Page 49 and 50: Chapitre II : La plateforme Qualsca
Page 51 and 52: L’occupation du sol conditionnant
Page 53 and 54: (a) (b) Fig. II.11 : Cartes des sol
Page 55 and 56: Formalisme du paysage « occupation
Page 57 and 58: Une représentation hiérarchique d
Page 59 and 60: surface=y), et la décision attendu
Page 61 and 62: Fig. II.16 : Les différentes class
Page 63 and 64: Représentation du milieu physique
Page 65 and 66: Sources de la plateforme Qualscape
Page 67: Chapitre III Stochastree, un modèl
Page 70 and 71: Les arbres de décision de Stochast
Page 72 and 73: Introduction Despite their role in
Page 74 and 75: Fig. III.19 : Agricultural land are
Page 76 and 77: The life-cycle analysis survey cond
Page 78 and 79: - farm-types have specific spatial
Page 82 and 83: Comparative analysis of the simulat
Page 84 and 85: GrMz: 1023m Wh: 858m dairy AWD: 636
Page 86 and 87: Fal 33% TempP 71% PermP 100% Forage
Page 88 and 89: simulation method Rotomatrix Stocha
Page 90 and 91: of significant differences was gene
Page 92 and 93: crop class simulation method perman
Page 94 and 95: Spatial distribution of crops over
Page 96 and 97: The decision trees used in this res
Page 98 and 99: Gabrielle B., B. Mary, R. Roche, P.
Page 100 and 101: In Cheverry C. (ed.). Agriculture i
Page 103 and 104: Chapitre IV : Une approche multi-re
Page 105 and 106: Multi-resource and multi-criteria l
Page 107 and 108: ecology (Ruiz et al., 2006) and for
Page 109 and 110: as cattle excreta (Bodet et al., 20
Page 111 and 112: Factorial simulation design A set o
Page 113 and 114: Cropping system factor levels Two l
Page 115 and 116: TNT2 simulations TNT2 calibration w
Page 117 and 118: Results Virtual landscape construct
Page 119 and 120: The NST transformations of measured
Page 121 and 122: Cropping system factor levels Fig.
Page 123 and 124: Scenario comparison and analysis St
Page 125 and 126: Exploratory analysis of the simulat
Page 127 and 128: The variance of all simulation resu
Page 129 and 130: The effects of the factor levels va
Page 131 and 132:
higher denitrification rates, thus
Page 133 and 134:
Conclusion The study develops innov
Page 135 and 136:
Gabrielle B., B. Mary, R. Roche, P.
Page 137 and 138:
Chapitre V Conclusion générale
Page 139 and 140:
Chapitre V : Conclusion générale
Page 141 and 142:
par le système de culture ; − au
Page 143 and 144:
La plateforme Qualscape Les modules
Page 145 and 146:
Cependant, afin de maintenir un cer
Page 147 and 148:
Perspectives - de l’évaluation e
Page 149 and 150:
Bibliographie générale
Page 151 and 152:
Bibliographie générale Aarts H.F.
Page 153 and 154:
Deffontaines, J.P., C. Thenail, J.
Page 155 and 156:
in considering landscape trajectori
Page 157 and 158:
Ecosystems & Environment, 120 (2-4)
Page 159 and 160:
on global change research. 124 p. U
Page 161:
Annexes Annexes I : Article soumis
Page 165 and 166:
Annexes I : Application d’une mod
Page 167 and 168:
Application of virtual landscape mo
Page 169 and 170:
Introduction The increasing develop
Page 171 and 172:
Material and methods Construction o
Page 173 and 174:
Fig. A.I.35: Construction of the in
Page 175 and 176:
Virtual landscape evolution Three i
Page 177 and 178:
Fertilization practices We consider
Page 179 and 180:
PEi,y= PexportYcrop = PexportN(mean
Page 181 and 182:
RMSE= 1 N Year 1 N Realization N
Page 183 and 184:
The rules we followed to interpret
Page 185 and 186:
The sill of the experimental variog
Page 187 and 188:
Strategy evaluation Accuracy assess
Page 189 and 190:
− samples stratified by landuse (
Page 191 and 192:
est design worst design BIAS curren
Page 193:
Papritz A. and R. Webster. 1995. Es
Page 197 and 198:
Annexes II : Matrices de probabilit
Page 199 and 200:
Arbres de décision utilisés par S
Page 201 and 202:
Fig. A.II.43 : Ensemble des parcell
Page 203 and 204:
Fig. A.II.45 : Ensemble des parcell
Page 205:
Arbre de décision utilisé pour si
Page 208 and 209:
1. Exemple de fichier texte produit
Page 210 and 211:
Méthode principale de manipulation
show all

Paysages virtuels et analyse de scénarios pour évaluer les impacts ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?