Development of a Liquid Scintillator and of Data ... - Borexino - Infn

More documents

Recommendations

Info

5 Particle Identification with a Neural Network 5.2 Artificial Neural Networks The term artificial neural network (NN) describes where they come from: the task to replicate the main feature of the human brain, i.e. the network of neurons. The hope was that a combination of the complex design of the brain and the technology of modern micro processors could combine the advantages of both: good pattern recognition capabilities and fast signal transmission. An artificial neural network can be thought of as a sort of ‘black box’ processing system; its operational capabilities allow the reproduction of an application between two sets of vectors. In a neural network the basic units are called neurons. A neuron has several input and output connections, and the weighted sum of all the signals received by a neuron generates its activation Ø ÛÜ Ü are the input values, Û are the weights and is a threshold potential (see 5.4). The activation function is very often chosen as a sigmoidal function such as Ü The constant Ì sets the gain of the activation function. The output Ó, the signal which is sent to the neighbouring cells, is calculated from the activation by an output function Ó ÓÙØ ÜÌ In the simplest case, the output function is the identity function, so that Ó ÓÙØ With this simple neuron, various types of networks and architectures can be built. One of the most widely used is the feed-forward, multilayer neural network with supervised training. In 66 x 2 x 1 w 2 w 1 Figure 5.4: A simple neural network consisting of two input neurons, Ü and Ü , with specific weights Û and Û , and one output neuron Ý with a threshold . y Figure 5.5: Schematic of a three layer feedforward neural network.
5.2 Artificial Neural Networks this configuration the neurons are grouped in layers: an input layer, one or more hidden layers and an output layer. Each neuron is connected with all the neurons of adjacent layers, but no back-coupling links or links within the same layer are allowed (see fig. 5.5). In principle, one hidden layer is sufficient to fit any continuous function, but it may well be more practical to use a network with more than one hidden layer. Once the weights are set in the training process, the network architecture is completely determined. There is a variety of training methods. In the present case I have used a method known as back-propagation. A set of Monte-Carlo simulated data is presented to the net with the corresponding output. For each input pattern the energy function is evaluated ÆÔ ÆÔ Ç Ì where Ç is the computed network output for the given pattern and Ì is the desired value (target) for the same pattern. The energy function is computed by the net and its value is propagated backwards to modify each weight. The updating function is ¡Û Û Ô ¡Û where ¡Û is the st update of the weight Û and is the learning parameter, representing the step length on the error surface that is defined by the value of the error as a function of all weights. The last term is a momentum term which stabilizes the training by avoiding oscillations. The strength of the damping is set by the parameter Ô which should be between 0 and 1. The updating of the weights can be done after each pattern, after a group of patterns or just once after the complete training set, called one epoch. The smaller the number of patterns the energy function is averaged over, the faster, but the more unstable, the training process can be. The training starts with randomly selected weights and relatively high values of the learning rate and the momentum; the factors are changed during the training over several thousands of epochs, until the error function reaches a minimum on a validation set, a Monte Carlo generated set different from the learning set. It is important that the input variables are of the same order of magnitude and range in order to obtain stable convergence towards the minimum: the simplest method to obtain this is to renormalize the input variables to an interval ℄. Furthermore, in order to avoid local minima, it is recommended to pick the input event at random from the simulated set, and, to prevent overfitting, to have the number of simulated learning patterns at least one order of magnitude higher than the number of weights in the net. The problem of overfitting arises when there are too many weights in the net with respect to the number of learning patterns, so that the net becomes very specialized in fitting only those patterns and loses its generalization capability. For the NN computations the Fortran package JetNet 3.4 [Pet93] was used. A three-layer NN was chosen with a sigmoidal activation function having the output in the range ℄. The training was performed using the back-propagation algorithm. The training was stopped when the error function behaviour became flat. 67
Page 1:
Fakultät für Physik der Technisch
Page 4 and 5:
Abstract The first chapter contains
Page 6 and 7:
Übersicht an Radionukliden in BORE
Page 8 and 9:
Contents 3 The Counting Test Facili
Page 10 and 11:
Contents viii
Page 12 and 13:
1 Solar Neutrinos Figure 1.1: The p
Page 14 and 15:
1 Solar Neutrinos 1.2 Detection of
Page 16 and 17:
1 Solar Neutrinos With this detecti
Page 18 and 19:
1 Solar Neutrinos 1.2.3 Future Expe
Page 20 and 21:
1 Solar Neutrinos 1.3.2 Astrophysic
Page 22 and 23:
1 Solar Neutrinos Using the unitari
Page 24 and 25:
1 Solar Neutrinos km in the earth.
Page 26 and 27: 1 Solar Neutrinos 16 Solution ¡Ñ
Page 28 and 29: 2 The Borexino Experiment The relat
Page 30 and 31: 2 The Borexino Experiment Figure 2.
Page 32 and 33: 2 The Borexino Experiment 2.2 The D
Page 34 and 35: 2 The Borexino Experiment 2.3 Detec
Page 36 and 37: 2 The Borexino Experiment 2.3.2 Anc
Page 38 and 39: 2 The Borexino Experiment which per
Page 40 and 41: 2 The Borexino Experiment 2.4 Backg
Page 42 and 43: 2 The Borexino Experiment ns; Rn-
Page 44 and 45: 2 The Borexino Experiment volume wo
Page 46 and 47: 3 The Counting Test Facility (CTF)
Page 56 and 57: 4 Position Reconstruction of Scinti
Page 72 and 73: 5 Particle Identification with a Ne
Page 86 and 87: 6 Test of a PXE based scintillator
Page 108 and 109: 7 The Source Runs in CTF2 transpare
Page 110 and 111: 7 The Source Runs in CTF2 7.3 The S
Page 112 and 113: 7 The Source Runs in CTF2 102 of th
Page 114 and 115: 7 The Source Runs in CTF2 7.4 Analy
Page 116 and 117: 7 The Source Runs in CTF2 7.4.2 Rec
Page 118 and 119: 7 The Source Runs in CTF2 7.4.3 Ene
Page 120 and 121: 7 The Source Runs in CTF2 45 40 35
Page 122 and 123: 7 The Source Runs in CTF2 112 reemi
Page 124 and 125: Summary and Outlook possible to dis
Page 126 and 127:
Bibliography [Ade98] E. G. Adelberg
Page 128 and 129:
Bibliography [Bra00] L. De Braeckel
Page 130 and 131:
Bibliography [Lom97] P. Lombardi, D
Page 132 and 133:
Bibliography [Vog96] R. B. Vogelaar
Page 134 and 135:
Danksagung An dieser Stelle möchte
show all

Development of a Liquid Scintillator and of Data ... - Borexino - Infn

Create successful ePaper yourself

Delete template?

Save as template?