Tesis y Tesistas 2020 - Postgrado - Fac. de Informática - UNLP

Recommendations

Info

DOCTORADO ENCIENCIAS INFORMÁTICASDr. FacundoManuel Quirogae-mailfacundoq@gmail.comAdvisorDr. Laura LanzariniThesis defense dateMarch 13, 2020SEDICIhttp://sedici.unlp.edu.ar/handle/10915/90903Transformational Invarianceand Equivariance Measures forConvolutional Neural Networks.Applications in HandshapeRecognitionKeywords: Equivariance; Invariance; Neural Networks; Convolutional Neural Networks; Handshape Classification;Measures; Transformations.MotivationNeural Networks are machine learning models which currentlyhave the best performance in a wide variety of problems. Pairedwith optimization algorithms based on gradient descent,they can optimize thousands or millions of parameterswith respect to a custom error function. Neural Networksdistinguish themselves from other models but not requiringmanual design of features to work correctly; instead, featuresare learned automatically through the optimization process,which is also called training. The design of Neural Networks isorganized in layers which determine its architecture. Recently,Neural Networks with a large number of layers have beensuccessfully trained using a set of techniques usually knownas Deep Learning.In particular, Convolutional Neural Networks, that is, NeuralNetworks that employ Convolutional Layers, are the state-ofthe-artin most computer vision problems, including imageclassification.Many of the problems for which Convolutional Networks arethe state-of-theart require models to behave in a certainway whenever their input is transformed. There are twofundamental properties that capture such requirement:invariance and equivariance. Invariance states that the outputof the model is not affected at all by the transformations.Equivariance generalizes invariances by allowing the output tobe affected, but in a controlled and useful manner.While traditional models based on Convolutional Networksare equivariant to translation by design, they are neitherequivariant to other transformations nor invariant to any setof transformations, at least in the usual training and testingscenarios. There are two main options to grant invariance orequivariance to a neural network model. The most traditionaloption has been to modify the design of the model so that itpossesses such properties. The other option is to train it usingdata augmentation with the same set of transformations towhich the invariance or equivariance is desired.However, the mechanisms by which these models acquire theseproperties is unclear, weather for modified models or for dataaugmented ones. Furthermore, the impact of the modificationof the models on their efficiency and representational poweris unknown. Even more, for traditional models trained withdata augmentation, the different augmentation strategies toacquire equivariances have not been studied.ObjectivesOur main objective in this thesis is to contribute to theunderstanding and improvement of equivariance in neuralnetwork models. In terms of applications, we focus onhandshape classification for sign language and other types ofgestures using convolutional networks.Therefore, we set the following specific goals:1. Analyze CNN models design specifically for equivariance.2. Compare specific models and data augmentation as meansto obtain equivariance. Evaluate transfer learning strategies toobtain equivariant models starting with non-equivariant ones.3. Develop equivariance measures for activations or innerrepresentations in Neural Networks. Implement thosemeasures in an open source library. Analyze the measuresbehaviour, and compare with existing measures.4. Characterize CNN models for image classification in termsof equivariance using the proposed measures.5. Compare CNN models, with or without equivariance, forhandshape classification.Given the existence of multiple methods to achieve10
englishDr. Facundo Manuel Quirogaequivariance, and the lack of deep and rigorous comparisonsamong them, the scope of this thesis is limited to the analysisof the models and we therefore do not propose new equivariantnetwork models.Thesis contributionsIn this thesis we present several contributions:1. A comparative analysis of Neural Network based models forsign language handshape classification.2. An analysis of strategies to achieve equivariance to rotationsin neural networks for:• Comparing the performance of strategies based on dataaugmentation and specially designed networks and layers.• Determining strategies to retrain networks so that theyacquire equivariance to rotations.3. A set of measures to empirically analyze the equivariance ofNeural Networks, as well as any other model based on latentrepresentations, and the corresponding:• Validation of the measures to establish if they are indeedmeasuring the purported quantity.• Analysis of the different variants of the proposed measures.• Analysis of the properties of the measures, in terms of theirvariability to transformations, models and weight initialization.• Analysis of the impact of several hyperparameters of themodels on the structure of their equivariance, including MaxPooling layers, Batch Normalization, and kernel size.• Analysis of the structure of the equivariance in several wellknown CNN models such as ResNet All Convolutional and VGG.• Analysis of the impact on the equivariance of using specializedmodels to obtain equivariance such as TransformationalInvariance Pooling.• Analysis of the class dependency of equivariance.• Analysis of the effect of varying the complexity and diversityof the transformations on the measures.Future Research LinesWe believe it is possible to learn more about Convolutionaland Neural Networks by studying their equivariances andtherefore improve existing models, enabling new applicationsof Computer Vision and Machine Learning.To further these goals, we have identified several possiblework directions.Regarding the acquisition of invariance via data augmentation,in principle we note the necessity of expanding the evaluationdomains, considering other classification problems as wellas segmentation, localization and regression. In this regard,domains where samples appear naturally rotated are of specialimportance. Furthermore, studies relating the complexity ofthe transformations to the corresponding model complexityare needed to understand the computational cost of invarianceand equivariance. This relationship can be studied in termsof computing time or size of the model, as well as designcomplexity and transferability of features. Finally, techniquesfor transfer learning to acquire invariance must be advancedand formalized, given that in several domains transfer learningis the most direct and sometimes only choice to obtain modelswith good performance.Regarding the proposed equivariance measures, it would beinteresting to endow them with the capacity to automaticallydetect equivariance structures in the network in terms ofgroups of activations. This would allow the analysis with intermediate granularities that lie between analysing the wholenetwork and individual activations. Furthermore, efficientextensions of the measures for the full equivariance case canbe explored. The statistical characteristics of the measureshave yet to be studied, in order to be able to perform hypothesistesting, for example, to compare two models in terms ofinvariance. Also, invariance analysis can be a complementto Adversarial Attacks and Defenses, since invariance toperturbations implies robustness to attacks. Finally, we haveplaned to implement support for Tensorflow and Tensorboardin the Transformational Measures library, so that the use of thistechniques is available to a wider community of researchers.We will also continue with its development to improve itsperformance and ease of use, adding new features such as theability to filter layers or activations, and to pre-process them.Regarding the characterization of models via the measures,we believe it is necessary to explore more certain aspects suchas the dependency between the number of filters or featuresand the acquired equivariance. Using the same scheme we employed for the data augmentation experiments, we will analysethe difference of the invariance structure between modelstrained for invariance from scratch from those that have beenpre-trained in order to offer a complementary perspective ontransfer learning via invariance and equivariance. Finally, weare also interested in widening the set of transformationsand domains in which to apply the measure, ranging outsidethe set of affine transformations and image classificationproblems.Given the preceding arguments, we believe more can belearned about Neural Networks by studying their equivariancesand invariances.11 TESIS Y TESISTAS 2020
Page 2 and 3: índiceEquipo Editorialpag. 4MAESTR
Page 4 and 5: equipoeditorialDIRECTOR DE POSTGRAD
Page 7 and 8: 01DOCTORADO ENCIENCIAS INFORMÁTICA
Page 9: españolDr. Facundo Manuel Quiroga5
Page 13 and 14: españolDr. Diego Miguel Montezanti
Page 15 and 16: englishDr. Diego Miguel Montezantit
Page 17 and 18: españolDra. Verónica ArtolaAsí,
Page 19 and 20: englishDr. Verónica Artola• The
Page 21 and 22: españolDra. Patricia Rosalia Jimbo
Page 23 and 24: englishDr. Patricia Rosalia Jimbo S
Page 25 and 26: españolDr. Sergio Hernán Rocabado
Page 27 and 28: englishDr. Sergio Hernán Rocabado
Page 29 and 30: españolDra. Noelia Soledad Pintoat
Page 31 and 32: englishDr. Noelia Soledad Pintoof e
Page 33 and 34: DOCTORADO ENCIENCIAS INFORMÁTICASD
Page 35 and 36: españolDr. Nahuel Mangiaruautiliza
Page 37: englishDr. Nahuel Mangiaruaand reco
Page 40 and 41: MAESTRÍATECNOLOGÍA INFORMÁTICAAP
Page 60 and 61:
MAESTRÍATECNOLOGÍA INFORMÁTICAAP
Page 62 and 63:
Page 64 and 65:
Page 66 and 67:
Page 68 and 69:
Page 70 and 71:
Page 72 and 73:
Page 74 and 75:
Page 76 and 77:
Page 78 and 79:
MAESTRÍAINGENIERÍA DE SOFTWAREMg.
Page 80 and 81:
Page 82 and 83:
Page 84 and 85:
Page 86 and 87:
Page 88 and 89:
Page 90 and 91:
Page 92 and 93:
Page 94 and 95:
MAESTRÍAredes de datosMg. Mónica
Page 96 and 97:
MAESTRÍAredes de datosMg. Mónica
Page 98 and 99:
MAESTRÍAredes de datosMg. Moises E
Page 100 and 101:
MAESTRÍAredes de datosMg. Diego Ro
Page 102 and 103:
MAESTRÍAredes de datosMg. Diego Ro
Page 104 and 105:
MAESTRÍAredes de datosMg. Hugo Ort
Page 106 and 107:
MAESTRÍAredes de datosMg. Hugo Ort
Page 108 and 109:
MAESTRÍAredes de datosMg. Emanuel
Page 110 and 111:
MAESTRÍACómputo de AltasPrestacio
Page 113 and 114:
03especializacionesTECNOLOGÍA INFO
Page 115 and 116:
españolDr. José Manuel Ochoa Robl
Page 117 and 118:
englishDr. José Manuel Ochoa Roble
Page 119 and 120:
españolEsp. Mario Alberto Vincenzi
Page 121 and 122:
englishEsp. Mario Alberto VincenziW
Page 123 and 124:
españolEsp. Nicolás Martín Páez
Page 125 and 126:
englishEsp. Nicolás Martín PáezP
Page 127 and 128:
españolEsp. Cesar Armando Estrebou
Page 129 and 130:
englishEsp. Cesar Armando Estrebouw
Page 131 and 132:
españolEsp. Laura Randa Calcagnius
Page 133 and 134:
englishEsp. Laura Randa Calcagnican
Page 135 and 136:
especializaciónIngeniería de Soft
Page 137 and 138:
JURADOSDESIGNADOSLic. Azrilevich Pa
show all

Tesis y Tesistas 2020 - Postgrado - Fac. de Informática - UNLP

Create successful ePaper yourself

Delete template?

Save as template?