12.07.2015 Views

Zriadenie Fakulty informatiky a informačných technológií - FIIT STU

Zriadenie Fakulty informatiky a informačných technológií - FIIT STU

Zriadenie Fakulty informatiky a informačných technológií - FIIT STU

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

44 <strong>STU</strong> Faculty of Informatics and Information Technologies−−−−Poláčik, Michal: Adaptation of Recurrent Neural Network Training Algorithms forMultiprocessor Computer Environment. May 2009. Supervisor: M. ČerňanskýPorubčan, Zdenko: Computing Models for Hierarchical Temporal Memory.May 2009. Supervisor: J. ŠtefanovičPrinczkel, Daniel: Adaptation of Neural Networks Using Newton Raphson Method.May 2009. Supervisor: V. KvasničkaSim, Lukáš: Optimalization of Random Forest Algorithm for Prediction Tasks.May 2009. Supervisor: P. Angelovič− Svrček, Matúš: Evolution of Optimal Networks. May 2009. Supervisor: J.Pospíchal−−Šillík, Milan: Multimedial E-learning System about Video Compression. May2009. Supervisor: M. ŠperkaŠimko, Alexander: Symbolic Sequence Processing with Recurrent NeuralNetworks. May 2009. Supervisor: M. Čerňanský− Valaška, Ján: Manipulation with Parts of Different Program's. May 2009.Supervisor: J. Parízková−Varga, Ľubomír: Classification of Visual Data by Neural Networks.May 2009. Supervisor: P. Lacko− Zeman, Jozef: GIS for the Support of Tourism in Towns and Regions. May 2009.Supervisor: M. GalbavýDoctoral (PhD.) ThesesStudent name: Peter LackoDegree program: Artificial IntelligenceThesis title: Emergence of Game Strategy in Multiagent SystemsSupervisor: Vladimír Kvasnička, ProfessorDefended on: June 25, 2009Annotation: In this thesis we focused on subsymbolic approach to machine gameplay problem. We worked on two different methods of learning. Ourfirst goal was to test the ability of common feed-forward neural networksand the mixture of expert topology. We have derived reinforcementlearning algorithm for mixture of expert network topology.This topology is capable to split the problem into smaller parts, whichare easier to be solved by an expert neural network. We have comparedthe quality of strategy emergence between mixture of expertnetworks and feed-forward networks. Our experiments demonstratethat mixture of experts is able to play a game at the same level asfeed-forward networks with equal number of weights. The secondapproach derived in this work is reinforcement learning with usage ofextended Kalman filer. Extended Kalman filter can be used for neuralnetwork training. Its advantage is very high learning rate in terms oftraining cycles. We have proposed usage of extended Kalman filterfor reinforcement learning with TD(0) and Monte Carlo method. Wehave compared the quality of strategy emergence between extended

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!