April 2012 Volume 15 Number 2 - Educational Technology & Society
April 2012 Volume 15 Number 2 - Educational Technology & Society
April 2012 Volume 15 Number 2 - Educational Technology & Society
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Valsamidis, S., Kontogiannis, S., Kazanidis, I., Theodosiou, T., & Karakos, A. (<strong>2012</strong>). A Clustering Methodology of Web Log<br />
Data for Learning Management Systems. <strong>Educational</strong> <strong>Technology</strong> & <strong>Society</strong>, <strong>15</strong> (2), <strong>15</strong>4–167.<br />
A Clustering Methodology of Web Log Data for Learning Management<br />
Systems<br />
Stavros Valsamidis 1 , Sotirios Kontogiannis 1 , Ioannis Kazanidis 2 , Theodosios Theodosiou 2<br />
and Alexandros Karakos 1<br />
1 Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi, Greece //<br />
2 Accounting Department, Kavala Institute of <strong>Technology</strong>, Agios Loukas, 65404, Kavala, Greece //<br />
svalsam@ee.duth.gr // skontog@ee.duth.gr // kazanidis@teikav.edu.gr // theodosios.theodosiou@gmail.com //<br />
karakos@ee.duth.gr<br />
(Submitted July 2, 2010; Revised January 14, 2011; Accepted <strong>April</strong> 1, 2011)<br />
ABSTRACT<br />
Learning Management Systems (LMS) collect large amounts of data. Data mining techniques can be applied to<br />
analyse their web data log files. The instructors may use this data for assessing and measuring their courses. In<br />
this respect, we have proposed a methodology for analysing LMS courses and students’ activity. This<br />
methodology uses a Markov CLustering (MCL) algorithm for clustering the students’ activity and a<br />
SimpleKMeans algorithm for clustering the courses. Additionally we provide a visualisation of the results using<br />
scatter plots and 3D graphs. We propose specific metrics for the assessment of the courses based on the course<br />
usage. These metrics applied to data originated from the LMS log files of the Information Management<br />
Department of the TEI of Kavala. The results show that these metrics, if combined properly, can quantify<br />
quality characteristics of the courses. Furthermore, the application of the MCL algorithm to students’ activities<br />
provides useful insights to their usage of the LMS platform.<br />
Keywords<br />
E-learning, Web mining, Clustering, Metrics<br />
Introduction<br />
Learning Management Systems (LMSs) offer a lot of methods for the distribution of information and for the<br />
communication between the participants on a course. They allow instructors to deliver assignments to the students,<br />
produce and publish educational material, prepare assessments and tests, tutor distant classes and activate archive<br />
storage, news feeds and students’ interaction with multimedia. They also enhance collaborative learning with<br />
discussion forums, chats and wikis (Romero et al., 2008a).<br />
Some of the most well-known commercial LMS are Blackboard, Virtual-U, WebCT and TopClass, while Moodle,<br />
Ilias, Claroline and aTutor are open source, freely distributed LMSs (Romero et al., 2008a). In Greece, the Greek<br />
University Network (GUNet) uses the platform Open eClass (GUNet, 2009), which is an evolution of Claroline<br />
(Claroline, 2009). This system is an asynchronous distance education platform which uses Apache as a web server,<br />
MySQL as its database server and has been implemented in PHP. Open eClass is open source software under<br />
General Public Licence (GPL).<br />
Due to the volume of data, one of the main problems of any LMS is the lack of exploitation of the acquired<br />
information. Most of the times, these systems produce reports with statistical data, which, however, don’t help<br />
instructors to draw useful conclusions either about the course or about the students; they are useful only for<br />
administrative purposes of each platform. Moreover, the existing e-learning platforms do not offer concrete tools for<br />
the assessment of user actions and course educational content.<br />
Data and web mining<br />
Data mining is the search for relationships and patterns that exist in large databases, but are 'hidden' among the vast<br />
amounts of data. It is part of the whole Knowledge Data Discovery (KDD) process. KDD is the complete set of<br />
processes for knowledge discovery in databases that aims at the detection of valid information and pattern<br />
recognition in raw data (Kantardzic, 2003). The classical KDD process includes 5 phases: data pre-processing, data<br />
ISSN 1436-4522 (online) and 1176-3647 (print). © International Forum of <strong>Educational</strong> <strong>Technology</strong> & <strong>Society</strong> (IFETS). The authors and the forum jointly retain the<br />
copyright of the articles. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies<br />
are not made or distributed for profit or commercial advantage and that copies bear the full citation on the first page. Copyrights for components of this work owned by<br />
others than IFETS must be honoured. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior<br />
specific permission and/or a fee. Request permissions from the editors at kinshuk@ieee.org.<br />
<strong>15</strong>4