05.01.2013 Views

April 2012 Volume 15 Number 2 - Educational Technology & Society

April 2012 Volume 15 Number 2 - Educational Technology & Society

April 2012 Volume 15 Number 2 - Educational Technology & Society

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Valsamidis, S., Kontogiannis, S., Kazanidis, I., Theodosiou, T., & Karakos, A. (<strong>2012</strong>). A Clustering Methodology of Web Log<br />

Data for Learning Management Systems. <strong>Educational</strong> <strong>Technology</strong> & <strong>Society</strong>, <strong>15</strong> (2), <strong>15</strong>4–167.<br />

A Clustering Methodology of Web Log Data for Learning Management<br />

Systems<br />

Stavros Valsamidis 1 , Sotirios Kontogiannis 1 , Ioannis Kazanidis 2 , Theodosios Theodosiou 2<br />

and Alexandros Karakos 1<br />

1 Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi, Greece //<br />

2 Accounting Department, Kavala Institute of <strong>Technology</strong>, Agios Loukas, 65404, Kavala, Greece //<br />

svalsam@ee.duth.gr // skontog@ee.duth.gr // kazanidis@teikav.edu.gr // theodosios.theodosiou@gmail.com //<br />

karakos@ee.duth.gr<br />

(Submitted July 2, 2010; Revised January 14, 2011; Accepted <strong>April</strong> 1, 2011)<br />

ABSTRACT<br />

Learning Management Systems (LMS) collect large amounts of data. Data mining techniques can be applied to<br />

analyse their web data log files. The instructors may use this data for assessing and measuring their courses. In<br />

this respect, we have proposed a methodology for analysing LMS courses and students’ activity. This<br />

methodology uses a Markov CLustering (MCL) algorithm for clustering the students’ activity and a<br />

SimpleKMeans algorithm for clustering the courses. Additionally we provide a visualisation of the results using<br />

scatter plots and 3D graphs. We propose specific metrics for the assessment of the courses based on the course<br />

usage. These metrics applied to data originated from the LMS log files of the Information Management<br />

Department of the TEI of Kavala. The results show that these metrics, if combined properly, can quantify<br />

quality characteristics of the courses. Furthermore, the application of the MCL algorithm to students’ activities<br />

provides useful insights to their usage of the LMS platform.<br />

Keywords<br />

E-learning, Web mining, Clustering, Metrics<br />

Introduction<br />

Learning Management Systems (LMSs) offer a lot of methods for the distribution of information and for the<br />

communication between the participants on a course. They allow instructors to deliver assignments to the students,<br />

produce and publish educational material, prepare assessments and tests, tutor distant classes and activate archive<br />

storage, news feeds and students’ interaction with multimedia. They also enhance collaborative learning with<br />

discussion forums, chats and wikis (Romero et al., 2008a).<br />

Some of the most well-known commercial LMS are Blackboard, Virtual-U, WebCT and TopClass, while Moodle,<br />

Ilias, Claroline and aTutor are open source, freely distributed LMSs (Romero et al., 2008a). In Greece, the Greek<br />

University Network (GUNet) uses the platform Open eClass (GUNet, 2009), which is an evolution of Claroline<br />

(Claroline, 2009). This system is an asynchronous distance education platform which uses Apache as a web server,<br />

MySQL as its database server and has been implemented in PHP. Open eClass is open source software under<br />

General Public Licence (GPL).<br />

Due to the volume of data, one of the main problems of any LMS is the lack of exploitation of the acquired<br />

information. Most of the times, these systems produce reports with statistical data, which, however, don’t help<br />

instructors to draw useful conclusions either about the course or about the students; they are useful only for<br />

administrative purposes of each platform. Moreover, the existing e-learning platforms do not offer concrete tools for<br />

the assessment of user actions and course educational content.<br />

Data and web mining<br />

Data mining is the search for relationships and patterns that exist in large databases, but are 'hidden' among the vast<br />

amounts of data. It is part of the whole Knowledge Data Discovery (KDD) process. KDD is the complete set of<br />

processes for knowledge discovery in databases that aims at the detection of valid information and pattern<br />

recognition in raw data (Kantardzic, 2003). The classical KDD process includes 5 phases: data pre-processing, data<br />

ISSN 1436-4522 (online) and 1176-3647 (print). © International Forum of <strong>Educational</strong> <strong>Technology</strong> & <strong>Society</strong> (IFETS). The authors and the forum jointly retain the<br />

copyright of the articles. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies<br />

are not made or distributed for profit or commercial advantage and that copies bear the full citation on the first page. Copyrights for components of this work owned by<br />

others than IFETS must be honoured. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior<br />

specific permission and/or a fee. Request permissions from the editors at kinshuk@ieee.org.<br />

<strong>15</strong>4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!