27.06.2013 Views

Volume Two - Academic Conferences

Volume Two - Academic Conferences

Volume Two - Academic Conferences

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

4.1 The Mining Plans Selector system<br />

Cristina Wanzeller and Orlando Belo<br />

Figure 1 shows the MPS system fundamental functional components. We adapted the acknowledge<br />

CBR cycle (Aamod and Plaza 1994) to the tasks at hands, devising (six) MPS constituent steps and<br />

forming a problem solving and learning from experience strategy. The R’s steps, (Retrieve, Reuse,<br />

Revise and Retain) are the original ones. Other steps are specific of the MPS system.<br />

Figure 1: MPS functional components<br />

The problem solving part of the system takes as inputs the clickstream data and the requirements<br />

and, based on the cases kept on the knowledge base, delivers appropriate WUM plans. One MPS’s<br />

specific task is to “characterize” the target usage data, producing metadata. We need a data<br />

characterization consistent and independent from KD tool, in order to compare different datasets<br />

systematically. Further, the extraction of some metadata must be automatized, as much as possible,<br />

since clickstream data is typically huge. Another particular task is to “construct” a new WUM problem,<br />

guiding, gathering and organizing the user’s explicit analysis constraints specification, through the<br />

provided abstractions (e.g. analysis goals and application areas).<br />

The “retrieve” step is a typical one and plays a vital role on problem solving. This step selects the<br />

most plausible cases to found the construction of mining plans to recommend, according to the target<br />

problem. The retrieve is based on the similarity threshold value, to choose the cases potentially more<br />

effective. Next, the “reuse” task generates WUM plans, mostly based on the mining methods and the<br />

levels of similarity of the retrieved cases, and considering also the evaluation criteria most important<br />

to the analyst. This step does not performs extensive adaptation of the solution to the current<br />

problem, namely in the wide sense intended by the original step. Yet, it focus the main parts of the<br />

candidate cases that may be transferred to the target problems, by recommending mining methods<br />

instead of mere cases. So, it prepares the reuse of the methods that constructed the solution. The<br />

“revise” step is accomplished by the analyst, outside of the system, using a KD tool.<br />

From the learning point of view, the system operates accepting heterogeneous descriptions of new<br />

WUM processes and acquiring knowledge, through a semi-automated approach. The goal is to<br />

simplify arduous activity, due to the great number of details. The accepted incomings are: documents<br />

describing mining activities, generated by the KD tool in PMML; the process complementary<br />

description, which would be exhaustive, if the used tool does not supports the PMML standard. The<br />

learning starts with a “conciliate” task, to transform and combine the heterogeneous descriptions<br />

items, supplied by user interaction and documents in PMML. Then the traditional “retain” task<br />

essentially adds a new case to the knowledge base. This is realised by integrating and structuring the<br />

incoming elements, considering the internal schema of the cases’ representation. After that, the case<br />

becomes available for edition, to obtain additional information, such as items omitted in the<br />

documents, decisions’ and settings’ explanations/justifications, discoveries and even new stages.<br />

Indeed, the conciliate step may be accomplished in several iterations to enrich progressively the<br />

description and benefit from the availability of the items that are extracted automatically.<br />

4.2 Exemplifying problem solving<br />

To exemplify problem solving we use a simple dataset and specify requisites comprising all the D, T<br />

and P dimensions of problem description. We consider the improving site structure WUM typical<br />

problem. We want to discover meaning relationships among learning materials. The idea is to find out<br />

contents relevance and relations, taking into account the student’ point of view.<br />

857

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!