13.07.2015 Views

Data Mining: Practical Machine Learning Tools and ... - LIDeCC

Data Mining: Practical Machine Learning Tools and ... - LIDeCC

Data Mining: Practical Machine Learning Tools and ... - LIDeCC

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

402 CHAPTER 10 | THE EXPLORER(a)Figure 10.17 Using Weka’s metalearner for discretization: (a) configuring FilteredClassifier,<strong>and</strong> (b) the menu of filters.(b)Table 10.3NameAttributeSelectionClassOrderDiscretizeNominalToBinarySupervised attribute filters.FunctionProvides access to the same attribute selection methods as theSelect attributes panelR<strong>and</strong>omize, or otherwise alter, the ordering of class valuesConvert numeric attributes to nominalConvert nominal attributes to binary, using a supervised methodif the class is numericNameTable 10.4Supervised instance filters.FunctionResampleSpreadSubsampleStratifiedRemoveFoldsProduce a r<strong>and</strong>om subsample of a dataset, sampling withreplacementProduce a r<strong>and</strong>om subsample with a given spread betweenclass frequencies, sampling with replacementOutput a specified stratified cross-validation fold for the datasetSupervised attribute filtersDiscretize, highlighted in Figure 10.17, uses the MDL method of supervised discretization(Section 7.2). You can specify a range of attributes or force the discretizedattribute to be binary. The class must be nominal. By default Fayyad<strong>and</strong> Irani’s (1993) criterion is used, but Kononenko’s method (1995) is anoption.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!