anytime algorithms for learning anytime classifiers saher ... - Technion

More documents

Recommendations

Info

Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 between missing features and makes it possible to share information value computations between different feature subsets. VIOLA was shown empirically to achieve dramatic cost improvements without the prohibitive computational costs of comprehensive search. Yang, Webb, Korb, and Ting (2007) introduced Anytime Averaged Probabilistic Estimators (AAPE) for utilizing additional computational resources during classification. At classification time, AAPE computes Naive Bayes and then exploits extra time to refine the probability estimate. Ueno, Xi, Keogh, and Lee (2006) showed how to convert nearest neighbor classifiers into anytime classifiers. The proposed algorithm, called Anytime Nearest Neighbor (ANN), utilizes additional classification time by considering more and more examples out of which the nearest neighbor is picked. Unlike TATA, both ANN and AAPE are fixed-time learners and cannot benefit from additional resources during learning. Moreover, ANN and AAPE do not deal with attribute costs and assume that the major constraint is the computation time required by the probabilistic models. Kaplan, Kushilevitz, and Mansour (2005) studied an extension of the classical PAC model where test costs are involved. In the first setup, called offline, f, the target function, and D, the distribution of the test outcomes, are known. The goal is to minimize the expected cost of classifying a case using f. In the second setup, denoted distributional online, either f or D are unknown. When f is unknown, the goal is to minimize the predictive errors while maintaining low testing costs. When D is unknown, the goal is to minimize the expected cost of the tests, while collecting information regarding the distribution. In the third setup, denoted adversarial online, f is given but the inputs are selected adversarially. The goal is to bound the average online cost compared to a fixed evaluation order. Each of these scenarios is analyzed and theoretical bounds are derived. 6.4 Other Learning Components In addition to the induction procedure, the learning process involves other components, where additional resources can be invested. The problem of active learning is to select an example for tagging out of a pool of untagged examples. Lindenbaum et al. (2004) presented an active learning algorithm that uses lookahead to evaluate the untagged examples and selects the best example for tagging. This can be viewed as an anytime algorithm parameterized by the depth of lookahead. Feature generation can also be performed by anytime algorithms. Markovitch and Rosenstein (2002) presented an algorithm that searches for good features over the (potentially infinite) space of constructed features. This algorithm can be interrupted at anytime, returning the currently best generated features. Feature selection can also benefit from the anytime approach. Last, Kandel, 132
Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Maimon, and Eberbach (2001) introduced an interruptible anytime algorithm for feature selection. Their proposed method performs a hill-climbing forward search in the space of feature subsets where the attributes are added to the selected subset one at a time. constructed from the previously selected attributes. The algorithm can be stopped at any moment, returning the set of currently selected features. All these methods are orthogonal to our induction methods and can complement them. An interesting research direction is to determine resource distribution between the various learning stages. Costs are also involved in the learning phase, during example acquisition and during model learning. The problem of budgeted learning has been studied by Lizotte et al. (2003). There is a cost associated with obtaining each attribute value of a training example, and the task is to determine what attributes to test given a budget. A related problem is active feature-value acquisition. In this setup one tries to reduce the cost of improving accuracy by identifying highly informative instances. Melville et al. (2004) introduced an approach in which instances are selected for acquisition based on the accuracy of the current model and the model’s confidence in the prediction. An extension to the work on optimal fixed depth trees (Greiner et al., 2002) is when tests have costs, both during learning and classification. Kapoor and Greiner (2005) presented a budgeted learning framework that operates under strict budget for feature-value acquisition during learning and produces a classifier with a limited classification budget. Provost, Melville, and Saar-Tsechansky (2007) discussed the challenges electronic commerce environments bring to data acquisition during learning and classification. They discussed several settings and presented a unified framework to integrate acquisition of different types, with any cost structure and any predictive modeling objective. (Madani, Lizotte, & Greiner, 2004) studied the setup of budgeted active model selection. In this setup the learner can use a fixed budget of model probes to identify which of a given set of possible models has the highest expected accuracy. In each probe a model is evaluated on a random indistinguishable instance. The goal is a policy that sequentially determines which model to probe next, based on the information observed so far. Madani et al. formalized the task and showed that it is NP-hard in general. They also studied several algorithms for budgeted active model selection and compared them empirically. 133
Page 1 and 2:
Technion - Computer Science Departm
Page 3 and 4:
Page 5 and 6:
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Page 77 and 78:
Page 79 and 80:
Page 81 and 82:
Page 83 and 84:
Page 85 and 86:
Page 87 and 88:
Page 89 and 90:
Page 91 and 92:
Page 93 and 94:
Page 95 and 96:
Page 97 and 98: Technion - Computer Science Departm
Page 147: Technion - Computer Science Departm
show all

anytime algorithms for learning anytime classifiers saher ... - Technion

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?