13.07.2015 Views

Boosted Regression (Boosting): An introductory tutorial and a Stata ...

Boosted Regression (Boosting): An introductory tutorial and a Stata ...

Boosted Regression (Boosting): An introductory tutorial and a Stata ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

IterationsObs Variables 1000 5000 10000Interactions Interactions Interactions2 4 6 2 4 6 2 4 6100100010000100.3 0.5 0.6 1.3 2.2 3 2.6 4.4 6.20 0 0 0 0 0 0 0 0300.6 1.1 1.6 3 5.5 7.8 6.2 11.1 15.60 0 0 0 0 0 0 0 01002 3.5 5 9.5 17.9 25 19.1 35.3 500 0 0 0 0 0 0 0 0103.9 5.6 7.9 15.8 22.1 31.1 27.6 43 62.90 0 0 0 0 0 0 0 0307 11.4 17.2 32.3 57.6 82.6 63.8 113.9 158.70 0 0 0 0 0 0 0 039.7 75.9 111.1 197 380.9 557.6 394.5 758.4 1118100 0 0 0 0.1 0.1 0.2 0.1 0.2 0.329.8 64.5 91.1 156.5 328 422.1 377 599.5 862.510 0 0 0 0 0.1 0.1 0.1 0.2 0.297 178.6 249.6 479.2 906.3 1311.7 964.3 1683.4 2780.730 0 0 0.1 0.1 0.3 0.4 0.3 0.5 0.8487.8 935.1 1382.6 2443.7 4668.2 6840.8 4878.4 9360.5 13615.2100 0.1 0.3 0.4 0.7 1.3 1.9 1.4 2.6 3.8Table 3: Benchmark runtimes for boosting: various combinations of the number of observations(50% used for training, 50% for testing), number of variables, boosting iterations, <strong>and</strong> number ofinteractions chosen. The time is given both in seconds (top number) <strong>and</strong> in hours (bottom number).The runtime range from 0.3 seconds (100 observations, 10 variables, main effectsonly, 1000 iterations) to 3.8 hours (10,000 observations, 100 variables, six-wayinteractions, 10,000 iterations). The time increases roughly linearly with the number ofiterations, the number of interactions, <strong>and</strong> the number of variables. The time increasesmore than linearly with the number of observations. Because the observations are sortedthe runtime is O(n log(n)) where n is the number of observations (i.e. the runtime isbounded by a constant times n log(n); for linear increases the runtime would be boundedby a constant times n ) .30

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!