27.12.2013 Views

The influence of the place-value structure of the Arabic number ...

The influence of the place-value structure of the Arabic number ...

The influence of the place-value structure of the Arabic number ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

analyses. However, in <strong>the</strong> following, <strong>the</strong> results <strong>of</strong> two additional analyses will be presented<br />

addressing <strong>the</strong> problem <strong>of</strong> possible overfitting <strong>of</strong> <strong>the</strong> data by <strong>the</strong> more complex two-linear<br />

model.<br />

In a first approach it was evaluated in how far <strong>the</strong> results observed depended on <strong>the</strong><br />

inclusion <strong>of</strong> one particular data point (i.e., to-be-estimated <strong>number</strong>) to examine over-fitting <strong>of</strong><br />

<strong>the</strong> data. When iteratively omitting one item from <strong>the</strong> analysis results in considerable changes<br />

<strong>of</strong> each <strong>of</strong> <strong>the</strong> models’ performance this indicates that <strong>the</strong> overall result is highly dependable<br />

on <strong>the</strong> inclusion <strong>of</strong> particular data points. On <strong>the</strong> o<strong>the</strong>r hand, when results do not change<br />

considerably this argues for a high stability <strong>of</strong> <strong>the</strong> overall result indicating that it is ra<strong>the</strong>r<br />

invariant to <strong>the</strong> exclusion <strong>of</strong> specific items. A respective analysis for <strong>the</strong> current data showed<br />

that <strong>the</strong> results for both models were very stable (logarithmic: mean adjusted R 2 = .68, SEM <<br />

0.001; two-linear: mean adjusted R 2 = .82, SEM < 0.001; mean <strong>number</strong> <strong>of</strong> participants with an<br />

optimal break-point around 10: 62.72, SEM = 0.87; mean slope <strong>of</strong> single digit segment: 3.93,<br />

SEM < 0.01; mean slope <strong>of</strong> two-digit segment: 0.32, SEM < 0.001). More interestingly, <strong>the</strong><br />

two-linear model outperformed <strong>the</strong> logarithmic one reliably in terms <strong>of</strong> adjusted R 2 no matter<br />

which data point was omitted from <strong>the</strong> analysis (all t > 10.23, all p < .001). Note that this<br />

stability is true for a logarithmic model and for a two-linear model with a fixed break point. A<br />

model with a variable break point could be potentially more flexible and thus more variable<br />

when single data points are omitted.<br />

Ano<strong>the</strong>r way to address <strong>the</strong> point <strong>of</strong> possible overfitting <strong>of</strong> <strong>the</strong> two-linear model is to<br />

evaluate in how far a two-linear model accounts for data produced by a logarithmic model and<br />

vice versa. Generally, data produced by logarithmic model (including some random noise)<br />

must not be accounted for better by two-linear fitting as compared to logarithmic fitting.<br />

When data produced by a logarithmic model is accounted for better by two-linear fitting than<br />

by logarithmic fitting this would suggest overfitting <strong>of</strong> <strong>the</strong> data by <strong>the</strong> two-linear model.<br />

122

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!