Programme booklet (pdf)
Programme booklet (pdf)
Programme booklet (pdf)
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
PRESENTATION ABSTRACTS<br />
Abstract<br />
Memory-based text completion<br />
van den Bosch, Antal<br />
Tilburg University<br />
The commonly accepted technology for fast and efficient word completion is the prefix<br />
tree, or trie. As a word is keyed in, the trie can be queried for unicity points and best<br />
guesses. We present three improvements over the normal prefix trie in experiments in<br />
which we measure the percentage of keypresses saved on both in-domain and out-ofdomain<br />
test text, emulating a perfectly alert user who would select correct suggestions<br />
promptly. First, we train a suffix trie that tests backwards from the most recent<br />
keypresses. Conditioned on first letters, the suffix trie model yields about 10% more<br />
saved keypresses than the baseline character saving percentage on in-domain test<br />
data. Second, the suffix trie model can be straightforwardly extended to testing on<br />
characters of previous words. Adding this context yields another 10% increase in<br />
character savings. Third, when we train the context-rich suffix trie model to complete<br />
the current word and predict the next one in one go, character savings go up another<br />
4%. In a learning experiment on Dutch texts we observe character savings of up to 44%<br />
on in-domain test data where the baseline prefix tree savings percentage is 19%. On<br />
out-of-domain twitter data, the prefix trie baseline of 19% is only mildly surpassed by<br />
the suffix tree variants to 24% character savings. We develop an explanation for the<br />
spectacular success of the suffix tree approach on in-domain data, and review the<br />
applicability of the approach in real-world text entry contexts.<br />
Corresponding author: Antal.vdnBosch@uvt.nl<br />
51