12.06.2015 Views

The Annoyance Filter.pdf - Fourmilab

The Annoyance Filter.pdf - Fourmilab

The Annoyance Filter.pdf - Fourmilab

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

§9 ANNOYANCE-FILTER A BRIEF HISTORY OF ANNOYANCE-FILTER 17<br />

<strong>The</strong> annoyance−filter is based on Graham’s crystalline vision of Bayesian scoring of messages by<br />

empirically determined word probabilities. It includes the tedious but essential machinery required to parse<br />

MIME multi-part mail attachments, decode non-plain-text parts, and interpret character sets in languages<br />

the user isn’t accustomed to reading. This makes for great snowdrifts of software, but fortunately few details<br />

about which the typical user need fret.<br />

Preliminary tests indicate annoyance−filter is inordinately effective in discriminating legitimate from<br />

junk mail. But this entire endeavour remains very much an active area of research and, consequently,<br />

annoyance−filter has been implemented as a toolkit intended to facilitate experiments with various filtering<br />

strategies and measuring the characteristics which best identify mail worth reading. You’re more than<br />

welcome to build and install the program using the cookbook instructions but, if you’re inclined to delve<br />

deeper, feel free to jump in—the programming’s fine! Everyone is invited to contribute their own wisdom<br />

and creativity toward bringing to an end this intellectual pollution. Remember, when nobody ever sees junk<br />

mail, nobody will bother to send it. Let us commence rowing toward that happy landfall.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!