12.06.2015 Views

The Annoyance Filter.pdf - Fourmilab

The Annoyance Filter.pdf - Fourmilab

The Annoyance Filter.pdf - Fourmilab

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

§256 ANNOYANCE-FILTER DEVELOPMENT LOG 217<br />

contains a comma or a quote (in which case it forces the quote by doubling it). Since our token definition<br />

doesn’t permit either a comma or a quote within a token, we’re still safe.<br />

2002 October 21<br />

Added a −−phraselimit option to discard phrases longer than the specified limit on the fly. This<br />

prevents dictionary bloat due to “phrases” generated by concatenation of gibberish from headers and<br />

strings decoded from binary attachments. <strong>The</strong>se will usually be eliminated by a −−prune, but that<br />

doesn’t help if the swap file’s already filled up with garbage phrases before reaching the end of the mail<br />

folder. <strong>The</strong> default −−phraselimit is 0, which imposes no limit on the length of phrases.<br />

2002 October 22<br />

When the default getNextEncodedLine of a MIMEdecoder encountered the “From␣” line of the next<br />

message in a mail folder, it failed to store the line as the part boundary, which in turn caused mailFolder<br />

to mis-count the number of messages in a folder being parsed when training. I fixed this, and in the<br />

process re-wrote an archaic C string test used in 〈 Check for start of new message in folder 139 〉 to use<br />

a proper C++ string comparison.<br />

Corrected some ancient URLs in README, and added information on the SourceForge project there and<br />

in annoyance−filter.manm.<br />

Release 0.1-RC6.<br />

2002 October 23<br />

Modified docutil/fixman2html.pl to include an absolute URL for the “<strong>Fourmilab</strong> Home Page” link.<br />

This gets people back to the site when the resulting manual page is posted on SourceForge.<br />

Updated the distclean target in Makefile.in to get rid of several intermediate files which had crept<br />

in since the last housecleaning. <strong>The</strong>se made it more difficult to detect any new files which required<br />

adding to the CVS repository.<br />

Added the utilities/maildir_filter.pl utility contributed by Travis Groth. This has been added<br />

with CVS but not yet committed.<br />

2002 October 26<br />

Added a −−biasmail option to set the frequency bias for words and phrases found in legitimate mail.<br />

Previously this was fixed at 2, which remains the default.<br />

Added autoconf plumbing to detect all the myriad stuff required to support POP3 proxying. We<br />

attempt to distill all of these detections down to a POP3_PROXY_SERVER definition which controls all<br />

code related to that capability.<br />

2002 October 27<br />

Integrated the stand-alone POP3 test article as a new POP3Proxy class with a hard-coded exerciser<br />

in the −−jig. At the moment, it’s purely a proxy—it doesn’t interpose the filter.<br />

2002 October 30<br />

After much struggling, the POP3 procy now seems to be working, so it’s time to integrate it fully into<br />

the program.<br />

Added a −−pop3port option to specify the port on which the POP3 proxy listens for connections. If no<br />

specified, the port number defaults to 9110.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!