The Annoyance Filter.pdf - Fourmilab
The Annoyance Filter.pdf - Fourmilab
The Annoyance Filter.pdf - Fourmilab
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
§256 ANNOYANCE-FILTER DEVELOPMENT LOG 217<br />
contains a comma or a quote (in which case it forces the quote by doubling it). Since our token definition<br />
doesn’t permit either a comma or a quote within a token, we’re still safe.<br />
2002 October 21<br />
Added a −−phraselimit option to discard phrases longer than the specified limit on the fly. This<br />
prevents dictionary bloat due to “phrases” generated by concatenation of gibberish from headers and<br />
strings decoded from binary attachments. <strong>The</strong>se will usually be eliminated by a −−prune, but that<br />
doesn’t help if the swap file’s already filled up with garbage phrases before reaching the end of the mail<br />
folder. <strong>The</strong> default −−phraselimit is 0, which imposes no limit on the length of phrases.<br />
2002 October 22<br />
When the default getNextEncodedLine of a MIMEdecoder encountered the “From␣” line of the next<br />
message in a mail folder, it failed to store the line as the part boundary, which in turn caused mailFolder<br />
to mis-count the number of messages in a folder being parsed when training. I fixed this, and in the<br />
process re-wrote an archaic C string test used in 〈 Check for start of new message in folder 139 〉 to use<br />
a proper C++ string comparison.<br />
Corrected some ancient URLs in README, and added information on the SourceForge project there and<br />
in annoyance−filter.manm.<br />
Release 0.1-RC6.<br />
2002 October 23<br />
Modified docutil/fixman2html.pl to include an absolute URL for the “<strong>Fourmilab</strong> Home Page” link.<br />
This gets people back to the site when the resulting manual page is posted on SourceForge.<br />
Updated the distclean target in Makefile.in to get rid of several intermediate files which had crept<br />
in since the last housecleaning. <strong>The</strong>se made it more difficult to detect any new files which required<br />
adding to the CVS repository.<br />
Added the utilities/maildir_filter.pl utility contributed by Travis Groth. This has been added<br />
with CVS but not yet committed.<br />
2002 October 26<br />
Added a −−biasmail option to set the frequency bias for words and phrases found in legitimate mail.<br />
Previously this was fixed at 2, which remains the default.<br />
Added autoconf plumbing to detect all the myriad stuff required to support POP3 proxying. We<br />
attempt to distill all of these detections down to a POP3_PROXY_SERVER definition which controls all<br />
code related to that capability.<br />
2002 October 27<br />
Integrated the stand-alone POP3 test article as a new POP3Proxy class with a hard-coded exerciser<br />
in the −−jig. At the moment, it’s purely a proxy—it doesn’t interpose the filter.<br />
2002 October 30<br />
After much struggling, the POP3 procy now seems to be working, so it’s time to integrate it fully into<br />
the program.<br />
Added a −−pop3port option to specify the port on which the POP3 proxy listens for connections. If no<br />
specified, the port number defaults to 9110.