First of all I'd Like to say a big thaks to the makers of spam cop. This software is really great.
I found that some of the filters are a bit too picky for my taste. Some legitimate mails have been marked as 'Online Pharmacy spam' for example. I would suggest to fix this by reducing the number of wildcards between the letters of the words from five to a more modest setting (2 or 3).
in Edit > Blacklist:Header or Body > Online Pharmacy spam
I replaced the Value:
(?i)(pharmacy)|(medication)|(prescription)|((p(.{1,5})?h(.{1,5})?
a(.{1,5})?r(.{1,5})?m(.{1,5})?a(.{1,5})?c(.{1,5})?y))|((m(.{1,5})
?e(.{1,5})?d(.{1,5})?i(.{1,5})?c(.{1,5})?a(.{1,5})?t(.{1,5})?i(.{1,5})
?o(.{1,5})?n))|((d(.{1,3})?r(.{1,3})?u(.{1,3})?g(.{1,3})?s))
with:
(?i)(pharmacy)|(medication)|(prescription)|((p(.{1,3})?h(.{1,3})?
a(.{1,3})?r(.{1,3)?m(.{1,3})?a(.{1,3})?c(.{1,3})?y))|((m(.{1,3})
?e(.{1,3})?d(.{1,3})?i(.{1,3})?c(.{1,3})?a(.{1,3})?t(.{1,3})?i(.{1,3})
?o(.{1,3})?n))|((d(.{1,3})?r(.{1,3})?u(.{1,3})?g(.{1,3})?s))
This seems to make the filter more tolerant. If anybody else has some useful ideas for blacklist or whitelist filters they would like to share then please post them here.
Thanks
Share Your Filters here!Moderators: Alex Markov, marisp
54 posts
• Page 1 of 4 • 1, 2, 3, 4
Share Your Filters here!Since SpamCombat has no filter by country, for those who cannot read messages written with non-Western European characters (Chinese, Korean, Japanese, Russian, Arabic, etc.) I suggested the creation of 6 filters (see "non-Western European character sets" in this Forum).
Each of these 6 filters had only one character in the field “Subject”. These characters are rarely used in Western European languages, but I found them often in the “Subject” of 45 spam of this type, read by SpamCombat. The more frequent characters found in these 45 spam, and their ANSI code were: Æ: Alt 0198, ¶: Alt 0182, ×: Alt 0215, (it is not an x or X) ø: Alt 0248, Ø: Alt 0216, ±: Alt 0177 Since then, I collected more 400 spam of this type (~20% of my total spam) and I include a new important filter with the following character: Ð: Alt 0208 With these seven filters I had no more false positives of this type (bad messages marked as good). The frequency of messages detected by each of these characters, among those 445 spam, was: ¶: (30%), Ð: (29%), Æ: (28%), ×: (11%), ø: (1%), Ø: (0,5%), ±: (0,5%) DC
Share Your Filters here!To blacklist the emails with non-English characters in the message header or/and body, you can use a regular expression with the following syntax:
\xhh hexadecimal character (up to 2 hex digits) An example of the regular expression shown below allows to catch ANY non-English character:
Share Your Filters here!Thanks Alex,
However I did not mean any non-English character, but non-Western European characters (Asian, Arabic, Cyrillic...) because I cannot read them. The seven filters I suggested are very good in this job. I tested your suggestion and I would be careful about it: all messages written in Spanish would be considered Spam. The same if they were in French, Italian, Portuguese or German (characters Ä, Ü, usw). DC
Share Your Filters here!When non-Western European characters are presented in the hexadecimal format, they look like below:
0198 = C6 0182 = B6 0215 = D7 0248 = F8 0216 = D8 0177 = B1 0208 = D0 So, I would recommend to use the following reg expression instead of your 7 filters:
Viagra in message body blaclistI deleted this filter by mistake. Could someone please post the code? Thanks.
Blacklist Emails With "Viagra" in the Message BodyRegular expression:
Blacklist Virus EmailsSince in the version 1.30 of SpamCombat added the ability to show the attached file name if an email has an attachment, you can block emails with virus and other suspicious attachments.
Here are the regular expressions that catch emails with W32.Novarg virus in the email body and attachment: W32.Novarg in Message Body - select MessageBody and add:
W32.Novarg in Attachment - select X-GSC-Attachment and add:
Here is the regular expression that allows to block emails with any suspicious attachment: Suspicious Attachment - select X-GSC-Attachment and add:
You can add these regular expressions to the SpamCombat blacklist. Since you add the regular expressions, enable the RegExp checkbox.
Where is the attached filename shown? I've tried looking at several e-mails with attachments but cannot see where the attachment filenames are displayed. I've looked in the main program display and also in the list of fields but no attachments shown. Am I on the wrong track here?
As regards tomtom's query. Shouldn't files with attachments have something like Content-Type: multipart/alternative; in the headers (body ?) ?
Also, FWIW a partial filter that might be handy (not checked with SpamCombat yet) is one I came across in another newsgroup :
Not sure how that would need to be changed to work with SpamCombat though. Regards, John.
I realize that the reg expression above is intented to catch emails with W32.Novarg virus. In the current version of SpamCombat this reg expression will not work yet. Please, see the post above (Blacklist Virus Emails) where the reg expressions to catch virus emails are described.
filtering messages with non-Western charactersSometime ago (see above), I suggested seven filters for those who cannot read messages written with non-Western characters (Chinese, Korean, Japanese, etc.).
Since then, I identify two more characters which identify the spam not filtered by those previous (in parenthesis their ANSI codes): ½ (Alt 0189), » (Alt 0187), The hexadecimal codes (to use in regular expressions) for these two characters are: xBD and xBB. I added these two codes to the seven I already had and the regular expression now sees like that: [\xC6\xB6\xD7\xF8\xD8\xB1\xD0\xBB\xBD] I use this expression in the Black List ("Subject" field) for about two weeks. Since then, SpamCombat classified as spam ALL messages I received (about 200) written with non-Western characters. DC By the way, the page below has a nice converter of characters from normal to ANSI and to hexadecimal codes: http://code.cside.com/3rdpage/us/unicode/converter.html
First you can add a condition To: your email address to the whitelist. As the whitelist is prioritized, all the emails with your email address in TO fiel will be automatically marked as good.
Then you can add this regular expression to the blacklist:
This regular expression will catch any email in the TO field and automatically mark the message as spam.
54 posts
• Page 1 of 4 • 1, 2, 3, 4
Who is onlineUsers browsing this forum: Yahoo [Bot] and 1 guest |





