For Programmers: Free Programming Magazines  


Home > Archive > PERL Miscellaneous > September 2004 > matching devious non ascii spam strings









You are viewing an archived Text-only version of the thread. To view this thread in it's original format and/or if you want to reply to this thread please [click here]

 

Author matching devious non ascii spam strings
Dan Jacobson

2004-09-24, 8:56 pm

To match the name of a famous spam drug for spamassassin, I use
/v[il1\xA0-\xFF]agra/i
as they now are using all the accented versions of "i".
I suppose I will have to do the same for the a's etc. too.
I have just used the entire range I see on "man iso_8859_1" and more,
rather than whittle it down. I don't suppose there are much better ways.
Chris Mattern

2004-09-25, 3:59 am

Dan Jacobson wrote:

> To match the name of a famous spam drug for spamassassin, I use
> /v[il1\xA0-\xFF]agra/i
> as they now are using all the accented versions of "i".
> I suppose I will have to do the same for the a's etc. too.
> I have just used the entire range I see on "man iso_8859_1" and more,
> rather than whittle it down. I don't suppose there are much better ways.


The Bayesian filters worry about that kind of crud automatically.
Let them do the work.

--
Christopher Mattern

"Which one you figure tracked us?"
"The ugly one, sir."
"...Could you be more specific?"
Sponsored Links







Also available: Server administration forum archive | Web Design forum archive | Software forum archive | Hardware reviews archive

Copyright 2008 codecomments.com