Re: CRIME Interesting way around spam filter

From: Shaun Savage (savages@private)
Date: Tue Jun 03 2003 - 11:53:10 PDT

  • Next message: Crispin Cowan: "Re: CRIME Interesting way around spam filter"

    -----BEGIN PGP SIGNED MESSAGE-----
    Hash: SHA1
    
    |
    | But I'd still really like to know which forms of text Mozilla's Bayesian
    | spam filter is looking at: just the raw text? Or is it looking at
    | formatted results?
    
    It looks at raw text. The tokens are found using a fixed set of
    delimiters.  The reason for this is the mozilla spam filter uses the
    html tags to help determine spam, alot of spam uses 'color' font.  Also
    ~ one of the delimiters is '<' '>'  so it can't determine what is a html
    tag.
    
    Shaun
    
    |
    | Shaun? Do you happen to know?
    |
    | Crispin
    |
    
    -----BEGIN PGP SIGNATURE-----
    Version: GnuPG v1.2.1 (GNU/Linux)
    Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
    
    iD8DBQE+3O6UHrFvgoWMI60RAslXAKChNIE6yeI56AzZlkxBqEAGwK8cJACgnlCN
    Z1BUTyRoDnyKo8cFlSKFQnk=
    =RdhF
    -----END PGP SIGNATURE-----
    



    This archive was generated by hypermail 2b30 : Tue Jun 03 2003 - 12:18:06 PDT