[logs] Re: what is normal ?

Previous message: Yang Lee: "RE: [logs] pix log parsing"
Next in thread: Raistlin: "Re: [logs] Re: what is normal ?"
Reply: Raistlin: "Re: [logs] Re: what is normal ?"
Reply: Jon Stearley: "Re: [logs] Re: what is normal ?"
Reply: John Rowan Littell: "Re: [logs] Re: what is normal ?"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

partainat_private

"Jon Stearley" <jrstearat_private> writes:

> i'm using the teiresias algorithm
> (http://cbcsrv.watson.ibm.com/Tspd.html) to classify log lines.

(OK, I'm scared :-)

Just an idle possibly-related thought: could any of the
principles of Bayesian spam filtering (quite the rage in
some circles...) be applied to logging?

(Best to go googling if you want real info on Bayesian spam
filtering, but the rough user-interface is: you train the
filter on a big pile of spam, and then on a big pile of
non-spam ('ham'); thereafter, it tells you whether messages
look more like the one or t'other.)

I'm guessing that a typical syslog message lacks enough info
to play the Bayesian game.  But I suppose you could feed it
chunks of logs ({1,5,10} {secs,mins}) and it could at least
eliminate the chunks that were entirely "uninteresting".

Please bear in mind that I don't know what I'm talking about.

Will
_______________________________________________
LogAnalysis mailing list
LogAnalysisat_private
http://lists.shmoo.com/mailman/listinfo/loganalysis