[logs] Re: on log standards

From: Christina Noren (cfrln@private)
Date: Wed Aug 30 2006 - 10:41:03 PDT


Morty,


>   Host software is often created in an unstructured way, with ad-hoc
>   logging infrastructure that is added during debugging stages.  Log
>   content standards sound great, but in practice, they would probably
>   not work well with the way software is actually written.
>

I think you hit the nail on the head with this observation.

Logging standardization is wishful thinking on the part of log  
analysis vendors. Log standardization has really only worked out for  
web server access logs and network device SNMP and will never work  
out in the software world.

Most software companies and software developers put in error messages  
on an ad hoc basis. Only a code audit could possibly inventory all  
potential messages. Since most code is proprietary, those writing  
parsers for the *content* rather than *header* portion of log  
messages from a specific application are most often taking educated  
guesses about the messages that a product *will* put out based on the  
evidence of log samples collected from running systems. I've been  
part of concerted efforts to document error logs in both commercial  
software companies and with in-house development teams and they don't  
work.

Not to mention the amount of dependence on legacy software with  
legacy logging.

The challenge is to create useful tools for searching, reporting and  
alerting on logs that don't depend on either standardization or real- 
time interpretation/normalization by the log tool.

Some of the techniques for this are:

- full-text indexing of raw content
- automated machine categorization of event patterns - the machine  
can't know that this content means that a process exited abnormally,  
but it can tell that it's different content than another event saying  
that a batch job completed
- exposing hooks for users to add semantic knowledge as they use the  
log data and have that knowledge added to the raw data incrementally  
- interpretation must be fluid and adaptable in this way
- noticing meta-patterns in a log data set of correlations,  
abnormalities, etc. based on the automated categorizations

On Aug 28, 2006, at 5:39 PM, Mordechai T. Abzug wrote:

> On Fri, Aug 25, 2006 at 10:57:30PM -0500, Anton Chuvakin wrote:
>
>> So I was thinking a lot about log standards and taxonomies and the
>> release of CEF inspired me to finally finish my brief article on log
>> standards - check it out:
>
>> http://chuvakin.blogspot.com/2006/08/on-common-event-format-cef.html
>
> Comments:
>
> * There is one standard for log content that is in widespread use: the
>   MIBs used for SNMP traps.  SNMP traps are not used too widely in the
>   host world, but they're fairly widespread in the network world.  The
>   SNMP world had standardization from day one, so it's useful to look
>   at how standardization has helped SNMP.
>
>   Note: SNMP is more than just traps/events, but we can ignore the
>   other aspects for this discussion.
>
> * Even in the SNMP trap world, the standardizations often prove
>   insufficient.  That is, there are multiple standardized components:
>   there is a standard way to describe SNMP PDUs in machine-readable
>   format, i.e. ASN.1 and SMI to write MIBs, and then there are
>   actually standard MIBs, so that all platforms can express certain
>   common events in a vendor-independent way.  The former is mostly
>   what you would call a form standard, while the latter is mostly what
>   you would call a content standard.  But the content component is, in
>   practice, of limited utility, because most vendors end up wanting
>   various events that are not standard events.
>
>   In more concrete terms:
>
>   * There is a standard way to express "interface down".  Any device,
>     regardless of vendor, can send an SNMP trap saying "interface
>     down", and the NMS (network management station) can understand it
>     without knowing anything about that particular vendor.
>
>   * But if a device wants to say "SONET problem with certain
>     vendor-specific flags", it is likely that no standard trap exists.
>     The vendor is going to utilize a custom trap.  SNMP allows for
>     this.  This custom trap can be defined using a MIB written in
>     standard format, so the NMS station can read in the MIB and then
>     immediately parse it, but actually understanding what to do with
>     it would still require trap-specific handling by the application
>     or by the administrator.
>
>   SNMP provides vendor-specific traps for a very good reason.  While
>   the standard MIBs provide a lot of useful traps, they cannot begin
>   to cover all the possible cases of existing technologies.  New
>   technologies that vendors use to differentiate themselves from each
>   other with necessarily exist and need management before they are
>   standardized.
>
>   The lesson of SNMP trap is that having standard content can be
>   useful, but in practice, sooner or later, you will have to allow for
>   a standard format, and let vendors extend the content.
>
> * In the host software world, life is worse than in the network world.
>   Host software is often created in an unstructured way, with ad-hoc
>   logging infrastructure that is added during debugging stages.  Log
>   content standards sound great, but in practice, they would probably
>   not work well with the way software is actually written.
>
> * The release of "CEF" seems like a non-event.  Looks like a
>   unilateral "standard" issued by one minor vendor without a lot of
>   buy-in from third-party vendors.  They haven't even really released
>   it; they're asking people to send them email to get a copy.  No
>   thanks.
>
> Morty
> _______________________________________________
> LogAnalysis mailing list
> LogAnalysis@private
> http://lists.shmoo.com/mailman/listinfo/loganalysis

_______________________________________________
LogAnalysis mailing list
LogAnalysis@private
http://lists.shmoo.com/mailman/listinfo/loganalysis



This archive was generated by hypermail 2.1.3 : Wed Aug 30 2006 - 19:25:17 PDT