On Mon, Sep 20, 2004 at 12:02:39PM -0700, cadams@private wrote:
> On Thu, Sep 16, 2004 at 11:14:32AM -0500, Ed Schmollinger wrote:
> > On Thu, Sep 16, 2004 at 12:33:12AM +0200, Mike Blomgren wrote:
> > > I'm using sort in the standard 'cat <file> | awk '{"compute..."}' | sort |
> > > uniq -c | sort -n -r' type analysis.
> >
> > You can get rid of the multiple sorts/uniq thing by doing it all at
> > once:
>
> Or by using GNU sort's -u option, which after getting rid of the
> unnecessary use of cat leaves:
>
> awk ... | sort -u -n -r
This didn't seem to give correct output for me. It only printed a
single line (the most frequent one?) and it didn't include the number of
times that the line appeared.
I was under the impression that what we are after is output that is
sorted by frequency of unique inputs. For example, a log that looks
like:
--- CUT HERE ---
cat
dog
cat
horse
horse
cat
horse
cat
--- CUT HERE ---
Would turn into output that looks like:
--- CUT HERE ---
1 dog
3 horse
4 cat
--- CUT HERE ---
A simple condensing sort (sort -u) would give us:
--- CUT HERE ---
cat
dog
horse
--- CUT HERE ---
And 'sort -u -n -r' (this is GNU sort) just prints:
--- CUT HERE ---
cat
--- CUT HERE ---
Am I missing something?
--
Ed Schmollinger - schmolli@private
_______________________________________________ LogAnalysis mailing list LogAnalysis@private http://lists.shmoo.com/mailman/listinfo/loganalysis
This archive was generated by hypermail 2.1.3 : Tue Sep 21 2004 - 13:32:06 PDT