Newsgroups: news.software.b
Path: utzoo!henry
From: henry@zoo.toronto.edu (Henry Spencer)
Subject: Re: article "header" contains non-header line
Message-ID: <1991Apr17.210919.11815@zoo.toronto.edu>
Date: Wed, 17 Apr 1991 21:09:19 GMT
References: <1991Mar25.220106.25166@zoo.toronto.edu> <1991Mar28.080325.7729@Daisy.EE.UND.AC.ZA> <1991Mar28.165240.13757@zoo.toronto.edu> <5299@pkmab.se> <1991Apr16.200219.8743@zoo.toronto.edu> <scs.671836683@wotan.iti.org>
Organization: U of Toronto Zoology

In article <scs.671836683@wotan.iti.org> scs@iti.org (Steve Simmons) writes:
>>>> The above suggestions work fine for one or two articles, but less well for
>>>>ten thousand "obviously bad" articles.
>
>You already have a solution, Henry.  In the present daily reporting of
>news activity you list only "top 5" sites for various bad things.  Why
>not keep the first 20 bad articles (or the first 20K) and throw the rest
>away?  It's a good save compromise between nothing and overflow.

A reasonable approach, but unfortunately it's awkward to pin down and to
implement.  It can't be the first 20 out of each batch!  Unfortunately,
communication between batches is awkward, because each is processed by
a separate relaynews.  Newsdaily has it easy; it does the "top 5" out of
the whole log file.

We might be able to work out something along these lines, but I'm not
making any promises just yet.
-- 
And the bean-counter replied,           | Henry Spencer @ U of Toronto Zoology
"beans are more important".             |  henry@zoo.toronto.edu  utzoo!henry
