Received: from spf5.us4.outblaze.com (spf5.us4.outblaze.com [205.158.62.27]) by sdf.lonestar.org (8.12.10/8.12.10) with ESMTP id i9S49325011313 for ; Thu, 28 Oct 2004 04:09:06 GMT Received: from lists.gnu.org (lists.gnu.org [199.232.76.165]) by spf5.us4.outblaze.com (Postfix) with ESMTP id AE3C677230 for ; Thu, 28 Oct 2004 04:09:02 +0000 (GMT) Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1CN1ij-00020S-0r for migo@homemail.com; Thu, 28 Oct 2004 00:17:05 -0400 Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.33) id 1CN1iJ-00020B-36 for gnu-arch-users@gnu.org; Thu, 28 Oct 2004 00:16:39 -0400 Received: from exim by lists.gnu.org with spam-scanned (Exim 4.33) id 1CN1iI-0001zl-7E for gnu-arch-users@gnu.org; Thu, 28 Oct 2004 00:16:38 -0400 Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1CN1iI-0001zi-3Z for gnu-arch-users@gnu.org; Thu, 28 Oct 2004 00:16:38 -0400 Received: from [129.255.60.186] (helo=ct.radiology.uiowa.edu) by monty-python.gnu.org with esmtp (Exim 4.34) id 1CN1aM-0007Ld-Mg for gnu-arch-users@gnu.org; Thu, 28 Oct 2004 00:08:26 -0400 Received: from [192.168.1.11] (12-217-241-0.client.mchsi.com [12.217.241.0]) by ct.radiology.uiowa.edu (8.11.6/8.11.6) with ESMTP id i9S48O300427; Wed, 27 Oct 2004 23:08:24 -0500 Message-ID: <418070AF.1070503@johnmeinel.com> Date: Wed, 27 Oct 2004 23:08:15 -0500 From: John Meinel User-Agent: Mozilla Thunderbird 0.8 (Windows/20040913) X-Accept-Language: en-us, en MIME-Version: 1.0 To: James Blackwell Subject: Re: [Gnu-arch-users] Re: Potential flaw in patch-log pruning in proposal References: <20041025210724.GA19744@merconline.com> <200410272144.i9RLiYnd074868@xl2.seyza.com> <418023CD.1000504@johnmeinel.com> <20041028023254.876A91436C5@comet.merconline.com> In-Reply-To: <20041028023254.876A91436C5@comet.merconline.com> X-Enigmail-Version: 0.86.0.0 X-Enigmail-Supports: pgp-inline, pgp-mime Cc: gnu-arch-users@gnu.org X-BeenThere: gnu-arch-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: a discussion list for all things arch-ish List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: multipart/mixed; boundary="===============0403824764==" Sender: gnu-arch-users-bounces+migo=homemail.com@gnu.org Errors-To: gnu-arch-users-bounces+migo=homemail.com@gnu.org Status: RO Content-Length: 4507 Lines: 126 This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --===============0403824764== Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="------------enigCB26501047AE0A2D49DB291F" This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --------------enigCB26501047AE0A2D49DB291F Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit James Blackwell wrote: > John Meinel wrote: > > >>I think the idea that the patch name gets kept, but you can optionally >>keep the patch-log itself is a good idea. So the '{arch}/=merged' file >>would contain the equivalent of >>tla logs --merges > > > There's a catch to doing this. I've meant to (and put off for) quite a > bit to build a command that lets you easily see in which revisions a > "file" was changed. > Doesn't fai have this with fai revisions --modified filename? Basically you just go back and parse the patch logs, right. > >>Or possibly with the summaries as well. (This isn't as important, and >>might cause file size to go up again.) >> >>Then when tla wants to check if a patch has already been merged, it can >>look for the patch-log, and secondly look for the entry in =merged. > > > Aye. I think this would be "good enough" for replay --skip-present. > Sure, and I think that's what we would be stuck with. But remember, you still have the patch log on the main branch (whatever the name of the merge is). So you can still say when the official tree was modified. And if you keep something like tla logs --merges Then you can also see what other revisions contributed to it. You don't know exactly which did what, but you do keep context sensitive, and you can locate it relative to just a couple packages. > >>It might be nicer if =merged was some sort of indexed file format, so >>that lookups could be faster, but greping even a 120K file doesn't seem >>very expensive. > > > Though I'm in favor of this, actually it can get to be quite a bit more > expensive than this. Though it wouldn't be this bad for small/medium > sized projects, larger heavily developed projects will see a > concatenated patch-log on the order of 5-6 megabytes per working > copy/revision in the library. > I don't think =merged is going to be all the patch logs "cat"ed together. It's supposed to be just the fully qualified revision name. I believe that's what Matthieu stated. The sum of all the patch logs was > 7 MB for all of the logs, but for just the list of revisions it is only 120Kb. I was assuming xtla is at least medium sized. Perhaps you are thinking more along the lines of a "gcc". I'm not really sure how to scale that big. One other possibility is to use ".zip" files instead of tarballs. zlib has native support for them as of something like 1.1 or 1.2. Basically they store each file compressed individually, with an index to find each one. You get better compression with a tarball because you can compress larger chunks, but you don't get indexing. .zip has traditionally been a windows thing, but since zlib supports it (and python started using it as an alternative to storing your scripts in individual files). I see it as becoming useful in more places. 7zip is an interesting compromise. You have an index, but only to the chunk level, where each chunk could be more than one file. But really, for something like patch logs, all you need is one indexed file that lets you quickly find if one exists, and then you can pack the actual text of the log however you want. John =:-> --------------enigCB26501047AE0A2D49DB291F Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (Cygwin) Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org iD8DBQFBgHCyJdeBCYSNAAMRAs1dAJ9FbHjIgNgTXVdhYNARbiyiWmoYcwCeNr3D C0Z5TqTSksmusEZHyn5vSkA= =T8Sg -----END PGP SIGNATURE----- --------------enigCB26501047AE0A2D49DB291F-- --===============0403824764== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Gnu-arch-users mailing list Gnu-arch-users@gnu.org http://lists.gnu.org/mailman/listinfo/gnu-arch-users GNU arch home page: http://savannah.gnu.org/projects/gnu-arch/ --===============0403824764==--