Received: from spf5.us4.outblaze.com (spf5.us4.outblaze.com [205.158.62.27]) by sdf.lonestar.org (8.12.10/8.12.10) with ESMTP id iB6IpK1M001069 for ; Mon, 6 Dec 2004 18:51:20 GMT Received: from lists.gnu.org (lists.gnu.org [199.232.76.165]) by spf5.us4.outblaze.com (Postfix) with ESMTP id EB0CD7704F for ; Mon, 6 Dec 2004 18:51:20 +0000 (GMT) Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1CbO6w-0002Ch-6C for migo@homemail.com; Mon, 06 Dec 2004 14:01:26 -0500 Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.33) id 1CbO5z-0001t4-BM for gnu-arch-users@gnu.org; Mon, 06 Dec 2004 14:00:27 -0500 Received: from exim by lists.gnu.org with spam-scanned (Exim 4.33) id 1CbO5y-0001s0-A4 for gnu-arch-users@gnu.org; Mon, 06 Dec 2004 14:00:26 -0500 Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1CbO5x-0001rY-Su for gnu-arch-users@gnu.org; Mon, 06 Dec 2004 14:00:25 -0500 Received: from [205.149.2.136] (helo=xl2.seyza.com) by monty-python.gnu.org with esmtp (TLSv1:DES-CBC3-SHA:168) (Exim 4.34) id 1CbNvm-0003tW-R9 for gnu-arch-users@gnu.org; Mon, 06 Dec 2004 13:49:56 -0500 Received: from xl2.seyza.com (localhost.seyza.com [127.0.0.1]) by xl2.seyza.com (8.12.10/8.12.10) with ESMTP id iB6IoXqe036472; Mon, 6 Dec 2004 10:50:33 -0800 (PST) (envelope-from lord@xl2.seyza.com) Received: (from lord@localhost) by xl2.seyza.com (8.12.10/8.12.10/Submit) id iB6IoWmS036469; Mon, 6 Dec 2004 10:50:32 -0800 (PST) (envelope-from lord) Date: Mon, 6 Dec 2004 10:50:32 -0800 (PST) Message-Id: <200412061850.iB6IoWmS036469@xl2.seyza.com> From: Thomas Lord To: stephen@xemacs.org In-reply-to: <87pt1o9kvb.fsf@tleepslib.sk.tsukuba.ac.jp> (stephen@xemacs.org) Subject: Re: [Gnu-arch-users] Arch Versus CVS Versus Subversoin References: <20041205000613.PNEI7152.lakermmtao09.cox.net@nonerjsnum1tkq> <26E1F314-4654-11D9-AD55-000A957659CC@spy.net> <20041205023828.GA11443@suffields.me.uk> <87pt1o9kvb.fsf@tleepslib.sk.tsukuba.ac.jp> Cc: gnu-arch-users@gnu.org X-BeenThere: gnu-arch-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: a discussion list for all things arch-ish List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: gnu-arch-users-bounces+migo=homemail.com@gnu.org Errors-To: gnu-arch-users-bounces+migo=homemail.com@gnu.org Status: RO Content-Length: 3162 Lines: 81 > From: "Stephen J. Turnbull" > True, but not a very helpful way to express it. > Summary: "binary" files are not readable by humans or programs, so you > can't expect their "diffs" to be either. The only thing a "binary" > diff is useful for is exact patching, ie, replacing an exact copy of > the "old" file with an exact copy of the "new" file. Arch implements > this by storing a copy of each. If the old file in the target > workspace is identical to the old file in the archive, then it is > replaced with the new file by using "cp" (or the equivalent) rather > than "patch" or the xdelta equivalent. Thus, archives are larger than > theoretically necessary, but Arch does implement binary diffing and > patching as accurately as is possible. Another point is that we have seen zero (`0') evidence that any significant saving would arrive, under any real-world usage conditions, from the use of binary deltas. Suppose that magically, or whatever, all technical issues re merging and so forth are resolved. You can now commit with changes to binary files and those parts of those changesets will be O(xdelta-result) in size. We win a bullet point but.... Where's the convincing class of scenarios that show me, as project maintainer, I should do what I can to raise the priority of binary delta compression? Note that I am apt to evaluate costs in dollars, not (directly) bytes and nanoseconds. Who is editting these BLOB files that change in such reliably small xdelta increments? What is it about the xdelta format that ensures this (and, hence, how fragile is the situation)? I have heard suggestions like "people editting sound files" but, supposing that the on-disk format of such files is a compressed one, is that really true? It seems to me that the relationship of compression to binary files reveals the implausibility that binary delta compression will be significantly useful: Programmers often use binary files for space compression. Many forms of compression share the familiar chaotic property of formats like `gzip': small changes to the "content" that generates the compressed form generated global changes to the compressed form. [[cartouche! The better a job binary-file-format designers do at space compression -- i.e., the more reason there is to have most binary file formats *at all* -- /the less useful it is to have generic binary delta compression in a revision control systems!/ ]] Don't get me wrong: I can think of plenty of applications for non-textual diffs. I just don't see any huge wins for generic binary diffs. Does anyone else? I hate it when people ask for features that don't make sense but then it takes 3 years to figure out how to articulate why the original request doesn't make sense. Don't you? -t _______________________________________________ Gnu-arch-users mailing list Gnu-arch-users@gnu.org http://lists.gnu.org/mailman/listinfo/gnu-arch-users GNU arch home page: http://savannah.gnu.org/projects/gnu-arch/