From tejblum@arc.hq.cti.ru  Tue Sep 16 11:15:37 1997
Received: from yandex.hq.cti.ru (arc.hq.cti.ru [194.67.85.53])
          by hub.freebsd.org (8.8.7/8.8.7) with ESMTP id LAA10313
          for <FreeBSD-gnats-submit@freebsd.org>; Tue, 16 Sep 1997 11:15:32 -0700 (PDT)
Received: (from tejblum@localhost)
	by yandex.hq.cti.ru (8.8.7/8.8.5) id WAA14422;
	Tue, 16 Sep 1997 22:15:18 +0400 (MSD)
Message-Id: <199709161815.WAA14422@yandex.hq.cti.ru>
Date: Tue, 16 Sep 1997 22:15:18 +0400 (MSD)
From: Dmitrij Tejblum <tejblum@arc.hq.cti.ru>
Reply-To: tejblum@arc.hq.cti.ru
To: FreeBSD-gnats-submit@freebsd.org
Subject: Typo in utf2(4) man page
X-Send-Pr-Version: 3.2

>Number:         4555
>Category:       docs
>Synopsis:       Typo in utf2(4) man page
>Confidential:   no
>Severity:       non-critical
>Priority:       low
>Responsible:    steve
>State:          closed
>Quarter:
>Keywords:
>Date-Required:
>Class:          doc-bug
>Submitter-Id:   current-users
>Arrival-Date:   Tue Sep 16 11:20:00 PDT 1997
>Closed-Date:    Tue Nov 4 20:20:07 PST 1997
>Last-Modified:  Tue Nov  4 20:20:58 PST 1997
>Originator:     Dmitrij Tejblum
>Release:        FreeBSD 2.2-STABLE i386
>Organization:
CompTek
>Environment:


>Description:

utf2(4) claims that runes 0x0080-0x03ff (00000bbb.bbbbbbbb) encoded with 2
                                    ^         ^^^
bytes. Of course, it really means 0x0080-0x07ff.

>How-To-Repeat:


>Fix:
	
--- /usr/src/lib/libc/locale/utf2.4	Tue Sep 16 18:20:11 1997
+++ utf2.4	Tue Sep 16 21:45:31 1997
@@ -60,7 +60,7 @@
 encoding is represented by the following table:
 .Bd -literal
 [0x0000 - 0x007f] [00000000.0bbbbbbb] -> 0bbbbbbb
-[0x0080 - 0x03ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb
+[0x0080 - 0x07ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb
 [0x0400 - 0xffff] [bbbbbbbb.bbbbbbbb] -> 1110bbbb, 10bbbbbb, 10bbbbbb
 .Ed
 .Pp

>Release-Note:
>Audit-Trail:

From: "Daniel O'Callaghan" <danny@FreeBSD.ORG>
To: freebsd-gnats-submit@freebsd.org, tejblum@arc.hq.cti.ru
Cc:  Subject: Re: docs/4555: Typo in utf2(4) man page
Date: Tue, 23 Sep 1997 13:43:58 +1100

 Hello,  you wrote:
 
 >      utf2(4) claims that runes 0x0080-0x03ff (00000bbb.bbbbbbbb) encoded with 2
 >                                          ^         ^^^
 >      bytes. Of course, it really means 0x0080-0x07ff.
 > 
 > How-To-Repeat
 > Fix
 > 
 >      --- /usr/src/lib/libc/locale/utf2.4     Tue Sep 16 18:20:11 1997
 >      +++ utf2.4      Tue Sep 16 21:45:31 1997
 >      @@ -60,7 +60,7 @@
 >       encoding is represented by the following table:
 >       .Bd -literal
 >       [0x0000 - 0x007f] [00000000.0bbbbbbb] -> 0bbbbbbb
 >      -[0x0080 - 0x03ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb
 >      +[0x0080 - 0x07ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb
 >       [0x0400 - 0xffff] [bbbbbbbb.bbbbbbbb] -> 1110bbbb, 10bbbbbb, 10bbbbbb
 
 Look closely and you'll see that the ranges are 0x0000-0x007f,
 0x0080-0x03ff and
 0x0400 -0xffff, making three contiguous ranges.  Your patch would
 suggest an overlap
 as the middle range would end at 0x07ff, after the third range starts at
 0x0400.
 
 Danny

From: Dmitrij Tejblum <tejblum@arc.hq.cti.ru>
To: "Daniel O'Callaghan" <danny@freebsd.org>
Cc: freebsd-gnats-submit@freebsd.org
Subject: Re: docs/4555: Typo in utf2(4) man page 
Date: Tue, 23 Sep 1997 18:42:42 +0400

 "Daniel O'Callaghan" wrote:
 > Look closely and you'll see that the ranges are 0x0000-0x007f,
 > 0x0080-0x03ff and
 > 0x0400 -0xffff, making three contiguous ranges.  Your patch would
 > suggest an overlap
 > as the middle range would end at 0x07ff, after the third range starts at
 > 0x0400.
 > 
 
 Oops, you are right. Here is a better patch.
 
 --- /usr/src/lib/libc/locale/utf2.4	Tue Sep 16 18:20:11 1997
 +++ utf2.4	Tue Sep 23 18:37:32 1997
 @@ -60,8 +60,8 @@
  encoding is represented by the following table:
  .Bd -literal
  [0x0000 - 0x007f] [00000000.0bbbbbbb] -> 0bbbbbbb
 -[0x0080 - 0x03ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb
 -[0x0400 - 0xffff] [bbbbbbbb.bbbbbbbb] -> 1110bbbb, 10bbbbbb, 10bbbbbb
 +[0x0080 - 0x07ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb
 +[0x0800 - 0xffff] [bbbbbbbb.bbbbbbbb] -> 1110bbbb, 10bbbbbb, 10bbbbbb
  .Ed
  .Pp
  If more than a single representation of a value exists (for example,
 
 Dima
 
 
State-Changed-From-To: open->closed 
State-Changed-By: steve 
State-Changed-When: Tue Nov 4 20:20:07 PST 1997 
State-Changed-Why:  
Suggested patch applied, thanks! 
>Unformatted:
