From jylefort@brutele.be  Tue Nov 16 23:36:32 2004
Return-Path: <jylefort@brutele.be>
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id E9C4316A4D0
	for <FreeBSD-gnats-submit@freebsd.org>; Tue, 16 Nov 2004 23:36:31 +0000 (GMT)
Received: from gateway.lefort.net (212.68.242.203.brutele.be [212.68.242.203])
	by mx1.FreeBSD.org (Postfix) with ESMTP id 1BD2543D45
	for <FreeBSD-gnats-submit@freebsd.org>; Tue, 16 Nov 2004 23:36:31 +0000 (GMT)
	(envelope-from jylefort@brutele.be)
Received: from jsite.lefort.net (jsite.lefort.net [192.168.1.2])
	by gateway.lefort.net (Postfix) with ESMTP id 6DA6E5551
	for <FreeBSD-gnats-submit@freebsd.org>; Wed, 17 Nov 2004 00:36:29 +0100 (CET)
Received: by jsite.lefort.net (Postfix, from userid 1000)
	id 1524B22E18; Wed, 17 Nov 2004 00:36:28 +0100 (CET)
Message-Id: <20041116233628.1524B22E18@jsite.lefort.net>
Date: Wed, 17 Nov 2004 00:36:28 +0100 (CET)
From: Jean-Yves Lefort <jylefort@brutele.be>
Reply-To: Jean-Yves Lefort <jylefort@brutele.be>
To: FreeBSD-gnats-submit@freebsd.org
Cc:
Subject: regexec() hangs with UTF-8 locales
X-Send-Pr-Version: 3.113
X-GNATS-Notify:

>Number:         74020
>Category:       bin
>Synopsis:       regexec() hangs with UTF-8 locales
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    tjr
>State:          closed
>Quarter:        
>Keywords:       
>Date-Required:  
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Tue Nov 16 23:40:23 GMT 2004
>Closed-Date:    Sun Dec 19 04:11:48 GMT 2004
>Last-Modified:  Sun Dec 19 04:11:48 GMT 2004
>Originator:     Jean-Yves Lefort
>Release:        FreeBSD 5.3-RELEASE i386
>Organization:
>Environment:
System: FreeBSD jsite.lefort.net 5.3-RELEASE FreeBSD 5.3-RELEASE #0: Fri Nov 12 15:27:39 CET 2004 jylefort@jsite.lefort.net:/usr/obj/usr/src/sys/JSITE i386
>Description:
In some situations, regexec() hangs.
>How-To-Repeat:
Compile this:

--- cut ---
#include <locale.h>
#include <sys/types.h>
#include <regex.h>
#include <assert.h>

int
main (int argc, char **argv)
{
  int status;
  regex_t test_re;
  regmatch_t pmatch[3];

  setlocale(LC_ALL, "");

  status = regcomp(&test_re, "foo=(.*) bar=(.*)", REG_EXTENDED);
  assert(status == 0);

  /* if the locale encoding is UTF-8, this call hangs */
  regexec(&test_re, "foo=one bar=two\302\251", test_re.re_nsub + 1, pmatch, 0);

  return 0;
}
--- cut ---

Works fine when executed with a non UTF-8 locale:

	$ LANG=en_US.ISO8859-1 ./test
	$

Hangs when executed with an UTF-8 locale:

	$ LANG=en_US.UTF-8 ./test
	<yawn>
>Fix:
>Release-Note:
>Audit-Trail:
Responsible-Changed-From-To: freebsd-bugs->tjr 
Responsible-Changed-By: ache 
Responsible-Changed-When: Fri Nov 19 20:38:22 GMT 2004 
Responsible-Changed-Why:  
To multibyte support author 

http://www.freebsd.org/cgi/query-pr.cgi?pr=74020 
State-Changed-From-To: open->analyzed 
State-Changed-By: tjr 
State-Changed-When: Sat Nov 20 03:14:18 GMT 2004 
State-Changed-Why:  
Problem confirmed and understood; patch on the way shortly. 

http://www.freebsd.org/cgi/query-pr.cgi?pr=74020 
State-Changed-From-To: analyzed->patched 
State-Changed-By: tjr 
State-Changed-When: Sun Nov 21 03:15:09 GMT 2004 
State-Changed-Why:  
Fixed in -current, will be MFC'd after 4 weeks. 

http://www.freebsd.org/cgi/query-pr.cgi?pr=74020 
State-Changed-From-To: patched->closed 
State-Changed-By: tjr 
State-Changed-When: Sun Dec 19 04:11:21 GMT 2004 
State-Changed-Why:  
Now fixed in 5-STABLE; thanks for the report! 

http://www.freebsd.org/cgi/query-pr.cgi?pr=74020 
>Unformatted:
