From nobody@FreeBSD.org  Wed Mar 29 22:56:14 2006
Return-Path: <nobody@FreeBSD.org>
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id 0A73E16A422
	for <freebsd-gnats-submit@FreeBSD.org>; Wed, 29 Mar 2006 22:56:14 +0000 (UTC)
	(envelope-from nobody@FreeBSD.org)
Received: from www.freebsd.org (www.freebsd.org [216.136.204.117])
	by mx1.FreeBSD.org (Postfix) with ESMTP id B888A43D45
	for <freebsd-gnats-submit@FreeBSD.org>; Wed, 29 Mar 2006 22:56:13 +0000 (GMT)
	(envelope-from nobody@FreeBSD.org)
Received: from www.freebsd.org (localhost [127.0.0.1])
	by www.freebsd.org (8.13.1/8.13.1) with ESMTP id k2TMuCOK047927
	for <freebsd-gnats-submit@FreeBSD.org>; Wed, 29 Mar 2006 22:56:12 GMT
	(envelope-from nobody@www.freebsd.org)
Received: (from nobody@localhost)
	by www.freebsd.org (8.13.1/8.13.1/Submit) id k2TMuCaY047926;
	Wed, 29 Mar 2006 22:56:12 GMT
	(envelope-from nobody)
Message-Id: <200603292256.k2TMuCaY047926@www.freebsd.org>
Date: Wed, 29 Mar 2006 22:56:12 GMT
From: Nigel Weeks <nweeks@examiner.com.au>
To: freebsd-gnats-submit@FreeBSD.org
Subject: System freeze irrespective of load on Promise FastTrak100 TX2pro IDE RAID
X-Send-Pr-Version: www-2.3

>Number:         95087
>Category:       i386
>Synopsis:       [hang] System freeze irrespective of load on Promise FastTrak100 TX2pro IDE RAID
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    gavin
>State:          closed
>Quarter:        
>Keywords:       
>Date-Required:  
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Wed Mar 29 23:00:30 GMT 2006
>Closed-Date:    Wed Mar 11 16:59:39 UTC 2009
>Last-Modified:  Wed Mar 11 16:59:39 UTC 2009
>Originator:     Nigel Weeks
>Release:        Release 6.0
>Organization:
The Examiner Newspaper
>Environment:
FreeBSD 6.0-RELEASE #1: Wed Mar 29 15:35:39 EST 2006     nweeks@oken.opi-lex.rpl.local:/usr/obj/usr/src/sys/WWW

>Description:
The system seems to hang, with errors being written to console regarding ar0 (Promise IDE RAID controller), mentioning g_vfs_done errors.

Background Information.
DMESG:
Copyright (c) 1992-2005 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
FreeBSD 6.0-RELEASE #1: Wed Mar 29 15:35:39 EST 2006
    nweeks@oken.opi-lex.rpl.local:/usr/obj/usr/src/sys/WWW
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Pentium III/Pentium III Xeon/Celeron (501.14-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x672  Stepping = 2
  Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CM
OV,PAT,PSE36,PN,MMX,FXSR,SSE>
real memory  = 402587648 (383 MB)
avail memory = 384348160 (366 MB)
ACPI APIC Table: <Award  >
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
ioapic0 <Version 1.1> irqs 0-23 on motherboard
npx0: [FAST]
npx0: <math processor> on motherboard
npx0: INT 16 interface
acpi0: <Award AWRDACPI> on motherboard
acpi0: Overriding SCI Interrupt from IRQ 9 to IRQ 20
    ACPI-0438: *** Error: Looking up [\\_PR_.CPU0] in namespace, AE_NOT_FOUND
SearchNode 0xc17c6260 StartNode 0xc17c6260 ReturnNode 0
acpi0: Power Button (fixed)
pci_link0: <ACPI PCI Link LNKA> irq 0 on acpi0
pci_link1: <ACPI PCI Link LNKB> irq 0 on acpi0
pci_link2: <ACPI PCI Link LNKC> irq 10 on acpi0
pci_link3: <ACPI PCI Link LNKD> irq 11 on acpi0
Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0
cpu0: <ACPI CPU> on acpi0
cpu1: <ACPI CPU> on acpi0
acpi_button0: <Power Button> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff,0x4000-0x4041,0x5000-0x500f on ac
pi0
pci0: <ACPI PCI bus> on pcib0
agp0: <Intel 82443BX (440 BX) host to PCI bridge> mem 0xd0000000-0xd0ffffff at d
evice 0.0 on pci0
pcib1: <PCI-PCI bridge> at device 1.0 on pci0
pci1: <PCI bus> on pcib1
isab0: <PCI-ISA bridge> at device 7.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel PIIX4 UDMA33 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x37
6,0xf000-0xf00f at device 7.1 on pci0
ata0: <ATA channel 0> on atapci0
ata1: <ATA channel 1> on atapci0
uhci0: <Intel 82371AB/EB (PIIX4) USB controller> port 0xd000-0xd01f irq 19 at de
vice 7.2 on pci0
uhci0: [GIANT-LOCKED]
usb0: <Intel 82371AB/EB (PIIX4) USB controller> on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
pci0: <bridge> at device 7.3 (no driver attached)
pci0: <display, VGA> at device 8.0 (no driver attached)
dc0: <82c169 PNIC 10/100BaseTX> port 0xd400-0xd4ff mem 0xd3010000-0xd30100ff irq
 18 at device 10.0 on pci0
miibus0: <MII bus> on dc0
bmtphy0: <BCM5201 10/100baseTX PHY> on miibus0
bmtphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
dc0: Ethernet address: 00:a0:cc:51:65:f5
atapci1: <Promise PDC20270 UDMA100 controller> port 0xd800-0xd807,0xdc00-0xdc03,
0xe000-0xe007,0xe400-0xe403,0xe800-0xe80f mem 0xd3000000-0xd300ffff irq 19 at de
vice 11.0 on pci0
ata2: <ATA channel 0> on atapci1
ata3: <ATA channel 1> on atapci1
acpi_tz0: <Thermal Zone> on acpi0
    ACPI-0438: *** Error: Looking up [\\_PR_.CPU0] in namespace, AE_NOT_FOUND
SearchNode 0xc17c6260 StartNode 0xc17c6260 ReturnNode 0
acpi_tz0: _CRT value is absurd, ignored (500.6C)
fdc0: <floppy drive controller> port 0x3f2-0x3f5,0x3f7 irq 6 drq 2 on acpi0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
ppc0: <ECP parallel printer port> port 0x378-0x37f,0x778-0x77b irq 7 drq 3 on ac
pi0
ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/16 bytes threshold
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xcbfff,0xcc000-0xd47ff
 on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounters tick every 1.000 msec
ipfw2 (+ipv6) initialized, divert loadable, rule-based forwarding disabled, defa
ult to deny, logging unlimited
    ACPI-0438: *** Error: Looking up [\\_PR_.CPU0] in namespace, AE_NOT_FOUND
SearchNode 0xc17c6260 StartNode 0xc17c6260 ReturnNode 0
acpi_tz0: _CRT value is absurd, ignored (500.6C)
    ACPI-0438: *** Error: Looking up [\\_PR_.CPU0] in namespace, AE_NOT_FOUND
SearchNode 0xc17c6260 StartNode 0xc17c6260 ReturnNode 0
acpi_tz0: _CRT value is absurd, ignored (500.6C)
acd0: CDROM <CD-ROM 40X/AKU/T02> at ata1-master PIO4
ad4: 38166MB <Seagate ST340015A 3.01> at ata2-master UDMA100
ad6: 38166MB <Seagate ST340015A 3.01> at ata3-master UDMA100
ar0: 38146MB <Promise Fasttrak RAID1> status: READY
ar0: disk0 READY (master) using ad6 at ata3-master
ar0: disk1 READY (mirror) using ad4 at ata2-master
SMP: AP CPU #1 Launched!
Trying to mount root from ufs:/dev/ar0s1a
WARNING: / was not properly dismounted
WARNING: /tmp was not properly dismounted
WARNING: /usr was not properly dismounted
/usr: mount pending error: blocks 1004 files 224
WARNING: /var was not properly dismounted
dc0: failed to force tx and rx to idle state
dc0: failed to force tx and rx to idle state

(This was immediately after the crash, hence the filesystem warnings...)


`kldstat` shows kernel, and acpi.ko are loaded

Kernel compiled with 
options SMP
options IPFIREWALL
options IPFIREWALL_VERBOSE
options DUMMYNET
options HZ=1000

It must be noted that freezes also occured on the non-SMP GENERIC kernel(before I'd done the recompile)

Tried putting system under load to repeat problem, with the following:
`cd /usr/src`
`tar cfz ../hog.tgz *`  (to simply put CPU and HDD under load)

No effect.

Put CPU and kernel process queue under more load with some simple c++:
#include <iostream>

main() {
  while(1)
  {
    // Do a tight loop of nuthin
  }
  return 0;
}

and launched 50 of them - no effect


Then re-launched tar as well to strain disk subsystem. `top` output now:

last pid:   688;  load averages: 52.06, 51.09, 39.89    up 0+00:42:09  09:31:26
80 processes:  53 running, 27 sleeping
CPU states: 98.7% user,  0.0% nice,  0.9% system,  0.4% interrupt,  0.0% idle
Mem: 24M Active, 250M Inact, 62M Wired, 15M Cache, 47M Buf, 17M Free
Swap: 2048M Total, 2048M Free

Next test: leave it idle for a while

Well, I've had four coffees, and I'm getting tired of waiting...submitting this PR.



>How-To-Repeat:
Tried, tried real hard, but it just happens...
>Fix:

>Release-Note:
>Audit-Trail:
State-Changed-From-To: open->feedback 
State-Changed-By: gavin 
State-Changed-When: Wed Mar 11 15:52:59 UTC 2009 
State-Changed-Why:  
To submitter: apologies for this PR sitting idle for so long.  Do you know 
if this problem was ever resolved?  If you still have this system and are 
still interested in the issue could you please retest with either 7.1-RELEASE 
or 6.4-RELEASE and see if they work for you?  I believe there were several 
fixes to the controller chip you have after 6.0 was released which may have 
resolved this issue. 


Responsible-Changed-From-To: freebsd-i386->gavin 
Responsible-Changed-By: gavin 
Responsible-Changed-When: Wed Mar 11 15:52:59 UTC 2009 
Responsible-Changed-Why:  
Track 

http://www.freebsd.org/cgi/query-pr.cgi?pr=95087 
State-Changed-From-To: feedback->closed 
State-Changed-By: gavin 
State-Changed-When: Wed Mar 11 16:59:22 UTC 2009 
State-Changed-Why:  
Mail to submitter bounces 

http://www.freebsd.org/cgi/query-pr.cgi?pr=95087 
>Unformatted:
