From hsu@clinet.fi  Sun Jan 28 20:25:31 1996
Received: from hauki.clinet.fi (root@hauki.clinet.fi [194.100.0.1])
          by freefall.freebsd.org (8.7.3/8.7.3) with ESMTP id UAA27256
          for <FreeBSD-gnats-submit@freebsd.org>; Sun, 28 Jan 1996 20:25:25 -0800 (PST)
Received: from katiska.clinet.fi (root@katiska.clinet.fi [194.100.0.4]) by hauki.clinet.fi (8.7.3/8.6.4) with ESMTP id GAA29991 for <FreeBSD-gnats-submit@freebsd.org>; Mon, 29 Jan 1996 06:25:23 +0200 (EET)
Received: (root@localhost) by katiska.clinet.fi (8.7.3/8.6.4) id GAA14928; Mon, 29 Jan 1996 06:25:22 +0200 (EET)
Message-Id: <199601290425.GAA14928@katiska.clinet.fi>
Date: Mon, 29 Jan 1996 06:25:22 +0200 (EET)
From: Heikki Suonsivu <hsu@clinet.fi>
Reply-To: hsu@clinet.fi
To: FreeBSD-gnats-submit@freebsd.org
Subject: NCR SCSI driver gives assertion errors and disk become inaccessible
X-Send-Pr-Version: 3.2

>Number:         976
>Category:       kern
>Synopsis:       NCR SCSI driver gives assertion errors and disk become inaccessible
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    se
>State:          closed
>Quarter:
>Keywords:
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Sun Jan 28 20:30:02 PST 1996
>Closed-Date:    Sat Dec 14 06:42:10 PST 1996
>Last-Modified:  Sat Dec 14 06:46:37 PST 1996
>Originator:     Heikki Suonsivu
>Release:        FreeBSD 2.2-CURRENT i386
>Organization:
Clinet, Espoo, Finland
>Environment:

Jan 29 01:05:12 katiska /kernel: FreeBSD 2.2-CURRENT #7: Mon Jan  8 04:58:16 EET 1996
Jan 29 01:05:12 katiska /kernel:     hsu@katiska.clinet.fi:/usr/current/src/sys/compile/CLINETSERVER
Jan 29 01:05:12 katiska /kernel: CPU: Pentium (90.19-MHz 586-class CPU)
Jan 29 01:05:12 katiska /kernel:   Origin = "GenuineIntel"  Id = 0x524  Stepping=4
Jan 29 01:05:12 katiska /kernel:   Features=0x1bf<FPU,VME,PSE,MCE,CX8,APIC>
Jan 29 01:05:12 katiska /kernel: real memory  = 67108864 (65536K bytes)
Jan 29 01:05:12 katiska /kernel: avail memory = 63537152 (62048K bytes)
Jan 29 01:05:12 katiska /kernel: DEVFS: ready for devices
Jan 29 01:05:12 katiska /kernel: Probing for devices on the PCI bus:
Jan 29 01:05:12 katiska /kernel: chip0 <Intel 82434NX (Neptune) PCI cache memory controller> rev 17 on pci0:0
Jan 29 01:05:12 katiska /kernel: chip1 <Intel 82378IB PCI-ISA bridge> rev 67 on pci0:2
Jan 29 01:05:12 katiska /kernel: de0 <Digital DC21040 Ethernet> rev 35 int a irq 11 on pci0:6
Jan 29 01:05:12 katiska /kernel: de0: DC21040 [10Mb/s] pass 2.3 Ethernet address 00:c0:95:ec:47:a3
Jan 29 01:05:12 katiska /kernel: de0: enabling Thinwire/AUI port
Jan 29 01:05:12 katiska /kernel: ncr0 <ncr 53c810 scsi> rev 2 int a irq 9 on pci0:12
Jan 29 01:05:12 katiska /kernel: ncr0 waiting for scsi devices to settle
Jan 29 01:05:12 katiska /kernel: (ncr0:0:0): "SEAGATE ST15230N 0298" type 0 fixed SCSI 2
Jan 29 01:05:13 katiska /kernel: sd0(ncr0:0:0): Direct-Access 
Jan 29 01:05:13 katiska /kernel: sd0(ncr0:0:0): FAST SCSI-2 100ns (10 Mb/sec) offset 8.
Jan 29 01:05:13 katiska /kernel: 4095MB (8386733 512 byte sectors)
Jan 29 01:05:13 katiska /kernel: sd0(ncr0:0:0): with 3992 cyls, 19 heads, and an average 110 sectors/track
Jan 29 01:05:13 katiska /kernel: (ncr0:3:0): "SEAGATE ST31200N 9348" type 0 fixed SCSI 2
Jan 29 01:05:13 katiska /kernel: sd3(ncr0:3:0): Direct-Access 
Jan 29 01:05:13 katiska /kernel: sd3(ncr0:3:0): FAST SCSI-2 100ns (10 Mb/sec) offset 8.
Jan 29 01:05:13 katiska /kernel: 1011MB (2072435 512 byte sectors)
Jan 29 01:05:13 katiska /kernel: sd3(ncr0:3:0): with 2700 cyls, 9 heads, and an average 85 sectors/track
Jan 29 01:05:13 katiska /kernel: (ncr0:4:0): "HP C1533A 9503" type 1 removable SCSI 2
Jan 29 01:05:13 katiska /kernel: st4(ncr0:4:0): Sequential-Access 
Jan 29 01:05:13 katiska /kernel: st4(ncr0:4:0): FAST SCSI-2 100ns (10 Mb/sec) offset 8.
Jan 29 01:05:13 katiska /kernel: density code 0x24, 512-byte blocks, write-enabled
Jan 29 01:05:13 katiska /kernel: ncr1 <ncr 53c810 scsi> rev 1 int a irq 9 on pci0:14
Jan 29 01:05:13 katiska /kernel: ncr1 waiting for scsi devices to settle
Jan 29 01:05:13 katiska /kernel: (ncr1:1:0): "SEAGATE ST15230N 0638" type 0 fixed SCSI 2
Jan 29 01:05:13 katiska /kernel: sd7(ncr1:1:0): Direct-Access 
Jan 29 01:05:13 katiska /kernel: sd7(ncr1:1:0): FAST SCSI-2 100ns (10 Mb/sec) offset 8.
Jan 29 01:05:13 katiska /kernel: 4095MB (8386733 512 byte sectors)
Jan 29 01:05:13 katiska /kernel: sd7(ncr1:1:0): with 3992 cyls, 19 heads, and an average 110 sectors/track
Jan 29 01:05:13 katiska /kernel: (ncr1:2:0): "SEAGATE ST15230N 0638" type 0 fixed SCSI 2
Jan 29 01:05:14 katiska /kernel: sd8(ncr1:2:0): Direct-Access 
Jan 29 01:05:14 katiska /kernel: sd8(ncr1:2:0): FAST SCSI-2 100ns (10 Mb/sec) offset 8.
Jan 29 01:05:14 katiska /kernel: 4095MB (8386733 512 byte sectors)
Jan 29 01:05:14 katiska /kernel: sd8(ncr1:2:0): with 3992 cyls, 19 heads, and an average 110 sectors/track
Jan 29 01:05:14 katiska /kernel: Probing for devices on the ISA bus:
Jan 29 01:05:14 katiska /kernel: vt0 at 0x60-0x6f irq 1 on motherboard
Jan 29 01:05:14 katiska /kernel: vt0: generic, 80/132 col, mono, 8 scr, mf2-kbd, [R3.20-b24]
Jan 29 01:05:14 katiska /kernel: ed0 not found at 0x280
Jan 29 01:05:14 katiska /kernel: lpt0 at 0x378-0x37f irq 7 on isa
Jan 29 01:05:14 katiska /kernel: lpt0: Interrupt-driven port
Jan 29 01:05:14 katiska /kernel: lp0: TCP/IP capable interface
Jan 29 01:05:14 katiska /kernel: lpt1 not found at 0xffffffff
Jan 29 01:05:14 katiska /kernel: sio0 at 0x3f8-0x3ff irq 4 on isa
Jan 29 01:05:14 katiska /kernel: sio0: type 16550A
Jan 29 01:05:14 katiska /kernel: sio1 at 0x2f8-0x2ff irq 3 on isa
Jan 29 01:05:14 katiska /kernel: sio1: type 16550A
Jan 29 01:05:14 katiska /kernel: pca0 on motherboard
Jan 29 01:05:15 katiska /kernel: pca0: PC speaker audio driver
Jan 29 01:05:15 katiska /kernel: cy0 not found
Jan 29 01:05:15 katiska /kernel: bt0 not found at 0x330
Jan 29 01:05:15 katiska /kernel: aha0 not found at 0x330
Jan 29 01:05:15 katiska /kernel: wdc0 not found at 0x1f0
Jan 29 01:05:15 katiska /kernel: fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
Jan 29 01:05:15 katiska /kernel: fdc0: NEC 72065B
Jan 29 01:05:15 katiska /kernel: fd0: 1.44MB 3.5in
Jan 29 01:05:15 katiska /kernel: matcdc0 not found at 0x230
Jan 29 01:05:15 katiska /kernel: npx0 on motherboard
Jan 29 01:05:15 katiska /kernel: npx0: INT 16 interface
Jan 29 01:05:15 katiska /kernel: changing root device to sd0a
Jan 29 01:05:15 katiska /kernel: devfs ready to run
Jan 29 01:05:15 katiska /kernel: new masks: bio c0000240, tty c003089a, net c003089a

>Description:

Kernel starts spitting out these

Jan 28 22:30:49 katiska /kernel: assertion "cp" failed: file "../../pci/ncr.c", line 5544
Jan 28 22:30:49 katiska /kernel: sd8(ncr1:2:0): COMMAND FAILED (4 28) @f1a9be00.
Jan 28 22:30:49 katiska /kernel: assertion "cp" failed: file "../../pci/ncr.c", line 5544
Jan 28 22:30:49 katiska /kernel: sd8(ncr1:2:0): COMMAND FAILED (4 28) @f1a9be00.
Jan 28 22:30:49 katiska /kernel: assertion "cp" failed: file "../../pci/ncr.c", line 5544
Jan 28 22:30:49 katiska /kernel: sd8(ncr1:2:0): COMMAND FAILED (4 28) @f1a9be00.
Jan 28 22:30:50 katiska /kernel: assertion "cp" failed: file "../../pci/ncr.c", line 5544
Jan 28 22:30:50 katiska /kernel: sd8(ncr1:2:0): COMMAND FAILED (4 28) @f1a9be00.
Jan 28 22:30:50 katiska /kernel: assertion "cp" failed: file "../../pci/ncr.c", line 5544
Jan 28 22:30:50 katiska /kernel: sd8(ncr1:2:0): COMMAND FAILED (4 28) @f1a9be00.
Jan 28 22:30:50 katiska /kernel: assertion "cp" failed: file "../../pci/ncr.c", line 5544
Jan 28 22:30:50 katiska /kernel: sd8(ncr1:2:0): COMMAND FAILED (4 28) @f1a9be00.

And the disk will report I/O error on all access.  This does not even
survive a reboot as motherboard BIOS does not touch the second controller
at all, I have to turn the big switch to make things work again.  The disk
apparently goes into a phase from which FreeBSD driver is not able to get
it out.  The disk have been switched several times, but they all have been
seacrates.  I have tried 2G barracuda and two different revisions of 4G
hawks (and yes, mounting is well-ventilated).  We have 4G hawks on suns at
the local university and they don't seem to give any trouble, but the load
is not as heavy.  This is a load-related problem, as it only happens to
news disks (the 2G barracuda I took out worked fine as a home directory
disk until its 6-month MTBF was reached).

>How-To-Repeat:

	Run a busy news server on seagates (someone else reported similar
problem with some other brand, I think, but not sure).

>Fix:
>Release-Note:
>Audit-Trail:
Responsible-Changed-From-To: freebsd-bugs->se 
Responsible-Changed-By: gpalmer 
Responsible-Changed-When: Mon Apr 8 19:10:45 PDT 1996 
Responsible-Changed-Why:  
Stefan Esser <se@FreeBSD.ORG> maintains the NCR driver 
State-Changed-From-To: open->closed 
State-Changed-By: se 
State-Changed-When: Sat Dec 14 06:42:10 PST 1996 
State-Changed-Why:  
This appears to have been a problem with a QUEUE FULL status  
being returned by some drives und er very heavy load. If a  
drive shows this behaviour, then tags shouldn't currently be 
used. This will be fixed when Justin Gibbs new generic SCSI 
is used, since it will  know how to deal with ressource   
shortages. 
>Unformatted:
