From ip@albatross.mcc.ac.uk Thu Apr 22 04:48:20 1999
Return-Path: <ip@albatross.mcc.ac.uk>
Received: from serenity.mcc.ac.uk (serenity.mcc.ac.uk [130.88.200.93])
	by hub.freebsd.org (Postfix) with ESMTP id B63711599F
	for <FreeBSD-gnats-submit@freebsd.org>; Thu, 22 Apr 1999 04:44:49 -0700 (PDT)
	(envelope-from ip@albatross.mcc.ac.uk)
Received: from albatross.mcc.ac.uk ([130.88.202.16])
	by serenity.mcc.ac.uk with esmtp (Exim 1.92 #3)
	for FreeBSD-gnats-submit@freebsd.org
	id 10aHs4-000KCl-00; Thu, 22 Apr 1999 12:42:20 +0100
Received: (from ip@localhost)
	by albatross.mcc.ac.uk (8.9.2/8.9.1) id MAA37347;
	Thu, 22 Apr 1999 12:42:19 +0100 (BST)
	(envelope-from ip)
Message-Id: <199904221142.MAA37347@albatross.mcc.ac.uk>
Date: Thu, 22 Apr 1999 12:42:19 +0100 (BST)
From: ip@mcc.ac.uk
Sender: ip@albatross.mcc.ac.uk
Reply-To: ip@mcc.ac.uk
To: FreeBSD-gnats-submit@freebsd.org
Subject: 3.1-STABLE panics under heavy SCSI load, possibly aha 1542 related
X-Send-Pr-Version: 3.2

>Number:         11280
>Category:       kern
>Synopsis:       3.1-STABLE panics under heavy SCSI load, possibly aha 1542 related
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    freebsd-bugs
>State:          closed
>Quarter:        
>Keywords:       
>Date-Required:  
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Thu Apr 22 04:50:02 PDT 1999
>Closed-Date:    Fri May 28 15:10:29 PDT 1999
>Last-Modified:  Fri May 28 15:11:06 PDT 1999
>Originator:     Ian Pallfreeman
>Release:        FreeBSD 3.1-STABLE i386
>Organization:
>Environment:

Box runs mainly sendmail, majordomo & mhonarc. Occasional bursts of traffic
to 20,000+ list addresses or processes growing to 200MB+, deliberately
throttled with old, slow disks and 1542 controller. Problem may also be
occuring on big squid boxes (similar symptoms), but owners are linux 
weenies and haven't a clue about ddb. Note that crash dump doesn't work.

>Description:

Fatal trap 12: page fault while in kernel mode
fault virtual address   = 0x18
fault code              = supervisor write, page not present
instruction pointer     = 0x8:0xf012519d
stack pointer           = 0x10:0xf4631aa4
frame pointer           = 0x10:0xf4631abc
code segment            = base 0x0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, def32 1, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = 85623 (sendmail)
interrupt mask          = cam 
kernel: type 12 trap, code=0
Stopped at      ahaexecuteccb+0x2b5:    movl    %edi,0x18(%eax)
db> trace
ahaexecuteccb(f3cb3508,f4631ad4,f,0,5000) at ahaexecuteccb+0x2b5
bus_dmamap_load(f07b8d00,f07b8880,f3090000,0,f0124ee8) at bus_dmamap_load+0x2e3
ahaaction(f07b87c0,f07c2200,1,f07b7b70,f07c2200) at ahaaction+0x334
xpt_run_dev_sendq(f07b8780,f07ddc00,0,f07c2200,80000000) at xpt_run_dev_sendq+0x1da
xpt_action(f07c2200,80000000) at xpt_action+0x1c5
dastart(f07da000,f07c2200,f07c2200,f07b7b70,1,80000000) at dastart+0x1cc
xpt_run_dev_allocq(f07b8780,f07ddc00,f1c23058,80000000,80000000) at xpt_run_dev_allocq+0xa7
xpt_schedule(f07da000,1,80000000,f07ddc00,f1c23058) at xpt_schedule+0xbe
dastrategy(f1c23058,f1c23058,f4631cf8,f0164209,f4631d20) at dastrategy+0x84
spec_strategy(f4631d20,f4631d04,f01ba501,f4631d20,f4631d3c) at spec_strategy+0x3e
spec_vnoperate(f4631d20,f4631d3c,f01b9f07,f4631d20,60000054) at spec_vnoperate+0x15
ufs_vnoperatespec(f4631d20,60000054,f1c23058,80000000,f09a2100) at ufs_vnoperatespec+0x15
ufs_strategy(f4631d60,f4631d6c,f0151173,f4631d60,f1c24c18) at ufs_strategy+0xd3
ufs_vnoperate(f4631d60) at ufs_vnoperate+0x15
bwrite(f1c23058,f4631d84,f01555a9,f4631d9c,f4631d90) at bwrite+0xaf
vop_stdbwrite(f4631d9c,f4631d90,f01ba4d1,f4631d9c,f4631da4) at vop_stdbwrite+0xe
vop_defaultop(f4631d9c,f4631da4,f0151370,f4631d9c,f020c298) at vop_defaultop+0x15
ufs_vnoperate(f4631d9c,f020c298,f1c23058,f4631dcc,f015528b) at ufs_vnoperate+0x15
bawrite(f1c23058,f1c24c18,f43ee440,f4631f44,80000000) at bawrite+0x2c
cluster_wbuild(f43ee440,2000,24,8,f1c24c18) at cluster_wbuild+0x397
cluster_write(f1c24c18,48000,0) at cluster_write+0x304
ffs_write(f4631f08,f4631f94,2000,f447ab00,f07b9600) at ffs_write+0x56f
vn_write(f0838d40,f4631f44,f08d9200,f447ab00,f020e1b4) at vn_write+0xef
write(f447ab00,f4631f94) at write+0x96
syscall(27,27,9203000,2812a4a4,efbfbc80) at syscall+0x127
Xint0x80_syscall() at Xint0x80_syscall+0x2c
db> show registers
cs                 0x8
ds                0x10
es                0x10
ss                0x10
eax                  0
ecx                  0
edx         0xc00c0000
ebx         0xf3cb3508
esp         0xf4631aa4
ebp         0xf4631abc
esi         0xf07b9e00
edi         0xf07c2214
eip         0xf012519d  ahaexecuteccb+0x2b5
efl            0x10286
ahaexecuteccb+0x2b5:    movl    %edi,0x18(%eax)
db> ps
  pid   proc     addr    uid  ppid  pgrp  flag stat wmesg   wchan   cmd
86138 f447b4a0 f4609000   99 86136 86135 004184  3  piperd f4375ac0 sendmail
86136 f447a580 f463e000   99 86135 86135 004004  2                  perl
86135 f435a1e0 f445c000    1 86132 86135 004084  3    wait f435a1e0 sh
86132 f447a160 f464a000    0     1   167 000184  3    wait f447a160 sendmail
85623 f447ab00 f4630000   99     1 85623 000104  2                  sendmail
85520 f435a760 f443c000   99 85512 85520 004086  3   ttyin f021f5cc bash
85512 f447b340 f460d000  132 85511 85512 004086  3    wait f447b340 bash
85511 f4655ba0 f465f000    0   160 85511 004184  2                  rlogind
58462 f447c260 f4564000   99  1958 58462 004086  3   ttyin f021f3e4 bash
52552 f435bc00 f43da000 65534   200   200 000184  3   lockf f08bfa80 httpd
51145 f447c520 f454c000 65534   200   200 000184  3   lockf f09ccd80 httpd
49354 f447cd60 f452f000 65534   200   200 000184  3   lockf f08998c0 httpd
47541 f435ace0 f444b000 65534   200   200 000184  3   lockf f09aac00 httpd
45822 f447e200 f44ad000 65534   200   200 000184  3  accept f42d1176 httpd
45420 f447df40 f44b4000 65534   200   200 000184  3   lockf f09cbcc0 httpd
32748 f435b100 f4423000 65534   200   200 000184  3   lockf f097e940 httpd
 2852 f447e0a0 f44b1000    0  2847  2852 004086  3   ttyin f021f4d8 bash
 2847 f447ee60 f4480000  503  2846  2847 004082  3    wait f447ee60 bash
 2846 f447d2e0 f44f7000    0   160  2846 004184  2                  rlogind
 1958 f435a340 f4469000    0   326  1958 004082  3    wait f435a340 bash
  326 f435b7e0 f43fc000  503   325   326 004082  3    wait f435b7e0 bash
  325 f435d8e0 f436d000    0   160   325 004184  3  select f0234f48 rlogind
  247 f435baa0 f43e7000    0     1   247 004082  3   ttyin f0233d98 getty
  246 f435c5a0 f43ba000    0     1   246 004082  3   ttyin f02312e0 getty
  245 f435c860 f43ae000    0     1   245 004082  3   ttyin f02311ec getty
  206 f435c2e0 f43c4000 1001     1     5 004186  3  select f0234f48 msql2d
  200 f435c440 f43c0000    0     1   200 000184  3  select f0234f48 httpd
  167 f435c9c0 f43aa000    0     1   167 000184  2                  sendmail
  163 f435cb20 f43a6000    0     1   163 000084  3  nanslp f021f360 cron
  160 f435d200 f4397000    0     1   160 000084  3  select f0234f48 inetd
  134 f435cc80 f43a3000    0     1   130 000080  3  nfsidl f0237028 nfsiod
  133 f435cde0 f43a0000    0     1   130 000080  3  nfsidl f0237024 nfsiod
  132 f435cf40 f439d000    0     1   130 000080  3  nfsidl f0237020 nfsiod
  131 f435d0a0 f439a000    0     1   130 000084  3  nfsidl f023701c nfsiod
  119 f435d4c0 f4386000    1     1   119 000184  3  select f0234f48 portmap
  115 f435d360 f4389000    0     1   115 000084  3  select f0234f48 xntpd
  105 f435d620 f4383000    0     1   105 000084  2                  syslogd
    4 f435da40 f4368000    0     0     0 000204  3  syncer f0234ef8 syncer
    3 f435dba0 f4366000    0     0     0 000204  3  psleep f022e764 vmdaemon
    2 f435dd00 f4364000    0     0     0 000204  3  psleep f0217760 pagedaemon
    1 f435de60 f4362000    0     0     1 004084  3    wait f435de60 init
    0 f023422c f0281000    0     0     0 000204  3   sched f023422c swapper
db> panic
panic: from debugger
(da0:aha0:0:0:0): Synchronize cache failed, status == 0xb, scsi status == 0x0
(da1:aha0:0:3:0): Synchronize cache failed, status == 0xb, scsi status == 0x0

dumping to dev 20409, offset 712212
dump Aborting dump due to I/O error.
status == 0xb, scsi status == 0x0
i/o error
Automatic reboot in 15 seconds - press a key on the console to abort

/kernel text=0x108a8a data=0x12c1c+0x1df4c syms=[0x4+0x1c050+0x4+0x1db51]
Copyright (c) 1992-1999 FreeBSD Inc.
Copyright (c) 1982, 1986, 1989, 1991, 1993
        The Regents of the University of California. All rights reserved.
FreeBSD 3.1-STABLE #6: Mon Apr 19 13:31:13 BST 1999
    ip@karma:/usr/src/sys/compile/VERACITY
Timecounter "i8254"  frequency 1193182 Hz
Timecounter "TSC"  frequency 90000600 Hz
CPU: Pentium/P54C (90.00-MHz 586-class CPU)
  Origin = "GenuineIntel"  Id = 0x525  Stepping=5
  Features=0x1bf<FPU,VME,DE,PSE,TSC,MSR,MCE,CX8>
real memory  = 67108864 (65536K bytes)
avail memory = 62734336 (61264K bytes)
Preloaded elf kernel "kernel" at 0xf0276000.
Probing for devices on PCI bus 0:
chip0: <Intel 82439TX System Controller (MTXC)> rev 0x01 on pci0.0.0
chip1: <Intel 82371AB PCI to ISA bridge> rev 0x01 on pci0.7.0
ide_pci0: <Intel PIIX4 Bus-master IDE controller> rev 0x01 on pci0.7.1
chip2: <Intel 82371AB Power management controller> rev 0x01 on pci0.7.3
Probing for devices on the ISA bus:
sc0 on isa
sc0: VGA color <16 virtual consoles, flags=0x0>
atkbdc0 at 0x60-0x6f on motherboard
atkbd0 irq 1 on isa
sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa
sio0: type 16550A, console
fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1.44MB 3.5in
wdc0 at 0x1f0-0x1f7 irq 14 flags 0x80ff80ff on isa
wdc0: unit 0 (wd0): <ST1144AT>, 32-bit
wd0: 124MB (255255 sectors), 1001 cyls, 15 heads, 17 S/T, 512 B/S
wdc1 at 0x170-0x177 irq 15 flags 0xa0ff80ff on isa
wdc1: unit 0 (wd2): <ST3660A>, 32-bit, multi-block-16
wd2: 520MB (1065456 sectors), 1057 cyls, 16 heads, 63 S/T, 512 B/S
wdc1: unit 1 (wd3): <SAMSUNG VG34323A   (4.32GB)>, DMA, 32-bit, multi-block-16
wd3: 4124MB (8446032 sectors), 14896 cyls, 9 heads, 63 S/T, 512 B/S
aha0 at 0x330-0x333 irq 11 drq 5 on isa
aha0: AHA-1540/1542 64 head BIOS FW Rev. 0.5 (ID=41) SCSI Host Adapter, SCSI ID 
7, 16 CCBs
1 3C5x9 board(s) on ISA found at 0x300
ep0 at 0x300-0x30f irq 10 on isa
ep0: aui/utp/bnc[*UTP*] address 00:20:af:38:ef:f8
vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa
npx0 on motherboard
npx0: INT 16 interface
Intel Pentium detected, installing workaround for F00F bug
Waiting 5 seconds for SCSI devices to settle
chada0 at aha0 bus 0 target 0 lun 0
da0: <CONNER CP30200  SUN0207 4234> Fixed Direct Access SCSI-2 device 
da0: 3.300MB/s transfers
da0: 203MB (416108 512 byte sectors: 64H 32S/T 203C)
da1 at aha0 bus 0 target 3 lun 0
da1: <SEAGATE ST1480   SUN0424 6266> Fixed Direct Access SCSI-2 device 
da1: 3.300MB/s transfers
da1: 411MB (843284 512 byte sectors: 64H 32S/T 411C)
nging root device to wd0s1a
WARNING: / was not properly dismounted

>How-To-Repeat:
>Fix:
>Release-Note:
>Audit-Trail:

From: Andy Farkas <andyf@speednet.com.au>
To: ip@mcc.ac.uk
Cc: FreeBSD-gnats-submit@FreeBSD.ORG
Subject: Re: kern/11280: 3.1-STABLE panics under heavy SCSI load, possibly
 aha 1542 related
Date: Thu, 22 Apr 1999 23:52:01 +1000 (EST)

 > >Number:         11280
 > >Category:       kern
 > >Synopsis:       3.1-STABLE panics under heavy SCSI load, possibly aha 1542 related
 > >Confidential:   yes
 > >Severity:       serious
 > >Priority:       high
 > >Responsible:    freebsd-bugs
 > >State:          open
 > >Quarter:        
 > >Keywords:       
 > >Date-Required:
 > >Class:          sw-bug
 > >Submitter-Id:   current-users
 > >Arrival-Date:   Thu Apr 22 04:50:02 PDT 1999
 > >Closed-Date:
 > >Last-Modified:
 > >Originator:     Ian Pallfreeman
 > >Release:        FreeBSD 3.1-STABLE i386
 > >Organization:
 > >Environment:
 > 
 > Box runs mainly sendmail, majordomo & mhonarc. Occasional bursts of traffic
 > to 20,000+ list addresses or processes growing to 200MB+, deliberately
 > throttled with old, slow disks and 1542 controller. Problem may also be
 > occuring on big squid boxes (similar symptoms), but owners are linux 
 > weenies and haven't a clue about ddb. Note that crash dump doesn't work.
 > 
 
 
 I also had a system crash on me during a copy (cp -pR) between two disks
 attached to an aha-1542 scsi card!  I must say, our hardware seems awfully
 similar:
 
 Copyright (c) 1992-1999 FreeBSD Inc.
 Copyright (c) 1982, 1986, 1989, 1991, 1993
 	The Regents of the University of California. All rights reserved.
 FreeBSD 3.0-STABLE #0: Sat Feb  6 12:53:42 EST 1999
     root@zippy.zippynet.iol.net.au:/usr/src/sys/compile/GENERIC
 Timecounter "i8254"  frequency 1193182 Hz
 CPU: Pentium/P55C (233.87-MHz 586-class CPU)
   Origin = "GenuineIntel"  Id = 0x543  Stepping=3
   Features=0x8001bf<FPU,VME,DE,PSE,TSC,MSR,MCE,CX8,MMX>
 real memory  = 67108864 (65536K bytes)
 config> quit
 avail memory = 61935616 (60484K bytes)
 Preloaded elf kernel "kernel" at 0xf0338000.
 Probing for devices on PCI bus 0:
 chip0: <Intel 82437VX PCI cache memory controller> rev 0x02 on pci0.0.0
 chip1: <Intel 82371SB PCI to ISA bridge> rev 0x01 on pci0.7.0
 ide_pci0: <Intel PIIX3 Bus-master IDE controller> rev 0x00 on pci0.7.1
 fxp0: <Intel EtherExpress Pro 10/100B Ethernet> rev 0x05 int a irq 11 on pci0.19.0
 fxp0: Ethernet address 00:a0:c9:cf:89:b1
 vga0: <S3 Trio graphics accelerator> rev 0x54 int a irq 15 on pci0.20.0
 Probing for devices on the ISA bus:
 sc0 on isa
 sc0: VGA color <16 virtual consoles, flags=0x0>
 ed0 not found at 0x280
 fe0 not found at 0x300
 atkbdc0 at 0x60-0x6f on motherboard
 atkbd0 irq 1 on isa
 psm0 not found
 sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa
 sio0: type 16550A
 sio1 at 0x2f8-0x2ff irq 3 on isa
 sio1: type 16550A
 lpt0 at 0x378-0x37f irq 7 on isa
 lpt0: Interrupt-driven port
 lp0: TCP/IP capable interface
 fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
 fdc0: FIFO enabled, 8 bytes threshold
 fd0: 1.44MB 3.5in
 wdc0 at 0x1f0-0x1f7 irq 14 on isa
 wdc0: unit 0 (wd0): <QUANTUM TRB850A>
 wd0: 810MB (1660176 sectors), 1647 cyls, 16 heads, 63 S/T, 512 B/S
 wdc1 not found at 0x170
 wt0 not found at 0x300
 mcd0 not found at 0x300
 matcdc0 not found at 0x230
 scd0 not found at 0x230
 ie0: unknown board_id: f000
 ie0 not found at 0x300
 ep0 not found at 0x300
 ex0 not found
 le0 not found at 0x300
 lnc0 not found at 0x280
 ze0 not found at 0x300
 zp0 not found at 0x300
 cs0 not found at 0x300
 adv0 not found at 0x330
 bt0 not found at 0x134
 aha0 at 0x330-0x333 irq 10 drq 5 on isa
 aha0: AHA-1542CF FW Rev. C.0 (ID=45) SCSI Host Adapter, SCSI ID 7, 16 CCBs
 vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa
 npx0 on motherboard
 npx0: INT 16 interface
 Intel Pentium detected, installing workaround for F00F bug
 Waiting 15 seconds for SCSI devices to settle
 changing root device to wd0s1a
 da1 at aha0 bus 0 target 1 lun 0
 da1: <MICROP 4110-09NB_Nov18b TN0F> Fixed Direct Access SCSI-2 device 
 da1: 5.0MB/s transfers (5.0MHz, offset 8)
 da1: 1002MB (2053880 512 byte sectors: 64H 32S/T 1002C)
 da0 at aha0 bus 0 target 0 lun 0
 da0: <QUANTUM FIREBALL_TM2110S 300X> Fixed Direct Access SCSI-2 device 
 da0: 5.0MB/s transfers (5.0MHz, offset 8)
 da0: 2014MB (4124736 512 byte sectors: 64H 32S/T 2014C)
 WARNING: / was not properly dismounted
 
 
 -- 
   
  :{ andyf@speednet.com.au
   
         Andy Farkas
     System Administrator
    Speed Internet Services
  http://www.speednet.com.au/
   
 
 
 
State-Changed-From-To: open->closed 
State-Changed-By: nrahlstr 
State-Changed-When: Fri May 28 15:10:29 PDT 1999 
State-Changed-Why:  
Closed at request of originator, possible hardware error. 
>Unformatted:
