From nobody  Mon Aug 18 13:06:15 1997
Received: (from nobody@localhost)
          by hub.freebsd.org (8.8.5/8.8.5) id NAA14664;
          Mon, 18 Aug 1997 13:06:15 -0700 (PDT)
Message-Id: <199708182006.NAA14664@hub.freebsd.org>
Date: Mon, 18 Aug 1997 13:06:15 -0700 (PDT)
From: nick@webignite.com
To: freebsd-gnats-submit@freebsd.org
Subject: Dump backup utility completely crashes the machine 25% of the time.
X-Send-Pr-Version: www-1.0

>Number:         4333
>Category:       bin
>Synopsis:       Dump backup utility completely crashes the machine 25% of the time.
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    gibbs
>State:          closed
>Quarter:        
>Keywords:       
>Date-Required:  
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Mon Aug 18 13:10:00 PDT 1997
>Closed-Date:    Mon Nov 2 11:29:48 PST 1998
>Last-Modified:  Mon Nov  2 11:30:30 PST 1998
>Originator:     Nick Tonkin
>Release:        2.2.1-Release
>Organization:
Web-Ignite Corp.
>Environment:
FreeBSD olympus.webignite.com 2.2.1-RELEASE FreeBSD 2.2.1-RELEASE #0: Thu May 29 13:03:40 PDT 1997     nick@olympus.webignite.com:/usr/src/sys/compile/052997.2  i386
>Description:
When using the "dump" utility to backup the filesystems,
a SCSI error appears to occur and the entire machine crashes.

This happens not every time, but about once every four or five
uses of "dump." The dump command issued as root is `dump Nusd 5000 42500 F`
where N is the dump level and F is the file system. It makes no
difference what the level of dump is, or which filesystem is being backed up.

I have had no other problems with the SCSI devices (two hard disk drives).

The SCSI tape drive is a Seagate DDS-2, model CTD8000H-S
The machine is a Dell Poweredge Pentium Pro 200 w/. 96Mb RAM


The error message when the machine crashes is as follows:

st0(ahc 0:6:0):SCB0x3 - timed out while idle, LASTPHASE == 0x1,SCSISIGI == 0x0
SEQ ADDR == 0x5
st0(ahc 0:6:0): Queueing an Abort SCB
st0(ahc 0:6:0): SCB0x3 - timed out while idle, LAST PHASE == 0x1, SCSISIGI == 0x0
SEQ ADDR == 0x5
st0(ahc 0:6:0): no longer in timeout
ahc0: Issued Channel A Bus Reset. 2 SCBs aborted

and then the machine has to be physically powered down and up again.
>How-To-Repeat:
Hmm, just keep running dump every day and within a week at the outside, it'll happen.
>Fix:

>Release-Note:
>Audit-Trail:
Responsible-Changed-From-To: freebsd-bugs->gibbs 
Responsible-Changed-By: gibbs 
Responsible-Changed-When: Mon Aug 18 21:09:14 PDT 1997 
Responsible-Changed-Why:  
My driver.  My PR.: 

State-Changed-From-To: open->closed 
State-Changed-By: gibbs 
State-Changed-When: Mon Nov 2 11:29:48 PST 1998 
State-Changed-Why:  
Believed to be fixed by CAM. 
>Unformatted:
This problem if not caused by a timeout value in the SCSI tape driver being
too short is most likely fixed in rev 1.123 (rev 1.81.2.26 on the RELENG_2_2
branch).  Please upgrade to a release of FreeBSD that contains these changes
and report back.  Information on installing 2.2-stable (which has this
fix) can be obtained from releng22.FreeBSD.org.
