From nobody@FreeBSD.org  Sun Sep 15 11:10:00 2002
Return-Path: <nobody@FreeBSD.org>
Received: from mx1.FreeBSD.org (mx1.FreeBSD.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id 51A2937B400
	for <freebsd-gnats-submit@FreeBSD.org>; Sun, 15 Sep 2002 11:10:00 -0700 (PDT)
Received: from www.freebsd.org (www.FreeBSD.org [216.136.204.117])
	by mx1.FreeBSD.org (Postfix) with ESMTP id E90E643E4A
	for <freebsd-gnats-submit@FreeBSD.org>; Sun, 15 Sep 2002 11:09:59 -0700 (PDT)
	(envelope-from nobody@FreeBSD.org)
Received: from www.freebsd.org (localhost [127.0.0.1])
	by www.freebsd.org (8.12.4/8.12.4) with ESMTP id g8FI9xOT003221
	for <freebsd-gnats-submit@FreeBSD.org>; Sun, 15 Sep 2002 11:09:59 -0700 (PDT)
	(envelope-from nobody@www.freebsd.org)
Received: (from nobody@localhost)
	by www.freebsd.org (8.12.4/8.12.4/Submit) id g8FI9xR0003220;
	Sun, 15 Sep 2002 11:09:59 -0700 (PDT)
Message-Id: <200209151809.g8FI9xR0003220@www.freebsd.org>
Date: Sun, 15 Sep 2002 11:09:59 -0700 (PDT)
From: "Paul G. LeDuc" <paul.leduc@appliedheuristics.com>
To: freebsd-gnats-submit@FreeBSD.org
Subject: /kernel: ad0: UDMA ICRC READ ERROR 
X-Send-Pr-Version: www-1.0

>Number:         42813
>Category:       i386
>Synopsis:       /kernel: ad0: UDMA ICRC READ ERROR
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    freebsd-bugs
>State:          closed
>Quarter:        
>Keywords:       
>Date-Required:  
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Sun Sep 15 11:10:01 PDT 2002
>Closed-Date:    Mon May 05 05:50:25 PDT 2003
>Last-Modified:  Mon May 05 05:50:25 PDT 2003
>Originator:     Paul G. LeDuc
>Release:        4.2
>Organization:
Applied Heuristics
>Environment:
FreeBSD cbs1.vault.appliedheuristics.com 4.2-RELEASE FreeBSD 4.2-RELEASE #0: Mon Nov 20 13:02:55 GMT 2000     jkh@bento.FreeBSD.org:/usr/src/sys/compile/GENERIC  i386
>Description:
Receive the following messages for hard drive:

Sep 15 10:08:38 cbs1 /kernel: ad0: UDMA ICRC READ ERROR blk# 4763052 retrying
Sep 15 10:08:39 cbs1 last message repeated 2 times
Sep 15 10:08:39 cbs1 /kernel: ad0: UDMA ICRC READ ERROR blk# 4763052 falling back to PIO mode


If system is rebooted clears up for a while but returns.  System eventually becomes unusable.  System information follows:

Sep 15 10:07:40 cbs1 /kernel: Rebooting...
Sep 15 10:07:40 cbs1 /kernel: Copyright (c) 1992-2000 The FreeBSD Project.
Sep 15 10:07:40 cbs1 /kernel: Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
Sep 15 10:07:40 cbs1 /kernel: The Regents of the University of California. All rights reserved.
Sep 15 10:07:40 cbs1 /kernel: FreeBSD 4.2-RELEASE #0: Mon Nov 20 13:02:55 GMT 2000
Sep 15 10:07:40 cbs1 /kernel: jkh@bento.FreeBSD.org:/usr/src/sys/compile/GENERIC
Sep 15 10:07:40 cbs1 /kernel: Timecounter "i8254"  frequency 1193182 Hz
Sep 15 10:07:40 cbs1 /kernel: CPU: AMD Athlon(tm) Processor (805.64-MHz 686-class CPU)
Sep 15 10:07:40 cbs1 /kernel: Origin = "AuthenticAMD"  Id = 0x621  Stepping = 1
Sep 15 10:07:40 cbs1 /kernel: Features=0x183f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR>
Sep 15 10:07:40 cbs1 /kernel: AMD Features=0xc0400000<AMIE,DSP,3DNow!>
Sep 15 10:07:40 cbs1 /kernel: real memory  = 402587648 (393152K bytes)
Sep 15 10:07:40 cbs1 /kernel: config> di pcic0
Sep 15 10:07:40 cbs1 /kernel: config> di sn0
Sep 15 10:07:40 cbs1 /kernel: config> di lnc0
Sep 15 10:07:40 cbs1 /kernel: config> di ie0
Sep 15 10:07:40 cbs1 /kernel: config> di fe0
Sep 15 10:07:40 cbs1 /kernel: config> di ed0
Sep 15 10:07:40 cbs1 /kernel: config> di cs0
Sep 15 10:07:40 cbs1 /kernel: config> q
Sep 15 10:07:40 cbs1 /kernel: avail memory = 387465216 (378384K bytes)
Sep 15 10:07:40 cbs1 /kernel: Preloaded elf kernel "kernel" at 0xc0436000.
Sep 15 10:07:40 cbs1 /kernel: Preloaded userconfig_script "/boot/kernel.conf" at 0xc043609c.
Sep 15 10:07:40 cbs1 /kernel: Pentium Pro MTRR support enabled
Sep 15 10:07:40 cbs1 /kernel: md0: Malloc disk
Sep 15 10:07:40 cbs1 /kernel: npx0: <math processor> on motherboard
Sep 15 10:07:40 cbs1 /kernel: npx0: INT 16 interface
Sep 15 10:07:40 cbs1 /kernel: pcib0: <AMD-751 host to PCI bridge> on motherboard
Sep 15 10:07:40 cbs1 /kernel: pci0: <PCI bus> on pcib0
Sep 15 10:07:40 cbs1 /kernel: pcib1: <AMD-751 PCI-PCI (AGP) bridge> at device 1.0 on pci0
Sep 15 10:07:40 cbs1 /kernel: pci1: <PCI bus> on pcib1
Sep 15 10:07:40 cbs1 /kernel: pci1: <NVidia Riva TNT2 graphics accelerator> at 5.0 irq 11
Sep 15 10:07:40 cbs1 /kernel: isab0: <VIA 82C686 PCI-ISA bridge> at device 4.0 on pci0
Sep 15 10:07:40 cbs1 /kernel: isa0: <ISA bus> on isab0
Sep 15 10:07:40 cbs1 /kernel: atapci0: <VIA 82C686 ATA66 controller> port 0x1060-0x106f at device 4.1 on pci0
Sep 15 10:07:40 cbs1 /kernel: ata0: at 0x1f0 irq 14 on atapci0
Sep 15 10:07:40 cbs1 /kernel: uhci0: <VIA 83C572 USB controller> port 0x1040-0x105f irq 9 at device 4.2 on pci0
Sep 15 10:07:40 cbs1 /kernel: usb0: <VIA 83C572 USB controller> on uhci0
Sep 15 10:07:40 cbs1 /kernel: usb0: USB revision 1.0
Sep 15 10:07:40 cbs1 /kernel: uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
Sep 15 10:07:40 cbs1 /kernel: uhub0: 2 ports with 2 removable, self powered
Sep 15 10:07:40 cbs1 /kernel: chip1: <VIA 82C686 ACPI interface> at device 4.4 on pci0
Sep 15 10:07:40 cbs1 /kernel: fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0x1000-0x103f mem 0xf4000000-0xf40fffff,0xf4100000-0xf4100fff irq 9 at device 16.0 on pci0
Sep 15 10:07:40 cbs1 /kernel: fxp0: Ethernet address 00:02:b3:4c:6a:f2
Sep 15 10:07:40 cbs1 /kernel: fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
Sep 15 10:07:40 cbs1 /kernel: fdc0: FIFO enabled, 8 bytes threshold
Sep 15 10:07:40 cbs1 /kernel: fd0: <1440-KB 3.5" drive> on fdc0 drive 0
Sep 15 10:07:40 cbs1 /kernel: atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
Sep 15 10:07:40 cbs1 /kernel: atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
Sep 15 10:07:40 cbs1 /kernel: kbd0 at atkbd0
Sep 15 10:07:40 cbs1 /kernel: psm0: <PS/2 Mouse> irq 12 on atkbdc0
Sep 15 10:07:40 cbs1 /kernel: psm0: model Generic PS/2 mouse, device ID 0
Sep 15 10:07:40 cbs1 /kernel: vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Sep 15 10:07:40 cbs1 /kernel: sc0: <System console> at flags 0x100 on isa0
Sep 15 10:07:40 cbs1 /kernel: sc0: VGA <16 virtual consoles, flags=0x300>
Sep 15 10:07:40 cbs1 /kernel: sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
Sep 15 10:07:40 cbs1 /kernel: sio0: type 16550A
Sep 15 10:07:40 cbs1 /kernel: sio1 at port 0x2f8-0x2ff irq 3 on isa0
Sep 15 10:07:40 cbs1 /kernel: sio1: type 16550A
Sep 15 10:07:40 cbs1 /kernel: ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
Sep 15 10:07:40 cbs1 /kernel: ppc0: Generic chipset (EPP/NIBBLE) in COMPATIBLE mode
Sep 15 10:07:40 cbs1 /kernel: plip0: <PLIP network interface> on ppbus0
Sep 15 10:07:40 cbs1 /kernel: lpt0: <Printer> on ppbus0
Sep 15 10:07:40 cbs1 /kernel: lpt0: Interrupt-driven port
Sep 15 10:07:40 cbs1 /kernel: ppi0: <Parallel I/O> on ppbus0
Sep 15 10:07:40 cbs1 /kernel: ad0: 76319MB <WDC WD800JB-00CRA1> [155061/16/63] at ata0-master UDMA66
Sep 15 10:07:40 cbs1 /kernel: acd0: CDROM <CDU5211> at ata0-slave using PIO4
Sep 15 10:07:40 cbs1 /kernel: Mounting root from ufs:/dev/ad0s1a
Sep 15 10:07:40 cbs1 named[122]: starting.  named 8.2.3-T6B Mon Nov 20 11:27:49 GMT 2000 	jkh@bento.FreeBSD.org:/usr/obj/usr/src/usr.sbin/named
Sep 15 10:07:40 cbs1 named[122]: limit files set to fdlimit (1024)
Sep 15 10:07:40 cbs1 named[123]: Ready to answer queries.
Sep 15 10:07:52 cbs1 login: ROOT LOGIN (root) ON ttyv0
Sep 15 10:08:38 cbs1 /kernel: ad0: UDMA ICRC READ ERROR blk# 4763052 retrying
Sep 15 10:08:39 cbs1 last message repeated 2 times
Sep 15 10:08:39 cbs1 /kernel: ad0: UDMA ICRC READ ERROR blk# 4763052 falling back to PIO mode

This box also  runs samba.  As the above errors worsen.  I receive microuptime() ..tiime gone backwards errors as well.  Do not know if this is releated or not.

>How-To-Repeat:
Seems to happen during any steady disk acivity
>Fix:
      
>Release-Note:
>Audit-Trail:
State-Changed-From-To: open->feedback 
State-Changed-By: sos 
State-Changed-When: Sun Sep 15 11:53:00 PDT 2002 
State-Changed-Why:  
You should definitly upgrade to at least 4.6.2, the VIA support 
code has changed quite a bit since 4.2. 

http://www.freebsd.org/cgi/query-pr.cgi?pr=42813 

From: Ulrich 'Q' Spoerlein <q@uni.de>
To: freebsd-gnats-submit@FreeBSD.org
Cc:  
Subject: Re: i386/42813: /kernel: ad0: UDMA ICRC READ ERROR
Date: Mon, 16 Sep 2002 19:19:52 +0200

 On Sun, 15 Sep 2002 11:53:52 -0700 (PDT), you wrote:
 
 >Synopsis: /kernel: ad0: UDMA ICRC READ ERROR
 >
 >State-Changed-From-To: open->feedback
 >State-Changed-By: sos
 >State-Changed-When: Sun Sep 15 11:53:00 PDT 2002
 >State-Changed-Why: 
 >You should definitly upgrade to at least 4.6.2, the VIA support
 >code has changed quite a bit since 4.2.
 
 i'm _not_ the originator of this PR, but i witnessed a similiar behaviour
 when i switched harddrives in two of our servers.
 
 the original setup was:
 Machine A 4.6.1-RELEASE-p10:
 atapci0: <AcerLabs Aladdin ATA33 controller> port 0xb000-0xb00f irq 0 at
 device 15.0 on pci0
 ata0: at 0x1f0 irq 14 on atapci0
 ata1: at 0x170 irq 15 on atapci0
 ad0: 9787MB <QUANTUM FIREBALL CX10.2A> [19885/16/63] at ata0-master
 UDMA33
 
 (irq 0 ?????)
 
 Machine B 4.7-PRERELEASE:
 atapci0: <AcerLabs Aladdin ATA33 controller> port 0xf000-0xf00f at device
 15.0 on pci0
 ata0: at 0x1f0 irq 14 on atapci0
 ata1: at 0x170 irq 15 on atapci0
 ad0: 9671MB <IBM-DTTA-351010> [19650/16/63] at ata0-master UDMA33
 
 both setups were running PIO Mode (IIRC), but since the hardware on
 machine A was causing Signal 11s and core dumps we decided to switch
 harddisks and see if the problem still exists (i don't trust Ali
 hardware...)
 
 so now the Quantum HDD is inside the 4.7-PRERELEASE and this is what i
 get at startup:
 ad0: 9787MB <QUANTUM FIREBALL CX10.2A> [19885/16/63] at ata0-master
 UDMA33
 Mounting root from ufs:/dev/ad0s1a
 ad0s1a: UDMA ICRC error reading fsbn 192 of 96-127 (ad0s1 bn 192; cn 0 tn
 3 sn 3) retrying
 ad0s1a: UDMA ICRC error reading fsbn 192 of 96-127 (ad0s1 bn 192; cn 0 tn
 3 sn 3) retrying
 ad0s1a: UDMA ICRC error reading fsbn 72128 of 36064-36191 (ad0s1 bn
 72128; cn 114 tn 4 sn 56) retrying
 ad0s1a: UDMA ICRC error reading fsbn 72128 of 36064-36191 (ad0s1 bn
 72128; cn 114 tn 4 sn 56) retrying
 ad0s1a: UDMA ICRC error reading fsbn 72128 of 36064-36191 (ad0s1 bn
 72128; cn 114 tn 4 sn 56) retrying
 ad0s1a: UDMA ICRC error reading fsbn 72128 of 36064-36191 (ad0s1 bn
 72128; cn 114 tn 4 sn 56) falling back to PIO mode
 
 this didn't show up when the quantum was at the 4.6.1 machine, and this
 didn't show up with the IBM and 4.7-pre
 
 i'm pretty sure that UDMA with Ali Chipsets is just plain broken, but
 wouldn't it be better to be running PIO as default with these chipsets.
 or is anyone running UDMA with one of these things?
 
 PS: sorry for double-post :(
State-Changed-From-To: feedback->closed 
State-Changed-By: sos 
State-Changed-When: Mon May 5 05:49:25 PDT 2003 
State-Changed-Why:  
Please try -current (5.1) as the chipset support has been 
changed significantly there. 

http://www.freebsd.org/cgi/query-pr.cgi?pr=42813 
>Unformatted:
