From nobody@FreeBSD.org  Sat Sep  9 07:07:35 2006
Return-Path: <nobody@FreeBSD.org>
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id 30CC316A407
	for <freebsd-gnats-submit@FreeBSD.org>; Sat,  9 Sep 2006 07:07:35 +0000 (UTC)
	(envelope-from nobody@FreeBSD.org)
Received: from www.freebsd.org (www.freebsd.org [216.136.204.117])
	by mx1.FreeBSD.org (Postfix) with ESMTP id D6AB943D67
	for <freebsd-gnats-submit@FreeBSD.org>; Sat,  9 Sep 2006 07:07:34 +0000 (GMT)
	(envelope-from nobody@FreeBSD.org)
Received: from www.freebsd.org (localhost [127.0.0.1])
	by www.freebsd.org (8.13.1/8.13.1) with ESMTP id k8977YLO071602
	for <freebsd-gnats-submit@FreeBSD.org>; Sat, 9 Sep 2006 07:07:34 GMT
	(envelope-from nobody@www.freebsd.org)
Received: (from nobody@localhost)
	by www.freebsd.org (8.13.1/8.13.1/Submit) id k8977Y2x071601;
	Sat, 9 Sep 2006 07:07:34 GMT
	(envelope-from nobody)
Message-Id: <200609090707.k8977Y2x071601@www.freebsd.org>
Date: Sat, 9 Sep 2006 07:07:34 GMT
From: Michael Graziano <mikeg@bsd-box.net>
To: freebsd-gnats-submit@FreeBSD.org
Subject: bce driver - "Error mapping mbuf into TX chain!" [Tentative Patch]
X-Send-Pr-Version: www-2.3

>Number:         103059
>Category:       kern
>Synopsis:       [bce] [patch] "Error mapping mbuf into TX chain!" (tentative patch)
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    vwe
>State:          closed
>Quarter:        
>Keywords:       
>Date-Required:  
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Sat Sep 09 07:10:20 GMT 2006
>Closed-Date:    Sat Jan 31 11:12:25 UTC 2009
>Last-Modified:  Sat Jan 31 11:12:25 UTC 2009
>Originator:     Michael Graziano
>Release:        6.1-RELEASE-p6
>Organization:
Invision.com, Inc.
>Environment:
FreeBSD walnut.invision.net 6.1-RELEASE-p6 FreeBSD 6.1-RELEASE-p6 #1: Thu Sep  7 19:40:58 EDT 2006     root@larch.invision.net:/usr/obj/usr/src/sys/walnut.invision.net  amd64

Hardware: Dell 2950 (SMP)
>Description:
Various types of network activity can provoke the following error from the
bce driver:

bce0: Error mapping mbuf into TX chain!

This error knocks the network device out and requires a reboot to restore
network functionality.

This has been discussed at length on the -STABLE list, but I was unable to
find other PRs on it.  See 
http://www.mail-archive.com/freebsd-stable@freebsd.org/msg80749.html for
lots of details.


Relevant $ FreeBSD $ tags (bce files):

if_bce.c:__FBSDID("$FreeBSD: src/sys/dev/bce/if_bce.c,v 1.2.4.2 2006/04/13 22:42:07 ps Exp $");
if_bcefw.h: * $FreeBSD: src/sys/dev/bce/if_bcefw.h,v 1.1.4.1 2006/04/12 18:09:27 ps Exp $
if_bcereg.h: * $FreeBSD: src/sys/dev/bce/if_bcereg.h,v 1.1.4.2 2006/05/04 07:31:56 scottl Exp $

>How-To-Repeat:
Various types of network activity can provoke the error.  

I am able to reproduce it reliably with fairly light HTTP traffic (webmin
login scree).  Others have been able to reproduce it by piping dd over SSH.

See email thread referenced in description.
>Fix:
This is rumored to be fixed in -STABLE.  If this is the case, the changes
should be merged to RELENG_6_1.  If you need a guinnea pig to verify the
fix, I can install a -STABLE kernel on walnut on Monday for testing purposes.


If this is not fixed in -STABLE, the patch below at least masks the problem
in my case (not sure if it's a real fix though - it seems like it should
just be pushing the problem back for a while)

--- if_bcereg.h~        Thu May  4 03:31:56 2006
+++ if_bcereg.h Fri Sep  8 13:43:26 2006
@@ -4602,7 +4602,7 @@
 
 #define BCE_TX_TIMEOUT                                 5
 
-#define BCE_MAX_SEGMENTS                               8
+#define BCE_MAX_SEGMENTS                               32
 #define BCE_DMA_ALIGN                                  8
 #define BCE_DMA_BOUNDARY                               0
>Release-Note:
>Audit-Trail:

From: Bill Moran <wmoran@collaborativefusion.com>
To: bug-followup@FreeBSD.org, mikeg@bsd-box.net
Cc: seklecki@collaborativefusion.com
Subject: kern/103059: [bce] [patch] "Error mapping mbuf into TX chain!"
 (tentative patch)
Date: Tue, 19 Sep 2006 13:50:30 -0400

 Working with a Dell 2950, I can confirm that this problem goes
 away when the system is upgrade to 6.1-STABLE
 
 uname -a
 FreeBSD  6.2-PRERELEASE FreeBSD 6.2-PRERELEASE #0: Tue Sep 19 11:11:50 UTC 2006     root@:/usr/obj/usr/src/sys/GENERIC  amd64
 
 This same system was exhibiting the mentioned problem frequently
 with 6.1-RELEASE.
 
 -- 
 Bill Moran
 Collaborative Fusion Inc.

From: Michael Graziano <mikeg@bsd-box.net>
To: Bill Moran <wmoran@collaborativefusion.com>
Cc: bug-followup@FreeBSD.org, seklecki@collaborativefusion.com
Subject: Re: kern/103059: [bce] [patch] "Error mapping mbuf into TX chain!"
 (tentative patch)
Date: Tue, 19 Sep 2006 21:22:46 -0400

 Good enough for me :) I'll drop a 6-STABLE kernel on it tomorrow  
 afternoon and verify
 
 -MG
 
 On Sep 19, 2006, at 1:50 PM, Bill Moran wrote:
 
 >
 > Working with a Dell 2950, I can confirm that this problem goes
 > away when the system is upgrade to 6.1-STABLE
 >
 > uname -a
 > FreeBSD  6.2-PRERELEASE FreeBSD 6.2-PRERELEASE #0: Tue Sep 19  
 > 11:11:50 UTC 2006     root@:/usr/obj/usr/src/sys/GENERIC  amd64
 >
 > This same system was exhibiting the mentioned problem frequently
 > with 6.1-RELEASE.
 >
 > -- 
 > Bill Moran
 > Collaborative Fusion Inc.
 
State-Changed-From-To: open->feedback 
State-Changed-By: bms 
State-Changed-When: Mon Sep 25 17:55:58 UTC 2006 
State-Changed-Why:  
Any joy here? I'm not familiar with the chip and don't have one, however, 
the last round of commits on RELENG_6 seem to extend locking for the 
vlan related registers. 

Perhaps this helps? 

http://www.freebsd.org/cgi/query-pr.cgi?pr=103059 

From: Peter Schuller <peter.schuller@infidyne.com>
To: bug-followup@freebsd.org,
 mikeg@bsd-box.net
Cc:  
Subject: Re: kern/103059: [bce] [patch] "Error mapping mbuf into TX chain!" (tentative patch)
Date: Sat, 27 Jan 2007 07:01:33 +0100

 I was setting up, or about to set up, two Dell 2950 servers with BCM5708 and 
 6.1 on them. Both servers survived "pkg_add -v -r cvsup-without-gui". Both 
 servers crashed within seconds on "pkg_add -v -r zsh".
 
 Both servers would crash within seconds trying to run cvsup. I also 
 experienced crashes during interactive shell work. I could however transfer 
 large amounts of data, seemingly without statistically making things worse.
 
 I do not have console access to them, so I cannot verify that I was seeing the 
 mbuf mapping error message, but I did verify through an at job that the 
 machine was definitely up and running even after network connectivity died.
 
 Both machines are now upgraded to 6.2 and so far the problem has not shown up. 
 Given old behavior I would have crashed them 50 times by now.
 
 -- 
 / Peter Schuller
 
 PGP userID: 0xE9758B7D or 'Peter Schuller <peter.schuller@infidyne.com>'
 Key retrieval: Send an E-Mail to getpgpkey@scode.org
 E-Mail: peter.schuller@infidyne.com Web: http://www.scode.org
 

From: Peter Schuller <peter.schuller@infidyne.com>
To: bug-followup@freebsd.org,
 mikeg@bsd-box.net
Cc:  
Subject: Re: kern/103059: [bce] [patch] "Error mapping mbuf into TX chain!" (tentative patch)
Date: Sat, 27 Jan 2007 07:55:00 +0100

 To be clear, s/crashed/seemingly crashed/. As far as I can tell there was 
 never any crash involved, just the NIC death.
 
 -- 
 / Peter Schuller
 
 PGP userID: 0xE9758B7D or 'Peter Schuller <peter.schuller@infidyne.com>'
 Key retrieval: Send an E-Mail to getpgpkey@scode.org
 E-Mail: peter.schuller@infidyne.com Web: http://www.scode.org
 
State-Changed-From-To: feedback->open 
State-Changed-By: linimon 
State-Changed-When: Sun Mar 2 06:01:13 UTC 2008 
State-Changed-Why:  
Feedback was received some time ago. 

http://www.freebsd.org/cgi/query-pr.cgi?pr=103059 
Responsible-Changed-From-To: freebsd-bugs->freebsd-net 
Responsible-Changed-By: vwe 
Responsible-Changed-When: Wed Jan 14 22:24:09 UTC 2009 
Responsible-Changed-Why:  

Over to maintainer(s). 

http://www.freebsd.org/cgi/query-pr.cgi?pr=103059 
State-Changed-From-To: open->closed 
State-Changed-By: vwe 
State-Changed-When: Sat Jan 31 11:09:15 UTC 2009 
State-Changed-Why:  
Michael, 
we think this issue is fixed by r159411 and r164327. If you think 
this issue is still one, please check your problem with a more recent 
release as 6.1 has been EOL'd. 


Responsible-Changed-From-To: freebsd-net->vwe 
Responsible-Changed-By: vwe 
Responsible-Changed-When: Sat Jan 31 11:09:15 UTC 2009 
Responsible-Changed-Why:  
track 

http://www.freebsd.org/cgi/query-pr.cgi?pr=103059 
>Unformatted:
