From nobody@FreeBSD.ORG  Sat Nov 11 20:49:38 2000
Return-Path: <nobody@FreeBSD.ORG>
Received: by hub.freebsd.org (Postfix, from userid 32767)
	id 014C437B479; Sat, 11 Nov 2000 20:49:37 -0800 (PST)
Message-Id: <20001112044937.014C437B479@hub.freebsd.org>
Date: Sat, 11 Nov 2000 20:49:37 -0800 (PST)
From: lew@lppi.com
Sender: nobody@FreeBSD.ORG
To: freebsd-gnats-submit@FreeBSD.org
Subject: procfs corruption in FreeBSD 4.1.1-REL (ps gives "bad namelist")
X-Send-Pr-Version: www-1.0

>Number:         22779
>Category:       kern
>Synopsis:       procfs corruption in FreeBSD 4.1.1-REL (ps gives "bad namelist")
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    freebsd-bugs
>State:          closed
>Quarter:        
>Keywords:       
>Date-Required:  
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Sat Nov 11 20:50:01 PST 2000
>Closed-Date:    Sat Nov 11 22:43:38 PST 2000
>Last-Modified:  Sat Nov 11 22:44:48 PST 2000
>Originator:     Lew Payne
>Release:        4.1.1-RELEASE
>Organization:
Lew Payne Publishing, Inc.
>Environment:
FreeBSD nfs1.techhosting.net 4.1.1-RELEASE FreeBSD 4.1.1-RELEASE #0:
lewis@nfs1.techhosting.net:/usr/src/sys/compile/NFS  i386

>Description:
procfs seems to become corrupt over time.  This causes "ps" and "w" to fail with a "bad namelist" error.  Once this happens, it is impossible to manage processes.  Even re-genning the kernel, or booting good old kernel.GENERIC, does not fix this...  like managing the system with a blindfold on.  Can someone please, please help me with this.  I'm also running NFS (some machines as clients, some as servers) on the boxes, and the problem doesn't seem to discriminate between servers/clients.


  When it does, nothing fixes it... not even a reboot or re-genning the kernel.  Even booting with kernel.GENERIC won't fix the problem.
>How-To-Repeat:
Install 4.1.1-REL, and these ports, then just let it run for a week:
bzip2-1.0.1             jpeg-6b                 mysql-client-3.22.32
gdbm-1.8.0              libtool-1.3.4           mysql-server-3.22.32
gettext-0.10.35         lynx-2.8.3.1            rsync-2.4.6
gmake-3.79.1            mtr-0.42
or just write me for telnet access into my cluster.

>Fix:
Reinstalling FreeBSD from scratch temporarily solves the problem, but after several days of uptime, the problem resurfaces.  Please help!

>Release-Note:
>Audit-Trail:
State-Changed-From-To: open->closed 
State-Changed-By: ps 
State-Changed-When: Sat Nov 11 22:43:38 PST 2000 
State-Changed-Why:  
User was deleting /dev/null and not realizing it. 

http://www.freebsd.org/cgi/query-pr.cgi?pr=22779 
>Unformatted:
