Newsgroups: comp.sys.apollo
Path: utzoo!utgpu!news-server.csri.toronto.edu!helios.physics.utoronto.ca!alchemy.chem.utoronto.ca!system
From: system@alchemy.chem.utoronto.ca (System Admin (Mike Peterson))
Subject: Re: TCP/IP hangup
Message-ID: <1991Feb13.162340.16774@alchemy.chem.utoronto.ca>
Organization: University of Toronto Chemistry Department
References: <9102111305.AA09778@apo.esiee.fr>
Date: Wed, 13 Feb 1991 16:23:40 GMT

In article <9102111305.AA09778@apo.esiee.fr> bonnetf@apo.esiee.fr (bonnet-franck) writes:
>We are in touble with our TCP/IP gateway machine.
>The problem is the following :
> Sometimes ( generally during the week-end ...) this machine
>seems to hang TCP/IP traffic without any logical reason. 
>   ... lots of stuff deleted ...
>I precise that this machine is also a big file server for all our students so 
>it is not easy to stop it at any time ... I know it is not very smart to do 
>this but we have no choice at this time.
>
>- Is it a known bug ?

Yes, but is partially patched by one of the SR10.2 Domain/OS patches.
Sorry I don't remember which one. Make sure you have the proper Ethernet
microcode (the SR10.2 or later version) on ALL your Ethernet nodes.

>- Does the 10.3 solve this trouble ?

No. We still see this at SR10.3 on a DN4500 running X and NFS (both heavy
(ab)users of TCP/IP). It used to hang once a month or so, but since SR10.3
and NFS, it is once a week. On our DN10000, it used to be once a
week, but as of SR10.3.p + NFS + USENET news, our MTBH (mean time between hangs)
is about 2 days. Our system is also a file server for all our users (150),
and when it dies, it is a big pain, but rebooting is the only solution I
know of.

>- WHAT CAN I DO ???                      

Complain to Apollo if you have a software support contract.
-- 
Mike Peterson, System Administrator, U/Toronto Department of Chemistry
E-mail: system@alchemy.chem.utoronto.ca
Tel: (416) 978-7094                  Fax: (416) 978-8775
