Coda File System

cannot start venus from time to time

From: Florian Schaefer <listbox_at_netego.de>
Date: Sun, 14 Sep 2003 14:57:09 +0200
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hello,

over the last weeks the same thing happened here several times, venus
6.0.2 got a "fatal signal (11)". Every time I just did a "venus -init"
and everything worked again.

This time I tried to gather some information before wiping everything
with a new initialization.

This is what I got (sorry for the lengthy output, I didn't know what
could be helpful):

- ----------[ /usr/coda/etc/console ]-----------------

Date: Sun 09/14/2003

14:23:26 Coda Venus, version 6.0.2

14:23:27 /usr/coda/LOG size is 832000 bytes
14:23:27 /usr/coda/DATA size is 3322276 bytes
14:23:27 Loading RVM data
14:23:28 Last init was Wed Sep	3 16:17:53 2003
14:23:28 Last shutdown was clean
14:23:28 Starting RealmDB scan
14:23:28	Found 2 realms
14:23:28 starting VDB scan
14:23:28	5 volume replicas
14:23:28	3 replicated volumes
14:23:28	0 CML entries allocated
14:23:28	0 CML entries on free-list
14:23:28 starting FSDB scan (1250, 30000) (25, 75, 4)
14:23:30	582 cache files in table (27762 blocks)
14:23:30	668 cache files on free-list
14:23:30 starting HDB scan
14:23:30	0 hdb entries in table
14:23:30	0 hdb entries on free-list
14:23:30 Mounting root volume...
14:23:30 Venus starting...
14:23:30 /coda now mounted.

14:24:19 Fatal Signal (11); pid 620 becoming a zombie...
14:24:19 You may use gdb to attach to 620

- ----------[ /usr/coda/etc/venus.log ]-----------------

[ X(00) : 0000 : 14:23:27 ] Coda Venus, version 6.0.2
[ X(00) : 0000 : 14:23:27 ] Logfile initialized with LogLevel = 0 at Sun
Sep 14 14:23:27 2003

[ X(00) : 0000 : 14:23:27 ] E StatsInit()
[ X(00) : 0000 : 14:23:27 ] L StatsInit()
[ X(00) : 0000 : 14:23:28 ] BeginRvmFlush (1, 60, F)
[ X(00) : 0000 : 14:23:28 ] EndRvmFlush
[ X(00) : 0000 : 14:23:28 ] BeginRvmTruncate (1, 220, F)
[ X(00) : 0000 : 14:23:28 ] EndRvmTruncate
[ X(00) : 0000 : 14:23:30 ] BeginRvmFlush (1, 1276, F)
[ X(00) : 0000 : 14:23:30 ] EndRvmFlush
[ X(00) : 0000 : 14:23:30 ] BeginRvmTruncate (11, 1436, F)
[ X(00) : 0000 : 14:23:30 ] EndRvmTruncate
[ X(00) : 0000 : 14:23:30 ] E adv_daemon::adv_daemon: AdviceServer    

[ A(18) : 0000 : 14:23:30 ] adv_daemon::main()

[ H(07) : 0000 : 14:23:30 ] HDBDaemon about to sleep on hdbdaemon_sync

[ W(20) : 0000 : 14:23:30 ] FidToNodeid: called for volume root
(50328f88.ff000001)!!!

[ D(21) : 0000 : 14:23:35 ] WAITING(SRVRQ):

[ V(05) : 0000 : 14:23:35 ] userent::Connect:
ViceGetAttrPlusSHA(floyd.netego.de)
[ V(05) : 0000 : 14:23:35 ] userent::Connect: ViceGetAttrPlusSHA() -> 22
[ V(05) : 0000 : 14:23:35 ] userent::Connect: VGAPlusSHA_Supported -> 1

[ D(21) : 0000 : 14:23:35 ] WAIT OVER, elapsed = 24.3
[ D(21) : 0000 : 14:23:35 ] userent::Connect:
ViceGetAttrPlusSHA(floyd.netego.de)
[ D(21) : 0000 : 14:23:35 ] userent::Connect: ViceGetAttrPlusSHA() -> 22
[ D(21) : 0000 : 14:23:35 ] userent::Connect: VGAPlusSHA_Supported -> 1

[ T(01) : 0001 : 14:23:45 ] BeginRvmFlush (1, 5256, T)
[ T(01) : 0001 : 14:23:45 ] EndRvmFlush

[ W(20) : 0000 : 14:24:19 ] *****  FATAL SIGNAL (11) *****

- ----------[ backtrace, sorry no debug information ]-----------------

(gdb) bt
#0  0x4019c944 in sigsuspend () from /lib/libc.so.6
#1  0x080b7f38 in strcpy ()
#2  <signal handler called>
#3  0x08070518 in strcpy ()
#4  0x081324f0 in ?? ()
#5  0x080b1ad3 in strcpy ()
#6  0x080b6a16 in strcpy ()
#7  0x080acf25 in strcpy ()
#8  0x4007ce9a in Create_Process_Part2 () at lwp.c:796

- ----------[ rvmutl output ]-----------------

root_at_zini:~ # rvmutl 
* o /usr/coda/LOG
* status
Status of log:		 /usr/coda/LOG

  log created on:	 Wen Sep  3 2003 16:17:52.756569
  log created with:	 RVM Interface Version 1.3  7 Mar 1994
			 RVM Log Version  1.4 Oct 17, 1997 
			 RVM Statistics Version 1.1 8 Dec 1992
  status last written:	 Sun Sep 14 2003 14:27:29.170394
  last truncation:	 Sun Sep 14 2003 14:23:30.186930

  log head offset:	     461808

  log tail offset:	     467360
  log empty:		 false

  space used by records:       5552
  space available:	     824912
  status area size:	       1536
  total log size:	     832000

  first record number:	       1060
  last record number:	       1062
  first timestamp:	 null
  last	timestamp:	 null
  first trans. uname:	 null
  last	trans. uname:	 null

I tried a "recover" of the LOG file and "i /usr/coda/LOG 20M" but both
hadn't helped me any further.

This is all the information I got, if more is required I will have to
wait for the next crash. ;-)
Can anyone tell me what is going wrong here?

Ciao
Florian

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.2 (GNU/Linux)
Comment: Get my DSA key from: www.netego.de/hpc?p=download&l=en

iD8DBQE/ZGWk+2lxodi1OoURAoFdAKCEJI1XycEwFtG14irSIunt9iM/9ACggtoz
3JRatPM0W5C2IaBe8TVcLXs=
=cCMZ
-----END PGP SIGNATURE-----
Received on 2003-09-14 09:01:25