Coda File System

Codasrv crash - Reason?

From: Markus Wiesecke <wiesecke_at_uni-bielefeld.de>
Date: Tue, 07 Sep 2004 16:26:35 +0200
Hello,

I am running a codaserver which I have inherited from a colleage leaving
the group. Thus I am not very experienced with this stuff. 
>From team to time, the codasrv crashes, and I do not see any reason for
I, for example tonight. The CodaSrv-Process was still running, but no
longer answering queries.
The LogFile from the time reads as follows (sorry if I am pasting to
much, but I do not want to miss the point):

18:06:33 Starting SmonDaemon timer
18:08:37 VGetVnode: vnode 1000004.3d70 is not allocated
18:08:38 VGetVnode: vnode 1000004.3d70 is not allocated
18:08:38 VGetVnode: vnode 1000004.4890 is not allocated
18:08:38 VGetVnode: vnode 1000004.471c is not allocated
18:08:38 VGetVnode: vnode 1000004.479a is not allocated
18:08:38 VGetVnode: vnode 1000004.479a is not allocated
18:08:39 VGetVnode: vnode 1000004.47da is not allocated
18:08:39 VGetVnode: vnode 1000004.47da is not allocated
18:09:17 Unbinding RPC2 connection 10373
18:09:17 Worker1: Unbinding RPC connection 9459
18:09:17 VGetVnode: vnode 1000004.896 is not allocated
18:09:18 VGetVnode: vnode 1000004.38dc is not allocated
18:13:17 Unbinding RPC2 connection 835
18:17:17 Unbinding RPC2 connection 12137
18:17:22 Callback failed RPC2_DEAD (F) for ws 129.70.138.34:2430
18:17:22 Unbinding RPC2 connection 15064
18:17:22 Unbinding RPC2 connection 7681
18:17:22 Unbinding RPC2 connection 6572
18:21:22 Unbinding RPC2 connection 14910
18:25:22 Unbinding RPC2 connection 16031
18:29:22 Unbinding RPC2 connection 2426
18:30:01 Worker2: Unbinding RPC connection 2998
18:30:01 Worker4: Unbinding RPC connection 14161
18:30:30 Worker1: Unbinding RPC connection 4661
18:30:30 Worker1: Unbinding RPC connection 16204
18:30:31 Worker4: Unbinding RPC connection 13531
18:30:31 ****** WARNING entry at 0x8169cb0 already has deqing set!

18:30:31 VGetVnode: memory vnode was snatched away
18:30:31 VGetVnode: memory vnode was snatched away
18:30:31 VGetVnode: vnode 1000004.4816 is not allocated
18:30:31 Worker0: Unbinding RPC connection 12040
18:30:31 Worker2: Unbinding RPC connection 4647
18:30:32 Worker3: Unbinding RPC connection 9453
18:30:32 Worker3: Unbinding RPC connection 10106
18:30:32 VGetVnode: memory vnode was snatched away
18:30:32 VGetVnode: memory vnode was snatched away
18:30:33 CheckRemoveSemantics: 1000004.5.3, VCP error (198)
18:30:33 Entering VFlushVnode for vnode 0x5
18:30:33 VGetVnode: vnode 1000004.48cc is not allocated
18:30:34 Worker1: Unbinding RPC connection 4335
18:30:34 Worker1: Unbinding RPC connection 3127
18:30:35 VGetVnode: memory vnode was snatched away
18:30:35 VGetVnode: memory vnode was snatched away
18:30:35 Worker3: Unbinding RPC connection 3000
18:30:37 VGetVnode: vnode 1000004.475c is not allocated
18:30:37 ****** WARNING entry at 0x8169cb0 already has deqing set!

18:37:22 Callback failed RPC2_NAKED (F) for ws 129.70.139.98:32811
18:37:22 Unbinding RPC2 connection 6351
18:37:22 Unbinding RPC2 connection 703
18:37:22 Unbinding RPC2 connection 16197
18:37:22 Unbinding RPC2 connection 12497
18:37:22 Unbinding RPC2 connection 12717
18:37:22 Unbinding RPC2 connection 16041
18:37:23 Callback failed RPC2_NAKED (F) for ws 129.70.139.45:2430
18:37:23 Unbinding RPC2 connection 6302
18:37:23 Unbinding RPC2 connection 11380
18:37:23 Unbinding RPC2 connection 9748
18:37:23 Unbinding RPC2 connection 13578
18:37:23 Unbinding RPC2 connection 3662
18:37:23 Unbinding RPC2 connection 10411
18:57:24 Callback failed RPC2_NAKED (F) for ws 129.70.139.61:32771
18:57:24 Unbinding RPC2 connection 887
18:57:24 Unbinding RPC2 connection 9650
18:57:24 Unbinding RPC2 connection 11319
18:57:24 Unbinding RPC2 connection 16090
18:57:24 Unbinding RPC2 connection 5157
19:06:33 SmonDaemon timer expired
19:06:33 Entered CheckRVMResStat
19:06:33 Starting SmonDaemon timer
20:06:33 SmonDaemon timer expired
20:06:33 Entered CheckRVMResStat
20:06:33 Starting SmonDaemon timer
21:06:33 SmonDaemon timer expired
21:06:33 Entered CheckRVMResStat
21:06:33 Starting SmonDaemon timer
22:06:33 SmonDaemon timer expired
22:06:33 Entered CheckRVMResStat
22:06:33 Starting SmonDaemon timer
23:06:33 SmonDaemon timer expired
23:06:33 Entered CheckRVMResStat
23:06:33 Starting SmonDaemon timer

Date: Tue 09/07/2004

00:01:25 Callback failed RPC2_NAKED (F) for ws 129.70.139.44:2430
00:01:25 Unbinding RPC2 connection 14135
00:01:25 Unbinding RPC2 connection 526
00:01:25 Unbinding RPC2 connection 2178
00:01:25 Unbinding RPC2 connection 8101
00:01:25 Unbinding RPC2 connection 16309
00:01:25 Unbinding RPC2 connection 15611
00:01:27 Callback failed RPC2_NAKED (F) for ws 129.70.139.164:2430
00:01:27 Unbinding RPC2 connection 9048
00:01:27 Unbinding RPC2 connection 13632
00:01:27 Unbinding RPC2 connection 10905
00:01:27 Unbinding RPC2 connection 1433
00:01:27 Unbinding RPC2 connection 13859
00:01:27 Unbinding RPC2 connection 8083
00:01:27 Callback failed RPC2_NAKED (F) for ws 129.70.139.75:2430
00:01:27 Unbinding RPC2 connection 4059
00:01:27 Unbinding RPC2 connection 5977
00:01:27 Unbinding RPC2 connection 8838
00:01:27 Unbinding RPC2 connection 3734
00:01:27 Unbinding RPC2 connection 2072
00:01:27 Unbinding RPC2 connection 4220
00:01:27 Unbinding RPC2 connection 695
00:01:27 Unbinding RPC2 connection 7908
00:01:27 Callback failed RPC2_NAKED (F) for ws 129.70.139.166:2430
00:01:27 Unbinding RPC2 connection 924
00:01:27 Unbinding RPC2 connection 4534
00:01:27 Unbinding RPC2 connection 183
00:01:27 Unbinding RPC2 connection 9683
00:01:27 Unbinding RPC2 connection 16065
00:01:27 Unbinding RPC2 connection 8610
00:01:27 Unbinding RPC2 connection 8702
00:06:33 SmonDaemon timer expired
00:06:33 Entered CheckRVMResStat
00:06:33 Starting SmonDaemon timer
01:06:33 SmonDaemon timer expired
01:06:33 Entered CheckRVMResStat
02:06:33 SmonDaemon timer expired
02:06:33 Entered CheckRVMResStat
02:06:33 Starting SmonDaemon timer
03:06:33 SmonDaemon timer expired
03:06:33 Entered CheckRVMResStat
03:06:33 Starting SmonDaemon timer
04:06:33 SmonDaemon timer expired
04:06:33 Entered CheckRVMResStat
04:06:33 Starting SmonDaemon timer
05:06:33 SmonDaemon timer expired
05:06:33 Entered CheckRVMResStat
05:06:33 Starting SmonDaemon timer
06:06:33 SmonDaemon timer expired
06:06:33 Entered CheckRVMResStat
06:06:33 Starting SmonDaemon timer
07:06:33 SmonDaemon timer expired
07:06:33 Entered CheckRVMResStat
07:06:33 Starting SmonDaemon timer
08:06:33 SmonDaemon timer expired
08:06:33 Entered CheckRVMResStat
08:06:33 Starting SmonDaemon timer
08:18:00 Shutdown received
08:18:17 Shutting down the File Server Tue Sep  7 08:18:17 2004

The Shutdown was initiaded by the "codasrv.init stop" script this
morning.

Can you see any reason for the crash? The IP 129.70.138.34 belongs to a
laptop, which I suppose that it was shut down at the time the RPC2_DEAD
appeared in the logs - may this be a reason for a crash?

Greetings and thanks in advance
Markus
-- 
Universitšt Bielefeld                        Raum: M6-111
Technische Fakultšt                 Telefon:0521.106-2949 
AG Angewandte Informatik           Telefax: 0521.106-2992
Dipl.-Inform. Markus Wiesecke        Mobil: 0172.521 3093 
33594 Bielefeld  Email: mwieseck_at_techfak.uni-bielefeld.de
Received on 2004-09-07 10:28:26