From ajk@tesla.physics.purdue.edu Fri Oct 30 18:49:09 1998 Received: from tesla.physics.purdue.edu (tesla.physics.purdue.edu [128.210.146.199]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id SAA29969 for ; Fri, 30 Oct 1998 18:49:08 -0800 (PST) (envelope-from ajk@tesla.physics.purdue.edu) Received: (from root@localhost) by tesla.physics.purdue.edu (8.9.1/8.9.1) id CAA02366; Sat, 31 Oct 1998 02:49:16 -0500 (EST) (envelope-from ajk) Message-Id: <199810310749.CAA02366@tesla.physics.purdue.edu> Date: Sat, 31 Oct 1998 02:49:16 -0500 (EST) From: ajk@purdue.edu Reply-To: ajk@purdue.edu To: FreeBSD-gnats-submit@freebsd.org Subject: Another nfsd panic X-Send-Pr-Version: 3.2 >Number: 8515 >Category: kern >Synopsis: Another nfsd panic >Confidential: no >Severity: critical >Priority: high >Responsible: freebsd-bugs >State: closed >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Fri Oct 30 18:50:00 PST 1998 >Closed-Date: Fri Nov 13 01:51:24 PST 1998 >Last-Modified: Fri Nov 13 01:53:00 PST 1998 >Originator: Andrew J. Korty >Release: FreeBSD 3.0-RELEASE i386 >Organization: Purdue University Physics Department >Environment: FreeBSD tesla.physics.purdue.edu 3.0-RELEASE FreeBSD 3.0-RELEASE #4: Thu Oct 29 16:26:03 EST 1998 root@tesla.physics.purdue.edu:/usr/src/sys/compile/TESLA i386 >Description: Kernel panics in nfsd. For all I know, it could only happen on machines running SMP, but I don't think so. I did the following stack trace with an a.out GDB from an old -CURRENT. Btw, how are we supposed to debug kernels without an a.out GDB? # /sys/compile/TESLA 501 # /tmp/gdb -k kernel.debug /var/crash/vmcore.0 GDB is free software and you are welcome to distribute copies of it under certain conditions; type "show copying" to see the conditions. There is absolutely no warranty for GDB; type "show warranty" for details. GDB 4.16 (i386-unknown-freebsd), Copyright 1996 Free Software Foundation, Inc... IdlePTD 3182592 initial pcb at 29a20c panicstr: from debugger panic messages: --- Fatal trap 12: page fault while in kernel mode mp_lock = 00000002; cpuid = 0; lapic.id = 01000000 fault virtual address = 0xf1148b74 fault code = supervisor read, page not present instruction pointer = 0x8:0xf014e018 stack pointer = 0x10:0xf9c5ed90 frame pointer = 0x10:0xf9c5edb0 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 128 (nfsd) interrupt mask = <- SMP: XXX panic: from debugger mp_lock = 00000002; cpuid = 0; lapic.id = 01000000 boot() called on cpu#0 syncing disks... 13 13 6 done dumping to dev 20401, offset 1067920 dump 512 511 510 509 508 507 506 505 504 503 502 501 500 499 498 497 496 495 494 493 492 491 490 489 488 487 486 485 484 483 482 481 480 479 478 477 476 475 474 473 472 471 470 469 468 467 466 465 464 463 462 461 460 459 458 457 456 455 454 453 452 451 450 449 448 447 446 445 444 443 442 441 440 439 438 437 436 435 434 433 432 431 430 429 428 427 426 425 424 423 422 421 420 419 418 417 416 415 414 413 412 411 410 409 408 407 406 405 404 403 402 401 400 399 398 397 396 395 394 393 392 391 390 389 388 387 386 385 384 383 382 381 380 379 378 377 376 375 374 373 372 371 370 369 368 367 366 365 364 363 362 361 360 359 358 357 356 355 354 353 352 351 350 349 348 347 346 345 344 343 342 341 340 339 338 337 336 335 334 333 332 331 330 329 328 327 326 325 324 323 322 321 320 319 318 317 316 315 314 313 312 311 310 309 308 307 306 305 304 303 302 301 300 299 298 297 296 295 294 293 292 291 290 289 288 287 286 285 284 283 282 281 280 279 278 277 276 275 274 273 272 271 270 269 268 267 26! 6 265 264 263 262 261 260 259 258 257 256 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 --- #0 boot (howto=256) at ../../kern/kern_shutdown.c:268 268 dumppcb.pcb_cr3 = rcr3(); (kgdb) bt #0 boot (howto=256) at ../../kern/kern_shutdown.c:268 #1 0xf0151c08 in panic (fmt=0xf012220c "from debugger") at ../../kern/kern_shutdown.c:430 #2 0xf012222d in db_panic (addr=-267067368, have_addr=0, count=-1, modif=0xf9c5ec08 "") at ../../ddb/db_command.c:432 #3 0xf01220f5 in db_command (last_cmdp=0xf027f804, cmd_table=0xf027f664, aux_cmd_tablep=0xf0297468) at ../../ddb/db_command.c:332 #4 0xf01222a2 in db_command_loop () at ../../ddb/db_command.c:454 #5 0xf0124f1b in db_trap (type=12, code=0) at ../../ddb/db_trap.c:71 #6 0xf022953a in kdb_trap (type=12, code=0, regs=0xf9c5ed54) at ../../i386/i386/db_interface.c:157 #7 0xf023dc9d in trap_fatal (frame=0xf9c5ed54) at ../../i386/i386/trap.c:874 #8 0xf023d699 in trap_pfault (frame=0xf9c5ed54, usermode=0) at ../../i386/i386/trap.c:772 #9 0xf023d26b in trap (frame={tf_es = -265814000, tf_ds = -107675632, tf_edi = 0, tf_esi = -265796656, tf_ebp = -104469072, tf_isp = -104469124, tf_ebx = 72, tf_edx = -250556416, tf_ecx = 1, tf_eax = -250311820, tf_trapno = 12, tf_err = 0, tf_eip = -267067368, tf_cs = 8, tf_eflags = 66054, tf_esp = 72, tf_ss = -241240320}) at ../../i386/i386/trap.c:396 #10 0xf014e018 in free (addr=0x0, type=0xf02843d0) at ../../kern/kern_malloc.c:269 #11 0xf01d2b0e in nfsrv_dorec (slp=0xf1971100, nfsd=0xf1793000, ndp=0xf9c5ee20) at ../../nfs/nfs_socket.c:2235 #12 0xf01d7094 in nfssvc_nfsd (nsd=0xf9c5ee7c, argp=0x8074ffccannot read proc at 0 ) at ../../nfs/nfs_syscalls.c:537 #13 0xf01d6c59 in nfssvc (p=0xf9c46880, uap=0xf9c5ef84) at ../../nfs/nfs_syscalls.c:342 #14 0xf023dfd8 in syscall (frame={tf_es = 39, tf_ds = 39, tf_edi = 6, tf_esi = 0, tf_ebp = -272638520, tf_isp = -104468524, tf_ebx = 0, tf_edx = -272638920, tf_ecx = 0, tf_eax = 155, tf_trapno = 12, tf_err = 2, tf_eip = 134519552, tf_cs = 31, tf_eflags = 646, tf_esp = -272638916, tf_ss = 39}) at ../../i386/i386/trap.c:1031 #15 0xf022a23c in Xint0x80_syscall () cannot read proc at 0 (kgdb) frame 10 #10 0xf014e018 in free (addr=0x0, type=0xf02843d0) at ../../kern/kern_malloc.c:269 269 kup = btokup(addr); (kgdb) list 264 #ifdef DIAGNOSTIC 265 if ((char *)addr < kmembase || (char *)addr >= kmemlimit) { 266 panic("free: address %p out of range", (void *)addr); 267 } 268 #endif 269 kup = btokup(addr); 270 size = 1 << kup->ku_indx; 271 kbp = &bucket[kup->ku_indx]; 272 s = splmem(); 273 #ifdef DIAGNOSTIC (kgdb) frame 11 #11 0xf01d2b0e in nfsrv_dorec (slp=0xf1971100, nfsd=0xf1793000, ndp=0xf9c5ee20) at ../../nfs/nfs_socket.c:2235 2235 FREE(nam, M_SONAME); (kgdb) list 2230 nd->nd_md = nd->nd_mrep = m; 2231 nd->nd_nam2 = nam; 2232 nd->nd_dpos = mtod(m, caddr_t); 2233 error = nfs_getreq(nd, nfsd, TRUE); 2234 if (error) { 2235 FREE(nam, M_SONAME); 2236 free((caddr_t)nd, M_NFSRVDESC); 2237 return (error); 2238 } 2239 *ndp = nd; (kgdb) frame 12 #12 0xf01d7094 in nfssvc_nfsd (nsd=0xf9c5ee7c, argp=0x8074ffccannot read proc at 0 ) at ../../nfs/nfs_syscalls.c:537 537 error = nfsrv_dorec(slp, nfsd, &nd); (kgdb) list 532 nfsrv_rcv(slp->ns_so, (caddr_t)slp, 533 M_WAIT); 534 nfs_sndunlock(&slp->ns_solock, 535 &slp->ns_solock); 536 } 537 error = nfsrv_dorec(slp, nfsd, &nd); 538 cur_usec = nfs_curusec(); 539 if (error && slp->ns_tq.lh_first && 540 slp->ns_tq.lh_first->nd_time <= cur_usec) { 541 error = 0; (kgdb) frame 13 #13 0xf01d6c59 in nfssvc (p=0xf9c46880, uap=0xf9c5ef84) at ../../nfs/nfs_syscalls.c:342 342 error = nfssvc_nfsd(nsd, uap->argp, p); (kgdb) list 337 } 338 } 339 } 340 if ((uap->flag & NFSSVC_AUTHINFAIL) && (nfsd = nsd->nsd_nfsd)) 341 nfsd->nfsd_flag |= NFSD_AUTHFAIL; 342 error = nfssvc_nfsd(nsd, uap->argp, p); 343 } 344 #endif /* NFS_NOSERVER */ 345 if (error == EINTR || error == ERESTART) 346 error = 0; (kgdb) frame 14 #14 0xf023dfd8 in syscall (frame={tf_es = 39, tf_ds = 39, tf_edi = 6, tf_esi = 0, tf_ebp = -272638520, tf_isp = -104468524, tf_ebx = 0, tf_edx = -272638920, tf_ecx = 0, tf_eax = 155, tf_trapno = 12, tf_err = 2, tf_eip = 134519552, tf_cs = 31, tf_eflags = 646, tf_esp = -272638916, tf_ss = 39}) at ../../i386/i386/trap.c:1031 1031 error = (*callp->sy_call)(p, args); (kgdb) list 1026 p->p_retval[0] = 0; 1027 p->p_retval[1] = frame.tf_edx; 1028 1029 STOPEVENT(p, S_SCE, callp->sy_narg); 1030 1031 error = (*callp->sy_call)(p, args); 1032 1033 switch (error) { 1034 1035 case 0: (kgdb) frame 15 #15 0xf022a23c in Xint0x80_syscall () (kgdb) list 1036 /* 1037 * Reinitialize proc pointer `p' as it may be different 1038 * if this is a child returning from fork syscall. 1039 */ 1040 p = curproc; 1041 frame.tf_eax = p->p_retval[0]; 1042 frame.tf_edx = p->p_retval[1]; 1043 frame.tf_eflags &= ~PSL_C; 1044 break; 1045 (kgdb) quit >How-To-Repeat: Start the KDE on a machine that NFS-mounts your home directory off the machine you want to panic. >Fix: Unknown. >Release-Note: >Audit-Trail: State-Changed-From-To: open->closed State-Changed-By: dfr State-Changed-When: Fri Nov 13 01:51:24 PST 1998 State-Changed-Why: Fixed in revision 1.47 of nfs_socket.c >Unformatted: