From nobody@FreeBSD.org Thu Jun 8 07:51:02 2006 Return-Path: Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 989D916DA48 for ; Thu, 8 Jun 2006 05:21:19 +0000 (UTC) (envelope-from nobody@FreeBSD.org) Received: from www.freebsd.org (www.freebsd.org [216.136.204.117]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6548B43D46 for ; Thu, 8 Jun 2006 05:21:19 +0000 (GMT) (envelope-from nobody@FreeBSD.org) Received: from www.freebsd.org (localhost [127.0.0.1]) by www.freebsd.org (8.13.1/8.13.1) with ESMTP id k585LI9Q007908 for ; Thu, 8 Jun 2006 05:21:18 GMT (envelope-from nobody@www.freebsd.org) Received: (from nobody@localhost) by www.freebsd.org (8.13.1/8.13.1/Submit) id k585LIj9007907; Thu, 8 Jun 2006 05:21:18 GMT (envelope-from nobody) Message-Id: <200606080521.k585LIj9007907@www.freebsd.org> Date: Thu, 8 Jun 2006 05:21:18 GMT From: Lars Köller To: freebsd-gnats-submit@FreeBSD.org Subject: alternate system clock dies X-Send-Pr-Version: www-2.3 >Number: 98694 >Category: kern >Synopsis: alternate system clock dies >Confidential: no >Severity: serious >Priority: medium >Responsible: remko >State: closed >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Thu Jun 08 08:00:35 GMT 2006 >Closed-Date: Sat Feb 09 11:48:39 UTC 2008 >Last-Modified: Sat Feb 09 11:48:39 UTC 2008 >Originator: Lars Köller >Release: 5.5-RELEASE-p1, i386 >Organization: University Bielefeld >Environment: FreeBSD xxxxxxx 5.5-RELEASE-p1 FreeBSD 5.5-RELEASE-p1 #1: Thu Jun 1 14:41:54 CEST 2006 yyyyyyyy :/opt/tmp/obj/usr/src/sys/PMX i386 >Description: Server Hardware: FSC RX300 (Fujitsu Siemens Computers) After an uptime of approx. 5 days I get the message when running systat -vmstat The alternate system clock has died! Reverting to ``pigs'' display. The "CPU states" line in top is also broken (all values are zero), dito vmstat CPU values. All runs well with FreeBSD 4.1 on the same hardware. BIOS, BMC, etc. is up to date. The problem is critical for us, cause the Sophos PureMessage collects several statistical OS data of the server with vmstat, etc. These values are all broken after the "clock died". Reboot fixes the problem for some days. Kernel Config is: # # PMX-Server kernel configuration file for FreeBSD/i386 SMP # include GENERIC # ident PMX # # To make an SMP kernel, the next line is needed # options SMP # Symmetric MultiProcessor Kernel # # asr old ioctls support, needed by raidutils # options ASR_COMPAT # # # options MAXSSIZ="(2000*1024*1024)" options MAXDSIZ="(2000*1024*1024)" options DFLDSIZ="(2000*1024*1024)" >How-To-Repeat: >Fix: >Release-Note: >Audit-Trail: From: =?ISO-8859-1?Q?Lars_K=F6ller?= To: bug-followup@FreeBSD.org, lars.koeller@uni-bielefeld.de Cc: Subject: Re: kern/98694: alternate system clock dies Date: Fri, 09 Jun 2006 11:39:48 +0200 Some additional information: I try to start the system without ACPI (loader option), but then th kernel panics during probe. Here is the dmesg.boot of the running system: Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.5-RELEASE-p1 #1: Thu Jun 1 14:41:54 CEST 2006 root@pmx1.hrz.uni-bielefeld.de:/opt/tmp/obj/usr/src/sys/PMX ACPI APIC Table: Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2400.10-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf27 Stepping = 7 Features=0xbfebfbff Hyperthreading: 2 logical CPUs real memory = 2147483648 (2048 MB) avail memory = 2095931392 (1998 MB) FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): APIC ID: 6 cpu1 (AP): APIC ID: 0 cpu2 (AP): APIC ID: 1 cpu3 (AP): APIC ID: 7 ioapic0 irqs 0-15 on motherboard ioapic1 irqs 16-31 on motherboard npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000 acpi_timer0: <32-bit timer at 3.579545MHz> port 0xf008-0xf00b on acpi0 cpu0: on acpi0 cpu1: on acpi0 cpu2: on acpi0 cpu3: on acpi0 acpi_button0: on acpi0 pcib0: on acpi0 pci0: on pcib0 pci0: at device 4.0 (no driver attached) atapci0: port 0x1400-0x140f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 15.1 on pci0 ata0: channel #0 on atapci0 ata1: channel #1 on atapci0 ohci0: mem 0xf8021000-0xf8021fff irq 9 at device 15.2 on pci0 usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 4 ports with 4 removable, self powered isab0: at device 15.3 on pci0 isa0: on isab0 pcib1: on acpi0 pci1: on pcib1 pcib2: on acpi0 pci2: on pcib2 bge0: mem 0xfa000000-0xfa00ffff,0xfa010000-0xfa01ffff irq 18 at device 0.0 on pci2 miibus0: on bge0 brgphy0: on miibus0 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge0: Ethernet address: 00:30:05:2f:dc:9f bge1: mem 0xfa020000-0xfa02ffff,0xfa030000-0xfa03ffff irq 19 at device 0.1 on pci2 miibus1: on bge1 brgphy1: on miibus1 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge1: Ethernet address: 00:30:05:2f:dc:a0 pcib3: on acpi0 pci3: on pcib3 asr0: mem 0xfc000000-0xfdffffff,0xfb000000-0xfbffffff,0xfa500000-0xfa5fffff irq 24 at device 8.0 on pci3 asr0: ADAPTEC 2010S FW Rev. FS11, 2 channel, 256 CCBs, Protocol I2O pcib4: on acpi0 pci4: on pcib4 atkbdc0: port 0x64,0x60 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 psm0: irq 12 on atkbdc0 psm0: model IntelliMouse, device ID 3 fdc0: port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0 fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A, console orm0: at iomem 0xc9000-0xcefff,0xc0000-0xc8fff on isa0 pmtimer0 on isa0 ppc0: parallel port not found. sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 10.000 msec acd0: DVDROM at ata1-master PIO4 ses0 at asr0 bus 0 target 8 lun 0 ses0: Fixed Processor SCSI-2 device ses0: SAF-TE Compliant Device da0 at asr0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: Tagged Queueing Enabled da0: 34700MB (71065600 512 byte sectors: 255H 63S/T 4423C) da1 at asr0 bus 0 target 3 lun 0 da1: Fixed Direct Access SCSI-2 device da1: Tagged Queueing Enabled da1: 69400MB (142131200 512 byte sectors: 255H 63S/T 8847C) SMP: AP CPU #3 Launched! SMP: AP CPU #2 Launched! SMP: AP CPU #1 Launched! Mounting root from ufs:/dev/da0s1a ipfw2 initialized, divert disabled, rule-based forwarding disabled, default to deny, logging disabled Accounting enabled From: Lars =?iso-8859-1?Q?K=F6ller?= To: bug-followup@freebsd.org, lars.koeller@uni-bielefeld.de Cc: Subject: Re: kern/98694: alternate system clock dies Date: Fri, 09 Jun 2006 11:42:39 +0200 -------- Some additional information: I try to start the system without ACPI (loader option), but then th kernel panics during probe. Here is the dmesg.boot of the running system: Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.5-RELEASE-p1 #1: Thu Jun 1 14:41:54 CEST 2006 root@pmx1.hrz.uni-bielefeld.de:/opt/tmp/obj/usr/src/sys/PMX ACPI APIC Table: Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2400.10-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf27 Stepping = 7 Features=0xbfebfbff Hyperthreading: 2 logical CPUs real memory = 2147483648 (2048 MB) avail memory = 2095931392 (1998 MB) FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): APIC ID: 6 cpu1 (AP): APIC ID: 0 cpu2 (AP): APIC ID: 1 cpu3 (AP): APIC ID: 7 ioapic0 irqs 0-15 on motherboard ioapic1 irqs 16-31 on motherboard npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000 acpi_timer0: <32-bit timer at 3.579545MHz> port 0xf008-0xf00b on acpi0 cpu0: on acpi0 cpu1: on acpi0 cpu2: on acpi0 cpu3: on acpi0 acpi_button0: on acpi0 pcib0: on acpi0 pci0: on pcib0 pci0: at device 4.0 (no driver attached) atapci0: port 0x1400-0x140f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 15.1 on pci0 ata0: channel #0 on atapci0 ata1: channel #1 on atapci0 ohci0: mem 0xf8021000-0xf8021fff irq 9 at device 15.2 on pci0 usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 4 ports with 4 removable, self powered isab0: at device 15.3 on pci0 isa0: on isab0 pcib1: on acpi0 pci1: on pcib1 pcib2: on acpi0 pci2: on pcib2 bge0: mem 0xfa000000-0xfa00ffff,0xfa010000-0xfa01ffff irq 18 at device 0.0 on pci2 miibus0: on bge0 brgphy0: on miibus0 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge0: Ethernet address: 00:30:05:2f:dc:9f bge1: mem 0xfa020000-0xfa02ffff,0xfa030000-0xfa03ffff irq 19 at device 0.1 on pci2 miibus1: on bge1 brgphy1: on miibus1 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge1: Ethernet address: 00:30:05:2f:dc:a0 pcib3: on acpi0 pci3: on pcib3 asr0: mem 0xfc000000-0xfdffffff,0xfb000000-0xfbffffff,0xfa500000-0xfa5fffff irq 24 at device 8.0 on pci3 asr0: ADAPTEC 2010S FW Rev. FS11, 2 channel, 256 CCBs, Protocol I2O pcib4: on acpi0 pci4: on pcib4 atkbdc0: port 0x64,0x60 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 psm0: irq 12 on atkbdc0 psm0: model IntelliMouse, device ID 3 fdc0: port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0 fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A, console orm0: at iomem 0xc9000-0xcefff,0xc0000-0xc8fff on isa0 pmtimer0 on isa0 ppc0: parallel port not found. sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 10.000 msec acd0: DVDROM at ata1-master PIO4 ses0 at asr0 bus 0 target 8 lun 0 ses0: Fixed Processor SCSI-2 device ses0: SAF-TE Compliant Device da0 at asr0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: Tagged Queueing Enabled da0: 34700MB (71065600 512 byte sectors: 255H 63S/T 4423C) da1 at asr0 bus 0 target 3 lun 0 da1: Fixed Direct Access SCSI-2 device da1: Tagged Queueing Enabled da1: 69400MB (142131200 512 byte sectors: 255H 63S/T 8847C) SMP: AP CPU #3 Launched! SMP: AP CPU #2 Launched! SMP: AP CPU #1 Launched! Mounting root from ufs:/dev/da0s1a ipfw2 initialized, divert disabled, rule-based forwarding disabled, default to deny, logging disabled Accounting enabled State-Changed-From-To: open->feedback State-Changed-By: remko State-Changed-When: Mon Dec 25 20:41:28 UTC 2006 State-Changed-Why: Hello, does this occur on 6.x as well? What happends if you only use the default GENERIC configuration ? and what with a SMP kernel (without the extra options you have). Responsible-Changed-From-To: freebsd-bugs->remko Responsible-Changed-By: remko Responsible-Changed-When: Mon Dec 25 20:41:28 UTC 2006 Responsible-Changed-Why: grab the pr to trace the feedback http://www.freebsd.org/cgi/query-pr.cgi?pr=98694 From: =?ISO-8859-1?Q?Lars_K=F6ller?= To: bug-followup@FreeBSD.org, lars.koeller@uni-bielefeld.de Cc: Subject: Re: kern/98694: alternate system clock dies Date: Tue, 09 Jan 2007 16:52:42 +0100 Hi, all tests leads th the same result. When I've opened the case, there was no option to check 6.1, cause it was not supported by Sophos (the machines run PureMessage). Since the support matrix has changed and we have a test machine by hand soon, we will give it a try. Bes regards Lars -- Dr. Lars Köller \ Tel: +49 521/106-4964, FAX: +49 521/106-154964 Systeme und \ http://www.uni-bielefeld.de/hrz/raum/plan.htm Serverdienste \ mailto://lars.koeller@uni-bielefeld.de State-Changed-From-To: feedback->open State-Changed-By: linimon State-Changed-When: Tue Jun 12 07:40:26 UTC 2007 State-Changed-Why: The problem seems to still exist. http://www.freebsd.org/cgi/query-pr.cgi?pr=98694 From: =?ISO-8859-15?Q?Lars_K=F6ller?= To: bug-followup@FreeBSD.org, lars.koeller@uni-bielefeld.de Cc: Subject: Re: kern/98694: alternate system clock dies Date: Tue, 12 Jun 2007 10:23:56 +0200 Hello, since several weeks we now run on the supported FreeBSD 6.1 version, and the problem with the clock vanished on the FSX RX300S1 servers. So this PR can be closed. Thanks Lars -- Dr. Lars Köller, Raum: V0-318, Tel: +49 521/106-4964, FAX: -154964 http://www.uni-bielefeld.de/hrz/raum/plan.htm State-Changed-From-To: open->closed State-Changed-By: remko State-Changed-When: Sat Feb 9 11:48:37 UTC 2008 State-Changed-Why: Lars reported in 2007 that we can close this ticket. http://www.freebsd.org/cgi/query-pr.cgi?pr=98694 >Unformatted: