From nobody@FreeBSD.org Wed Mar 29 22:56:14 2006 Return-Path: Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0A73E16A422 for ; Wed, 29 Mar 2006 22:56:14 +0000 (UTC) (envelope-from nobody@FreeBSD.org) Received: from www.freebsd.org (www.freebsd.org [216.136.204.117]) by mx1.FreeBSD.org (Postfix) with ESMTP id B888A43D45 for ; Wed, 29 Mar 2006 22:56:13 +0000 (GMT) (envelope-from nobody@FreeBSD.org) Received: from www.freebsd.org (localhost [127.0.0.1]) by www.freebsd.org (8.13.1/8.13.1) with ESMTP id k2TMuCOK047927 for ; Wed, 29 Mar 2006 22:56:12 GMT (envelope-from nobody@www.freebsd.org) Received: (from nobody@localhost) by www.freebsd.org (8.13.1/8.13.1/Submit) id k2TMuCaY047926; Wed, 29 Mar 2006 22:56:12 GMT (envelope-from nobody) Message-Id: <200603292256.k2TMuCaY047926@www.freebsd.org> Date: Wed, 29 Mar 2006 22:56:12 GMT From: Nigel Weeks To: freebsd-gnats-submit@FreeBSD.org Subject: System freeze irrespective of load on Promise FastTrak100 TX2pro IDE RAID X-Send-Pr-Version: www-2.3 >Number: 95087 >Category: i386 >Synopsis: [hang] System freeze irrespective of load on Promise FastTrak100 TX2pro IDE RAID >Confidential: no >Severity: serious >Priority: medium >Responsible: gavin >State: closed >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Wed Mar 29 23:00:30 GMT 2006 >Closed-Date: Wed Mar 11 16:59:39 UTC 2009 >Last-Modified: Wed Mar 11 16:59:39 UTC 2009 >Originator: Nigel Weeks >Release: Release 6.0 >Organization: The Examiner Newspaper >Environment: FreeBSD 6.0-RELEASE #1: Wed Mar 29 15:35:39 EST 2006 nweeks@oken.opi-lex.rpl.local:/usr/obj/usr/src/sys/WWW >Description: The system seems to hang, with errors being written to console regarding ar0 (Promise IDE RAID controller), mentioning g_vfs_done errors. Background Information. DMESG: Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.0-RELEASE #1: Wed Mar 29 15:35:39 EST 2006 nweeks@oken.opi-lex.rpl.local:/usr/obj/usr/src/sys/WWW Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Pentium III/Pentium III Xeon/Celeron (501.14-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x672 Stepping = 2 Features=0x387fbff real memory = 402587648 (383 MB) avail memory = 384348160 (366 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0 irqs 0-23 on motherboard npx0: [FAST] npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard acpi0: Overriding SCI Interrupt from IRQ 9 to IRQ 20 ACPI-0438: *** Error: Looking up [\\_PR_.CPU0] in namespace, AE_NOT_FOUND SearchNode 0xc17c6260 StartNode 0xc17c6260 ReturnNode 0 acpi0: Power Button (fixed) pci_link0: irq 0 on acpi0 pci_link1: irq 0 on acpi0 pci_link2: irq 10 on acpi0 pci_link3: irq 11 on acpi0 Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 cpu0: on acpi0 cpu1: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff,0x4000-0x4041,0x5000-0x500f on ac pi0 pci0: on pcib0 agp0: mem 0xd0000000-0xd0ffffff at d evice 0.0 on pci0 pcib1: at device 1.0 on pci0 pci1: on pcib1 isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x37 6,0xf000-0xf00f at device 7.1 on pci0 ata0: on atapci0 ata1: on atapci0 uhci0: port 0xd000-0xd01f irq 19 at de vice 7.2 on pci0 uhci0: [GIANT-LOCKED] usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered pci0: at device 7.3 (no driver attached) pci0: at device 8.0 (no driver attached) dc0: <82c169 PNIC 10/100BaseTX> port 0xd400-0xd4ff mem 0xd3010000-0xd30100ff irq 18 at device 10.0 on pci0 miibus0: on dc0 bmtphy0: on miibus0 bmtphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto dc0: Ethernet address: 00:a0:cc:51:65:f5 atapci1: port 0xd800-0xd807,0xdc00-0xdc03, 0xe000-0xe007,0xe400-0xe403,0xe800-0xe80f mem 0xd3000000-0xd300ffff irq 19 at de vice 11.0 on pci0 ata2: on atapci1 ata3: on atapci1 acpi_tz0: on acpi0 ACPI-0438: *** Error: Looking up [\\_PR_.CPU0] in namespace, AE_NOT_FOUND SearchNode 0xc17c6260 StartNode 0xc17c6260 ReturnNode 0 acpi_tz0: _CRT value is absurd, ignored (500.6C) fdc0: port 0x3f2-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: port 0x378-0x37f,0x778-0x77b irq 7 drq 3 on ac pi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/16 bytes threshold ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] pmtimer0 on isa0 orm0: at iomem 0xc0000-0xc7fff,0xc8000-0xcbfff,0xcc000-0xd47ff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 1.000 msec ipfw2 (+ipv6) initialized, divert loadable, rule-based forwarding disabled, defa ult to deny, logging unlimited ACPI-0438: *** Error: Looking up [\\_PR_.CPU0] in namespace, AE_NOT_FOUND SearchNode 0xc17c6260 StartNode 0xc17c6260 ReturnNode 0 acpi_tz0: _CRT value is absurd, ignored (500.6C) ACPI-0438: *** Error: Looking up [\\_PR_.CPU0] in namespace, AE_NOT_FOUND SearchNode 0xc17c6260 StartNode 0xc17c6260 ReturnNode 0 acpi_tz0: _CRT value is absurd, ignored (500.6C) acd0: CDROM at ata1-master PIO4 ad4: 38166MB at ata2-master UDMA100 ad6: 38166MB at ata3-master UDMA100 ar0: 38146MB status: READY ar0: disk0 READY (master) using ad6 at ata3-master ar0: disk1 READY (mirror) using ad4 at ata2-master SMP: AP CPU #1 Launched! Trying to mount root from ufs:/dev/ar0s1a WARNING: / was not properly dismounted WARNING: /tmp was not properly dismounted WARNING: /usr was not properly dismounted /usr: mount pending error: blocks 1004 files 224 WARNING: /var was not properly dismounted dc0: failed to force tx and rx to idle state dc0: failed to force tx and rx to idle state (This was immediately after the crash, hence the filesystem warnings...) `kldstat` shows kernel, and acpi.ko are loaded Kernel compiled with options SMP options IPFIREWALL options IPFIREWALL_VERBOSE options DUMMYNET options HZ=1000 It must be noted that freezes also occured on the non-SMP GENERIC kernel(before I'd done the recompile) Tried putting system under load to repeat problem, with the following: `cd /usr/src` `tar cfz ../hog.tgz *` (to simply put CPU and HDD under load) No effect. Put CPU and kernel process queue under more load with some simple c++: #include main() { while(1) { // Do a tight loop of nuthin } return 0; } and launched 50 of them - no effect Then re-launched tar as well to strain disk subsystem. `top` output now: last pid: 688; load averages: 52.06, 51.09, 39.89 up 0+00:42:09 09:31:26 80 processes: 53 running, 27 sleeping CPU states: 98.7% user, 0.0% nice, 0.9% system, 0.4% interrupt, 0.0% idle Mem: 24M Active, 250M Inact, 62M Wired, 15M Cache, 47M Buf, 17M Free Swap: 2048M Total, 2048M Free Next test: leave it idle for a while Well, I've had four coffees, and I'm getting tired of waiting...submitting this PR. >How-To-Repeat: Tried, tried real hard, but it just happens... >Fix: >Release-Note: >Audit-Trail: State-Changed-From-To: open->feedback State-Changed-By: gavin State-Changed-When: Wed Mar 11 15:52:59 UTC 2009 State-Changed-Why: To submitter: apologies for this PR sitting idle for so long. Do you know if this problem was ever resolved? If you still have this system and are still interested in the issue could you please retest with either 7.1-RELEASE or 6.4-RELEASE and see if they work for you? I believe there were several fixes to the controller chip you have after 6.0 was released which may have resolved this issue. Responsible-Changed-From-To: freebsd-i386->gavin Responsible-Changed-By: gavin Responsible-Changed-When: Wed Mar 11 15:52:59 UTC 2009 Responsible-Changed-Why: Track http://www.freebsd.org/cgi/query-pr.cgi?pr=95087 State-Changed-From-To: feedback->closed State-Changed-By: gavin State-Changed-When: Wed Mar 11 16:59:22 UTC 2009 State-Changed-Why: Mail to submitter bounces http://www.freebsd.org/cgi/query-pr.cgi?pr=95087 >Unformatted: