最終更新: dreamcraft 2012年05月20日(日) 11:54:33履歴
CPUをQuad Core Q9650へ換装し、暫く様子を見ていました。
しかし、先週4機搭載しているうちの1つ、Seagate製のSATA 250GBのHDDが突然マウント出来なくなり、DMAエラーを出すようになりました。
DMAのモードをいろいろ変更しましたが、どうもタイミングの問題なのか、マウント出来る時と出来ない時があることが
分かりました。いろいろ試行錯誤しながら復旧を試みようとしましたが、この段階ではダメでした。
そんな状況でタイミング良く秋葉原のクレバリーからタイムセールのお知らせメールが来ました。メールを読むと2TBのWestern Digital製
のHDDが9350円!思わずポチしてしまいました。
先週の土曜日に2TBのHDDが到着したので、早速、Seagate製の250GBのHDDと入れ替えを行い、250GBのHDDをリムーバブルケースへ入れて起動。
増設したWestern Digital製2TBのハードディスクを認識させるため、/usr/sbin/sysinstall→Configure→FDISK→LABELでそれぞれ設定し、無事認識。
一方の250GBのHDDはと言うと、何事も無かったかのようにちゃんと認識しマウント出来ました。
何が悪かったのでしょうか?
250GBの方はSATAのコネクタをリムーバブルケースに接続を変えただけのことなんですが、一体何があったのでしょうね。
と、言うわけで250GBのHDD内に入っているデータをrsyncを使って2TBのHDDにコピーしました。
これにて一件落着。と、言いたいところですが原因不明というのがどうも気になります。
そこで、もう少し詳しく調査してみました。
まずは、お決まりのdmesg。
そして次に更にHDDの状況を詳しく知るためにsmartmontoolsを導入します。
smartmontoolsによりHDDの障害切分けを行います。
インストール自体は簡単です。
/usr/ports/sysutils/smartmontools よりインストールします。
設定はこちらのサイトが分かりやすいです。
で、smartctl -a /dev/ad18 > /home/20120519_smartmontools_ad18
とかすると、テキストで書き出せます。
これを見てもやはりDMAによるエラーが出ています。その理由はデバイスがアクティブではなくアイドル状態になっていた為ということなんですが・・・。
もう少し時間を掛けて調査する必要があるようです。
しかし、先週4機搭載しているうちの1つ、Seagate製のSATA 250GBのHDDが突然マウント出来なくなり、DMAエラーを出すようになりました。
DMAのモードをいろいろ変更しましたが、どうもタイミングの問題なのか、マウント出来る時と出来ない時があることが
分かりました。いろいろ試行錯誤しながら復旧を試みようとしましたが、この段階ではダメでした。
そんな状況でタイミング良く秋葉原のクレバリーからタイムセールのお知らせメールが来ました。メールを読むと2TBのWestern Digital製
のHDDが9350円!思わずポチしてしまいました。
先週の土曜日に2TBのHDDが到着したので、早速、Seagate製の250GBのHDDと入れ替えを行い、250GBのHDDをリムーバブルケースへ入れて起動。
増設したWestern Digital製2TBのハードディスクを認識させるため、/usr/sbin/sysinstall→Configure→FDISK→LABELでそれぞれ設定し、無事認識。
一方の250GBのHDDはと言うと、何事も無かったかのようにちゃんと認識しマウント出来ました。
何が悪かったのでしょうか?
250GBの方はSATAのコネクタをリムーバブルケースに接続を変えただけのことなんですが、一体何があったのでしょうね。
と、言うわけで250GBのHDD内に入っているデータをrsyncを使って2TBのHDDにコピーしました。
これにて一件落着。と、言いたいところですが原因不明というのがどうも気になります。
そこで、もう少し詳しく調査してみました。
まずは、お決まりのdmesg。
Copyright (c) 1992-2011 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 8.2-RELEASE #0: Fri Feb 18 02:24:46 UTC 2011 root@almeida.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC i386 module cdce already present! Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Core(TM)2 Quad CPU Q9650 @ 3.00GHz (2999.67-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x1067a Family = 6 Model = 17 Stepping = 10 Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> Features2=0x408e3fd<SSE3,DTES64,MON,DS_CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,XSAVE> AMD Features=0x20100000<NX,LM> AMD Features2=0x1<LAHF> TSC: P-state invariant real memory = 6442450944 (6144 MB) avail memory = 3404795904 (3247 MB) ACPI APIC Table: <A_M_I_ OEMAPIC > FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs FreeBSD/SMP: 1 package(s) x 4 core(s) cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 cpu2 (AP): APIC ID: 2 cpu3 (AP): APIC ID: 3 ioapic0 <Version 2.0> irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: <A_M_I_ OEMXSDT> on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of fed08000, 1000 (3) failed acpi0: reservation of fed1c000, 4000 (3) failed acpi0: reservation of fed20000, 20000 (3) failed acpi0: reservation of fed50000, 40000 (3) failed acpi0: reservation of ffc00000, 200000 (3) failed acpi0: reservation of fec00000, 1000 (3) failed acpi0: reservation of fee00000, 1000 (3) failed acpi0: reservation of e0000000, 10000000 (3) failed acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, cff00000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 cpu0: <ACPI CPU> on acpi0 ACPI Warning: Incorrect checksum in table [OEMB] - 0xF7, should be 0xEE (20101013/tbutils-354) cpu1: <ACPI CPU> on acpi0 cpu2: <ACPI CPU> on acpi0 cpu3: <ACPI CPU> on acpi0 pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 pci0: <ACPI PCI bus> on pcib0 pcib1: <ACPI PCI-PCI bridge> irq 16 at device 6.0 on pci0 pci1: <ACPI PCI bus> on pcib1 vgapci0: <VGA-compatible display> port 0xb000-0xb0ff mem 0xd0000000-0xdfffffff,0xfe8e0000-0xfe8effff irq 16 at device 0.0 on pci1 pci1: <multimedia, HDA> at device 0.1 (no driver attached) uhci0: <Intel 82801JI (ICH10) USB controller USB-D> port 0xa800-0xa81f irq 16 at device 26.0 on pci0 uhci0: [ITHREAD] uhci0: LegSup = 0x2f00 usbus0: <Intel 82801JI (ICH10) USB controller USB-D> on uhci0 uhci1: <Intel 82801JI (ICH10) USB controller USB-E> port 0xa880-0xa89f irq 21 at device 26.1 on pci0 uhci1: [ITHREAD] uhci1: LegSup = 0x2f00 usbus1: <Intel 82801JI (ICH10) USB controller USB-E> on uhci1 uhci2: <Intel 82801JI (ICH10) USB controller USB-F> port 0xac00-0xac1f irq 18 at device 26.2 on pci0 uhci2: [ITHREAD] uhci2: LegSup = 0x2f00 usbus2: <Intel 82801JI (ICH10) USB controller USB-F> on uhci2 ehci0: <Intel 82801JI (ICH10) USB 2.0 controller USB-B> mem 0xfe7ffc00-0xfe7fffff irq 18 at device 26.7 on pci0 ehci0: [ITHREAD] usbus3: EHCI version 1.0 usbus3: <Intel 82801JI (ICH10) USB 2.0 controller USB-B> on ehci0 pci0: <multimedia, HDA> at device 27.0 (no driver attached) pcib2: <ACPI PCI-PCI bridge> irq 17 at device 28.0 on pci0 pci4: <ACPI PCI bus> on pcib2 pcib3: <ACPI PCI-PCI bridge> irq 17 at device 28.4 on pci0 pci3: <ACPI PCI bus> on pcib3 atapci0: <Marvell 88SX6121 UDMA133 controller> port 0xdc00-0xdc07,0xd880-0xd883,0xd800-0xd807,0xd480-0xd483,0xd400-0xd40f mem 0xfeaffc00-0xfeafffff irq 16 at device 0.0 on pci3 atapci0: [ITHREAD] atapci1: <AHCI SATA controller> on atapci0 atapci1: [ITHREAD] atapci1: AHCI v1.00 controller with 3 3Gbps ports, PM supported ata2: <ATA channel 0> on atapci1 ata2: [ITHREAD] ata3: <ATA channel 1> on atapci1 ata3: [ITHREAD] ata4: <ATA channel 0> on atapci0 ata4: [ITHREAD] pcib4: <ACPI PCI-PCI bridge> irq 16 at device 28.5 on pci0 pci2: <ACPI PCI bus> on pcib4 mskc0: <Marvell Yukon 88E8056 Gigabit Ethernet> port 0xc800-0xc8ff mem 0xfe9fc000-0xfe9fffff irq 17 at device 0.0 on pci2 msk0: <Marvell Technology Group Ltd. Yukon EC Ultra Id 0xb4 Rev 0x03> on mskc0 msk0: Ethernet address: 00:26:18:8f:e1:c2 miibus0: <MII bus> on msk0 e1000phy0: <Marvell 88E1149 Gigabit PHY> PHY 0 on miibus0 e1000phy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto, auto-flow mskc0: [ITHREAD] uhci3: <Intel 82801JI (ICH10) USB controller USB-A> port 0xa080-0xa09f irq 23 at device 29.0 on pci0 uhci3: [ITHREAD] uhci3: LegSup = 0x2f00 usbus4: <Intel 82801JI (ICH10) USB controller USB-A> on uhci3 uhci4: <Intel 82801JI (ICH10) USB controller USB-B> port 0xa400-0xa41f irq 19 at device 29.1 on pci0 uhci4: [ITHREAD] uhci4: LegSup = 0x2f00 usbus5: <Intel 82801JI (ICH10) USB controller USB-B> on uhci4 uhci5: <Intel 82801JI (ICH10) USB controller USB-C> port 0xa480-0xa49f irq 18 at device 29.2 on pci0 uhci5: [ITHREAD] uhci5: LegSup = 0x2f00 usbus6: <Intel 82801JI (ICH10) USB controller USB-C> on uhci5 ehci1: <Intel 82801JI (ICH10) USB 2.0 controller USB-A> mem 0xfe7ff800-0xfe7ffbff irq 23 at device 29.7 on pci0 ehci1: [ITHREAD] usbus7: EHCI version 1.0 usbus7: <Intel 82801JI (ICH10) USB 2.0 controller USB-A> on ehci1 pcib5: <ACPI PCI-PCI bridge> at device 30.0 on pci0 pci5: <ACPI PCI bus> on pcib5 pcm0: <Envy24HT audio (Generic)> port 0xec00-0xec1f,0xe880-0xe8ff irq 16 at device 0.0 on pci5 pcm0: [GIANT-LOCKED] pcm0: [ITHREAD] pcm0: system configuration SubVendorID: 0x1412, SubDeviceID: 0x2401 XIN2 Clock Source: 49.152MHz(192kHz*256) MPU-401 UART(s) #: 1 ADC #: 1 DAC #: 3 Multi-track converter type: I2S(48KHz support, 24bit resolution, ID#0x0) S/PDIF(IN/OUT): 1/1 ID# 0x00 GPIO(mask/dir/state): 0xff/0xff/0xff skc0: <Marvell Gigabit Ethernet> port 0xe400-0xe4ff mem 0xfebfc000-0xfebfffff irq 18 at device 2.0 on pci5 skc0: Marvell Yukon Lite Gigabit Ethernet rev. (0x9) sk0: <Marvell Semiconductor, Inc. Yukon> on skc0 sk0: Ethernet address: 00:26:18:8f:e6:bf miibus1: <MII bus> on sk0 e1000phy1: <Marvell 88E1011 Gigabit PHY> PHY 0 on miibus1 e1000phy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-master, 1000baseT-FDX, 1000baseT-FDX-master, auto skc0: [ITHREAD] fwohci0: <Lucent FW322/323> mem 0xfebfb000-0xfebfbfff irq 19 at device 3.0 on pci5 fwohci0: [ITHREAD] fwohci0: OHCI version 1.0 (ROM=1) fwohci0: No. of Isochronous channels is 8. fwohci0: EUI64 00:1e:8c:00:01:fd:6e:6f fwohci0: Phy 1394a available S400, 2 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: <IEEE1394(FireWire) bus> on fwohci0 dcons_crom0: <dcons configuration ROM> on firewire0 dcons_crom0: bus_addr 0x177c000 fwe0: <Ethernet over FireWire> on firewire0 if_fwe0: Fake Ethernet address: 02:1e:8c:fd:6e:6f fwe0: Ethernet address: 02:1e:8c:fd:6e:6f fwip0: <IP over FireWire> on firewire0 fwip0: Firewire address: 00:1e:8c:00:01:fd:6e:6f @ 0xfffe00000000, S400, maxrec 2048 fwohci0: Initiate bus reset fwohci0: fwohci_intr_core: BUS reset fwohci0: fwohci_intr_core: node_id=0x00000000, SelfID Count=1, CYCLEMASTER mode isab0: <PCI-ISA bridge> at device 31.0 on pci0 isa0: <ISA bus> on isab0 atapci2: <Intel ICH10 SATA300 controller> port 0x9c00-0x9c07,0x9880-0x9883,0x9800-0x9807,0x9480-0x9483,0x9400-0x941f mem 0xfe7fe800-0xfe7fefff irq 19 at device 31.2 on pci0 atapci2: [ITHREAD] atapci2: AHCI called from vendor specific driver atapci2: AHCI v1.20 controller with 6 3Gbps ports, PM supported ata5: <ATA channel 0> on atapci2 ata5: [ITHREAD] ata6: <ATA channel 1> on atapci2 ata6: [ITHREAD] ata7: <ATA channel 2> on atapci2 ata7: [ITHREAD] ata8: <ATA channel 3> on atapci2 ata8: [ITHREAD] ata9: <ATA channel 4> on atapci2 ata9: [ITHREAD] ata10: <ATA channel 5> on atapci2 ata10: [ITHREAD] pci0: <serial bus, SMBus> at device 31.3 (no driver attached) acpi_button0: <Power Button> on acpi0 atrtc0: <AT realtime clock> port 0x70-0x71 irq 8 on acpi0 fdc0: <floppy drive controller (FDE)> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FILTER] acpi_hpet0: <High Precision Event Timer> iomem 0xfed00000-0xfed003ff on acpi0 Timecounter "HPET" frequency 14318180 Hz quality 900 uart0: <16550 or compatible> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 uart0: [FILTER] pmtimer0 on isa0 orm0: <ISA Option ROM> at iomem 0xc0000-0xcefff pnpid ORM0000 on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ata0 at port 0x1f0-0x1f7,0x3f6 irq 14 on isa0 ata0: [ITHREAD] ata1 at port 0x170-0x177,0x376 irq 15 on isa0 ata1: [ITHREAD] atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] ppc0: parallel port not found. est0: <Enhanced SpeedStep Frequency Control> on cpu0 p4tcc0: <CPU Frequency Thermal Control> on cpu0 est1: <Enhanced SpeedStep Frequency Control> on cpu1 p4tcc1: <CPU Frequency Thermal Control> on cpu1 est2: <Enhanced SpeedStep Frequency Control> on cpu2 p4tcc2: <CPU Frequency Thermal Control> on cpu2 est3: <Enhanced SpeedStep Frequency Control> on cpu3 p4tcc3: <CPU Frequency Thermal Control> on cpu3 Timecounters tick every 1.000 msec vboxdrv: fAsync=0 offMin=0x4bf offMax=0x66f firewire0: 1 nodes, maxhop <= 0 cable IRM irm(0) (me) firewire0: bus manager 0 usbus0: 12Mbps Full Speed USB v1.0 usbus1: 12Mbps Full Speed USB v1.0 usbus2: 12Mbps Full Speed USB v1.0 usbus3: 480Mbps High Speed USB v2.0 usbus4: 12Mbps Full Speed USB v1.0 usbus5: 12Mbps Full Speed USB v1.0 usbus6: 12Mbps Full Speed USB v1.0 usbus7: 480Mbps High Speed USB v2.0 ad10: 953869MB <Hitachi HDS721010CLA332 JP4OA39C> at ata5-master UDMA100 SATA 3Gb/s ugen0.1: <Intel> at usbus0 uhub0: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus0 ugen1.1: <Intel> at usbus1 uhub1: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus1 ugen2.1: <Intel> at usbus2 uhub2: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus2 ugen3.1: <Intel> at usbus3 uhub3: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus3 ugen4.1: <Intel> at usbus4 uhub4: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus4 ugen5.1: <Intel> at usbus5 uhub5: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus5 ugen6.1: <Intel> at usbus6 uhub6: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus6 ugen7.1: <Intel> at usbus7 uhub7: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus7 ad12: 1907729MB <ST2000DM001 9YN164 CC4C> at ata6-master UDMA100 SATA 3Gb/s ad14: 476940MB <WDC WD5001AALS-00L3B2 01.03B01> at ata7-master UDMA100 SATA 3Gb/s acd0: DVDR <TSSTcorp CDDVDW SH-S243D/SB00> at ata8-master UDMA100 SATA 1.5Gb/s ad18: 476940MB <Seagate ST3500418AS CC38> at ata9-master UDMA100 SATA 3Gb/s SMP: AP CPU #2 Launched! SMP: AP CPU #1 Launched! SMP: AP CPU #3 Launched! uhub0: 2 ports with 2 removable, self powered uhub1: 2 ports with 2 removable, self powered uhub2: 2 ports with 2 removable, self powered uhub4: 2 ports with 2 removable, self powered uhub5: 2 ports with 2 removable, self powered uhub6: 2 ports with 2 removable, self powered Root mount waiting for: usbus7 usbus3 Root mount waiting for: usbus7 usbus3 Root mount waiting for: usbus7 usbus3 uhub3: 6 ports with 6 removable, self powered uhub7: 6 ports with 6 removable, self powered ugen3.2: <Generic> at usbus3 umass0: <Generic Mass Storage Device, class 0/0, rev 2.00/1.00, addr 2> on usbus3 umass0: SCSI over Bulk-Only; quirks = 0x0000 Root mount waiting for: usbus7 usbus3 ugen7.2: <Sonix Technology Co., Ltd.> at usbus7 umass0:0:0:-1: Attached to scbus0 (probe0:umass-sim0:0:0:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe0:umass-sim0:0:0:0): CAM status: SCSI Status Error (probe0:umass-sim0:0:0:0): SCSI status: Check Condition (probe0:umass-sim0:0:0:0): SCSI sense: NOT READY csi:0,aa,55,40 asc:3a,0 (Medium not present) da0 at umass-sim0 bus 0 scbus0 target 0 lun 0 da0: <Generic USB SD Reader 1.00> Removable Direct Access SCSI-0 device da0: 40.000MB/s transfers da0: Attempt to query device size failed: NOT READY, Medium not present Root mount waiting for: usbus7 (probe0:umass-sim0:0:0:1): TEST UNIT READY. CDB: 0 20 0 0 0 0 (probe0:umass-sim0:0:0:1): CAM status: SCSI Status Error (probe0:umass-sim0:0:0:1): SCSI status: Check Condition (probe0:umass-sim0:0:0:1): SCSI sense: NOT READY csi:0,aa,55,40 asc:3a,0 (Medium not present) da1 at umass-sim0 bus 0 scbus0 target 0 lun 1 da1: <Generic USB CF Reader 1.01> Removable Direct Access SCSI-0 device da1: 40.000MB/s transfers da1: Attempt to query device size failed: NOT READY, Medium not present (probe0:umass-sim0:0:0:2): TEST UNIT READY. CDB: 0 40 0 0 0 0 (probe0:umass-sim0:0:0:2): CAM status: SCSI Status Error (probe0:umass-sim0:0:0:2): SCSI status: Check Condition (probe0:umass-sim0:0:0:2): SCSI sense: NOT READY csi:0,aa,55,40 asc:3a,0 (Medium not present) da2 at umass-sim0 bus 0 scbus0 target 0 lun 2 da2: <Generic USB SM Reader 1.02> Removable Direct Access SCSI-0 device da2: 40.000MB/s transfers da2: Attempt to query device size failed: NOT READY, Medium not present (probe0:umass-sim0:0:0:3): TEST UNIT READY. CDB: 0 60 0 0 0 0 (probe0:umass-sim0:0:0:3): CAM status: SCSI Status Error (probe0:umass-sim0:0:0:3): SCSI status: Check Condition (probe0:umass-sim0:0:0:3): SCSI sense: NOT READY csi:0,aa,55,40 asc:3a,0 (Medium not present) da3 at umass-sim0 bus 0 scbus0 target 0 lun 3 da3: <Generic USB MS Reader 1.03> Removable Direct Access SCSI-0 device da3: 40.000MB/s transfers da3: Attempt to query device size failed: NOT READY, Medium not present ugen7.3: <Apple Inc.> at usbus7 Root mount waiting for: usbus7 Trying to mount root from ufs:/dev/ad10s1a v ugen5.2: <Full-Speed Mouse> at usbus5 ums0: <Full-Speed Mouse Full-Speed Mouse, class 0/0, rev 1.10/0.07, addr 2> on usbus5 ums0: 6 buttons and [XYZT] coordinates ID=17 WARNING: TMPFS is considered to be a highly experimental feature in FreeBSD. ugen5.3: <CYKB23> at usbus5 ukbd0: <CYKB23 USB Keyboard, class 0/0, rev 1.10/1.01, addr 3> on usbus5 kbd2 at ukbd0 ums1: <CYKB23 USB Keyboard, class 0/0, rev 1.10/1.01, addr 3> on usbus5 vboxnet0: Ethernet address: 0a:00:27:00:00:00 sk0: link state changed to UP drm0: <ATI Radeon HD 4350> on vgapci0 info: [drm] MSI enabled 1 message(s) vgapci0: child drm0 requested pci_enable_busmaster info: [drm] Initialized radeon 1.31.0 20080613 info: [drm] Setting GART location based on new memory map info: [drm] Loading RV710 Microcode info: [drm] Resetting GPU info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] ugen3.3: <OLYMPUS> at usbus3 ugen3.3: <OLYMPUS> at usbus3 (disconnected) ugen7.3: <Apple Inc.> at usbus7 (disconnected) ugen7.3: <Apple Inc.> at usbus7
そして次に更にHDDの状況を詳しく知るためにsmartmontoolsを導入します。
smartmontoolsによりHDDの障害切分けを行います。
インストール自体は簡単です。
/usr/ports/sysutils/smartmontools よりインストールします。
設定はこちらのサイトが分かりやすいです。
で、smartctl -a /dev/ad18 > /home/20120519_smartmontools_ad18
とかすると、テキストで書き出せます。
smartctl 5.40 2010-10-16 r3189 [FreeBSD 8.2-RELEASE i386] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Seagate Barracuda 7200.12 family Device Model: ST3500418AS Serial Number: 6VMCNX0H Firmware Version: CC38 User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Sun May 20 11:39:52 2012 JST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 600) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 85) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x103f) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 118 100 006 Pre-fail Always - 185256179 3 Spin_Up_Time 0x0003 099 097 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 055 055 020 Old_age Always - 46466 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 068 060 030 Pre-fail Always - 7082564 9 Power_On_Hours 0x0032 082 082 000 Old_age Always - 16496 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 077 077 020 Old_age Always - 23557 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 099 099 000 Old_age Always - 1 188 Command_Timeout 0x0032 100 099 000 Old_age Always - 46 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 062 055 045 Old_age Always - 38 (Min/Max 31/45) 194 Temperature_Celsius 0x0022 038 045 000 Old_age Always - 38 (0 15 0 0) 195 Hardware_ECC_Recovered 0x001a 034 017 000 Old_age Always - 185256179 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 76239964507136 241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 1315511503 242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 1046973247 SMART Error Log Version: 1 ATA Error Count: 12 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 12 occurred at disk power-on lifetime: 16289 hours (678 days + 17 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 38 01 00 00 Error: UNC at LBA = 0x00000138 = 312 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 01 38 01 00 e0 00 00:00:24.647 READ DMA c8 00 01 37 01 00 e0 00 00:00:24.646 READ DMA c8 00 01 36 01 00 e0 00 00:00:24.645 READ DMA c8 00 01 35 01 00 e0 00 00:00:24.645 READ DMA c8 00 01 34 01 00 e0 00 00:00:24.645 READ DMA Error 11 occurred at disk power-on lifetime: 16289 hours (678 days + 17 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 38 01 00 00 Error: UNC at LBA = 0x00000138 = 312 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 20 1f 01 00 e0 00 00:00:21.418 READ DMA c8 00 20 7f 51 14 e6 00 00:00:21.394 READ DMA 25 00 04 ff ff ff 4f 00 00:00:21.380 READ DMA EXT 25 00 08 ff ff ff 4f 00 00:00:21.370 READ DMA EXT 25 00 08 ff ff ff 4f 00 00:00:21.362 READ DMA EXT Error 10 occurred at disk power-on lifetime: 16289 hours (678 days + 17 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 38 01 00 00 Error: UNC at LBA = 0x00000138 = 312 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 01 38 01 00 e0 00 00:00:17.961 READ DMA c8 00 01 37 01 00 e0 00 00:00:17.960 READ DMA c8 00 01 36 01 00 e0 00 00:00:17.960 READ DMA c8 00 01 35 01 00 e0 00 00:00:17.960 READ DMA c8 00 01 34 01 00 e0 00 00:00:17.960 READ DMA Error 9 occurred at disk power-on lifetime: 16289 hours (678 days + 17 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 38 01 00 00 Error: UNC at LBA = 0x00000138 = 312 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 20 1f 01 00 e0 00 00:00:14.683 READ DMA 25 00 04 ff ff ff 4f 00 00:00:14.682 READ DMA EXT 25 00 04 ff ff ff 4f 00 00:00:14.682 READ DMA EXT 25 00 04 ff ff ff 4f 00 00:00:14.675 READ DMA EXT 25 00 20 ff ff ff 4f 00 00:00:14.664 READ DMA EXT Error 8 occurred at disk power-on lifetime: 16289 hours (678 days + 17 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 38 01 00 00 Error: UNC at LBA = 0x00000138 = 312 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 01 38 01 00 e0 00 00:00:11.560 READ DMA c8 00 01 37 01 00 e0 00 00:00:11.559 READ DMA c8 00 01 36 01 00 e0 00 00:00:11.559 READ DMA c8 00 01 35 01 00 e0 00 00:00:11.559 READ DMA c8 00 01 34 01 00 e0 00 00:00:11.559 READ DMA SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 16488 - # 2 Extended offline Completed without error 00% 16469 - # 3 Short offline Completed without error 00% 16464 - # 4 Short offline Completed without error 00% 16440 - # 5 Short offline Completed without error 00% 16416 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
これを見てもやはりDMAによるエラーが出ています。その理由はデバイスがアクティブではなくアイドル状態になっていた為ということなんですが・・・。
もう少し時間を掛けて調査する必要があるようです。
最新コメント