  | |  | Less than kelpful kernel disk messages | Less than kelpful kernel disk messages 2004-11-07 - By nathan r. hruby
Back
Ok,
So I have a box with 3 SCSI disks. Overnight one of the disks, it seems,
has started to keel over. What I can 't determine (nor can Google seem to
really assist me) is if these are total failures, or if they are
recoverable.
What the error messages say to me is:
- Hey, I had a write error
- Hey, I think I can recover from this, and add a bad block to the list
- Hey, I tried to add a bad block to the disk, but that failed
- Hey, the md drivers saw these errors and are going to run your md 's in
degraded mode just to be safe.
What I don 't know is if the disklist errors are (or should be) fatal or not.
The other partitions on the problem disk seem to be fine...
AFIK, I should be able to put those failed partitions on the problem disk
back into the md and they should resync fine and all should be happy, but
It 'd be nice if someone could confirm / deny my analysis of these
messages.
Machine is a Compaq ML350 with the Compaq OEM 3960D SCSI controller
(AIC7XXX driver) with (argh!) Compaq disks:
Vendor: COMPAQ Model: BD036659CC Rev: 3B00
Type: Direct-Access ANSI SCSI revision: 03
Kernel messages follow:
-- ---
Nov 7 04:38:02 xon kernel: Info fld=0x23936ae, Current sd08:16: sense key Recovered Error
Nov 7 04:38:02 xon kernel: Additional sense indicates Write error - recovered with auto reallocation
Nov 7 04:50:34 xon kernel: SCSI disk error : host 0 channel 0 id 1 lun 0 return code = 8000002
Nov 7 04:50:34 xon kernel: Current sd08:17: sense key Hardware Error
Nov 7 04:50:34 xon kernel: Additional sense indicates Defect list error
Nov 7 04:50:34 xon kernel: I/O error: dev 08:17, sector 1048616
Nov 7 04:50:34 xon kernel: raid5: Disk failure on sdb7, disabling device. Operation continuing on 2 devices
Nov 7 04:50:34 xon kernel: md4: no spare disk to reconstruct array! -- continuing in degraded mode
Nov 7 05:50:04 xon kernel: SCSI disk error : host 0 channel 0 id 1 lun 0 return code = 8000002
Nov 7 05:50:04 xon kernel: Current sd08:16: sense key Hardware Error
Nov 7 05:50:04 xon kernel: Additional sense indicates Defect list error
Nov 7 05:50:04 xon kernel: I/O error: dev 08:16, sector 20480
Nov 7 05:50:04 xon kernel: raid5: Disk failure on sdb6, disabling device. Operation continuing on 2 devices
Nov 7 05:50:04 xon kernel: md1: no spare disk to reconstruct array! -- continuing in degraded mode
Nov 7 05:50:04 xon kernel: md4: no spare disk to reconstruct array! -- continuing in degraded mode
-- ---
Thanks y 'all!
-n
--
-- ---- ---- ---- ---- ---- ---- ---- -----
nathan hruby <nhruby@(protected) >
uga enterprise information technology services
production systems support
metaphysically wrinkle-free
-- ---- ---- ---- ---- ---- ---- ---- -----
--
Taroon-list mailing list
Taroon-list@(protected)
http://www.redhat.com/mailman/listinfo/taroon-list
|
|
 |