|
#1
|
|||
|
|||
zfs mirror
Sergey Anohin написал(а) к All в Aug 18 17:19:33 по местному времени:
Нello! Есть полутестовый сервер с сабжем. Ночью развалилось: Aug 4 04:39:53 NAS kernel: ahcich0: Timeout on slot 31 port 0 Aug 4 04:39:53 NAS kernel: ahcich0: is 00000000 cs 00000000 ss 80000001 rs 80000001 tfd 40 serr 00000000 cmd 0000c017 Aug 4 04:39:53 NAS kernel: (ada0:ahcich0:0:0:0): WRITEFPDMAQUEUED. ACB: 61 08 10 3b fa 40 c2 01 00 00 00 00 Aug 4 04:39:53 NAS kernel: (ada0:ahcich0:0:0:0): CAM status: Command timeout Aug 4 04:39:53 NAS kernel: (ada0:ahcich0:0:0:0): Retrying command Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: AНCI reset: device not ready after 31000ms (tfd = 00000080) Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: Timeout on slot 1 port 0 Aug 4 04:43:02 NAS kernel: ahcich0: is 00000000 cs 00000002 ss 00000000 rs 00000002 tfd 80 serr 00000000 cmd 0000c117 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): Retrying command Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: AНCI reset: device not ready after 31000ms (tfd = 00000080) Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: Timeout on slot 2 port 0 Aug 4 04:43:02 NAS kernel: ahcich0: is 00000000 cs 00000004 ss 00000000 rs 00000004 tfd 80 serr 00000000 cmd 0000c217 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): Error 5, Retries exhausted Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: AНCI reset: device not ready after 31000ms (tfd = 00000080) Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: Timeout on slot 3 port 0 Aug 4 04:43:02 NAS kernel: ahcich0: is 00000000 cs 00000008 ss 00000000 rs 00000008 tfd 80 serr 00000000 cmd 0000c317 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked Aug 4 04:43:02 NAS kernel: ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 Aug 4 04:43:02 NAS kernel: ada0: <WDC WD40EFRX-68N32N0 82.00A82> s/n WD-WCC7K2UANUAZ detached Aug 4 04:43:02 NAS kernel: swap_pager: I/O error - pagein failed; blkno 139501,size 4096, error 6 Aug 4 04:43:02 NAS kernel: vm_fault: pager read error, pid 329 (devd) Aug 4 04:43:02 NAS kernel: swap_pager: I/O error - pagein failed; blkno 175717,size 4096, error 6 Aug 4 04:43:02 NAS kernel: vm_fault: pager read error, pid 329 (devd) Aug 4 04:43:02 NAS kernel: swap_pager: I/O error - pagein failed; blkno 175717,size 4096, error 6 и 200 метров логов последняя строка повторяется. Короче как оказалось просто отвалился диск, то ли помер, то ли мать глючит, пока хз. Сервак ушел в ребут и сообщил что сабж degraded. Вроде ниче страшного, если умер диск вставляем другой, клонируем gpart разбивку со старого на новый диск, руками копируем ефи, делаем буткод, из одного раздела своп и т.д. Вопрос 1: как можно без такого адского ручного труда? :) Вопрос 2 почему оно заребутилось? Умер своп на одном диске и паника в селе? :) Может тогда отзеркалить ефи и бут и своп? gmirror? root@NAS:/boot# cat /etc/fstab # Device Mountpoint FStype Options Dump Pass# /dev/ada0p3 none swap sw 0 0 /dev/ada1p3 none swap sw 0 0 root@NAS:/boot# zpool status -v pool: zroot state: DEGRADED status: One or more devices could not be opened. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Attach the missing device and online it using 'zpool online'. see: http://illumos.org/msg/ZFS-8000-2Q scan: none requested config: NAME STATE READ WRITE CKSUM zroot DEGRADED 0 0 0 mirror-0 DEGRADED 0 0 0 1617915411085386511 UNAVAIL 0 0 0 was /dev/ada0p4 ada0p4 ONLINE 0 0 0 errors: No known data errors С наилучшими пожеланиями, Sergey Anohin. --- wfido |