[О блоге] [наверх] [пред] [2025-09-29 20:27:34+03:00] [2638db8098a64f8f870bd026f8a4884f6f10fd83]
Темы: [zfs]

ZFS показал что у меня начал отваливаться диск

На рабочем компьютере, после очередного scrub, мне показали что с одним
из дисков не всё в порядке:

    # zpool status
      pool: zroot
     state: ONLINE
    status: One or more devices has experienced an unrecoverable error.  An
            attempt was made to correct the error.  Applications are unaffected.
    action: Determine if the device needs to be replaced, and clear the errors
            using 'zpool clear' or replace the device with 'zpool replace'.
       see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P
      scan: scrub repaired 2M in 00:59:21 with 0 errors on Mon Sep  1 22:54:16 2025
    config:

            NAME                STATE     READ WRITE CKSUM
            zroot               ONLINE       0     0     0
              mirror-0          ONLINE       0     0     0
                gpt/SGWD0-ROOT  ONLINE      16     0     2
                gpt/SGWD1-ROOT  ONLINE       0     0     0

dmesg, действительно, показывает ошибки:

    [2463378] (ada0:ahcich0:0:0:0): RES: 41 40 17 ff 2d 40 07 00 00 00 00
    [2463378] (ada0:ahcich0:0:0:0): Retrying command, 2 more tries remain
    [2463380] (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 78 f7 2d 40 07 00 00 08 00 00
    [2463380] (ada0:ahcich0:0:0:0): CAM status: ATA Status Error
    [2463380] (ada0:ahcich0:0:0:0): ATA status: 41 (DRDY ERR), error: 40 (UNC )
    [2463380] (ada0:ahcich0:0:0:0): RES: 41 40 17 ff 2d 40 07 00 00 00 00
    [2463380] (ada0:ahcich0:0:0:0): Retrying command, 1 more tries remain
    [2463381] (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 78 f7 2d 40 07 00 00 08 00 00
    [2463381] (ada0:ahcich0:0:0:0): CAM status: ATA Status Error
    [2463381] (ada0:ahcich0:0:0:0): ATA status: 41 (DRDY ERR), error: 40 (UNC )
    [2463381] (ada0:ahcich0:0:0:0): RES: 41 40 17 ff 2d 40 07 00 00 00 00
    [2463381] (ada0:ahcich0:0:0:0): Retrying command, 0 more tries remain
    [2463383] (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 78 f7 2d 40 07 00 00 08 00 00
    [2463383] (ada0:ahcich0:0:0:0): CAM status: ATA Status Error
    [2463383] (ada0:ahcich0:0:0:0): ATA status: 41 (DRDY ERR), error: 40 (UNC )
    [2463383] (ada0:ahcich0:0:0:0): RES: 41 40 17 ff 2d 40 07 00 00 00 00
    [2463383] (ada0:ahcich0:0:0:0): Error 5, Retries exhausted

SMART здорового диска:

    ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
      1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
      3 Spin_Up_Time            0x0027   141   139   021    Pre-fail  Always       -       3908
      4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       165
      5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
      7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
      9 Power_On_Hours          0x0032   010   010   000    Old_age   Always       -       66297
     10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
     11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
     12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       165
    192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       70
    193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       110
    194 Temperature_Celsius     0x0022   099   085   000    Old_age   Always       -       44
    196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
    197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
    198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
    199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
    200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART на том, где ошибки:

    ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
      1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       10
      3 Spin_Up_Time            0x0027   139   138   021    Pre-fail  Always       -       4008
      4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       405
      5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
      7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
      9 Power_On_Hours          0x0032   001   001   000    Old_age   Always       -       88255
     10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
     11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
     12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       403
    192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       146
    193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       258
    194 Temperature_Celsius     0x0022   097   083   000    Old_age   Always       -       46
    196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
    197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
    198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
    199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
    200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       10

Работал более десяти лет. Пора на покой. zpool detach, вставляем новый
диск, zpool attach, через какое-то время здоровенькое зеркало снова в строю.

    [оставить комментарий]