Alert/Error message help

Status
Not open for further replies.

takkischitt

Explorer
Joined
Jan 20, 2014
Messages
70
Long test results for ada0

Code:
=== START OF INFORMATION SECTION ===
Model Family:     Hitachi Deskstar P7K500
Device Model:     Hitachi HDP725050GLA360
Serial Number:    ****
LU WWN Device Id: ****
Firmware Version: GM4OA5CA
User Capacity:    500,107,862,016 bytes [500 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 3.0 Gb/s
Local Time is:    Thu Oct 22 09:48:43 2015 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84)    Offline data collection activity
                    was suspended by an interrupting command from host.
                    Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever 
                    been run.
Total time to complete Offline 
data collection:         ( 7890) seconds.
Offline data collection
capabilities:             (0x5b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    No Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine 
recommended polling time:     (   1) minutes.
Extended self-test routine
recommended polling time:     ( 131) minutes.
SCT capabilities:           (0x003d)    SCT Status supported.
                    SCT Error Recovery Control supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   016    Pre-fail  Always       -       0
  2 Throughput_Performance  0x0005   133   133   054    Pre-fail  Offline      -       139
  3 Spin_Up_Time            0x0007   119   119   024    Pre-fail  Always       -       313 (Average 333)
  4 Start_Stop_Count        0x0012   097   097   000    Old_age   Always       -       15766
  5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0005   100   100   020    Pre-fail  Offline      -       288
  9 Power_On_Hours          0x0012   093   093   000    Old_age   Always       -       49410
 10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       106
192 Power-Off_Retract_Count 0x0032   086   086   000    Old_age   Always       -       17495
193 Load_Cycle_Count        0x0012   086   086   000    Old_age   Always       -       17495
194 Temperature_Celsius     0x0002   187   187   000    Old_age   Always       -       32 (Min/Max 17/44)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0022   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0008   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x000a   200   200   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     49392         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Long test results for ada1

Code:
=== START OF INFORMATION SECTION ===
Model Family:     Hitachi Deskstar P7K500
Device Model:     Hitachi HDP725050GLA360
Serial Number:    ****
LU WWN Device Id: ****
Firmware Version: GM4OA5CA
User Capacity:    500,107,862,016 bytes [500 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 3.0 Gb/s
Local Time is:    Thu Oct 22 09:51:17 2015 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84)    Offline data collection activity
                    was suspended by an interrupting command from host.
                    Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever 
                    been run.
Total time to complete Offline 
data collection:         ( 7890) seconds.
Offline data collection
capabilities:             (0x5b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    No Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine 
recommended polling time:     (   1) minutes.
Extended self-test routine
recommended polling time:     ( 131) minutes.
SCT capabilities:           (0x003d)    SCT Status supported.
                    SCT Error Recovery Control supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   016    Pre-fail  Always       -       0
  2 Throughput_Performance  0x0005   133   133   054    Pre-fail  Offline      -       139
  3 Spin_Up_Time            0x0007   119   119   024    Pre-fail  Always       -       313 (Average 333)
  4 Start_Stop_Count        0x0012   096   096   000    Old_age   Always       -       18321
  5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -       2
  7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0005   100   100   020    Pre-fail  Offline      -       0
  9 Power_On_Hours          0x0012   093   093   000    Old_age   Always       -       49412
 10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       106
192 Power-Off_Retract_Count 0x0032   084   084   000    Old_age   Always       -       20060
193 Load_Cycle_Count        0x0012   084   084   000    Old_age   Always       -       20060
194 Temperature_Celsius     0x0002   181   181   000    Old_age   Always       -       33 (Min/Max 18/47)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       2
197 Current_Pending_Sector  0x0022   100   100   000    Old_age   Always       -       3
198 Offline_Uncorrectable   0x0008   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x000a   200   200   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     49397         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
OK, your results look fine. My reason for suggesting you manually run a long test was to see if there were other bad sectors that the drive just hadn't found yet, but it doesn't sound like it. ada1 is showing a few bad sectors, which has the potential to damage data given your pool configuration, but it is only a few. ada0 appears to be fine.
 

takkischitt

Explorer
Joined
Jan 20, 2014
Messages
70
Great stuff! Many thanks for all the help from everyone. I'm feeling pretty happy now that once I set it up with the 500GB mirror and have the added security of the offsite backup, the system should be fairly protected from failures.

I take it 'Current_Pending_Sector' is the field in the results to keep an eye on?

Also just noticing the 'Power_On_Hours' is working out at around 5 years in total. I really don't think those drives are that old. Is this field usually accurate?

The only other thing that I am considering is adding a UPS, but I will search the forum for recommendations and further information.
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Also just noticing the 'Power_On_Hours' is working out at around 5 years in total. I really don't think those drives are that old. Is this field usually accurate?
To within general quartz watch standards, I'd expect.
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710

takkischitt

Explorer
Joined
Jan 20, 2014
Messages
70
I have swapped out the bad 500GB drive as it died before I got the chance to deal with it. I fitted a new 500GB drive in its place and removed/deleted the previous volume. I think I have it set up as a mirror...

Capture.png


Does this look right? Not sure why its got compression on one of the drives, I didn't see any settings for that?
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
That looks entirely right. Compression is enabled by default when you create the pool, and then the default setting for any datasets is to inherit the settings of their parents. What you've posted doesn't show whether you have a mirror, though. Click on the first entry for Files, then click on the Volume Status button at the bottom (it looks like a blank sheet of notebook paper), and give us what that shows.
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
It's not one of the drives, it's the root dataset, and the first line is you pool. Compression is enabled by default and if you don't know if you need to disable it then you should leave it enabled.
 

takkischitt

Explorer
Joined
Jan 20, 2014
Messages
70
Seems like it's set up...

Capture.png


The shortcuts I have on my PCs to access the various folders that were on the previous volume should still work, as long as the location has the same path on the new volume. Would that be correct? The system config is still the same, IP, etc.

I'm having problems copying some folders/files back on as WinSCP is giving me an error. The folder/file paths are very long, could this be the cause of the error? Any way to restore the backup with changing all the folder and file names to make them shorter, if that is the problem?
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
OK, yes, you do have a mirror set up. As long as the share names are the same, and all the data is in the same locations, your client machines shouldn't know the difference.
 

takkischitt

Explorer
Joined
Jan 20, 2014
Messages
70
As long as the share names are the same, and all the data is in the same locations, your client machines shouldn't know the difference.

That's what I thought, but for some reason my shortcuts aren't working/connecting.
 

takkischitt

Explorer
Joined
Jan 20, 2014
Messages
70
What does this mean?

If you deleted the pool, that would explain why your shortcuts aren't working.

I recreated the volume and named it the same as before under mnt, but this time it's a mirror (2 x 500GB) instead of joining the two drives (1TB).
 
Status
Not open for further replies.
Top