Volume corruption

Status
Not open for further replies.

leang521

Dabbler
Joined
Mar 18, 2013
Messages
18
HELP!A serious problem has occurred.
SMART error, repair a hard drive.The manufacturers change of a new hard disk.
The new hard disks into the system.The first time, because there is no insert original location, display volume error undetectable capacity.
Shut down and replace the position.
Volume display properly, but work in degraded mode.
Choose to replace the hard disk does not exist, display three hard disks, one of which is the hard disk does not exist, the long string of numbers.
(At this point there may be a problem with the system being copied data, but I do not understand at the time.)
At that time, impatient uninstall the disk does not exist, did not respond.
Try to uninstall the new disk, the tragedy occurred, system error, regardless of the point, what are the error. Even after rebooting will not work.
Finally, I should press the reset on the machine.
Again boot volume can not find it. Capacity unknowable.
How to do, now able to see the volume can not see the data. All my data are on the inside. . . . . .
 

ProtoSD

MVP
Joined
Jul 1, 2011
Messages
3,348
You're not very like to get help using the English you posted, very little makes sense. You would be better to use Google translate or use one of the foreign language sub-forums. You also need to follow the rules and post complete details about your hardware.

Good luck! :)
 

leang521

Dabbler
Joined
Mar 18, 2013
Messages
18
Sorry.I come from China. English is poor.
Too little about FreeNAS forum in China. Can not get enough help.
About the specific rules of the forum, because the language is not very understanding.

My hardware configuration is as follows.
intel E3200, P35 motherboard, 3GB RAM
Double WD Green 2.0TB hard drive

Please try to help me
 

leang521

Dabbler
Joined
Mar 18, 2013
Messages
18
我用两个西数绿色2.0TB硬盘组建ZFS mirror,其中一块硬盘报错,SMART C6=1。
我把坏硬盘拔掉,系统显示工作在降级状态。其中一个硬盘不可见。
将坏硬盘返厂后给换了一块新硬盘。
重新插上新硬盘,系统显示有两块硬盘,一个卷。
卷管理显示一个好硬盘和一个离线硬盘(一串数字)。
我对离线硬盘进行替换,系统显示成功。
此时卷显示两个好硬盘和一个离线硬盘。
我对离线硬盘卸载,系统显示成功。
但此时卷状态仍然是两个好硬盘和一个离线硬盘。
(我后来猜测系统此时可能正在做镜像数据恢复)
但是因为当时不了解,以为替换没有成功。
就对新硬盘也是用了卸载命令,打算重新替换硬盘。
然后系统陷入了错误中。
不管进行任何操作都显示系统发生错误。
这是我又悲剧的硬件重启了系统。
系统重启后即无法正常运行,卷状态显示不可预知容量。卷管理打不开。


The following is a translation of Google:
I set up a ZFS mirror with two Western Digital Green 2.0TB hard drive, hard disk error, SMART C6 = 1.
I unplug the bad hard drive, the system displays work in a degraded state. One of the hard drive is not visible.
Bad hard disk Depot for a new hard disk.
Re-plug in the new hard drive, the system displays two hard drives, one volume. Volume management a good hard drive and a hard disk offline (string of numbers).
Offline hard disk to be replaced, the system displays successful. Two good hard drive and an offline hard disk volume.
I am offline hard disk uninstall. However, the volume status is still two good hard drive and an offline hard disk.
(Later, I guess the system may're doing mirrored data recovery)
But because do not understand that the replacement was not successful. On the new hard disk is to use the uninstall command, intended to replace the hard disk.
Then the system into mistakes. Regardless of any operation carried out have shown that the system error occurred.
This is a tragedy and I restarted the system hardware.
Not work properly after the system restart, the volume status display unpredictable capacity. Volume Manager does not turn on.




I asked my friend to help me translate it:
When I made the ZFS mirror with two western digital green 2.0TB hard disks,one disk displayed error SMART C6=1.So,I uninstalled it, the system showed it was working in a degraded state.And one of disks was not visible. I had to change a new hard disk.
Installed the new one ,system displayed two disks,one volume.The volume manager showed a disk and an offline disk.
The manager showed two disks and an offline disk after I replaced the offline disk.So ,I uninstalled the offline disk,system showed successful,but the volume state still displayed two disks and an offline disk.
I thouht the replace command didn't work ,so I unstalled the new disk for trying replace command again.Then,The the system was showing errors whatever I operate.
I had to reboot the system. The system crushed afer reboot.
Now the volume state shows unpredictable capacity.I can not open the volume manager as well. Please help me
 

leang521

Dabbler
Joined
Mar 18, 2013
Messages
18
I re-installed a system, try to hang in the volume, but failed.Show:
Apr 12 16:43:15 freenas manage.py: [middleware.exceptions:38] [MiddlewareError:The volume "NAS1" failed to import, for futher details check pool status]
I try to use the following command "zpool import"
ZFS volume three hard missing two."State: DEGRADED"
Suspected because ZFS special mechanism, resulting in even the mirror missing two can not work.
Back to the original system, hanging on all hard disk.
Found "Alert System" prompt:
WARNING: The volume NAS1 (ZFS) status is UNKNOWN: One or more devices has experienced an error resulting in data corruption. Applications may be affected.Restore the file in question if possible. Otherwise restore the entire pool from backup.

- - - Updated - - -

Now my question is, how to restore the volume health status and ensure data security?
 

leang521

Dabbler
Joined
Mar 18, 2013
Messages
18
Just seen the launch of the following two warnings in the log. Is there a relationship?

Apr 13 13:57:48 freenas root: /etc/rc: WARNING: Dump device does not exist. Savecore not run.
Apr 13 13:57:48 freenas root: /etc/rc: WARNING: failed precmd routine for vmware_guestd
 

leang521

Dabbler
Joined
Mar 18, 2013
Messages
18
You're not very like to get help using the English you posted, very little makes sense. You would be better to use Google translate or use one of the foreign language sub-forums. You also need to follow the rules and post complete details about your hardware.

Good luck! :)



Go read the ZFS documentation, I feel I might replace the hard disk resynchronization process mandatory reboot the device due to abnormal data. "Zpool status-v" command to see the resulting output is as follows:

[root@freenas ~]# zpool status -v NAS1 |more
pool: NAS1
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://www.sun.com/msg/ZFS-8000-8A
scan: resilvered 26.5G in 0h5m with 28818 errors on Thu Apr 11 23:29:03 2013
config:

NAME STATE READ WRITE C
KSUM
NAS1 DEGRADED 0 0
0
mirror-0 DEGRADED 0 0
0
replacing-0 DEGRADED 0 0
0
13770178437660062256 UNAVAIL 0 0
0 was /dev/gptid/3df8be2d-988a-11e2-83d9-001d7de86a8c
gptid/c82e1cfb-a2bb-11e2-a83f-001d7de86a8c ONLINE 0 0
0
gptid/3ead7eef-988a-11e2-83d9-001d7de86a8c ONLINE 0 0
0

errors: Permanent errors have been detected in the following files:

111.JPG
 

leang521

Dabbler
Joined
Mar 18, 2013
Messages
18
Now the problem has been resolved.Yesterday I back up all data, re-use the zpool scrub command to scan the entire mirror.Makes the file system back to normal.Then uninstall has dropped the hard disk.
The final loss of data about 8GB, Fortunately, the data is unimportant.
As can be seen from this incident ZFS Although known as very stable, but ease of use is not high. And human error may result in loss of data. Hard disk replacement, be sure to let ZFS to do, do not forcibly unmount the hard drive may cause the collapse of the system. Even if a system error occurs, try not to use hard reboot (I was there is no way, without a graphics card Mishap).
Before seen a lot of loss of power or hard reboot the ZFS crash, also very surprised. It seems from this incident, is still very possible. So the next step to prepare the system to be inside the UPS. In order to ensure the stability of the system.

Another strange thing is that my serial port can only operate at 9600 baud. I used the zigbee conversion to enable wireless serial. But I do not know because of the module or system problems can only work in 9600, reflecting the slow when a lot happens. But for the time being does not affect the use, I'll solve it.
 
Status
Not open for further replies.
Top