How worried should I be - One or more devices has experienced an unrecoverable error

Status
Not open for further replies.

Neil Whitworth

Dabbler
Joined
Nov 14, 2013
Messages
30
This morning I woke up to find that I had lost power to the computer room (a.k.a the loft) and everything was off. Since this was the first real test of my UPS I wanted to make sure all my data was safe and started a scrub, which almost imediatly reported some errors.

  • /var/log/messages seams to show the system shutting down cleanly during power outage.
  • S.M.A.R.T data appears OK to me
  • long tests started via smartctl -t long /dev/ada0..4 (~40% complete so far)
Have I missed anything? Should I be woried about the errors?


Code:
~# zpool status
  pool: tank
state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
        attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
        using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://illumos.org/msg/ZFS-8000-9P
  scan: resilvered 112K in 0h0m with 0 errors on Tue Sep 22 09:10:35 2015
config:

        NAME                                            STATE     READ WRITE CKSUM
        tank                                            ONLINE       0     0     0
          raidz2-0                                      ONLINE       0     0     0
            gptid/8dfdd010-78d1-11e3-977c-002590d473f7  ONLINE       0     0     0
            gptid/8e5bdb5b-78d1-11e3-977c-002590d473f7  ONLINE       3     2     0
            gptid/8ebb0e81-78d1-11e3-977c-002590d473f7  ONLINE       0     0     0
            gptid/fe671ac7-7923-11e3-abca-002590d473f7  ONLINE       0     0     0

errors: No known data errors

Code:
Sep 21 07:04:48 monarth kernel: ugen1.3: <EATON> at usbus1 (disconnected)
Sep 21 07:04:50 monarth kernel: ugen1.3: <EATON> at usbus1
Sep 21 07:04:56 monarth root: Unknown USB device: vendor 0x0463 product 0xffff bus uhub2
Sep 21 07:04:56 monarth kernel: ugen1.3: <EATON> at usbus1 (disconnected)
Sep 21 07:04:56 monarth upsd[2737]: Data for UPS [ups] is stale - check driver
Sep 21 07:04:58 monarth upsmon[2745]: Poll UPS [ups] failed - Data stale
Sep 21 07:04:58 monarth upsmon[2745]: Communications with UPS ups lost
Sep 21 07:04:58 monarth kernel: ugen1.3: <EATON> at usbus1
Sep 21 07:05:03 monarth upsmon[2745]: Poll UPS [ups] failed - Data stale
Sep 21 07:05:05 monarth root: Unknown USB device: vendor 0x0463 product 0xffff bus uhub2
Sep 21 07:05:08 monarth upsmon[2745]: Poll UPS [ups] failed - Data stale
Sep 21 07:05:13 monarth upsmon[2745]: Poll UPS [ups] failed - Data stale
Sep 21 07:05:13 monarth root: Unknown USB device: vendor 0x0463 product 0xffff bus uhub2
Sep 21 07:05:16 monarth upsd[2737]: UPS [ups] data is no longer stale
Sep 21 07:05:18 monarth upsmon[2745]: Communications with UPS ups established
Sep 21 07:05:31 monarth kernel: ugen1.3: <EATON> at usbus1 (disconnected)
Sep 21 07:05:33 monarth kernel: ugen1.3: <EATON> at usbus1
Sep 21 07:05:40 monarth root: Unknown USB device: vendor 0x0463 product 0xffff bus uhub2
Sep 21 07:05:45 monarth upsd[2737]: Data for UPS [ups] is stale - check driver
Sep 21 07:05:48 monarth upsmon[2745]: Poll UPS [ups] failed - Data stale
Sep 21 07:05:48 monarth upsmon[2745]: Communications with UPS ups lost
Sep 21 07:05:48 monarth root: Unknown USB device: vendor 0x0463 product 0xffff bus uhub2
Sep 21 07:05:49 monarth upsd[2737]: UPS [ups] data is no longer stale
Sep 21 07:05:53 monarth upsmon[2745]: Communications with UPS ups established
Sep 21 07:05:53 monarth upsmon[2745]: UPS ups on battery
Sep 21 07:06:23 monarth upssched-cmd: issuing shutdown
Sep 21 07:06:23 monarth upsmon[2745]: Executing automatic power-fail shutdown
Sep 21 07:06:23 monarth upsmon[2745]: Auto logout and shutdown proceeding
Sep 21 07:06:53 monarth shutdown: power-down by root:
Sep 21 07:07:01 monarth kernel: .
Sep 21 07:07:01 monarth nmbd[3091]: [2015/09/21 07:07:01.435370,  0] nmbd/nmbd.c:66(terminate)
Sep 21 07:07:01 monarth kernel: .
Sep 21 07:07:01 monarth nmbd[3091]:  Got SIGTERM: going down...
Sep 21 07:07:01 monarth nmbd[3091]: [2015/09/21 07:07:01.448019,  0] libsmb/nmblib.c:856(send_udp)
Sep 21 07:07:01 monarth nmbd[3091]:  Packet send failed to 192.168.100.255(138) ERRNO=Network is down
Sep 21 07:07:01 monarth nmbd[3091]: [2015/09/21 07:07:01.507229,  0] libsmb/nmblib.c:856(send_udp)
Sep 21 07:07:01 monarth nmbd[3091]:  Packet send failed to 192.168.100.255(138) ERRNO=Network is down
Sep 21 07:07:01 monarth upsd[2737]: mainloop: Interrupted system call
Sep 21 07:07:01 monarth ntpd[2596]: ntpd exiting on signal 15
Sep 21 07:07:01 monarth kernel: .
Sep 21 07:07:10 monarth last message repeated 2 times
Sep 21 07:07:10 monarth syslogd: exiting on signal 15
Sep 22 09:07:14 monarth syslogd: kernel boot file is /boot/kernel/kernel
Sep 22 09:07:14 monarth kernel: Copyright (c) 1992-2013 The FreeBSD Project.
Sep 22 09:07:14 monarth kernel: Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
Sep 22 09:07:14 monarth kernel: The Regents of the University of California. All rights reserved.
Sep 22 09:07:14 monarth kernel: FreeBSD is a registered trademark of The FreeBSD Foundation.
Sep 22 09:07:14 monarth kernel: FreeBSD 9.2-RELEASE #0 r+2315ea3: Fri Dec 20 12:48:50 PST 2013
Sep 22 09:07:14 monarth kernel: root@build.ixsystems.com:/tank/home/jkh/checkout/freenas/os-base/amd64/tank/home/jkh/checkout/freenas/FreeBSD/src/sys/FREENAS.amd64 amd64
Sep 22 09:07:14 monarth kernel: gcc version 4.2.1 20070831 patched [FreeBSD]
Sep 22 09:07:14 monarth kernel: CPU: Intel(R) Xeon(R) CPU E3-1220 V2 @ 3.10GHz (3100.08-MHz K8-class CPU)

FreeNAS Version: FreeNAS-9.2.0-RELEASE-x64 (ab098f4)
Hardware: Supermicro 5017C-MTF ( X9SCL-F/E3-1220 V2 @ 3.10GHz/16 GB ECC RAM)
HDD: 4 x Seagate ST4000VN000 4TB (RaidZ2)
UPS : Eaton 5p 850i Rack1u 850va/600w
 

Attachments

  • messages.zip
    5.6 KB · Views: 303
  • smartctl.zip
    8.5 KB · Views: 271

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
You should investigate the error but no need to worry about your data, it's safe ;)
 
Status
Not open for further replies.
Top