SMART service not starting/Smartctl issues/IBM M1015

Status
Not open for further replies.

randomrat

Cadet
Joined
Feb 28, 2014
Messages
5
Ok, so i'm having issues other than my previous "Incorrect volume size problem".
Treat this as a separate problem but they might be related...I'll let an expert be the judge of that...anyway...

This all came about when I was reading into WD Green's Load Cycle Count on many forums as two of my WD30EZRS drives in my old server recently started to spew SMART issues.
I did an advance RMA and got some refurb WD30EZRS drives back from them, which I then stuck into my FreeNAS box. The disk is totally unconfigured in the sense that it is not in any volume - it's just there waiting to be used...
I thought it would be a wise time to ensure SMART on my freeNAS box is working and I have it all configured...

Everytime I attempt to enable the SMART service, I get "The service could not be started". Keeps happening no matter how many reboots I do.

I've been racking my incompetent brain over this and been reading as to why it might not be working. All of the following below is what i've done whilst researching into this problem to fix this.

First things first, I entered the shell to use smartctl
"smartctl -a /dev/mfisyspd0"
Code:
[root@ODYSSEY] ~# smartctl -a /dev/mfisyspd0
smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p3 amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
 
/dev/mfisyspd0: Unable to detect device type
Please specify device type with the -d option.
 
Use smartctl -h to get a usage summary

Perplexed by this, I used the -dauto option to see what would happen
Code:
[root@ODYSSEY] ~# smartctl -a -dauto /dev/mfisyspd0
smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p3 amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
 
/dev/mfisyspd0: Unable to detect device type
Please specify device type with the -d option.
 
Use smartctl -h to get a usage summary

...Ok, still nothing...I tried the -dsat flag next
Code:
[root@ODYSSEY] ~# smartctl -a -dsat /dev/mfisyspd0
smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p3 amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
 
Smartctl open device: /dev/mfisyspd0 [SAT] failed: No such file or directory

...*sigh*...Then tried "smartctl --scan"
Code:
[root@ODYSSEY] ~# smartctl --scan
[root@ODYSSEY] ~#

No output whatsoever?..There has to be something wrong.
After looking through x number of forums, opening the smartd.conf file seemed like something of value.
Code:
[root@ODYSSEY] ~# cat /usr/local/etc/smartd.conf
################################################
# smartd.conf generated by /etc/rc.d/ix-smartd
################################################
[root@ODYSSEY] ~#

...or not? If I haven't mentioned already. I created some SMART tests in System -> SMART Tests, saved the settings and checked again. Still nada.
Here's "gpart show" if its of any help
Code:
[root@ODYSSEY] ~# gpart show
=>        34  7814037101  mfisyspd0  GPT  (3.7T)
          34          94            - free -  (47k)
        128    4194304          1  freebsd-swap  (2.0G)
    4194432  7809842703          2  freebsd-zfs  (3.7T)
 
=>        34  7814037101  mfisyspd1  GPT  (3.7T)
          34          94            - free -  (47k)
        128    4194304          1  freebsd-swap  (2.0G)
    4194432  7809842703          2  freebsd-zfs  (3.7T)
 
=>        34  7814037101  mfisyspd2  GPT  (3.7T)
          34          94            - free -  (47k)
        128    4194304          1  freebsd-swap  (2.0G)
    4194432  7809842703          2  freebsd-zfs  (3.7T)
 
=>        34  7814037101  mfisyspd3  GPT  (3.7T)
          34          94            - free -  (47k)
        128    4194304          1  freebsd-swap  (2.0G)
    4194432  7809842703          2  freebsd-zfs  (3.7T)
 
=>      63  15359937  da0  MBR  (7.3G)
        63  1930257    1  freebsd  (942M)
  1930320        63      - free -  (31k)
  1930383  1930257    2  freebsd  [active]  (942M)
  3860640      3024    3  freebsd  (1.5M)
  3863664    41328    4  freebsd  (20M)
  3904992  11455008      - free -  (5.5G)
 
=>      0  1930257  da0s1  BSD  (942M)
        0      16        - free -  (8.0k)
      16  1930241      1  !0  (942M)
 
=>      0  1930257  da0s2  BSD  (942M)
        0      16        - free -  (8.0k)
      16  1930241      1  !0  (942M)


and "gpart status"
Code:
[root@ODYSSEY] ~# gpart status
      Name  Status  Components
mfisyspd0p1      OK  mfisyspd0
mfisyspd0p2      OK  mfisyspd0
mfisyspd1p1      OK  mfisyspd1
mfisyspd1p2      OK  mfisyspd1
mfisyspd2p1      OK  mfisyspd2
mfisyspd2p2      OK  mfisyspd2
mfisyspd3p1      OK  mfisyspd3
mfisyspd3p2      OK  mfisyspd3
      da0s1      OK  da0
      da0s2      OK  da0
      da0s3      OK  da0
      da0s4      OK  da0
    da0s1a      OK  da0s1
    da0s2a      OK  da0s2

By now, I was totally lost, tired and had begun to contemplate the meaning of life itself. Had a coffee break and started again.
I decided to move my sights onto the IBM M1015 RAID card thinking it's doing something abnormal
"mfiutil show adapter"
Code:
[root@ODYSSEY] ~# mfiutil show adapter
mfi0 Adapter:
    Product Name: ServeRAID M1015 SAS/SATA Controller
  Serial Number: SP12529397
        Firmware: 20.10.1-0045
    RAID Levels: JBOD, RAID0, RAID1, RAID10
  Battery Backup: not present
          NVRAM: 32K
  Onboard Memory: 0M
  Minimum Stripe: 8k
  Maximum Stripe: 64k


"mfiutil show firmware"
Code:
[root@ODYSSEY] ~# mfiutil show firmware
mfi0 Firmware Package Version: 20.10.1-0045
mfi0 Firmware Images:
Name  Version                        Date        Time        Status
BIOS  4.25.00_4.11.05.00_0x0501A000  6/21/2011
  6/21/2011
  active
PCLI  03.02-015:#%00008              Oct 12 2010  14:13:05    active
BCON  4.0-49-e_37-Rel                May 24 2011  14:28:34    active
NVDT  3.09.03-0022                  May 24 2011  07:07:57    active
APP  2.120.104-1284                May 24 2011  13:37:35    active
BTBL  2.02.00.00-0001                Aug 18 2010  11:44:44    active


"mfiutil show drives"
Code:
[root@ODYSSEY] ~# mfiutil show drives
mfi0 Physical Drives:
17 (  0.0) JBOD <WDC WD40EFRX-68W 0A80 serial=WD-WCCremovedthisbit> SATA E1:S3
18 (  0.0) JBOD <WDC WD40EFRX-68W 0A80 serial=WD-WCCremovedthisbit> SATA E1:S2
19 (  0.0) JBOD <WDC WD40EFRX-68W 0A80 serial=WD-WCCremovedthisbit> SATA E1:S1
20 (  0.0) JBOD <WDC WD40EFRX-68W 0A80 serial=WD-WCCremovedthisbit> SATA E1:S0
21 (  0.0) JBOD <WDC WD30EZRX-00S 0A80 serial=WD-WCCremovedthisbit> SATA E1:S4
22 (  0.0) JBOD <WDC WD30EZRS-00J 0A80 serial=WD-WCAremovedthisbit> SATA E1:S5


After way more googling around. I actually ran into something that was interesting. I stopped moping...

http://forums.freenas.org/index.php...y-showing-as-2-2tb-on-intel-m1015-card.16622/

Cyberjock replied somewhere down in that thread "M1015s aren't mfi devices. It's an mps device".
...Erm...all my devices are showing as mfisyspd0, mfisyspd1, mfisyspd2 etc and i'm darn confident I have IBM M1015's too...

Here is a chunk of my dmesg.
Code:
mfi0: <Drake Skinny> port 0xd800-0xd8ff mem 0xf8d9c000-0xf8d9ffff,0xf8dc0000-0xf8dfffff i132
mfi0: Using MSI
mfi0: Megaraid SAS driver Ver 4.23
pcib12: <ACPI PCI-PCI bridge> at device 7.0 on pci128
pci133: <ACPI PCI bus> on pcib12
mfi1: <Drake Skinny> port 0xe800-0xe8ff mem 0xf8e9c000-0xf8e9ffff,0xf8ec0000-0xf8efffff i133
mfi1: Using MSI
mfi1: Megaraid SAS driver Ver 4.23
pcib13: <ACPI PCI-PCI bridge> at device 9.0 on pci128
pci134: <ACPI PCI bus> on pcib13
mfi2: <Drake Skinny> port 0xf800-0xf8ff mem 0xf8f9c000-0xf8f9ffff,0xf8fc0000-0xf8ffffff i134
mfi2: Using MSI
mfi2: Megaraid SAS driver Ver 4.23

b***h what?! o_O Why is it talking about Megaraid?!

The guy in that forum who had the problems did a re-flash of his cards and got them working by using firmware from servethehome. Thing is, I got my firmware from servethehome too?

Can somebody confirm that this is actually the problem and tell me it needs fixing by doing another reflash?
And if so, I may I have some spoonfeeding to ensure my flashes go ok?

Then I found some even more interesting news
http://lists.freebsd.org/pipermail/freebsd-current/2012-September/036483.html
In a nutshell some guy said..
Code:
>> However, I could not get smartctl to see an mfisyspd volume (it claimed
>> there was no such file...?) and so I flashed the controller back to mps
>> for now. A shame, because I really like the mfi driver better, and
>> mfiutil worked great (even to flash firmware updates).


So as it turns out, mfi doesnt report SMART...

Thanks to all those who took time in reading this.
Might have looked like a total waste of post and ultimately finding the potential solution to it. It's a good feeling for a noob like me to have finally found something of value by putting the time and effort into reading around.
 

randomrat

Cadet
Joined
Feb 28, 2014
Messages
5
Update:
When I initially did my FreeNAS build I used the following link and sas2008.zip download to cross flash my M1015
http://www.servethehome.com/ibm-serveraid-m1015-part-4/

I'm thinking I should now download the P14 firmware from the LSI website and "downgrade" my M1015's?
How can I ensure that my cards don't use mfi and use mps instead?
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
Yeah.. your stuff is broke. Not broken as in need to be replaced, but you need to flash the IT firmware. It's also important to flash your card with the same phase version firmware as the driver. See the release notes for 9.2.1 to determine what the correct version is for the latest versions.
 

randomrat

Cadet
Joined
Feb 28, 2014
Messages
5
You were right. My stuff was broke, but not broken as in it needed to be replaced.
In a nutshell flashing the firmware cured so many issues.

For the purposes of "completeness" here are the steps I took to solve my problem...Please double check these if you are going to follow them and seek other approval.

Download FreeNAS OS/Config/M1015 FW
1) Start off a fresh download of FreeNAS 9.2.1.2. x64 RELEASE from the website and burn to CD or win32diskimager it to a USB
2) Download the config file. System -> Settings -> General -> Save Config (save to another computer)
3) Look through the release notes of the FreeNAS version you are downloading and check what the mps driver version is. In this case it's 16.
This means the phase version firmware of the M1015's should be P16.​
4) Download mobilenvidia's SAS2008.zip from herehttp://forums.laptopvideo2go.com/topic/29059-sas2008-lsi92409211-firmware-files/
Extract folder to desktop and name folder "sas2008"
5) Head to the LSI website -> Support -> Downloads
Component type: Storage, Product family: Host Bus Adapters, Product: LSI SAS 9211-8i, Asset type: All Asset types -> Search​
Expand firmware tab and select Archive​
Download "9211_8i_Package_P16_IR_IT_Firmware_BIOS_for_MSDOS_Windows".
Extract folder to desktop.​
6) Within the extracted "9211_8i......Windows" find and copy the following files
"2118it.bin" from /Firmware/HBA_9211_8i_IT​
"sas2flsh.exe" from /sas2flash_dos_rel​
"mptsas2.rom" and "mptbios.txt" from /sasbios_rel​
7) Paste and replace these files with those in the extracted sas2008 folder.
8) Download rufus and create an MS-DOS memory stick. Uncheck the "Create extended label and icon files" checkbox.
9) Paste the modified SAS2008 folder within the memory stick.

Upgrading the M1015
If you are upgrading multiple cards, remove all but one - I am doing each card separately
1) Shut down the box if runing
2) Unplug any devices attached to the M1015's
3) Connect USB flash drive to the server.4) Press F11 and boot from the flash drive.
4) Type "cd sas2008" hit enter
5) Type "megarec -writesbr 0 sbrempty.bin" hit enter
6) Type "megarec -cleanflash 0" hit enter
7) REBOOT system8) Press F11 and boot from the flash drive.
9) Type "sas2flash.efi -o -f 2118it.bin"
10) Type "sas2flash.efi -o -sasadd 500605Bxxxxxxxxx" (x= numbers for SAS address)"
11) Type "sas2flash.efi -listall" to see if you card was flashed properly to the new version.
12) Type "exit" and shutdown
13) Remove USB flash drive
14) Repeat for any other cards

Reinstall freeNAS from CD or USB and reupload your config file to the system.

Capture.jpg


Code:
[root@ODYSSEY] ~# smartctl -a /dev/da0
smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p3 amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
 
=== START OF INFORMATION SECTION ===
Model Family:    Western Digital Caviar Green (AF)
Device Model:    WDC WD30EZRS-00J99B0
Serial Number:    WD-WCAWxxxxxxxx
LU WWN Device Id: 5 0014ee 25b8d78fe
Firmware Version: 80.00A80
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:    512 bytes logical, 4096 bytes physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:  ATA8-ACS (minor revision not indicated)
SATA Version is:  SATA 2.6, 3.0 Gb/s
Local Time is:    Thu Mar  6 21:07:34 2014 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
 
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
 
General SMART Values:
Offline data collection status:  (0x80) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Enabl
Self-test execution status:      (  0) The previous self-test routine comp
                                        without error or no self-test has e
                                        been run.
Total time to complete Offline
data collection:                (51900) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/offort.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (  2) minutes.
Extended self-test routine
recommended polling time:        ( 498) minutes.
Conveyance self-test routine
recommended polling time:        (  5) minutes.
SCT capabilities:              (0x3035) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.
 
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG    VALUE WORST THRESH TYPE      UPDATED  FAILED RAW_VALUE
  1 Raw_Read_Error_Rate    0x002f  100  253  051    Pre-fail  Always          0
  3 Spin_Up_Time            0x0027  153  150  021    Pre-fail  Always          9308
  4 Start_Stop_Count        0x0032  100  100  000    Old_age  Always          17
  5 Reallocated_Sector_Ct  0x0033  200  200  140    Pre-fail  Always          0
  7 Seek_Error_Rate        0x002e  100  253  000    Old_age  Always          0
  9 Power_On_Hours          0x0032  100  100  000    Old_age  Always          35
10 Spin_Retry_Count        0x0032  100  253  000    Old_age  Always          0
11 Calibration_Retry_Count 0x0032  100  253  000    Old_age  Always          0
12 Power_Cycle_Count      0x0032  100  100  000    Old_age  Always          17
192 Power-Off_Retract_Count 0x0032  200  200  000    Old_age  Always          16
193 Load_Cycle_Count        0x0032  200  200  000    Old_age  Always          52
194 Temperature_Celsius    0x0022  125  108  000    Old_age  Always          27
196 Reallocated_Event_Count 0x0032  200  200  000    Old_age  Always          0
197 Current_Pending_Sector  0x0032  200  200  000    Old_age  Always          0
198 Offline_Uncorrectable  0x0030  100  253  000    Old_age  Offline        0
199 UDMA_CRC_Error_Count    0x0032  200  200  000    Old_age  Always          0
200 Multi_Zone_Error_Rate  0x0008  100  253  000    Old_age  Offline        0
 
SMART Error Log Version: 1
No Errors Logged
 
SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]
 
 
SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Done :)
 

enjoywithme

Dabbler
Joined
Dec 23, 2014
Messages
13
I have the same problem with my freenas 9.10 built on DELL R730 with H730 raid card. I configured the raid card mode as no RAID. The smart service cannot be started. And "smartctrl -a /dev/mfisyspd1" reports "Unable to detect device type".
This card does not support smart info passthrough?
 

maglin

Patron
Joined
Jun 20, 2015
Messages
299
Sounds like your card doesn't support Passthrough. Get something else or see if you can flash it to IT mode.


Sent from my iPhone using Tapatalk
 

enjoywithme

Dabbler
Joined
Dec 23, 2014
Messages
13
Sounds like your card doesn't support Passthrough. Get something else or see if you can flash it to IT mode.


Sent from my iPhone using Tapatalk
I checked DELL document. It says "Value/performance RAID and non-RAID (pass-through) solution for high-density servers (1U or 2U) with
the flexibility to use both SATA and SAS HDDs, SSDs, and pass-through drive configurations". The "Non-RAID" is (Pass-through). So it may not be totally pass-through.

I found this article which explains H730 does not really support pass-through in freeBSD.
http://serverfault.com/questions/723676/zfs-on-enterprise-raid-pass-through-and-zfs-on-freebsd-root
 
Last edited:

Mirfster

Doesn't know what he's talking about
Joined
Oct 2, 2015
Messages
3,215
As far as I know the H700 (H700, H710,& H730) family does not work and cannot be cross-flashed to a true HBA/IT Mode. Same is true for the H800s. Get a LSI 9211-8i, IBM M1015, Perc H200, Perc H310, etc. that can actually be cross-flashed or set to IT Mode.
 
Status
Not open for further replies.
Top