TrueNAS Unrecoverable Error...But Really?

I’m running TrueNAS Core 13.0-U6.2. I have a pool that consists of two 2TB M.2 SSDs. I received a Critical Alert:

“One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.”

I pulled the drive out and connected it to another machine and there seems to be nothing wrong with it. SMART test is good. No bad sectors. Both M.2 drives are about two months old. They are connected via cheap PCIE cards I picked up on Amazon, so maybe it has something to do with that?

Questions:

If there is something wrong with the drive, how do I find out what it is?

I don’t believe there is anything wrong with the drive, how do I add it back in the pool?

How do I avoid this from happening in the future?

I’ve had that happen with a spinning drive, put it back in and online it again and it ran for years until I changed all the drives out. Sometimes a drive will get flagged in error, but it is usually pretty good at finding an issue and flagging it for your attention.

If under warranty, see if you can exchange it. If not, might be wise to get a replacement ready. The other times I’ve had this had were 100% correct and after moving data to safeguard it, the questionable drive was completely dead.

So, it says the pool is unhealthy, but when I click the status, both drives are online with no errors.

It’s probably safe to dismiss that alert and see if it comes back. I’d back up the data elsewhere if possible, if it’s a boot mirror, then save your configuration so you can do a clean install and import the config to get back up and running.

1 Like