Replaced disk and get "unsuitable for rebuilding RAID group"

Device: TS-853A
Firmware: QTS 5.2.1.2930

Recently my disk #1 showed up with error and was recommended to be replaced. I put in the new disk, it shows up with a green checkmark, yet there’s a “(!)” icon which shows:

The installed disk is unsuitable for rebuilding RAID group 1. Install another disk to rebuild the RAID group

Before that, my disk #1 was of this type:

1.82 TB - WD20EFRX-68EUZN0

So, yes, until before I replaced my old disk I was running:

  • #1 : 1.82 TB - WD20EFRX-68EUZN0
  • #2 : 5.46 TB - WD6003FFBX-68MU3N0

Also: before I had the #2 5.46TB disk I too had there a 1.82TB disk, which however already had to be replaced years ago. Knowing that I could not use the additional disk space yet, I however started to replace with bigger drivers eventually assuming all need to be replaced, ending up with all drives at 5.46TB at some point.
Which now happened for RAID 1, but the initial setup was for 1.82TB:

In the end I face two problems:

  1. my latest disk is not accepted at all
  2. eventually I want to resize to 5.46TB

My first goal is to fix 1): get the new disk accepted so that my RAID is not degraded and my data safe.

I found Why is my disk labeled as unsuitable for rebuilding the RAID group? | QNAP but I cannot rebuild the raid group:

I also cannot expand it:

What do I need to do to solve this?

thanks!

Have you tried extracting this disk, formatting it from another machine, and integrating it (without switching off the NAS) into the nas?

Have you tried extracting

You mean taking out of the system etc.?

No, not really, because I just received it fresh and put it into the system.

I just pulled out my old, put in the new. I’m also not sure I’ve a way for putting it somewhere else. And format with what FS exactly?

It feels like this does not really address anything I’m witnessing.

I might mention I didn’t do anything different then all the previous disks I already replaced.

Probably best raise a ticket with QNAP Support.

Backup your raid1 data if you haven’t and open a ticket with support.

On Qnaps, the only raid1’s I use are all on M.2’s for ‘system’ pool and on hero systems for ZILwriteLog, but numerous times on raid6’s and a couple times on raid5’s I’ve done the ‘occasionally drop in larger disks as budget allows until last member disk has been replaced with the larger size and and when that last rebuild completes the raid capacity enlarges’. Based on that experience, I would have expected when you dropped that brand new 6TB into HDD1 slot it should immediately begin to rebuild, and after the hours that takes when the rebuild completes the capacity expansion should automatically begin and eventually let you know your raid1 capacity has gone 1.8TB–>5.4TB.

Since the raid1 was perfectly happy with exact same model-size working in HDD2 slot, a brand new one of those in HDD1 slot should be perfectly ‘suitable for rebuilding RAID group 1.’ …unless new disk came doa… It says ‘Ready’ in the snippet you provide, but maybe look at disk details and SMART info to verify no HDD hardware issues rendering it ‘unsuitable’

That same snippet shows HDD1 as ‘Data’ (vs. ‘Free’) and the Stor Pool 1 Mgmt display shows HDD1 as ‘Not Member’. Those seem to imply your new HDD1 disk is recognized by QTS as a 'Data" disk, but just not part of this storage pool… I believe that’s why one respondent suggested moving new HDD to different machine to format and then trying again.

With your firmware level I believe QTS altered behavior when disks are inserted or first used where you used to get a pop-up warning ‘all data will be destroyed. OK?’ it now skips that pop-up and gives just ‘info bubble’ “not suitable…” if QTS thinks that drive might already be part of a different QTS pool, leaving you to have to take further action on your own to be able to use it. If you have a brand new drive never inserted in a qnap slot before, that shouldn’t be an issue, hence my suggestion to open a support ticket.

If you have data backed up and can’t wait on support, don’t have a different machine to format hdd1 drive, something to consider - You have a TS-853A…

If you have an empty bay in #3-8, move disk from HDD1 slot to empty bay, then StorMan->Disks selecting that bay disk should show as ‘Free’ and on ‘Action’ pull-down do ‘securely erase’. Once erase completes, move disk back to HDD1 slot, and hopefully you see ‘start rebuilding raid 1’ in event log.

If you don’t have an empty bay, possibly one of the pool(s) in bays#3-8 could be safely detached and removed to free a bay to work with. While I safely detach and scan to recover pools pretty routinely with no issues, I did just have previously unseen difficulty one of the eight times I did it upgrading nas’s to 5.2.1. In fact first I ever saw ‘inelligible disk’ message. Described in old forum post:

QTS/hero 5.2.1 upgrades on a handful of models, no issue during/after upgrade. On a EC-1280U-RP, upgrade done as on the others: (1) Reboot (2) Safely detach all data volume pools and remove HDD’s (3) Manual firmware download and installation via gui.

As with the other NAS’s, upgrade and subsequent reboot went fine. When things looked ‘OK’, replugged data pool HDDs, but when I tried “Attach and recover” apparently it did not recognize my safely detached pool and recovered nothing. Rebooted and tried again; same result. Re-installed firmware, this time thru Qfinder, again successful, but still can’t find my ‘safely detached pool’ with ‘scan and recover’. Put in some new HDD’s and tried to create a new pool and get ‘pool creation failed’. Since my ‘(System)’ volume has only apps to re-install and logs to be lost upon system re-initialization, I re-initialized from scratch at same 5.2.1 level. Afterwards I could create a new pool with the new HDD’s, but I still couldn’t successfully ‘scan and recover’ my ‘safely detached’ data volume pool.

I suspect at that point while 5.2.1 could not successfully ‘scan and recover’ my old data volume pool, I think it recognized that there was possibly valid data on the data pool hdd’s because when I tried to use these HDD’s to create a new pool (understanding I would be destroying content on the ‘safely detached’ data vol pool hdd’s) 5.2.1 said these disks were ineligible for pool creation (even though disks showed as 'Free"). This all leads me to believe that QNAP support could probably assist in making 'Scan and Recover" work or otherwise successfully recover access to that pool safely detached prior to initial 5.2.1 upgrade.

Weighing prior support response times when they need to login remote to fix (>2 weeks to months) vs. restore times (7-10 days ~75TB’s) decided data restore would have additional advantage of exercising disaster recovery plan not tested for past few years (though ‘free’ in gui, had to securely erase data volume pool hdd’s to allow recreation of data vol pool).

Thank you for your in-depth insights, very appreciated and great community.

In fact, I also have to praise the very impressive support. Upfront: my problem isn’t fixed yet.

But I did open a support ticket and they were fast, friendly and very knowledgeable.

The TL;DR is, as hinted by @theunk7 , everything is supposed to just work here, but ultimately doesn’t.

In short, the support did:

  • get remote access
  • they analyzed the disk, reported back there were no problems and then it started to be accepted by the pool without issues (I never understood this part “why” and couldn’t figure it out from the support discussion)
  • until I f…ked things up myself, because only recognizing the disk was fixed, but the extended use of storage wasn’t, I started the “replace 1 by 1” approach
  • for this I removed the 1st disk and simply inserted it
  • resulting in the same initial situation: now the disk simply was not accepted anymore
  • I contacted support again and they took a deeper look, but ultimately concluded “it doesn’t work, but should”
  • I should remove the disk, format it from scratch, insert it again

I don’t have any external drive for that so I first have to order one to do this :sweat_smile:

In the meantime I left the raid in this state, disk 1 inserted but not properly integrated, and then came the recent QNAP update, which I did. But it hung shutting down the system :scream: I forced the shutdown with the physical button, but then it hung booting. Forced shutdown again, removed disk 1, finally it booted. I left the disk 1 removed for now.

TL;DR 2: waiting for external case to re-format and re-insert disk 1 now.

i’m not sure why are you reformating a drive to put it back in. just remove the partition so its a clean drive, then place the drive into the nas and let the NAS reformat/initialize the drive.

Yeah I think I was suggested a quick format and to get rid of the partitions.

I still have to wait for the external case anyway.

Thx

Per suggestion of the support, and I finally received my external case, I deleted all existing 5 partitions, did a quick NTFS reformat with a single partition and put the disk back into the NAS → immediately recognized and started rebuilding :tada:

Once that is done, I’m still facing the challenge to teach the pool with a configuration of 1.81TB that all disks are 5.46TB now and it should use that space, but let’s wait until the rebuild is finished tomorrow.

Rebuild finished successfully!

Now I checked the storage dialogs and then within the “replace disk one by one” dialog I in fact found out that, since it detected that the storage pool < smallest disk in the raid and already suggested to me I could expand:

Running this now, let’s wait another day :crossed_fingers: :slight_smile:

All solved now, after expanding the capacity I also had to resize the volume and then I finally had all the new storage space available.

Victory at last \o/

Thank you all for your help :bowing_man: