All times are UTC - 5 hours [ DST ]




Post new topic Reply to topic  [ 23 posts ]  Go to page Previous  1, 2
Author Message
 Post subject: Re: Increasing "Number of Reported Uncorrectable Errors"
PostPosted: January 10th, 2022, 2:34 
Offline

Joined: August 23rd, 2017, 10:02
Posts: 15
Location: Asia
I know this is a bit old thread but posting here because my issue is almost similar. I have multiple Seagate BarraCuda 3.5 ST4000DM004 out of which one is having this almost same issue except this hdd is 2 years old & installed in a regular general purpose desktop which shutdown maybe once in a few weeks/months. A few weeks ago I faced an issue where this hdd would disappear from windows & only solution was to restart the pc after reconnecting sata & power cable(just to be sure). I assumed it may be because of some sata cable issue so changed sata cable recently on 5th but 2 days later found out error 153 filling event viewer after every few seconds/minute. UltraDMA crc error count raw value also jumped from earlier 0 to around 36000. I changed the sata cable & both event viewer error 153 as well as UltraDMA CRC Error count stopped happening but now I am seeing Reported Uncorrectable Errors value increasing by 1 per day(it was 1 on 8th & 3 now, its value was 0 in Nov last year & 1 around the time drive disappear issue happened first I think). A point to note is that when using faulty sata cable for 2 days(from 5th till 8th) there was no increase in Reported Uncorrectable Errors value.

5th Jan report(after just installing faulty sata cable):
Code:
ID Cur Wor Thr RawValues(6) Attribute Name
01 _72 _49 __6 00000CBAE5E7 Read Error Rate
03 _96 _96 __0 000000000000 Spin-Up Time
04 100 100 _20 000000000034 Start/Stop Count
05 _99 _99 _10 000000000000 Reallocated Sectors Count
07 _87 _60 _45 00001AD26343 Seek Error Rate
09 _78 _78 __0 6F1900004D2B Power-On Hours
0A 100 100 _97 000000000000 Spin Retry Count
0C 100 100 _20 000000000033 Power Cycle Count
B7 100 100 __0 000000000000 Vendor Specific
B8 100 100 _99 000000000000 End-to-End Error
BB _99 _99 __0 000000000001 Reported Uncorrectable Errors
BC 100 _99 __0 000000000001 Command Timeout
BD 100 100 __0 000000000000 High Fly Writes
BE _74 _50 _40 00001A14001A Airflow Temperature
BF 100 100 __0 000000000000 G-Sense Error Rate
C0 100 100 __0 00000000014B Power-off Retract Count
C1 100 100 __0 00000000040E Load/Unload Cycle Count
C2 _26 _50 __0 00110000001A Temperature
C3 _83 _64 __0 00000CBAE5E7 Hardware ECC recovered
C5 100 100 __0 000000000000 Current Pending Sector Count
C6 100 100 __0 000000000000 Uncorrectable Sector Count
C7 200 200 __0 000000000000 UltraDMA CRC Error Count
F0 100 253 __0 E8B400004CFA Head Flying Hours
F1 100 253 __0 0005D2092F46 Total Host Writes
F2 100 253 __0 0022F2574A4F Total Host Reads


8th Jan report(after just replacing faulty sata cable by another cable)
Code:
ID Cur Wor Thr RawValues(6) Attribute Name
01 _53 _47 __6 0000018C7AA0 Read Error Rate
03 _96 _96 __0 000000000000 Spin-Up Time
04 100 100 _20 000000000035 Start/Stop Count
05 _99 _99 _10 000000000000 Reallocated Sectors Count
07 _87 _60 _45 00001AE9E896 Seek Error Rate
09 _78 _78 __0 202400004D73 Power-On Hours
0A 100 100 _97 000000000000 Spin Retry Count
0C 100 100 _20 000000000034 Power Cycle Count
B7 100 100 __0 000000000000 Vendor Specific
B8 100 100 _99 000000000000 End-to-End Error
BB _99 _99 __0 000000000001 Reported Uncorrectable Errors
BC 100 _95 __0 005F005F0060 Command Timeout
BD 100 100 __0 000000000000 High Fly Writes
BE _68 _50 _40 0000201C0020 Airflow Temperature
BF 100 100 __0 000000000000 G-Sense Error Rate
C0 100 100 __0 00000000014C Power-off Retract Count
C1 100 100 __0 000000000411 Load/Unload Cycle Count
C2 _32 _50 __0 001100000020 Temperature
C3 _74 _64 __0 0000018C7AA0 Hardware ECC recovered
C5 100 100 __0 000000000000 Current Pending Sector Count
C6 100 100 __0 000000000000 Uncorrectable Sector Count
C7 200 194 __0 000000008E91 UltraDMA CRC Error Count
F0 100 253 __0 AAEC00004D42 Head Flying Hours
F1 100 253 __0 0005D3323239 Total Host Writes
F2 100 253 __0 0023206EEB01 Total Host Reads


Today's report:
Code:
ID Cur Wor Thr RawValues(6) Attribute Name
01 _66 _47 __6 0000092B404E Read Error Rate
03 _96 _96 __0 000000000000 Spin-Up Time
04 100 100 _20 000000000035 Start/Stop Count
05 _99 _99 _10 000000000000 Reallocated Sectors Count
07 _87 _60 _45 00001AF67FEF Seek Error Rate
09 _78 _78 __0 040300004DA2 Power-On Hours
0A 100 100 _97 000000000000 Spin Retry Count
0C 100 100 _20 000000000034 Power Cycle Count
B7 100 100 __0 000000000000 Vendor Specific
B8 100 100 _99 000000000000 End-to-End Error
BB _97 _97 __0 000000000003 Reported Uncorrectable Errors
BC 100 _95 __0 005F005F0060 Command Timeout
BD 100 100 __0 000000000000 High Fly Writes
BE _68 _50 _40 0000221C0020 Airflow Temperature
BF 100 100 __0 000000000000 G-Sense Error Rate
C0 100 100 __0 00000000014C Power-off Retract Count
C1 100 100 __0 000000000412 Load/Unload Cycle Count
C2 _32 _50 __0 001100000020 Temperature
C3 _82 _64 __0 0000092B404E Hardware ECC recovered
C5 100 100 __0 000000000000 Current Pending Sector Count
C6 100 100 __0 000000000000 Uncorrectable Sector Count
C7 200 194 __0 000000008E91 UltraDMA CRC Error Count
F0 100 253 __0 8B6700004D71 Head Flying Hours
F1 100 253 __0 0005D41FA7A3 Total Host Writes
F2 100 253 __0 002335BD14D1 Total Host Reads


Top
 Profile  
 
 Post subject: Re: Increasing "Number of Reported Uncorrectable Errors"
PostPosted: June 18th, 2023, 15:26 
Offline

Joined: December 20th, 2020, 16:11
Posts: 10
Location: Earth
Update: after ˜1.5 years moving this "problematic HDD" out from Synology NAS to a 24x7 desktop, the situation remains the same:

1) SMART is still reporting "0" for Reallocated_Sector_Ct/Current_Pending_Sector/Offline_Uncorrectable
2) GP (General Purpose)/Device statistics is stil showing "33" for Number of Reported Uncorrectable Errors (stopped increasing)

Just for testing, I've stored almost 4TB of data in a bunch of large files on this disk, and checked MD5 every 6 months (had to use NTFS and manually generated MD5 hashes, BTRFS for Windows is not reliable). No data corruption so far, so it is not bit rot. Since during this time I've used this disk only for reading data, I suspect that writing data may be causing the issue (SMR firmware bug?).

Anyway, soon I will move this disk back to Synology and use it as a separate volume for Surveillance (BTFS with checksum enabled). This will cause this disk to start getting some write operations (I just save video when movement is detected from a single camera, so it will be a dozen short videos saved every day).

If uncorrectable errors reported in GP starts increasing again, then I will retire this disk for good. Otherwise I will keep it dedicated to Synology Surveillance storage (no problem if I ever lose some data).


Top
 Profile  
 
 Post subject: Re: Increasing "Number of Reported Uncorrectable Errors"
PostPosted: June 21st, 2023, 3:12 
Offline
User avatar

Joined: May 29th, 2023, 13:54
Posts: 56
Location: /home/mr44er
First this appears:
Code:
0x03  0x040  4             449  ---  Number of High Priority Unload Events


then that:
Code:
0x06  =====  =               =  ===  == Transport Statistics (rev 1) ==
0x06  0x008  4             610  ---  Number of Hardware Resets
0x06  0x010  4              17  ---  Number of ASR Events


compared with:
Code:
12 Power_Cycle_Count       -O--CK   100   100   020    -    43


Could also be bad connection to the backplane, the cable to the backplane or a faulty PSU and if you already changed the complete machine, then maybe the PCB is just bad@the connectors.


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 23 posts ]  Go to page Previous  1, 2

All times are UTC - 5 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 180 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group