Šta je novo?

WD40EZRX - 20 sati rada i 13k gresaka...

petarboj

Slavan
Učlanjen(a)
08.06.2007
Poruke
1,292
Poena
375
Pozdrav ljudi, da li neko moze da desifruje ovo stanje harda? kupio sam ga juce novog i danas pogledam reda radi smart brojke i zamalo nisam pao sa stolice... sta mi je ciniti? generalne brojke nisu lose, sve deluje ok, ali 13k gresaka logovanih me ipak plasi... imam garanciju, ali ne znam da li zbog ovoga moze da se poteze garancija. hdsentinel (linux) daje 100/100 i ne prijavljuje nikakve greske. losih sektora nema. mozda sata kabl pravio probleme?

Kod:
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-62-generic] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     WDC WD40EZRX-00SPEB0
Serial Number:    WD-WCC4E6NFX8U3
LU WWN Device Id: 5 0014ee 260f61130
Firmware Version: 80.00A80
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Sun Sep 13 21:05:48 2015 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x80) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (52080) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 521) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x7035) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   184   184   021    Pre-fail  Always       -       7800
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       10
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       20
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       8
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       5
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       9
194 Temperature_Celsius     0x0022   118   117   000    Old_age   Always       -       34
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   100   253   000    Old_age   Offline      -       0

SMART Error Log Version: 1
[B]ATA Error Count: 13356 (device log contains only the most recent five errors)[/B]
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 13356 occurred at disk power-on lifetime: 13 hours (0 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 61 02 00 00 00 a0  Device Fault; Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  ef 10 02 00 00 00 a0 00      13:01:00.868  SET FEATURES [Enable SATA feature]
  ec 00 00 00 00 00 a0 00      13:01:00.866  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      13:01:00.863  SET FEATURES [Set transfer mode]
  ef 10 02 00 00 00 a0 00      13:01:00.861  SET FEATURES [Enable SATA feature]
  ec 00 00 00 00 00 a0 00      13:01:00.858  IDENTIFY DEVICE

Error 13355 occurred at disk power-on lifetime: 13 hours (0 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 61 46 00 00 00 a0  Device Fault; Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  ef 03 46 00 00 00 a0 00      13:01:00.863  SET FEATURES [Set transfer mode]
  ef 10 02 00 00 00 a0 00      13:01:00.861  SET FEATURES [Enable SATA feature]
  ec 00 00 00 00 00 a0 00      13:01:00.858  IDENTIFY DEVICE
  c8 00 08 00 00 00 e0 00      13:01:00.854  READ DMA
  ef 10 02 00 00 00 a0 00      13:01:00.851  SET FEATURES [Enable SATA feature]

Error 13354 occurred at disk power-on lifetime: 13 hours (0 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 61 02 00 00 00 a0  Device Fault; Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  ef 10 02 00 00 00 a0 00      13:01:00.861  SET FEATURES [Enable SATA feature]
  ec 00 00 00 00 00 a0 00      13:01:00.858  IDENTIFY DEVICE
  c8 00 08 00 00 00 e0 00      13:01:00.854  READ DMA
  ef 10 02 00 00 00 a0 00      13:01:00.851  SET FEATURES [Enable SATA feature]
  ec 00 00 00 00 00 a0 00      13:01:00.849  IDENTIFY DEVICE

Error 13353 occurred at disk power-on lifetime: 13 hours (0 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 61 08 00 00 00 e0  Device Fault; Error: ABRT 8 sectors at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 00 00 00 e0 00      13:01:00.854  READ DMA
  ef 10 02 00 00 00 a0 00      13:01:00.851  SET FEATURES [Enable SATA feature]
  ec 00 00 00 00 00 a0 00      13:01:00.849  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      13:01:00.847  SET FEATURES [Set transfer mode]
  ef 10 02 00 00 00 a0 00      13:01:00.844  SET FEATURES [Enable SATA feature]

Error 13352 occurred at disk power-on lifetime: 13 hours (0 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 61 02 00 00 00 a0  Device Fault; Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  ef 10 02 00 00 00 a0 00      13:01:00.851  SET FEATURES [Enable SATA feature]
  ec 00 00 00 00 00 a0 00      13:01:00.849  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      13:01:00.847  SET FEATURES [Set transfer mode]
  ef 10 02 00 00 00 a0 00      13:01:00.844  SET FEATURES [Enable SATA feature]
  ec 00 00 00 00 00 a0 00      13:01:00.842  IDENTIFY DEVICE

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%        19         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 
Trebalo bi da je sa tim diskom sve u redu za sad.
 
"Error 13355", a ne 13355 gresaka koliko ja shvatam ovaj output?
 
ATA Error Count: 13356

a ispod je ispisano poslednjih 5 gresaka.
 
Promenio čim sam ovo video jer sam na njega prvog posumnjao. Pratiću stanje narednih dana, nadam se da neće biti ponovnog bursta grešaka...
 
Jesi ga stavio kao sistemski ili dodatni? Jesi pre toga dosta puta restartovao komp? Pošto prijavljuje greške prilikom inicijalizacije, ono "Identify" i "Set Features". Ako nije sistemski, moguće da je problem u agresivnom parkiranju glava. Idi na wd sajt, nađi "wdidle" pa povećaj vreme na maksimum.
 
nije sistemski, stoji mi u nas-u kao storage pored jos jednog green-a od 2tb, sistem je na ssd-u. parkiranje glave sam disable-ovao cim sam ubacio disk (i posle toga ugasio nas, sacekao malo i upalio nas). vidi se iz ispisa da je lcc 9 a start-stop 10.
 
@petarboj
Video sam odmah, ali... 13k grešaka na 13h mu dođe greška na 3-4 sekunde. Mada negde videh info 20h rada, pa i to dođe na 5 sekundi. Zato sam rekao. Osim ako stvarno nije po brljavio kabl, mada opet kažem, prijavljuje greške kao da si ga tek sad upalio. Pogledaj i ovo što kaže kolega gore, ja sam na telefonu, pa malo "teže" ide. Videću večeras.
 
zadnja greska je bila kad je imao 13h (vidi se iz loga) a sad ima 20h. i koliko vidim iz loga ovih pet zadnjih gresaka su se desavale na ~3-5ms. sto mu dodje da su sve mogle da se dese u jednoj minuti oko 2-3 restartovanja nas-a. probacu danas kad se vratim s posla da restartujem jos jednom, ali mislim da nece biti greske jer mi se cini da sam ga restartovao oko 15og sata rada i tada nije prijavio nista... ploca je inace MSI C847MS-E33.

@DariusIII hvala za link sad cu pogledati ;)
 
U pravu si, izvini. TT mi je iseckao linije loga samo tako. Ti u poslednjih 7h i nemaš problema.
PS
Da nije ovo čuveno "burn-in" opravdanje sa wd foruma za svaku grešku ;-)
 
To je onda i desktop masina pretpostavljam, ne samo server. Pitam jer i os moze da napravi kojesta. Kad si stopirao lcc sa wdidle3 jel radjeno iz linux okruzenja i da li ima promena od tad na bolje ili gore ?
 
nije desktop, ali je htpc/nas (nema desktop okruzenje, po potrebi se startuje x-server i kodi). radjeno iz linuxa, cim sam ubacio hard, u roku od 10min. nas je tad ugasen i upaljen. nista ne skljoca, radi kako treba, brzina upisa dobra. tako da ne verujem da je vezano za to...
 
Poslednja izmena:
Imam isti takav disk, zaista je dobar. Greske koje su navedene se odnose na interfejs sto ce reci od kontrolera na ploci, preko sata interfejsa, kabla pa sve tako do kontrolera na samom disku. Ovo moze i os da pravi, ploca, kablovi, fw na disku. FW za plocu mozda moze popraviti stvari. Ja bih najmanje sumnjao na sam disk u ovom slucaju.
 
ni ja ne sumnjam na disk :) sumnjao sam na sata kabl koji sam odmah i promenio. nego me interesuje da li su ove greske mogle nekako da uticu na sam disk? pretpostaljam da ne, gledajuci ostale smart vrednosti. ovo su vise greske u komunikaciji po meni...
videcu za par dana kako ce biti, uradicu par restarta. nadam se da nece biti vise problema.
 
Spremi jos neki kabl za zamenu, sata 3, ako se situacija ne popravi.
 
hard trenutno ima 100 sati rada i nije vise bilo gresaka :) dakle, najverovatnije sata kablic. ako nekome znaci, u pitanju je sata kabl biostar ploce crvene boje sa crnim vrhovima bez metalnog osiguraca :) moram kupiti jedan normalan da imam u rezervi :)
 
Jedini problem kod wdidle je sto ljudi ne urade power cycle harda posle njega. Znaci wdidle, ugasis komp, sacekas malo, upalis komp. Restart kompa ne radi power cycle harda. Posle toga sve bude kako treba. Ja cak nisam ni koristio wdidle vec linux tool:
http://idle3-tools.sourceforge.net/
koji posle promene naglasi da mora da se odradi power cycle da bi sve bilo ok.
 
Ako nije problem da se nadovezem na temu, uzeo sam polovan hard koji nije imao ni jedan sat rada,sada ima 5 dana. Po ukljucivanju skljocao je nenormalno i nije ga video windows,prvo sam pomislio da je istumban u prevozu do kuce, ali sam ga pazio, zatim sam pomislio da je slaba grana napajanja sto je nemoguce, promenio je, i posle par restarta se stabilizovalo stanje. Sentinel sve pokazuje ok sto se tice losih sektora i nema data transfer gresaka, ali ima neke errore koji mi nisu poznati i cije se vrednosti stalno menjaju . Jedino skljocanje glave cujem po startovanju racunara, ovako je tih.

Model: Seagate Barracuda ST31000524AS 1TB 7200 RPM 32MB


 
Vrh Dno