Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
[HW]Emask 0x1 (device error) - Sta per morire l'hd?[RISOLTO]
View unanswered posts
View posts from last 24 hours
View posts from last 7 days

 
Reply to topic    Gentoo Forums Forum Index Forum italiano (Italian)
View previous topic :: View next topic  
Author Message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4486
Location: Somewere around the world

PostPosted: Fri Nov 30, 2007 2:39 pm    Post subject: [HW]Emask 0x1 (device error) - Sta per morire l'hd?[RISOLTO] Reply with quote

HELP!

All'avvio i log mi riportano i seguenti errori:
Code:
EXT3 FS on sda2, internal journal
EXT3-fs: mounted filesystem with ordered data mode.
ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
ata1.00: cmd b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0 cdb 0x0 data 0
         res 51/04:00:00:4f:c2/00:00:00:00:00/00 Emask 0x1 (device error)
ata1.00: configured for UDMA/100
ata1: EH complete
sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
ata1.00: cmd b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0 cdb 0x0 data 0
         res 51/04:00:00:4f:c2/00:00:00:00:00/00 Emask 0x1 (device error)
ata1.00: configured for UDMA/100
ata1: EH complete
sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
ata1.00: cmd b0/d0:01:00:4f:c2/00:00:00:00:00/00 tag 0 cdb 0x0 data 512 in
         res 51/04:01:00:4f:c2/00:00:00:00:00/00 Emask 0x1 (device error)
ata1.00: configured for UDMA/100
ata1: EH complete
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
ata1.00: cmd b0/d1:01:01:4f:c2/00:00:00:00:00/00 tag 0 cdb 0x0 data 512 in
         res 51/04:01:01:4f:c2/00:00:00:00:00/00 Emask 0x1 (device error)
ata1.00: configured for UDMA/100
ata1: EH complete
sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[drm] Setting GART location based on new memory map
[drm] Loading R300 Microcode
[drm] writeback test succeeded in 1 usecs


E' l'hd che sta morendo?
L'hd di questa macchina ha dati troppo importanti perché possa morire. Sto già facendo un backup. Ditemi qualcosa di carino e positivo vi prego
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne


Last edited by Cazzantonio on Fri Nov 30, 2007 6:47 pm; edited 1 time in total
Back to top
View user's profile Send private message
Scen
Retired Dev
Retired Dev


Joined: 29 Jul 2003
Posts: 2470
Location: Padova, Italy

PostPosted: Fri Nov 30, 2007 3:06 pm    Post subject: Re: [HW] Emask 0x1 (device error) - Sta per morire l'hd? Reply with quote

Cazzantonio wrote:
E' l'hd che sta morendo?
L'hd di questa macchina ha dati troppo importanti perché possa morire. Sto già facendo un backup. Ditemi qualcosa di carino e positivo vi prego

R.I.P. (Restore In Pain) :twisted:

Tornando seri...
Code:

[I] sys-apps/smartmontools
     Available versions:  5.36-r1 5.37 {static}
     Installed versions:  5.37(11:58:05 15/10/2007)(-static)
     Homepage:            http://smartmontools.sourceforge.net/
     Description:         control and monitor storage systems using the Self-Monitoring, Analysis and Reporting Technology System (S.M.A.R.T.)

_________________
I was born in a deep forest/I wish I could live here all my life/I am made from stones and roots/My home, these woods and roads
All my life I loved this sound/Of the woods all around/Eagles flies where the winds blows free
Journey is my destiny
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4486
Location: Somewere around the world

PostPosted: Fri Nov 30, 2007 4:08 pm    Post subject: Reply with quote

Code:
heavensdoor ~ # smartctl -l error /dev/sda
smartctl version 5.37 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
ATA Error Count: 11 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 11 occurred at disk power-on lifetime: 24 hours (1 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 00 00 00 e0  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 03 f0 97 fd 12 e0 00      00:20:30.214  READ DMA EXT
  25 03 10 87 fd 12 e0 00      00:20:30.205  READ DMA EXT
  25 03 f0 97 fc 12 e0 00      00:20:30.186  READ DMA EXT
  25 03 10 87 fc 12 e0 00      00:20:30.174  READ DMA EXT
  25 03 f0 97 fb 12 e0 00      00:20:30.153  READ DMA EXT

Error 10 occurred at disk power-on lifetime: 21 hours (0 days + 21 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 00 00 00 e0  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 03 80 bf 07 3f e0 00      00:56:10.682  READ DMA EXT
  25 03 80 3f 07 3f e0 00      00:56:10.679  READ DMA EXT
  25 03 80 bf 06 3f e0 00      00:56:10.676  READ DMA EXT
  25 03 80 3f 06 3f e0 00      00:56:10.674  READ DMA EXT
  25 03 80 bf 05 3f e0 00      00:56:10.671  READ DMA EXT

Error 9 occurred at disk power-on lifetime: 21 hours (0 days + 21 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 00 00 00 e0  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 03 80 3f 7e 5e e0 00      00:44:48.323  READ DMA EXT
  25 03 80 bf 7d 5e e0 00      00:44:48.361  READ DMA EXT
  25 03 80 3f 7d 5e e0 00      00:44:48.359  READ DMA EXT
  25 03 80 bf 7c 5e e0 00      00:44:48.356  READ DMA EXT
  25 03 80 3f 7c 5e e0 00      00:44:48.353  READ DMA EXT

Error 8 occurred at disk power-on lifetime: 19 hours (0 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 00 00 00 e0  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 03 80 9f b2 99 e0 00      01:25:23.393  READ DMA EXT
  25 03 80 1f b2 99 e0 00      01:25:23.390  READ DMA EXT
  25 03 80 9f b1 99 e0 00      01:25:23.387  READ DMA EXT
  25 03 80 1f b1 99 e0 00      01:25:23.383  READ DMA EXT
  25 03 80 9f b0 99 e0 00      01:25:23.380  READ DMA EXT

Error 7 occurred at disk power-on lifetime: 19 hours (0 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 00 00 00 e0  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 03 80 af 4b 6a e0 00      01:24:06.276  READ DMA EXT
  25 03 80 2f 4b 6a e0 00      01:24:06.273  READ DMA EXT
  25 03 80 af 4a 6a e0 00      01:24:06.270  READ DMA EXT
  25 03 80 2f 4a 6a e0 00      01:24:06.267  READ DMA EXT
  25 03 80 af 49 6a e0 00      01:24:06.263  READ DMA EXT

_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
djinnZ
Advocate
Advocate


Joined: 02 Nov 2006
Posts: 4831
Location: somewhere in L.O.S.

PostPosted: Fri Nov 30, 2007 4:43 pm    Post subject: Reply with quote

qualcosa di carino e positivo :twisted:

sempre che non hai appena aggiornato al nuovo kernel, soliti problemi con i device ata convertiti ai nuovi etc.

Verifica immediatamente connettori ed alimentazione, soprattutto se è uno di quei primi sata con l'alimentazione AT, come temo.
Potrebbe essere banalmente uno dei due poli di massa allentati, in tal caso, comunque, l'HD non è più affidabile.

Se senti rumore di ferraglia/vibrazioni/stridii al seek od all'avvio ovviamente è morto ma ancora non lo sa (e temo che sia questo il caso).

Per me puoi solo montarlo RO e fare una copia visto che immagino abbia sopra almeno un anno di lavoro.
_________________
scita et risus abundant in ore stultorum sed etiam semper severi insani sunt:wink:
mala tempora currunt...mater stultorum semper pregna est :evil:
Murpy'sLaw:If anything can go wrong, it will - O'Toole's Corollary:Murphy was an optimist :wink:
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4486
Location: Somewere around the world

PostPosted: Fri Nov 30, 2007 6:08 pm    Post subject: Reply with quote

Code:
smartctl version 5.37 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Conveyance offline  Completed without error       00%      2128         -
# 2  Extended offline    Completed without error       00%      2127         -
# 3  Short offline       Completed without error       00%      2126         -

I selftest smart non riportano errori... Potrebbe esser un errore del controller?
Comunque non è un SATA, è un ATA. Lo vede come sda perché i nuovi driver del kernel lo mappano come tale. Non sta facendo rumori e ancora non ho perso dati... ancora... solo quel messaggio terrificante da dmesg durante l'avvio. Durante il funzionamento non si lamenta... solo all'avvio stampa quegli orribili messaggi.
Lo smonto e controllo ma dubito sia un problema di montaggio. Di solito i contatti o funzionano o non funzionano.
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4486
Location: Somewere around the world

PostPosted: Fri Nov 30, 2007 6:33 pm    Post subject: Reply with quote

Tiro un sospiro di sollievo!
Pare che gli errori siano generati da smartctl che viene lanciato all'avvio (come controllo dello stato dell'hd, lo faccio a tutti gli avvii)
Code:
heavensdoor ~ # smartctl -H /dev/sda
smartctl version 5.37 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
Please note the following marginal Attributes:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
255 Unknown_Attribute       0x373f   200   016   063    Pre-fail  Always   In_the_past 69269232549888
 32 Unknown_Attribute       0x2020   032   032   032    Old_age   Offline  FAILING_NOW 95984788262944
 57 Unknown_Attribute       0x0059   000   000   089    Pre-fail  Offline  FAILING_NOW 59593442985024
 65 Unknown_Attribute       0x2031   032   032   049    Pre-fail  Offline  FAILING_NOW 35322350018592
 32 Unknown_Attribute       0x2020   032   032   032    Old_age   Offline  FAILING_NOW 35322350018592
 32 Unknown_Attribute       0x2020   032   032   032    Old_age   Offline  FAILING_NOW 550026354720
249 Unknown_Attribute       0x000d   000   007   013    Pre-fail  Offline  FAILING_NOW 131943408599808
240 Head_Flying_Hours       0x7800   000   000   000    Old_age   Offline  FAILING_NOW 0
104 Unknown_Attribute       0x0934   060   003   052    Old_age   Offline  In_the_past 2113376
128 Unknown_Attribute       0xfe80   255   077   128    Old_age   Offline  In_the_past 16646240

Ora quello che un po' mi preoccupa è quello FAILING_NOW e In_the_past rispetto a questi attributi sconosciuti.
C'è da fidarsi? Sono falsi positivi generati da smartctl? Perché li definisce "marginal" ?
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4486
Location: Somewere around the world

PostPosted: Fri Nov 30, 2007 6:47 pm    Post subject: Reply with quote

Pare che l'errore si risolva lanciando
Code:
smartctl -s on /dev/sda
all'avvio. Strano perché smart risultata abilitato anche senza farlo esplicitamente, tuttavia in questo modo l'errore non si presenta.
Spero che questo significhi che l'hd è in buona salute! :)
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
Scen
Retired Dev
Retired Dev


Joined: 29 Jul 2003
Posts: 2470
Location: Padova, Italy

PostPosted: Fri Nov 30, 2007 6:57 pm    Post subject: Reply with quote

Cazzantonio wrote:
Pare che l'errore si risolva lanciando
Code:
smartctl -s on /dev/sda
all'avvio. Strano perché smart risultata abilitato anche senza farlo esplicitamente, tuttavia in questo modo l'errore non si presenta.
Spero che questo significhi che l'hd è in buona salute! :)

Potresti utilizzare gli strumenti di diagnostica forniti dal produttore del tuo HD, se provi con Ultimate Boot CD dovresti trovarli più o meno tutti.
_________________
I was born in a deep forest/I wish I could live here all my life/I am made from stones and roots/My home, these woods and roads
All my life I loved this sound/Of the woods all around/Eagles flies where the winds blows free
Journey is my destiny
Back to top
View user's profile Send private message
flocchini
Veteran
Veteran


Joined: 17 May 2003
Posts: 1124
Location: Milano, Italy

PostPosted: Fri Nov 30, 2007 7:54 pm    Post subject: Reply with quote

Scen wrote:

Potresti utilizzare gli strumenti di diagnostica forniti dal produttore del tuo HD, se provi con Ultimate Boot CD dovresti trovarli più o meno tutti.


straquoto, visto che devi eliminare tutti i dubbi per capire se e' o no l'hdd, bootare direttamente da un sistema minimale e' l'idea migliore
_________________
~~ Per amore della rosa si sopportano le spine... ~~
Back to top
View user's profile Send private message
djinnZ
Advocate
Advocate


Joined: 02 Nov 2006
Posts: 4831
Location: somewhere in L.O.S.

PostPosted: Sat Dec 01, 2007 11:29 am    Post subject: Reply with quote

Cazzantonio wrote:
Potrebbe esser un errore del controller?
Comunque non è un SATA, è un ATA.


per questo ti avevo detto di verificare se non era una novità dovuta al passaggio dai vecchi ai nuovi driver.

Cazzantonio wrote:
Lo smonto e controllo ma dubito sia un problema di montaggio. Di solito i contatti o funzionano o non funzionano.


La mia esperienza mi ha insegnato il contrario.

Non so a me riporta un errore di dma e device disconnetted sugli HD del secondo controller ata (a parte i salaci commenti sulla stabilità e le performance del chipset ITE) solo all'avvio ma non al reboot, quindi ho risolto con un maggiore delay al boot.

Però se lo controlli è meglio.
_________________
scita et risus abundant in ore stultorum sed etiam semper severi insani sunt:wink:
mala tempora currunt...mater stultorum semper pregna est :evil:
Murpy'sLaw:If anything can go wrong, it will - O'Toole's Corollary:Murphy was an optimist :wink:
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Forum italiano (Italian) All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum