Hardware Error

I

invincible123

Guest
Hi,

I am using OpenSuse 12.1, on newly built machine, Off lately this error message is popping continuously making me unable to work.
Code:
Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.595032] [Hardware Error]: MC4_STATUS[-|CE|MiscV|-|AddrV|CECC]: 0x9c0240006b080813

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.595042] [Hardware Error]: Northbridge Error (node 0): DRAM ECC error detected on the NB.

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.595062] [Hardware Error]: cache level: L3/GEN, mem/io: MEM, mem-tx: RD, part-proc: SRC (no timeout)

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.605030] [Hardware Error]: MC4_STATUS[-|CE|MiscV|-|AddrV|CECC]: 0x9c0240006b080813

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.605039] [Hardware Error]: Northbridge Error (node 0): DRAM ECC error detected on the NB.

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.605060] [Hardware Error]: cache level: L3/GEN, mem/io: MEM, mem-tx: RD, part-proc: SRC (no timeout)

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.615026] [Hardware Error]: MC4_STATUS[-|CE|MiscV|-|AddrV|CECC]: 0x9c044000ca080813

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.615035] [Hardware Error]: Northbridge Error (node 0): DRAM ECC error detected on the NB.

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.615055] [Hardware Error]: cache level: L3/GEN, mem/io: MEM, mem-tx: RD, part-proc: SRC (no timeout)

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.625030] [Hardware Error]: MC4_STATUS[-|CE|MiscV|-|AddrV|CECC]: 0x9c044000ca080a13

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.625043] [Hardware Error]: Northbridge Error (node 0): DRAM ECC error detected on the NB.

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.625070] [Hardware Error]: cache level: L3/GEN, mem/io: MEM, mem-tx: RD, part-proc: RES (no timeout)

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.635049] [Hardware Error]: MC4_STATUS[-|CE|MiscV|-|AddrV|CECC]: 0x9c0240006b080813

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.635062] [Hardware Error]: Northbridge Error (node 0): DRAM ECC error detected on the NB.

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.635090] [Hardware Error]: cache level: L3/GEN, mem/io: MEM, mem-tx: RD, part-proc: SRC (no timeout)

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.645031] [Hardware Error]: MC4_STATUS[-|CE|MiscV|-|AddrV|CECC]: 0x9c0240006b080a13

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.645044] [Hardware Error]: Northbridge Error (node 0): DRAM ECC error detected on the NB.

Message from syslogd@linux-hse7 at Jul 24 18:38:57 ...
 kernel:[  723.645072] [Hardware Error]: cache level: L3/GEN, mem/io: MEM, mem-tx: RD, part-proc: RES (no timeout)

Could anyone please help me out with this?
 


It looks like a RAM error. You aren't giving much information, so we don't know how many RAM sticks you're using or what motherboard etc..

I had bought four 4GB sticks when I built my newest box and had one bad stick in the bunch. I tested each stick in RAM Slot one of the motherboard (your documentation should tell you which slot that is) until I found the bad stick.

With more info, you will probably get better advice.
 
Dear god, that is the most sickening dump I have seen in a very long time.

I would refit the ram sticks into the sockets (since it's newly built), and do a memtest. If that doesn't work after a couple of cycles, I'd say you have a broken stick (Whichever held address 0x9c044000ca080a13 at the time.
 


Top