View previous topic :: View next topic |
Author |
Message |
Bigun Advocate
Joined: 21 Sep 2003 Posts: 2196
|
Posted: Tue Aug 23, 2016 9:08 am Post subject: Something Happened |
|
|
So, I walk into my computer room, and noticed some of the HD lights on one of my towers isn't blinking. I log in and check the torrent program, and it locks up. I shut down the process restart it, it locks up again. So I try to list the contents of what I was sharing, locks up. I check dmesg, and find this:
Code: | [118954.637733] kworker/dying (1028) used greatest stack depth: 5332 bytes left
[150953.331750] kworker/dying (10761) used greatest stack depth: 4988 bytes left
[338220.524869] CIFS VFS: Autodisabling the use of server inode numbers on \\127.0.0.1\Media. This server doesn't seem to support them properly. Hardlinks will not be recognized on this mount. Consider mounting with the "noserverino" option to silence this message.
[1672067.452550] BUG: unable to handle kernel paging request at 20726570
[1672067.452599] IP: [<20726570>] 0x20726570
[1672067.452627] *pdpt = 0000000030290001 *pde = 0000000000000000
[1672067.452666] Oops: 0010 [#1] SMP
[1672067.452690] Modules linked in: xt_mark
[1672067.452719] CPU: 1 PID: 7709 Comm: cifsd Not tainted 4.4.6-gentoo #1
[1672067.452758] Hardware name: Gigabyte Technology Co., Ltd. To be filled by O.E.M./H61M-DS2H, BIOS F5 04/02/2012
[1672067.452815] task: f1247700 ti: f2b5c000 task.ti: f2b5c000
[1672067.452848] EIP: 0060:[<20726570>] EFLAGS: 00010246 CPU: 1
[1672067.452881] EIP is at 0x20726570
[1672067.452902] EAX: c9efd2c0 EBX: f254c000 ECX: c1aa6538 EDX: f254c154
[1672067.452939] ESI: c9efd2c0 EDI: c11ebf47 EBP: f2b5df6c ESP: f2b5df34
[1672067.452976] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[1672067.453008] CR0: 80050033 CR2: 20726570 CR3: 313a85c0 CR4: 000406b0
[1672067.453045] Stack:
[1672067.453059] c11db04c f254c038 f1247700 0029de28 00000000 e6e2b6c0 c9efd2c0 f1247700
[1672067.453117] f2b5e000 f254c000 c11daa21 f3169ae0 f254c000 c11daa21 f2b5dfac c1047119
[1672067.453175] 00000000 00000000 f254c000 00000000 00000000 f2b5df88 f2b5df88 00000000
[1672067.453232] Call Trace:
[1672067.453253] [<c11db04c>] ? cifs_demultiplex_thread+0x62b/0x644
[1672067.453289] [<c11daa21>] ? dequeue_mid+0x3f/0x3f
[1672067.453318] [<c11daa21>] ? dequeue_mid+0x3f/0x3f
[1672067.453349] [<c1047119>] kthread+0x9f/0xa4
[1672067.453378] [<c1678789>] ret_from_kernel_thread+0x21/0x38
[1672067.453412] [<c104707a>] ? kthread_create_on_node+0xf7/0xf7
[1672067.453445] Code: Bad EIP value.
[1672067.453469] EIP: [<20726570>] 0x20726570 SS:ESP 0068:f2b5df34
[1672067.453507] CR2: 0000000020726570
[1672067.459231] ---[ end trace 9a81b92b23dad5a3 ]--- |
I see mentioning of network equipment, CIFS, memory addresses, I'm not sure where to point the finger. Going to try a reboot.
*edit*
A reboot seems to have fixed things for now. Unsure what happened, I usually can maintain several months worth of uptime before I decide to compile a new kernel. _________________ "It's ok, they might have guns but we have flowers." - Perpetual Victim |
|
Back to top |
|
|
Roman_Gruber Advocate
Joined: 03 Oct 2006 Posts: 3846 Location: Austro Bavaria
|
Posted: Tue Aug 23, 2016 10:07 am Post subject: |
|
|
Generic advise: Check your cables if they are proper connected or just unplug and replug any cables and plugin cards ....
Looks like a desctop box and maybe something not very well connected can cause random crap messages sometimes. |
|
Back to top |
|
|
Bigun Advocate
Joined: 21 Sep 2003 Posts: 2196
|
Posted: Tue Aug 23, 2016 10:43 am Post subject: |
|
|
tw04l124 wrote: | Generic advise: Check your cables if they are proper connected or just unplug and replug any cables and plugin cards ....
Looks like a desctop box and maybe something not very well connected can cause random crap messages sometimes. |
That tells me all I need to know. I have toddlers that like to watch their tablet on top of that machine. _________________ "It's ok, they might have guns but we have flowers." - Perpetual Victim |
|
Back to top |
|
|
NeddySeagoon Administrator
Joined: 05 Jul 2003 Posts: 54220 Location: 56N 3W
|
Posted: Tue Aug 23, 2016 11:49 am Post subject: |
|
|
Bigun,
Bigun wrote: | I have toddlers ... | That explains almost anything :)
Check for jam sandwiches inserted into any suitably sized (to a todders mind) orifice on the system.. _________________ Regards,
NeddySeagoon
Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail. |
|
Back to top |
|
|
Hu Moderator
Joined: 06 Mar 2007 Posts: 21607
|
Posted: Tue Aug 23, 2016 11:18 pm Post subject: |
|
|
That eip value looks suspicious. When interpreted as ASCII, it is 'FAH '. That would never be a valid kernel address, but could be a valid exploit address. It could also be caused by the kernel treating freed/corrupted memory as a pointer to a function. |
|
Back to top |
|
|
Bigun Advocate
Joined: 21 Sep 2003 Posts: 2196
|
Posted: Wed Aug 24, 2016 10:51 am Post subject: |
|
|
Hu wrote: | That eip value looks suspicious. When interpreted as ASCII, it is 'FAH '. That would never be a valid kernel address, but could be a valid exploit address. It could also be caused by the kernel treating freed/corrupted memory as a pointer to a function. |
Should I run a RAM check on this? _________________ "It's ok, they might have guns but we have flowers." - Perpetual Victim |
|
Back to top |
|
|
NeddySeagoon Administrator
Joined: 05 Jul 2003 Posts: 54220 Location: 56N 3W
|
Posted: Wed Aug 24, 2016 11:22 am Post subject: |
|
|
Bigun,
It won't do any harm to do a RAM check.
If you don't have ECC RAM, the random bit clearing that happens due to environmental noise will go undetected.
Then you get this sort of random error instead of a warning in dmesg.
In the early days of DRAM, alpha particle induced errors were a major issue. Design changes over the years have helped reduce the error rates but DRAMS have got bigger too. In fact, the alpha particle errors have been designed out but there are other external sources of RAM errors too. _________________ Regards,
NeddySeagoon
Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail. |
|
Back to top |
|
|
depontius Advocate
Joined: 05 May 2004 Posts: 3509
|
Posted: Wed Aug 24, 2016 3:29 pm Post subject: |
|
|
Back to the toddlers...
When my daughter was three-and-change, we were in a local computer store. I only turned around for a moment, (frequent parent's lament) I saw her clicking the "Confirm" on the delete screen for some file or other, don't know what, on one of their floor computers. (Not bad for random toddler-fiddling?)
When my son was a similar age, I said something to my wife about the roll of masking tape sitting on the floor. My son said, "Tape?" and picked it up, took it over to the cassette player, and tried to put it in.
Toddlers can be a sufficient reason for almost any mysterious malfunction. _________________ .sigs waste space and bandwidth |
|
Back to top |
|
|
1clue Advocate
Joined: 05 Feb 2006 Posts: 2569
|
Posted: Wed Aug 24, 2016 3:54 pm Post subject: |
|
|
I know it sounds a bit ridiculous, but do you have screws holding your cards into the chassis? While reseating your cables and such, you might want to look inside the box too. |
|
Back to top |
|
|
NeddySeagoon Administrator
Joined: 05 Jul 2003 Posts: 54220 Location: 56N 3W
|
Posted: Wed Aug 24, 2016 4:46 pm Post subject: |
|
|
1clue,
Rule 1. Assume nothing. With apologies to Monty Python. _________________ Regards,
NeddySeagoon
Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail. |
|
Back to top |
|
|
|