Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Something Happened
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Other Things Gentoo
View previous topic :: View next topic  
Author Message
Bigun
Advocate
Advocate


Joined: 21 Sep 2003
Posts: 2196

PostPosted: Tue Aug 23, 2016 9:08 am    Post subject: Something Happened Reply with quote

So, I walk into my computer room, and noticed some of the HD lights on one of my towers isn't blinking. I log in and check the torrent program, and it locks up. I shut down the process restart it, it locks up again. So I try to list the contents of what I was sharing, locks up. I check dmesg, and find this:

Code:
[118954.637733] kworker/dying (1028) used greatest stack depth: 5332 bytes left
[150953.331750] kworker/dying (10761) used greatest stack depth: 4988 bytes left
[338220.524869] CIFS VFS: Autodisabling the use of server inode numbers on \\127.0.0.1\Media. This server doesn't seem to support them properly. Hardlinks will not be recognized on this mount. Consider mounting with the "noserverino" option to silence this message.
[1672067.452550] BUG: unable to handle kernel paging request at 20726570
[1672067.452599] IP: [<20726570>] 0x20726570
[1672067.452627] *pdpt = 0000000030290001 *pde = 0000000000000000
[1672067.452666] Oops: 0010 [#1] SMP
[1672067.452690] Modules linked in: xt_mark
[1672067.452719] CPU: 1 PID: 7709 Comm: cifsd Not tainted 4.4.6-gentoo #1
[1672067.452758] Hardware name: Gigabyte Technology Co., Ltd. To be filled by O.E.M./H61M-DS2H, BIOS F5 04/02/2012
[1672067.452815] task: f1247700 ti: f2b5c000 task.ti: f2b5c000
[1672067.452848] EIP: 0060:[<20726570>] EFLAGS: 00010246 CPU: 1
[1672067.452881] EIP is at 0x20726570
[1672067.452902] EAX: c9efd2c0 EBX: f254c000 ECX: c1aa6538 EDX: f254c154
[1672067.452939] ESI: c9efd2c0 EDI: c11ebf47 EBP: f2b5df6c ESP: f2b5df34
[1672067.452976]  DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[1672067.453008] CR0: 80050033 CR2: 20726570 CR3: 313a85c0 CR4: 000406b0
[1672067.453045] Stack:
[1672067.453059]  c11db04c f254c038 f1247700 0029de28 00000000 e6e2b6c0 c9efd2c0 f1247700
[1672067.453117]  f2b5e000 f254c000 c11daa21 f3169ae0 f254c000 c11daa21 f2b5dfac c1047119
[1672067.453175]  00000000 00000000 f254c000 00000000 00000000 f2b5df88 f2b5df88 00000000
[1672067.453232] Call Trace:
[1672067.453253]  [<c11db04c>] ? cifs_demultiplex_thread+0x62b/0x644
[1672067.453289]  [<c11daa21>] ? dequeue_mid+0x3f/0x3f
[1672067.453318]  [<c11daa21>] ? dequeue_mid+0x3f/0x3f
[1672067.453349]  [<c1047119>] kthread+0x9f/0xa4
[1672067.453378]  [<c1678789>] ret_from_kernel_thread+0x21/0x38
[1672067.453412]  [<c104707a>] ? kthread_create_on_node+0xf7/0xf7
[1672067.453445] Code:  Bad EIP value.
[1672067.453469] EIP: [<20726570>] 0x20726570 SS:ESP 0068:f2b5df34
[1672067.453507] CR2: 0000000020726570
[1672067.459231] ---[ end trace 9a81b92b23dad5a3 ]---


I see mentioning of network equipment, CIFS, memory addresses, I'm not sure where to point the finger. Going to try a reboot.

*edit*

A reboot seems to have fixed things for now. Unsure what happened, I usually can maintain several months worth of uptime before I decide to compile a new kernel.
_________________
"It's ok, they might have guns but we have flowers." - Perpetual Victim
Back to top
View user's profile Send private message
Roman_Gruber
Advocate
Advocate


Joined: 03 Oct 2006
Posts: 3846
Location: Austro Bavaria

PostPosted: Tue Aug 23, 2016 10:07 am    Post subject: Reply with quote

Generic advise: Check your cables if they are proper connected or just unplug and replug any cables and plugin cards ....

Looks like a desctop box and maybe something not very well connected can cause random crap messages sometimes.
Back to top
View user's profile Send private message
Bigun
Advocate
Advocate


Joined: 21 Sep 2003
Posts: 2196

PostPosted: Tue Aug 23, 2016 10:43 am    Post subject: Reply with quote

tw04l124 wrote:
Generic advise: Check your cables if they are proper connected or just unplug and replug any cables and plugin cards ....

Looks like a desctop box and maybe something not very well connected can cause random crap messages sometimes.


That tells me all I need to know. I have toddlers that like to watch their tablet on top of that machine.
_________________
"It's ok, they might have guns but we have flowers." - Perpetual Victim
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 54216
Location: 56N 3W

PostPosted: Tue Aug 23, 2016 11:49 am    Post subject: Reply with quote

Bigun,

Bigun wrote:
I have toddlers ...
That explains almost anything :)
Check for jam sandwiches inserted into any suitably sized (to a todders mind) orifice on the system..
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
Hu
Moderator
Moderator


Joined: 06 Mar 2007
Posts: 21602

PostPosted: Tue Aug 23, 2016 11:18 pm    Post subject: Reply with quote

That eip value looks suspicious. When interpreted as ASCII, it is 'FAH '. That would never be a valid kernel address, but could be a valid exploit address. It could also be caused by the kernel treating freed/corrupted memory as a pointer to a function.
Back to top
View user's profile Send private message
Bigun
Advocate
Advocate


Joined: 21 Sep 2003
Posts: 2196

PostPosted: Wed Aug 24, 2016 10:51 am    Post subject: Reply with quote

Hu wrote:
That eip value looks suspicious. When interpreted as ASCII, it is 'FAH '. That would never be a valid kernel address, but could be a valid exploit address. It could also be caused by the kernel treating freed/corrupted memory as a pointer to a function.


Should I run a RAM check on this?
_________________
"It's ok, they might have guns but we have flowers." - Perpetual Victim
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 54216
Location: 56N 3W

PostPosted: Wed Aug 24, 2016 11:22 am    Post subject: Reply with quote

Bigun,

It won't do any harm to do a RAM check.

If you don't have ECC RAM, the random bit clearing that happens due to environmental noise will go undetected.
Then you get this sort of random error instead of a warning in dmesg.

In the early days of DRAM, alpha particle induced errors were a major issue. Design changes over the years have helped reduce the error rates but DRAMS have got bigger too. In fact, the alpha particle errors have been designed out but there are other external sources of RAM errors too.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
depontius
Advocate
Advocate


Joined: 05 May 2004
Posts: 3509

PostPosted: Wed Aug 24, 2016 3:29 pm    Post subject: Reply with quote

Back to the toddlers...

When my daughter was three-and-change, we were in a local computer store. I only turned around for a moment, (frequent parent's lament) I saw her clicking the "Confirm" on the delete screen for some file or other, don't know what, on one of their floor computers. (Not bad for random toddler-fiddling?)

When my son was a similar age, I said something to my wife about the roll of masking tape sitting on the floor. My son said, "Tape?" and picked it up, took it over to the cassette player, and tried to put it in.

Toddlers can be a sufficient reason for almost any mysterious malfunction.
_________________
.sigs waste space and bandwidth
Back to top
View user's profile Send private message
1clue
Advocate
Advocate


Joined: 05 Feb 2006
Posts: 2569

PostPosted: Wed Aug 24, 2016 3:54 pm    Post subject: Reply with quote

I know it sounds a bit ridiculous, but do you have screws holding your cards into the chassis? While reseating your cables and such, you might want to look inside the box too.
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 54216
Location: 56N 3W

PostPosted: Wed Aug 24, 2016 4:46 pm    Post subject: Reply with quote

1clue,

Rule 1. Assume nothing. With apologies to Monty Python.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Other Things Gentoo All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum