Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Trying to troubleshoot crash on new installation (SOLVED)
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Installing Gentoo
View previous topic :: View next topic  
Author Message
Featherfoot
Veteran
Veteran


Joined: 28 Dec 2002
Posts: 1108
Location: Stuart, Florida

PostPosted: Fri Feb 02, 2018 3:12 am    Post subject: Trying to troubleshoot crash on new installation (SOLVED) Reply with quote

I am rebuilding my amd_64 system after a catastrophe and I am having some problems.

I used a .config file from a system pre-crash to rebuild my system. It appears to mostly complete and then crash. The screen goes blank and after a few seconds there is a lot of whirring on the disk.
I tried to turn off X with the nox option on my boot and it seemed to work as Xorg.0.log stopped appearing in the logs. This implies to me that X is not the problem.

I am chrooting into the gentoo environment from Mint 18.3. I found that dmesg produces the Mint output even when I have chrooted in per the manual. Can I trust the listing in /var/log/dmesg? If not, is there a way to make it happen?

/var/log/sddm.log appeared to be reasonable for a long time then showed junk.
/var/log/messages is a hex dump. If I am looking for fatal messages, should I be looking at the tail?



Is there a way to log the messages so that I know which module could be the problem? Is increasing the dmesg level in the kernel worthwhile?


Last edited by Featherfoot on Sat Feb 03, 2018 2:27 am; edited 1 time in total
Back to top
View user's profile Send private message
Featherfoot
Veteran
Veteran


Joined: 28 Dec 2002
Posts: 1108
Location: Stuart, Florida

PostPosted: Fri Feb 02, 2018 3:15 am    Post subject: Reply with quote

I should have said my kernel is being built by genkernel-3.4.52.4-r2. The kernel is gentoo-sources-4.9.76-r1.

The system is an amd64 system with 16 Gb of memory. It uses an AMD Phenom II X4970 processor. The disk system is a 240 Gb Crucial SSD and a 1 Tb RAID 1 array.
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 54216
Location: 56N 3W

PostPosted: Fri Feb 02, 2018 9:18 am    Post subject: Reply with quote

Featherfoot,

Edit your original post to provide further information until you have a reply. When you respond to yourself your topic falls out of the unanswered post search and helpers may miss it.

When you boot mint, you will always get the mint dmesg. dmesg is the booting kernels log.
Saving logs depends on having somewhere to save them. That only happens if the boot gets as far as remounting root read /write.
In the chroot, /var/log/dmesg will be the leftovers of your last normal boot. Do check the timestamp. If its pre apocalypse, it won't be useful.

For the purposes of the investigation, lets assume that the system boots normally but you can't see it.
Can you boot normally and ssh in to get logs?

In parallel, but on the same theme, please post your
Code:
lspci -nnk
output and your kernel .config file.

Something else to try is genkernel-next. I think its still in testing. Its a complete genkernel rewrite.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
Featherfoot
Veteran
Veteran


Joined: 28 Dec 2002
Posts: 1108
Location: Stuart, Florida

PostPosted: Fri Feb 02, 2018 7:57 pm    Post subject: Reply with quote

Thanks for responding. As I mentioned, I booted with and without X and it crashed each time. There wasnt time to ssh in. I looked at the dmesg file in /var/log and it had a bunch of complaints from the open source Nvidia driver which shouldn't have been involved. Apparently you can set dmesg levels in the kernel. Do you know if they do anything useful for a non-kernel xeveloper?
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 54216
Location: 56N 3W

PostPosted: Fri Feb 02, 2018 8:05 pm    Post subject: Reply with quote

Featherfoot,

Lets try some analysis.

Quote:
In parallel, but on the same theme, please post your
Code:
lspci -nnk   

output and your kernel .config file.


I suspect that more detailed kernel logging won't help.

Your kernel .config will need to go to a pastebin.
The dmesg may have a few hints. Put that onto a pastebin too.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
Featherfoot
Veteran
Veteran


Joined: 28 Dec 2002
Posts: 1108
Location: Stuart, Florida

PostPosted: Fri Feb 02, 2018 9:21 pm    Post subject: Reply with quote

lspci follows:\
Code:

[gentoo]lspci
[chroot]bopper /usr/src/linux #  lspci -nnk
00:00.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD/ATI] RX780/RX790 Host Bridge [1002:5957]
   Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] RX780/RX790 Host Bridge [1002:5957]
lspci: Unable to load libkmod resources: error -12
00:02.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] RX780/RD790 PCI to PCI bridge (external gfx0 port A) [1002:5978]
   Kernel driver in use: pcieport
00:09.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] RD790 PCI to PCI bridge (PCI express gpp port E) [1002:597e]
   Kernel driver in use: pcieport
00:0a.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] RD790 PCI to PCI bridge (PCI express gpp port F) [1002:597f]
   Kernel driver in use: pcieport
00:11.0 RAID bus controller [0104]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 SATA Controller [Non-RAID5 mode] [1002:4392] (rev 40)
   Subsystem: Gigabyte Technology Co., Ltd SB7x0/SB8x0/SB9x0 SATA Controller [Non-RAID5 mode] [1458:b002]
   Kernel driver in use: ahci
00:12.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller [1002:4397]
   Subsystem: Gigabyte Technology Co., Ltd SB7x0/SB8x0/SB9x0 USB OHCI0 Controller [1458:5004]
   Kernel driver in use: ohci-pci
00:12.2 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller [1002:4396]
   Subsystem: Gigabyte Technology Co., Ltd SB7x0/SB8x0/SB9x0 USB EHCI Controller [1458:5004]
   Kernel driver in use: ehci-pci
00:13.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller [1002:4397]
   Subsystem: Gigabyte Technology Co., Ltd SB7x0/SB8x0/SB9x0 USB OHCI0 Controller [1458:5004]
   Kernel driver in use: ohci-pci
00:13.2 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller [1002:4396]
   Subsystem: Gigabyte Technology Co., Ltd SB7x0/SB8x0/SB9x0 USB EHCI Controller [1458:5004]
   Kernel driver in use: ehci-pci
00:14.0 SMBus [0c05]: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 SMBus Controller [1002:4385] (rev 41)
00:14.2 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 Azalia (Intel HDA) [1002:4383] (rev 40)
   Subsystem: Gigabyte Technology Co., Ltd SBx00 Azalia (Intel HDA) [1458:a102]
   Kernel driver in use: snd_hda_intel
00:14.3 ISA bridge [0601]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 LPC host controller [1002:439d] (rev 40)
   Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 LPC host controller [1002:439d]
00:14.4 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 PCI to PCI Bridge [1002:4384] (rev 40)
00:14.5 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI2 Controller [1002:4399]
   Subsystem: Gigabyte Technology Co., Ltd SB7x0/SB8x0/SB9x0 USB OHCI2 Controller [1458:5004]
   Kernel driver in use: ohci-pci
00:15.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] SB700/SB800/SB900 PCI to PCI bridge (PCIE port 0) [1002:43a0]
   Kernel driver in use: pcieport
00:16.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller [1002:4397]
   Subsystem: Gigabyte Technology Co., Ltd SB7x0/SB8x0/SB9x0 USB OHCI0 Controller [1458:5004]
   Kernel driver in use: ohci-pci
00:16.2 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller [1002:4396]
   Subsystem: Gigabyte Technology Co., Ltd SB7x0/SB8x0/SB9x0 USB EHCI Controller [1458:5004]
   Kernel driver in use: ehci-pci
00:18.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 10h Processor HyperTransport Configuration [1022:1200]
00:18.1 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 10h Processor Address Map [1022:1201]
00:18.2 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 10h Processor DRAM Controller [1022:1202]
00:18.3 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 10h Processor Miscellaneous Control [1022:1203]
   Kernel driver in use: k10temp
00:18.4 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 10h Processor Link Control [1022:1204]
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GF104 [GeForce GTX 460] [10de:0e22] (rev a1)
   Subsystem: ZOTAC International (MCO) Ltd. GF104 [GeForce GTX 460] [19da:1166]
   Kernel driver in use: nvidia
01:00.1 Audio device [0403]: NVIDIA Corporation GF104 High Definition Audio Controller [10de:0beb] (rev a1)
   Subsystem: ZOTAC International (MCO) Ltd. GF104 High Definition Audio Controller [19da:1166]
   Kernel driver in use: snd_hda_intel
02:00.0 USB controller [0c03]: NEC Corporation uPD720200 USB 3.0 Host Controller [1033:0194] (rev 03)
   Subsystem: Gigabyte Technology Co., Ltd uPD720200 USB 3.0 Host Controller [1458:5007]
   Kernel driver in use: xhci_hcd
03:00.0 SATA controller [0106]: JMicron Technology Corp. JMB363 SATA/IDE Controller [197b:2363] (rev 03)
   Subsystem: Gigabyte Technology Co., Ltd Motherboard [1458:b000]
   Kernel driver in use: ahci
03:00.1 IDE interface [0101]: JMicron Technology Corp. JMB363 SATA/IDE Controller [197b:2363] (rev 03)
   Subsystem: Gigabyte Technology Co., Ltd Motherboard [1458:b000]
   Kernel driver in use: pata_jmicron
04:07.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8169 PCI Gigabit Ethernet Controller [10ec:8169] (rev 10)
   Subsystem: Realtek Semiconductor Co., Ltd. RTL8169/8110 Family PCI Gigabit Ethernet NIC [10ec:8169]
   Kernel driver in use: r8169
04:0e.0 FireWire (IEEE 1394) [0c00]: Texas Instruments TSB43AB23 IEEE-1394a-2000 Controller (PHY/Link) [104c:8024]
   Subsystem: Gigabyte Technology Co., Ltd Motherboard [1458:1000]
   Kernel driver in use: firewire_ohci
05:00.0 SATA controller [0106]: JMicron Technology Corp. JMB363 SATA/IDE Controller [197b:2363] (rev 02)
   Subsystem: Gigabyte Technology Co., Ltd Motherboard [1458:b000]
   Kernel driver in use: ahci
05:00.1 IDE interface [0101]: JMicron Technology Corp. JMB363 SATA/IDE Controller [197b:2363] (rev 02)
   Subsystem: Gigabyte Technology Co., Ltd Motherboard [1458:b000]
   Kernel driver in use: pata_jmicron
[chroot]bopper /usr/src/linux # 



My kernel config is at: https://pastebin.com/UrTJ8S8F

The listing of /var/log/dmesg (not the output of dmesg follows:https://pastebin.com/pNUwyK9E
Back to top
View user's profile Send private message
jburns
Veteran
Veteran


Joined: 18 Jan 2007
Posts: 1214
Location: Massachusetts USA

PostPosted: Sat Feb 03, 2018 1:29 am    Post subject: Reply with quote

From your dmesg output both the nouveau and the nvidia driver are being loaded.
Back to top
View user's profile Send private message
Featherfoot
Veteran
Veteran


Joined: 28 Dec 2002
Posts: 1108
Location: Stuart, Florida

PostPosted: Sat Feb 03, 2018 1:45 am    Post subject: Reply with quote

I see that too. I probably should remove nouveau from my .config file, but it worked before.

I rebooted and tried to ssh into the gentoo system from another computer. It failed. I was able to get information in /var/log/messages: https://pastebin.com/fyW522zN

There are lot of evbug messages that I need to figure out how to get rid of, but it basically shows the same thing
Back to top
View user's profile Send private message
Featherfoot
Veteran
Veteran


Joined: 28 Dec 2002
Posts: 1108
Location: Stuart, Florida

PostPosted: Sat Feb 03, 2018 2:26 am    Post subject: Reply with quote

I removed nouveau from the kernel and it started to work!

Of course, I have a few things to set up but I can probably solve these problems.

Thank all of you for your help.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Installing Gentoo All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum