View previous topic :: View next topic |
Author |
Message |
opn n00b
Joined: 20 Aug 2012 Posts: 16
|
Posted: Mon Sep 03, 2012 8:10 am Post subject: [SOLVED] nvidia-drivers irq issue lenovo w530 nvidia k1000M |
|
|
Hello everybody,
I think my problem is related to this one: (I already posted there as you can see, but no answers as it's marked as [SOLVED] I think)
https://forums.gentoo.org/viewtopic-p-7118702.html
Difference for me is that I can boot fine until xdm starts gdm and loads the nvidia kernel module.
As stated there I can't startx with "Discrete Graphics"-mode turned on in Bios. Integrated Intel graphics works fine, but I can't use any external monitors what's the expected behaviour. As I really need more than the laptop display sometimes, I'm trying hard to get the nvidia gpu running. I'm pretty sure that my kernel is configured correctly as I triple checked the settings from the nvidia guide.
What I already tried:
- various Bios settings (hyperthreading on/off, intel virtualization on/off, optimus on/off, optimus os detection on/off, disable everything I can even mouse / trackpoint)
- Bios update to newest version (Version 1.08 UEFI: 1.08 / ECP: 1.06)
- different kernel parameters: irqpoll, irqfixup, nox2apic, pci=usepirqmask, acpi_os_name="Microsoft Windows", acpi=off, acpi=noirq, nolapic, noapic and even some more I think that I found as suggestions to solve nvidia optimus problems.
- remove 8gb of the 16gb ram I have as someone wrote in some bugtracker it could be an issue with 16gb of ram
The only kernel parameter that allowed me to boot into x is "acpi=noirq" all other tries didn't have an effect. The downside without irq is (again as said already in the other thread) I lose multicore and hyperthreading and my system runs terribly slow : /
Some informations that may help:
cat /proc/interrupts (discrete graphics mode):
Code: |
SEE MY 3RD POST PLEASE
|
dmesg shows that the graphic card should be on irq 17 but as loading the kernel module fails and results in a completely turned off screen black screen (no backlight) it doesn't appear in the above listing I think.
dmesg (discrete graphics mode):
SEE MY 3RD POST PLEASE
Xorg.0.log:
Code: |
SEE MY 3RD POST PLEASE
|
xorg.conf:
Code: |
SEE MY 3RD POST PLEASE
|
I also tried without setting the BusId implicitly.
lspci -vvv (discrete graphics mode):
SEE MY 3RD POST PLEASE
I tried nvidia-drivers 295.71, 302.17-r1, 304.37-r1 and 304.43 with "X acpi kernel_linux multilib tools" use flags enabled.
I'm running 3.5.3-gentoo 64bit with a self compiled kernel based on http://www.kernel-seeds.org/seeds/64_bit/gentoo/ configuration and tried the following other kernel versions:
[1] linux-3.3.8-gentoo
[2] linux-3.4.4-gentoo
[3] linux-3.4.5-gentoo
[4] linux-3.5.0-gentoo
[5] linux-3.5.1-gentoo
[6] linux-3.5.2-gentoo
I also tried genkernel 3.3.8 some time ago without success.
Yesterday I gave the the xorg-server release candidate a shot and installed version 1.12.99.905 in combination with mesa 8.0.4 or 8.0.4-r1
I spent hours searching around the net for a solution and tried everything I found. Many ubuntu users say they just turn on discrete graphics mode, disable os detection and intel virtualization stuff in bios and install the proprietary driver and it works for them. How can that be?
If you need any more input from me let me know.
I'm really stuck and can't believe there is no solution : /
Cheers
Alex
Last edited by opn on Wed Sep 19, 2012 10:30 pm; edited 3 times in total |
|
Back to top |
|
|
krinn Watchman
Joined: 02 May 2003 Posts: 7470
|
Posted: Mon Sep 03, 2012 10:58 am Post subject: |
|
|
All your info are totally incoherent :
- Showing an xorg.conf with dri disable and a xorg.log with dri enable
- Showing an interrupt list with irq 9 in use and a dmesg with a free irq 9 same for intel_hda that is MSI enable and your log show it has been stop.
I would say first, disable devices in your bios for trying, specially the snd_hda that take irq17 (same as your nvidia as you said) and have trouble with MSI in your log. (by devices i'm not speaking about any cpu features, just devices that take irq)
I would remove/unload vboxdrv too. And re-enable virtualization cpu feature in your bios. Re-read nvidia doc for feature, you are still using xorg opengl in your log.
Try to produce something coherent or with some kind of logic instead of a random infos from your many tries. We are blind, and anytime you will change something your previous infos gaven goes outdate and may even drive us in bad path as we couldn't guess you've made any change. |
|
Back to top |
|
|
opn n00b
Joined: 20 Aug 2012 Posts: 16
|
Posted: Mon Sep 03, 2012 1:41 pm Post subject: |
|
|
Sorry for that. I have tons of text files from the tries and must have gotten confused. Wasn't my intention to post such a crap : / I'll give it another try tomorrow and edit the post then with updated, coherent and new informations as I've got no time left today to give it another shot.
Regarding the snd_hda on irq 17: I think I can't deactivate it. There are plenty of switches for all kinds of devices but none for the onboard audio. I've just searched around a bit and it seems there really is no Bios switch to turn it off. Will it be enough to just remove the snd_hda kernel modules? |
|
Back to top |
|
|
opn n00b
Joined: 20 Aug 2012 Posts: 16
|
Posted: Tue Sep 04, 2012 1:48 pm Post subject: |
|
|
Im back from my next try to give you the new, coherent information.
Setup:
Bios:
- Disabled every device I can, except ethernet so that I can still ssh into the system to provide the informations. I also disabled the virtualization stuff to be sure it's not the cause for the problem. (I enabled it one time to check if it'll work then but it didn't)
- Discrete Graphics Mode, disabled OS detection
Kernel: 3.5.3-gentoo
- without snd_hda_intel configured to be sure it's not sitting on irq 17 (that seems to have worked, see interrupts)
- removed all intel hd configurations to be sure it's not somehow interfering
- checked http://www.gentoo.org/doc/en/nvidia-guide.xml again, everything was set correct and I'm using uvesafb
cat /proc/interrupts (discrete graphics mode):
Code: |
CPU0 CPU1 CPU2 CPU3 CPU4 CPU5 CPU6 CPU7
0: 45 0 0 0 0 0 0 0 IO-APIC-edge timer
1: 1 8 0 0 0 0 0 0 IO-APIC-edge i8042
8: 4 1 0 0 0 0 0 1 IO-APIC-edge rtc0
9: 376 92 13 49 20 25 4 4 IO-APIC-fasteoi acpi
12: 2363 9 1 1 0 1 1 6 IO-APIC-edge i8042
16: 35 0 0 1 0 0 1 0 IO-APIC-fasteoi ehci_hcd:usb1, mmc0
23: 40 1 1 1 0 0 0 0 IO-APIC-fasteoi ehci_hcd:usb2
41: 7463 5703 923 1045 694 223 377 459 PCI-MSI-edge ahci
42: 646 506 17 9 6 16 7 5 PCI-MSI-edge eth0
43: 0 0 0 0 0 0 0 0 PCI-MSI-edge xhci_hcd
NMI: 0 0 0 0 0 0 0 0 Non-maskable interrupts
LOC: 10462 12565 6685 8916 5639 9866 6750 6218 Local timer interrupts
SPU: 0 0 0 0 0 0 0 0 Spurious interrupts
PMI: 0 0 0 0 0 0 0 0 Performance monitoring interrupts
IWI: 0 0 0 0 0 0 0 0 IRQ work interrupts
RTR: 6 0 0 0 0 0 0 0 APIC ICR read retries
RES: 1309 661 673 281 117 153 73 107 Rescheduling interrupts
CAL: 310 292 316 323 314 326 321 321 Function call interrupts
TLB: 1555 1280 999 1485 696 1418 452 613 TLB shootdowns
TRM: 0 0 0 0 0 0 0 0 Thermal event interrupts
THR: 0 0 0 0 0 0 0 0 Threshold APIC interrupts
MCE: 0 0 0 0 0 0 0 0 Machine check exceptions
MCP: 4 4 4 4 4 4 4 4 Machine check polls
ERR: 0
MIS: 0
|
dmesg (discrete graphics mode):
http://pastebin.com/8h0ZZs18
xorg.log:
http://pastebin.com/LhaCfDjq
Code: |
# nvidia-xconfig: X configuration file generated by nvidia-xconfig
# nvidia-xconfig: version 304.43 (buildmeister@swio-display-x86-rhel47-13) Sun Aug 19 21:19:28 PDT 2012
Section "ServerLayout"
Identifier "Layout0"
Screen 0 "Screen0"
InputDevice "Keyboard0" "CoreKeyboard"
InputDevice "Mouse0" "CorePointer"
EndSection
Section "Files"
EndSection
Section "InputDevice"
# generated from data in "/etc/conf.d/gpm"
Identifier "Mouse0"
Driver "mouse"
Option "Protocol"
Option "Device" "/dev/input/mice"
Option "Emulate3Buttons" "no"
Option "ZAxisMapping" "4 5"
EndSection
Section "InputDevice"
# generated from default
Identifier "Keyboard0"
Driver "kbd"
EndSection
Section "Monitor"
Identifier "Monitor0"
VendorName "Unknown"
ModelName "Unknown"
HorizSync 28.0 - 33.0
VertRefresh 43.0 - 72.0
Option "DPMS"
EndSection
Section "Device"
Identifier "Device0"
Driver "nvidia"
VendorName "NVIDIA Corporation"
EndSection
Section "Screen"
Identifier "Screen0"
Device "Device0"
Monitor "Monitor0"
DefaultDepth 24
SubSection "Display"
Depth 24
EndSubSection
EndSection
|
lspci -vvv (discrete graphics mode):
http://pastebin.com/m3PWgeGA
The logs changed a bit, but the problem stays the same. My problem seems to be in dmesg:
Code: | [ 27.210799] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[ 27.210811] NVRM: os_pci_init_handle: invalid context!
[ 27.210813] NVRM: os_pci_init_handle: invalid context!
[ 27.210818] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[ 27.210822] NVRM: os_pci_init_handle: invalid context!
[ 27.210824] NVRM: os_pci_init_handle: invalid context!
[ 27.494129] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1183)
[ 27.494153] NVRM: rm_init_adapter(0) failed
[ 27.495414] NVRM: RmInitAdapter failed! (0x23:0x2f:675)
[ 27.495418] NVRM: rm_init_adapter(0) failed
[ 27.495435] nvidia-smi (2185) used greatest stack depth: 3928 bytes left |
Any suggestions now? Sure i googled already and found a bunch of nvidia bug reports but no real workaround :-/
EDIT: Forgot to say that acpi=noirq as kernel parameter still makes the system bootable with 1 cpu / core and glx also works then.
Thanks for helping and sorry for the bad informations I gave yesterday!
Regards
Alex |
|
Back to top |
|
|
madoldman n00b
Joined: 11 Sep 2012 Posts: 2
|
Posted: Tue Sep 11, 2012 9:25 am Post subject: |
|
|
Have you solved the problem?
I have the same problem too..... |
|
Back to top |
|
|
opn n00b
Joined: 20 Aug 2012 Posts: 16
|
Posted: Wed Sep 12, 2012 12:31 pm Post subject: |
|
|
No, I didn't : / Posting here in the forums was my last resort and I'm completely out of ideas. What's your system? W530 K1000M, too?
Meanwhile I installed Windows 7 on a second hdd to see if the nvidia card could be damaged. In Windows everything runs fine in discrete mode and I can even play games.
If you ever find a solution I'd be happy if you let me know. Be sure that I'll do the same! |
|
Back to top |
|
|
GODLiKE n00b
Joined: 11 Jan 2007 Posts: 62
|
Posted: Wed Sep 12, 2012 8:46 pm Post subject: |
|
|
I posted on the other thread you mentioned, have this same issue. After some digging it appears that there are more people with other notebook/video card models who experience the same issue when switching to discrete on the BIOS.
I would be all for using Optimus via Bumblebee here, but that one doesn't work either (in some of my tries, I even saw an Xorg.X.log message saying that the video card was not recognized, and I have the latest NVIDIA blob drivers). Could you try and see if you can get bumblebee running? The relevant ebuilds are in sabayon's overlay. I think maybe these two problems (black screen unless acpi=noirq AND bumblebee not working) are related. _________________ GERÓNIMOOOOOOOOOOOOOOOOOOOO!!! |
|
Back to top |
|
|
opn n00b
Joined: 20 Aug 2012 Posts: 16
|
Posted: Thu Sep 13, 2012 7:40 am Post subject: |
|
|
At least I'm not alone anymore : )
Alright, I'll try to get Bumblebee running and will report back here. Can take a few days though as I'll hardly find time for it at the weekend. |
|
Back to top |
|
|
GODLiKE n00b
Joined: 11 Jan 2007 Posts: 62
|
Posted: Thu Sep 13, 2012 4:25 pm Post subject: |
|
|
What bothered me is that when I tried with Ubuntu, Bumblebee was running flawlessly.
I have tried to mimic Ubuntu's environment, the only thing I did not get to do was to downgrade X to 1.12 or 1.11 (I don't know which one 12.04 uses). However, no success, even with kernel-3.2 _________________ GERÓNIMOOOOOOOOOOOOOOOOOOOO!!! |
|
Back to top |
|
|
opn n00b
Joined: 20 Aug 2012 Posts: 16
|
Posted: Mon Sep 17, 2012 9:35 pm Post subject: |
|
|
Just installed bumblebee and tried to run glxgears:
xorg.log says:
NVIDIA(0): The NVIDIA GPU at PCI:1:0:0 is not supported by the 304.43
but I guess it's just because of dmesg still saying:
Code: | [ 45.574647] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 304.43 Sun Aug 19 20:14:03 PDT 2012
[ 53.545546] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[ 53.545586] NVRM: os_pci_init_handle: invalid context!
[ 53.545587] NVRM: os_pci_init_handle: invalid context!
[ 53.545591] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[ 53.545594] NVRM: os_pci_init_handle: invalid context!
[ 53.545595] NVRM: os_pci_init_handle: invalid context!
[ 53.771893] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1183)
[ 53.771903] NVRM: rm_init_adapter(0) failed
[ 53.773481] NVRM: RmInitAdapter failed! (0x23:0x2f:675)
[ 53.773484] NVRM: rm_init_adapter(0) failed |
|
|
Back to top |
|
|
GODLiKE n00b
Joined: 11 Jan 2007 Posts: 62
|
Posted: Mon Sep 17, 2012 10:40 pm Post subject: |
|
|
Then welcome to my world xD I've managed (with the help from the guys at #bumblebee on Freenode) to narrow it down to that same thing. You can join #bumblebee if you like, at least now there'll be two of us with that issue xD.
Next thing I'm going to try when I have some time is booting with an Ubuntu liveusb and see if the problem persists in there. _________________ GERÓNIMOOOOOOOOOOOOOOOOOOOO!!! |
|
Back to top |
|
|
GODLiKE n00b
Joined: 11 Jan 2007 Posts: 62
|
Posted: Wed Sep 19, 2012 1:59 am Post subject: |
|
|
Frack my life. I just booted with an Ubuntu 12.04 amd64 livecd and the GPU works.
Ok, it's an old driver and right now I'm trying to get the latest since it recognizes my card as a Quadro 1100M instead of a K100M, but at least nvidia-xconfig -query-gpu-info is not bailing out, and I have no "GPU has fallen off the bus" errors. (EDIT: tried with current from x-swat PPA. Version is 304.43. It works)
So, should I try an ubuntu-sources kernel inside my Gentoo box? Oh the humanity... _________________ GERÓNIMOOOOOOOOOOOOOOOOOOOO!!! |
|
Back to top |
|
|
GODLiKE n00b
Joined: 11 Jan 2007 Posts: 62
|
Posted: Wed Sep 19, 2012 9:37 pm Post subject: |
|
|
Solved. More info here: http://nvnews.net/vbulletin/showthread.php?p=2576162#post2576162
Summing up: I enabled these 4 kernel options
Code: |
CONFIG_NO_HZ
CONFIG_RCU_FAST_NO_HZ
CONFIG_CALGARY_IOMMU
CONFIG_CALGARY_IOMMU_ENABLED_BY_DEFAULT
|
... after I booted with the Ubuntu kernel in my laptop U_U (no, I'm not proud of it, but I found the culprit). _________________ GERÓNIMOOOOOOOOOOOOOOOOOOOO!!! |
|
Back to top |
|
|
kernelOfTruth Watchman
Joined: 20 Dec 2005 Posts: 6111 Location: Vienna, Austria; Germany; hello world :)
|
|
Back to top |
|
|
opn n00b
Joined: 20 Aug 2012 Posts: 16
|
Posted: Wed Sep 19, 2012 10:29 pm Post subject: |
|
|
Marking this one as solved : )
After enabling the 4 kernel options that godlike suggested (thanks a lot!) I can finally boot in discrete graphics mode without any problems and use external monitors!
kernelOfTruth: Yes you're right. I don't know if there are other models out there that don't share this "problem" but for the W520/W530 that's really the case sadly. |
|
Back to top |
|
|
GODLiKE n00b
Joined: 11 Jan 2007 Posts: 62
|
Posted: Fri Sep 21, 2012 7:08 pm Post subject: |
|
|
kernelOfTruth, it depends on the laptop manufacturer and how gentle they are with BIOS settings and how the GPUs are hardwired to the external ports. In the case of the Thinkpad W530, the VGA is attached to the iGPU and the miniDP is attached to the dGPU. Also, the Thinkpad BIOS lets you choose between booting on the iGPU, the dGPU, or Optimus. If you choose to boot on Optimus, there are some people who managed to connect 2 external monitors (one on the VGA, one on the miniDP) and make them work all together using xrandr/xinerama/synergy. I have not tested that sadly as I don't have spare external monitors at home. Here's one of the links though: http://zachstechnotes.blogspot.com.ar/2012/01/tri-head-display-on-linux-thinkpad-w520.html _________________ GERÓNIMOOOOOOOOOOOOOOOOOOOO!!! |
|
Back to top |
|
|
|