Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
[SOLVED] nvidia-drivers irq issue lenovo w530 nvidia k1000M
View unanswered posts
View posts from last 24 hours
View posts from last 7 days

 
Reply to topic    Gentoo Forums Forum Index Desktop Environments
View previous topic :: View next topic  
Author Message
opn
n00b
n00b


Joined: 20 Aug 2012
Posts: 16

PostPosted: Mon Sep 03, 2012 8:10 am    Post subject: [SOLVED] nvidia-drivers irq issue lenovo w530 nvidia k1000M Reply with quote

Hello everybody,

I think my problem is related to this one: (I already posted there as you can see, but no answers as it's marked as [SOLVED] I think)

https://forums.gentoo.org/viewtopic-p-7118702.html

Difference for me is that I can boot fine until xdm starts gdm and loads the nvidia kernel module.

As stated there I can't startx with "Discrete Graphics"-mode turned on in Bios. Integrated Intel graphics works fine, but I can't use any external monitors what's the expected behaviour. As I really need more than the laptop display sometimes, I'm trying hard to get the nvidia gpu running. I'm pretty sure that my kernel is configured correctly as I triple checked the settings from the nvidia guide.

What I already tried:

- various Bios settings (hyperthreading on/off, intel virtualization on/off, optimus on/off, optimus os detection on/off, disable everything I can even mouse / trackpoint)
- Bios update to newest version (Version 1.08 UEFI: 1.08 / ECP: 1.06)
- different kernel parameters: irqpoll, irqfixup, nox2apic, pci=usepirqmask, acpi_os_name="Microsoft Windows", acpi=off, acpi=noirq, nolapic, noapic and even some more I think that I found as suggestions to solve nvidia optimus problems.
- remove 8gb of the 16gb ram I have as someone wrote in some bugtracker it could be an issue with 16gb of ram

The only kernel parameter that allowed me to boot into x is "acpi=noirq" all other tries didn't have an effect. The downside without irq is (again as said already in the other thread) I lose multicore and hyperthreading and my system runs terribly slow : /

Some informations that may help:

cat /proc/interrupts (discrete graphics mode):

Code:

SEE MY 3RD POST PLEASE


dmesg shows that the graphic card should be on irq 17 but as loading the kernel module fails and results in a completely turned off screen black screen (no backlight) it doesn't appear in the above listing I think.

dmesg (discrete graphics mode):

SEE MY 3RD POST PLEASE

Xorg.0.log:

Code:

SEE MY 3RD POST PLEASE


xorg.conf:
Code:

SEE MY 3RD POST PLEASE


I also tried without setting the BusId implicitly.


lspci -vvv (discrete graphics mode):
SEE MY 3RD POST PLEASE

I tried nvidia-drivers 295.71, 302.17-r1, 304.37-r1 and 304.43 with "X acpi kernel_linux multilib tools" use flags enabled.

I'm running 3.5.3-gentoo 64bit with a self compiled kernel based on http://www.kernel-seeds.org/seeds/64_bit/gentoo/ configuration and tried the following other kernel versions:

[1] linux-3.3.8-gentoo
[2] linux-3.4.4-gentoo
[3] linux-3.4.5-gentoo
[4] linux-3.5.0-gentoo
[5] linux-3.5.1-gentoo
[6] linux-3.5.2-gentoo

I also tried genkernel 3.3.8 some time ago without success.

Yesterday I gave the the xorg-server release candidate a shot and installed version 1.12.99.905 in combination with mesa 8.0.4 or 8.0.4-r1

I spent hours searching around the net for a solution and tried everything I found. Many ubuntu users say they just turn on discrete graphics mode, disable os detection and intel virtualization stuff in bios and install the proprietary driver and it works for them. How can that be?

If you need any more input from me let me know.

I'm really stuck and can't believe there is no solution : /

Cheers
Alex


Last edited by opn on Wed Sep 19, 2012 10:30 pm; edited 3 times in total
Back to top
View user's profile Send private message
krinn
Watchman
Watchman


Joined: 02 May 2003
Posts: 7470

PostPosted: Mon Sep 03, 2012 10:58 am    Post subject: Reply with quote

All your info are totally incoherent :

- Showing an xorg.conf with dri disable and a xorg.log with dri enable
- Showing an interrupt list with irq 9 in use and a dmesg with a free irq 9 same for intel_hda that is MSI enable and your log show it has been stop.


I would say first, disable devices in your bios for trying, specially the snd_hda that take irq17 (same as your nvidia as you said) and have trouble with MSI in your log. (by devices i'm not speaking about any cpu features, just devices that take irq)
I would remove/unload vboxdrv too. And re-enable virtualization cpu feature in your bios. Re-read nvidia doc for feature, you are still using xorg opengl in your log.

Try to produce something coherent or with some kind of logic instead of a random infos from your many tries. We are blind, and anytime you will change something your previous infos gaven goes outdate and may even drive us in bad path as we couldn't guess you've made any change.
Back to top
View user's profile Send private message
opn
n00b
n00b


Joined: 20 Aug 2012
Posts: 16

PostPosted: Mon Sep 03, 2012 1:41 pm    Post subject: Reply with quote

Sorry for that. I have tons of text files from the tries and must have gotten confused. Wasn't my intention to post such a crap : / I'll give it another try tomorrow and edit the post then with updated, coherent and new informations as I've got no time left today to give it another shot.

Regarding the snd_hda on irq 17: I think I can't deactivate it. There are plenty of switches for all kinds of devices but none for the onboard audio. I've just searched around a bit and it seems there really is no Bios switch to turn it off. Will it be enough to just remove the snd_hda kernel modules?
Back to top
View user's profile Send private message
opn
n00b
n00b


Joined: 20 Aug 2012
Posts: 16

PostPosted: Tue Sep 04, 2012 1:48 pm    Post subject: Reply with quote

Im back from my next try to give you the new, coherent information.

Setup:

Bios:
- Disabled every device I can, except ethernet so that I can still ssh into the system to provide the informations. I also disabled the virtualization stuff to be sure it's not the cause for the problem. (I enabled it one time to check if it'll work then but it didn't)
- Discrete Graphics Mode, disabled OS detection

Kernel: 3.5.3-gentoo
- without snd_hda_intel configured to be sure it's not sitting on irq 17 (that seems to have worked, see interrupts)
- removed all intel hd configurations to be sure it's not somehow interfering
- checked http://www.gentoo.org/doc/en/nvidia-guide.xml again, everything was set correct and I'm using uvesafb

cat /proc/interrupts (discrete graphics mode):

Code:


           CPU0       CPU1       CPU2       CPU3       CPU4       CPU5       CPU6       CPU7       
  0:         45          0          0          0          0          0          0          0   IO-APIC-edge      timer
  1:          1          8          0          0          0          0          0          0   IO-APIC-edge      i8042
  8:          4          1          0          0          0          0          0          1   IO-APIC-edge      rtc0
  9:        376         92         13         49         20         25          4          4   IO-APIC-fasteoi   acpi
 12:       2363          9          1          1          0          1          1          6   IO-APIC-edge      i8042
 16:         35          0          0          1          0          0          1          0   IO-APIC-fasteoi   ehci_hcd:usb1, mmc0
 23:         40          1          1          1          0          0          0          0   IO-APIC-fasteoi   ehci_hcd:usb2
 41:       7463       5703        923       1045        694        223        377        459   PCI-MSI-edge      ahci
 42:        646        506         17          9          6         16          7          5   PCI-MSI-edge      eth0
 43:          0          0          0          0          0          0          0          0   PCI-MSI-edge      xhci_hcd
NMI:          0          0          0          0          0          0          0          0   Non-maskable interrupts
LOC:      10462      12565       6685       8916       5639       9866       6750       6218   Local timer interrupts
SPU:          0          0          0          0          0          0          0          0   Spurious interrupts
PMI:          0          0          0          0          0          0          0          0   Performance monitoring interrupts
IWI:          0          0          0          0          0          0          0          0   IRQ work interrupts
RTR:          6          0          0          0          0          0          0          0   APIC ICR read retries
RES:       1309        661        673        281        117        153         73        107   Rescheduling interrupts
CAL:        310        292        316        323        314        326        321        321   Function call interrupts
TLB:       1555       1280        999       1485        696       1418        452        613   TLB shootdowns
TRM:          0          0          0          0          0          0          0          0   Thermal event interrupts
THR:          0          0          0          0          0          0          0          0   Threshold APIC interrupts
MCE:          0          0          0          0          0          0          0          0   Machine check exceptions
MCP:          4          4          4          4          4          4          4          4   Machine check polls
ERR:          0
MIS:          0


dmesg (discrete graphics mode):
http://pastebin.com/8h0ZZs18

xorg.log:
http://pastebin.com/LhaCfDjq

Code:


# nvidia-xconfig: X configuration file generated by nvidia-xconfig
# nvidia-xconfig:  version 304.43  (buildmeister@swio-display-x86-rhel47-13)  Sun Aug 19 21:19:28 PDT 2012

Section "ServerLayout"
    Identifier     "Layout0"
    Screen      0  "Screen0"
    InputDevice    "Keyboard0" "CoreKeyboard"
    InputDevice    "Mouse0" "CorePointer"
EndSection

Section "Files"
EndSection

Section "InputDevice"
    # generated from data in "/etc/conf.d/gpm"
    Identifier     "Mouse0"
    Driver         "mouse"
    Option         "Protocol"
    Option         "Device" "/dev/input/mice"
    Option         "Emulate3Buttons" "no"
    Option         "ZAxisMapping" "4 5"
EndSection

Section "InputDevice"
    # generated from default
    Identifier     "Keyboard0"
    Driver         "kbd"
EndSection

Section "Monitor"
    Identifier     "Monitor0"
    VendorName     "Unknown"
    ModelName      "Unknown"
    HorizSync       28.0 - 33.0
    VertRefresh     43.0 - 72.0
    Option         "DPMS"
EndSection

Section "Device"
    Identifier     "Device0"
    Driver         "nvidia"
    VendorName     "NVIDIA Corporation"
EndSection

Section "Screen"
    Identifier     "Screen0"
    Device         "Device0"
    Monitor        "Monitor0"
    DefaultDepth    24
    SubSection     "Display"
        Depth       24
    EndSubSection
EndSection


lspci -vvv (discrete graphics mode):
http://pastebin.com/m3PWgeGA

The logs changed a bit, but the problem stays the same. My problem seems to be in dmesg:

Code:
[   27.210799] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[   27.210811] NVRM: os_pci_init_handle: invalid context!
[   27.210813] NVRM: os_pci_init_handle: invalid context!
[   27.210818] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[   27.210822] NVRM: os_pci_init_handle: invalid context!
[   27.210824] NVRM: os_pci_init_handle: invalid context!
[   27.494129] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1183)
[   27.494153] NVRM: rm_init_adapter(0) failed
[   27.495414] NVRM: RmInitAdapter failed! (0x23:0x2f:675)
[   27.495418] NVRM: rm_init_adapter(0) failed
[   27.495435] nvidia-smi (2185) used greatest stack depth: 3928 bytes left


Any suggestions now? Sure i googled already and found a bunch of nvidia bug reports but no real workaround :-/

EDIT: Forgot to say that acpi=noirq as kernel parameter still makes the system bootable with 1 cpu / core and glx also works then.

Thanks for helping and sorry for the bad informations I gave yesterday!

Regards
Alex
Back to top
View user's profile Send private message
madoldman
n00b
n00b


Joined: 11 Sep 2012
Posts: 2

PostPosted: Tue Sep 11, 2012 9:25 am    Post subject: Reply with quote

Have you solved the problem?
I have the same problem too.....
Back to top
View user's profile Send private message
opn
n00b
n00b


Joined: 20 Aug 2012
Posts: 16

PostPosted: Wed Sep 12, 2012 12:31 pm    Post subject: Reply with quote

No, I didn't : / Posting here in the forums was my last resort and I'm completely out of ideas. What's your system? W530 K1000M, too?

Meanwhile I installed Windows 7 on a second hdd to see if the nvidia card could be damaged. In Windows everything runs fine in discrete mode and I can even play games. :)

If you ever find a solution I'd be happy if you let me know. Be sure that I'll do the same!
Back to top
View user's profile Send private message
GODLiKE
n00b
n00b


Joined: 11 Jan 2007
Posts: 62

PostPosted: Wed Sep 12, 2012 8:46 pm    Post subject: Reply with quote

I posted on the other thread you mentioned, have this same issue. After some digging it appears that there are more people with other notebook/video card models who experience the same issue when switching to discrete on the BIOS.

I would be all for using Optimus via Bumblebee here, but that one doesn't work either (in some of my tries, I even saw an Xorg.X.log message saying that the video card was not recognized, and I have the latest NVIDIA blob drivers). Could you try and see if you can get bumblebee running? The relevant ebuilds are in sabayon's overlay. I think maybe these two problems (black screen unless acpi=noirq AND bumblebee not working) are related.
_________________
GERÓNIMOOOOOOOOOOOOOOOOOOOO!!!
Back to top
View user's profile Send private message
opn
n00b
n00b


Joined: 20 Aug 2012
Posts: 16

PostPosted: Thu Sep 13, 2012 7:40 am    Post subject: Reply with quote

At least I'm not alone anymore : )

Alright, I'll try to get Bumblebee running and will report back here. Can take a few days though as I'll hardly find time for it at the weekend.
Back to top
View user's profile Send private message
GODLiKE
n00b
n00b


Joined: 11 Jan 2007
Posts: 62

PostPosted: Thu Sep 13, 2012 4:25 pm    Post subject: Reply with quote

What bothered me is that when I tried with Ubuntu, Bumblebee was running flawlessly.

I have tried to mimic Ubuntu's environment, the only thing I did not get to do was to downgrade X to 1.12 or 1.11 (I don't know which one 12.04 uses). However, no success, even with kernel-3.2 :(
_________________
GERÓNIMOOOOOOOOOOOOOOOOOOOO!!!
Back to top
View user's profile Send private message
opn
n00b
n00b


Joined: 20 Aug 2012
Posts: 16

PostPosted: Mon Sep 17, 2012 9:35 pm    Post subject: Reply with quote

Just installed bumblebee and tried to run glxgears:

xorg.log says:

NVIDIA(0): The NVIDIA GPU at PCI:1:0:0 is not supported by the 304.43

but I guess it's just because of dmesg still saying:

Code:
[   45.574647] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  304.43  Sun Aug 19 20:14:03 PDT 2012
[   53.545546] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[   53.545586] NVRM: os_pci_init_handle: invalid context!
[   53.545587] NVRM: os_pci_init_handle: invalid context!
[   53.545591] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[   53.545594] NVRM: os_pci_init_handle: invalid context!
[   53.545595] NVRM: os_pci_init_handle: invalid context!
[   53.771893] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1183)
[   53.771903] NVRM: rm_init_adapter(0) failed
[   53.773481] NVRM: RmInitAdapter failed! (0x23:0x2f:675)
[   53.773484] NVRM: rm_init_adapter(0) failed
Back to top
View user's profile Send private message
GODLiKE
n00b
n00b


Joined: 11 Jan 2007
Posts: 62

PostPosted: Mon Sep 17, 2012 10:40 pm    Post subject: Reply with quote

Then welcome to my world xD I've managed (with the help from the guys at #bumblebee on Freenode) to narrow it down to that same thing. You can join #bumblebee if you like, at least now there'll be two of us with that issue xD.

Next thing I'm going to try when I have some time is booting with an Ubuntu liveusb and see if the problem persists in there.
_________________
GERÓNIMOOOOOOOOOOOOOOOOOOOO!!!
Back to top
View user's profile Send private message
GODLiKE
n00b
n00b


Joined: 11 Jan 2007
Posts: 62

PostPosted: Wed Sep 19, 2012 1:59 am    Post subject: Reply with quote

Frack my life. I just booted with an Ubuntu 12.04 amd64 livecd and the GPU works.

Ok, it's an old driver and right now I'm trying to get the latest since it recognizes my card as a Quadro 1100M instead of a K100M, but at least nvidia-xconfig -query-gpu-info is not bailing out, and I have no "GPU has fallen off the bus" errors. (EDIT: tried with current from x-swat PPA. Version is 304.43. It works)

So, should I try an ubuntu-sources kernel inside my Gentoo box? Oh the humanity...
_________________
GERÓNIMOOOOOOOOOOOOOOOOOOOO!!!
Back to top
View user's profile Send private message
GODLiKE
n00b
n00b


Joined: 11 Jan 2007
Posts: 62

PostPosted: Wed Sep 19, 2012 9:37 pm    Post subject: Reply with quote

Solved. More info here: http://nvnews.net/vbulletin/showthread.php?p=2576162#post2576162

Summing up: I enabled these 4 kernel options

Code:

CONFIG_NO_HZ
CONFIG_RCU_FAST_NO_HZ
CONFIG_CALGARY_IOMMU
CONFIG_CALGARY_IOMMU_ENABLED_BY_DEFAULT


... after I booted with the Ubuntu kernel in my laptop U_U (no, I'm not proud of it, but I found the culprit).
_________________
GERÓNIMOOOOOOOOOOOOOOOOOOOO!!!
Back to top
View user's profile Send private message
kernelOfTruth
Watchman
Watchman


Joined: 20 Dec 2005
Posts: 6111
Location: Vienna, Austria; Germany; hello world :)

PostPosted: Wed Sep 19, 2012 9:41 pm    Post subject: Reply with quote

*subscribes*

I might stumble upon this issue in the near future


it's not 100% related but:

so you can't connect an external monitor with only internal graphics enabled ?

if that's the case - hybrid graphics sucks :?
_________________
https://github.com/kernelOfTruth/ZFS-for-SystemRescueCD/tree/ZFS-for-SysRescCD-4.9.0
https://github.com/kernelOfTruth/pulseaudio-equalizer-ladspa

Hardcore Gentoo Linux user since 2004 :D
Back to top
View user's profile Send private message
opn
n00b
n00b


Joined: 20 Aug 2012
Posts: 16

PostPosted: Wed Sep 19, 2012 10:29 pm    Post subject: Reply with quote

Marking this one as solved : )

After enabling the 4 kernel options that godlike suggested (thanks a lot!) I can finally boot in discrete graphics mode without any problems and use external monitors!

kernelOfTruth: Yes you're right. I don't know if there are other models out there that don't share this "problem" but for the W520/W530 that's really the case sadly.
Back to top
View user's profile Send private message
GODLiKE
n00b
n00b


Joined: 11 Jan 2007
Posts: 62

PostPosted: Fri Sep 21, 2012 7:08 pm    Post subject: Reply with quote

kernelOfTruth, it depends on the laptop manufacturer and how gentle they are with BIOS settings and how the GPUs are hardwired to the external ports. In the case of the Thinkpad W530, the VGA is attached to the iGPU and the miniDP is attached to the dGPU. Also, the Thinkpad BIOS lets you choose between booting on the iGPU, the dGPU, or Optimus. If you choose to boot on Optimus, there are some people who managed to connect 2 external monitors (one on the VGA, one on the miniDP) and make them work all together using xrandr/xinerama/synergy. I have not tested that sadly as I don't have spare external monitors at home. Here's one of the links though: http://zachstechnotes.blogspot.com.ar/2012/01/tri-head-display-on-linux-thinkpad-w520.html
_________________
GERÓNIMOOOOOOOOOOOOOOOOOOOO!!!
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Desktop Environments All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum