Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Problem with the latest nvidia drivers
View unanswered posts
View posts from last 24 hours

Goto page 1, 2  Next  
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
brutico
Tux's lil' helper
Tux's lil' helper


Joined: 16 Feb 2014
Posts: 130

PostPosted: Sun Sep 23, 2018 9:11 am    Post subject: Problem with the latest nvidia drivers Reply with quote

[Moderator note: this thread was posted twice. The first response to this thread was in one copy. The second response was in the other copy. I combined the two threads into one. -Hu]

I did not start X with version 410.57 and I had to download the version. Has anyone had the same?

My video card GTX 1080
Back to top
View user's profile Send private message
DawgG
l33t
l33t


Joined: 17 Sep 2003
Posts: 866

PostPosted: Sun Sep 23, 2018 11:07 am    Post subject: Reply with quote

yes, same problem. run ~amd64 with 4.18.9 on a two-kernel-solution as descibed https://wiki.gentoo.org/wiki/Nouveau_%26_nvidia-drivers_switching#Switching_using_two_kernels
the script supposed to "modpropbe nvidia" hangs at just this point. i checked via ssh and nvidia IS loaded, but cannot be unloaded even if the hanging
Code:
modprobe -q nvidia
is killed manually.
Code:
nvidia-modprobe
does not change a thing; even if module is loaded startx returns "Failed to initialize nvidia kernel module."
Any ideas (besides downgrading)?
_________________
DUMM KLICKT GUT.
Back to top
View user's profile Send private message
Keruskerfuerst
Advocate
Advocate


Joined: 01 Feb 2006
Posts: 2289
Location: near Augsburg, Germany

PostPosted: Sun Sep 23, 2018 3:31 pm    Post subject: Reply with quote

Can you give some more information ?
Back to top
View user's profile Send private message
brutico
Tux's lil' helper
Tux's lil' helper


Joined: 16 Feb 2014
Posts: 130

PostPosted: Mon Sep 24, 2018 4:19 pm    Post subject: Reply with quote

DawgG wrote:
yes, same problem. run ~amd64 with 4.18.9 on a two-kernel-solution as descibed https://wiki.gentoo.org/wiki/Nouveau_%26_nvidia-drivers_switching#Switching_using_two_kernels
the script supposed to "modpropbe nvidia" hangs at just this point. i checked via ssh and nvidia IS loaded, but cannot be unloaded even if the hanging
Code:
modprobe -q nvidia
is killed manually.
Code:
nvidia-modprobe
does not change a thing; even if module is loaded startx returns "Failed to initialize nvidia kernel module."
Any ideas (besides downgrading)?

only supports the 2080 range

https://www.nvidia.es/Download/driverResults.aspx/138290/es
Back to top
View user's profile Send private message
DawgG
l33t
l33t


Joined: 17 Sep 2003
Posts: 866

PostPosted: Tue Sep 25, 2018 9:11 am    Post subject: Reply with quote

as per https://www.nvidia.es/Download/driverResults.aspx/138290/es i say
Quote:
Added support for the following GPUs:
GeForce RTX 2080 Ti
GeForce RTX 2080

does not mean
Quote:
only supports the 2080 range


i downgraded to the 396.54-version which works w/out problems (since i've had my share of nvidia-binary-troubles recently.)
_________________
DUMM KLICKT GUT.
Back to top
View user's profile Send private message
runningnak3d
n00b
n00b


Joined: 05 Sep 2018
Posts: 62

PostPosted: Tue Sep 25, 2018 10:36 pm    Post subject: Reply with quote

I am running 410.57 (on a GTX 1050) without problems on OpenRC .

I use Dantrell's Gnome overlay for Gnome 3.28 if that matters.

I can post any other relevant info that could help you.

-- Brian
Back to top
View user's profile Send private message
ct85711
Veteran
Veteran


Joined: 27 Sep 2005
Posts: 1791

PostPosted: Wed Sep 26, 2018 4:55 am    Post subject: Reply with quote

Well, I checked on my system and I have no problem with running 410.57 drivers with openrc, xfce on a 4.15.10 kernel. From what I have seen, the 2 parts that minimizes the problems with nvidia-drivers I get is NOT updating the kernel as often (nvidia is known to be slow on supporting the newest kernel, so if you are running unstable branch, don't go and use the newest kernel right away unless you need to). The other part, is that it tends to work much better if you do a full system reboot after you updated the nvidia drivers. You may be able to get the drivers reloaded, but I find rebooting instead avoids those problems. The part to keep in mind, if that the nvidia drivers is not just one file; so when you get them reloaded, that doesn't mean all of it's libraries are also reloaded along with it.

Note: I my system has a GT 740 card in it.
Back to top
View user's profile Send private message
i4dnf
Apprentice
Apprentice


Joined: 18 Sep 2005
Posts: 271
Location: Bucharest, Romania

PostPosted: Wed Sep 26, 2018 6:57 am    Post subject: Reply with quote

No go on a 760 here, with a 4.18.9 kernel :(.

ct85711, runningnak3d do you by any chance have NUMA and/or CGROUPS enabled in your kernel config?
(both are disabled here, but I've noticed on some CUDA related issues that NUMA/CGROUPS were [unintended] requirements)
_________________
"The only difference between me and a madman is that I am not MAD" (SALVATOR DALI)


Last edited by i4dnf on Wed Sep 26, 2018 1:08 pm; edited 1 time in total
Back to top
View user's profile Send private message
arnvidr
l33t
l33t


Joined: 19 Aug 2004
Posts: 629
Location: Oslo, Norway

PostPosted: Wed Sep 26, 2018 7:53 am    Post subject: Reply with quote

Running normally for me, on a 4.18.7 kernel
_________________
Noone wrote:
anything
Back to top
View user's profile Send private message
Nikmind
n00b
n00b


Joined: 26 Jun 2002
Posts: 18

PostPosted: Wed Sep 26, 2018 8:10 am    Post subject: Reply with quote

I got the same problem and reverting to the previous driver works. As far as I can see when looking into the Xorg log files, is that it does not recognize that I have a monitor connected. I just says that it can't find anything connected to any of the ports on the graphics card. But I for sure have one connected :P On the previous version of the drivers it finds the monitor just fine.
_________________
/Mikael Grahn - Mindkiller Systems -
Back to top
View user's profile Send private message
ct85711
Veteran
Veteran


Joined: 27 Sep 2005
Posts: 1791

PostPosted: Wed Sep 26, 2018 2:37 pm    Post subject: Reply with quote

I know I have NUMA enabled by default for all kernels that I build, cgroup is enabled but I never use it (mainly just some useless bloat that I haven't trimmed away in the kernel yet). I don't use CUDA, so I never cared about that stuff.

Edit: I have seen the monitor not detected issue, I've seen once and a while; but sadly I don't know anything for sure to fix it. One cause I've seen, is if the monitor isn't turned on before the computer is booted.
Back to top
View user's profile Send private message
j_c_p
Guru
Guru


Joined: 30 Aug 2003
Posts: 319
Location: France - Colmar

PostPosted: Wed Sep 26, 2018 2:48 pm    Post subject: Reply with quote

NVIDIA Driver 410.57, kernel 4.18.10 and GTX960 without problem here (KDE desktop).

Quote:
jcp@phoenix64 ~ $ lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD9x0/RX980 Host Bridge (rev 02)
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD/ATI] RD890S/RD990 I/O Memory Management Unit (IOMMU)
00:04.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD9x0/RX980 PCI to PCI bridge (PCI Express GPP Port 0)
00:05.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD9x0/RX980 PCI to PCI bridge (PCI Express GPP Port 1)
00:06.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD9x0/RX980 PCI to PCI bridge (PCI Express GPP Port 2)
00:07.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD9x0/RX980 PCI to PCI bridge (PCI Express GPP Port 3)
00:0b.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD990 PCI to PCI bridge (PCI Express GFX2 port 0)
00:11.0 SATA controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode] (rev 40)
00:12.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:12.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:13.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:13.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 SMBus Controller (rev 42)
00:14.2 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 Azalia (Intel HDA) (rev 40)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 LPC host controller (rev 40)
00:14.4 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 PCI to PCI Bridge (rev 40)
00:14.5 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI2 Controller
00:15.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB700/SB800/SB900 PCI to PCI bridge (PCIE port 0)
00:15.2 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB900 PCI to PCI bridge (PCIE port 2)
00:16.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:16.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 10h Processor HyperTransport Configuration
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 10h Processor Address Map
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 10h Processor DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 10h Processor Miscellaneous Control
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 10h Processor Link Control
01:00.0 SATA controller: ASMedia Technology Inc. ASM1062 Serial ATA Controller (rev 01)
02:00.0 USB controller: Etron Technology, Inc. EJ188/EJ198 USB 3.0 Host Controller
03:00.0 SATA controller: ASMedia Technology Inc. ASM1062 Serial ATA Controller (rev 01)
04:00.0 USB controller: Etron Technology, Inc. EJ188/EJ198 USB 3.0 Host Controller
05:00.0 VGA compatible controller: NVIDIA Corporation GM206 [GeForce GTX 960] (rev a1)
05:00.1 Audio device: NVIDIA Corporation GM206 High Definition Audio Controller (rev a1)
07:00.0 FireWire (IEEE 1394): VIA Technologies, Inc. VT6315 Series Firewire Controller (rev 01)
08:00.0 Ethernet controller: Intel Corporation 82583V Gigabit Network Connection

_________________
Lian Li PC60 - AMD FX 8300 - Asrock 990FX EXTREME9 - Gigabyte GTX960 G1 Gaming 4Go
Back to top
View user's profile Send private message
runningnak3d
n00b
n00b


Joined: 05 Sep 2018
Posts: 62

PostPosted: Wed Sep 26, 2018 4:52 pm    Post subject: Reply with quote

i4dnf wrote:
No go on a 760 here, with a 4.18.9 kernel :(.

ct85711, runningnak3d do you by any chance have NUMA and/or CGROUPS enabled in your kernel config?
(both are disabled here, but I've noticed on some CUDA related issues that NUMA/CGROUPS were [unintended] requirements)


I have CGROUPS (and everything else that Docker needs) enabled.

Also, forgot to mention that I was running kernel 4.18.5.

-- Brian
Back to top
View user's profile Send private message
i4dnf
Apprentice
Apprentice


Joined: 18 Sep 2005
Posts: 271
Location: Bucharest, Romania

PostPosted: Wed Sep 26, 2018 6:13 pm    Post subject: Reply with quote

Thanks to both of you. Unfortunately neither NUMA nor CGROUPS help.
Guess I'll just wait for the next driver version.
_________________
"The only difference between me and a madman is that I am not MAD" (SALVATOR DALI)
Back to top
View user's profile Send private message
ct85711
Veteran
Veteran


Joined: 27 Sep 2005
Posts: 1791

PostPosted: Wed Sep 26, 2018 11:55 pm    Post subject: Reply with quote

What you may want to do, is just define the screen/monitor for xorg and not rely on the autodetect. I've seen before that the xorg autodetect feature does not always work too well, so setting it up saves you from having to worry if xorg sees it or not (as it will already know what to setup.

https://wiki.archlinux.org/index.php/xorg#Monitor_settings
Back to top
View user's profile Send private message
i4dnf
Apprentice
Apprentice


Joined: 18 Sep 2005
Posts: 271
Location: Bucharest, Romania

PostPosted: Thu Sep 27, 2018 7:21 am    Post subject: Reply with quote

I do have "Monitor" and "Screen" sections, so at least in my case it's not caused by failing autodetect.
_________________
"The only difference between me and a madman is that I am not MAD" (SALVATOR DALI)
Back to top
View user's profile Send private message
krinn
Watchman
Watchman


Joined: 02 May 2003
Posts: 7470

PostPosted: Thu Sep 27, 2018 7:48 am    Post subject: Reply with quote

that whole thread is made of:
1: it doesn't work
2: it work

you guys are going for stats or expect to get real help just from "it doesn't work" ?
Back to top
View user's profile Send private message
i4dnf
Apprentice
Apprentice


Joined: 18 Sep 2005
Posts: 271
Location: Bucharest, Romania

PostPosted: Thu Sep 27, 2018 9:07 am    Post subject: Reply with quote

AFAIKT there's not much we can do other than gathering reports and trying to find a common denominator either for working systems or for not working ones (so far GPU generation doesn't seem to matter, kernel series doesn't seem to mater, some kernel settings like CGROUPS/NUMA don't seem to make a difference, [not] relying on autodetect doesn't make a difference, etc.).
There's not much other info to provide since when/where it hangs, it simply hangs, there's no log output anymore neither in Xorg.log, nor in dmesg or messages, and this on system[s] where the only change is the nvidia driver.
_________________
"The only difference between me and a madman is that I am not MAD" (SALVATOR DALI)
Back to top
View user's profile Send private message
Josef.95
Advocate
Advocate


Joined: 03 Sep 2007
Posts: 4553
Location: Germany

PostPosted: Fri Sep 28, 2018 2:21 am    Post subject: Reply with quote

nvidia-drivers-410.57 is the current beta release: --> https://devtalk.nvidia.com/default/topic/533434/linux/current-graphics-driver-releases
I think you should report bugs upstream.
Back to top
View user's profile Send private message
schnitz81
n00b
n00b


Joined: 05 Dec 2011
Posts: 10
Location: Sweden

PostPosted: Sun Sep 30, 2018 12:35 pm    Post subject: Reply with quote

I just ran into this and everything turned black. For a quick fix, just mask the latest version in package.mask and reemerge nvidia-drivers (to get the old 396.54 driver back, which seems to work fine with the latest kernel).
Back to top
View user's profile Send private message
Cthulhu666
Apprentice
Apprentice


Joined: 02 Apr 2004
Posts: 195

PostPosted: Mon Oct 01, 2018 7:44 pm    Post subject: Reply with quote

There's a new revision of the 410.57 driver that installs some more (new) libraries. I haven't tried it yet, but it may fix the black screen problem some, myself included, have been experiencing.
_________________
A certified Gentoo lover!
Got 4 Gentoo boxes (2 desktops, 1 laptop and a server/MythTV HTPC).

Bought a computer bundled with windows? Don't want to use windows? Demand a refund!
Back to top
View user's profile Send private message
Aradayn
n00b
n00b


Joined: 16 Jul 2017
Posts: 19

PostPosted: Thu Oct 11, 2018 2:01 am    Post subject: Reply with quote

I am getting "No Screens Found" with a 1080 on 410.57. I am running an extremely stripped-down custom kernel.
I tried reverting to the previous driver release and I'm getting the same issues. Also, the video card isn't listed in lspci.

Please let me know if I can provide any additional information that can assist in discovering the problem.
Back to top
View user's profile Send private message
Cthulhu666
Apprentice
Apprentice


Joined: 02 Apr 2004
Posts: 195

PostPosted: Thu Oct 11, 2018 4:56 pm    Post subject: Reply with quote

Aradayn wrote:
I am getting "No Screens Found" with a 1080 on 410.57. I am running an extremely stripped-down custom kernel.
I tried reverting to the previous driver release and I'm getting the same issues. Also, the video card isn't listed in lspci.

Please let me know if I can provide any additional information that can assist in discovering the problem.

Sounds like either a hardware problem or too stripped-down kernel. What bus is the graphics card connected to? I'm assuming it's the PCI-E bus, so you should definitely have that enabled. You could try posting your kernel log, as that might contain something useful.

I finally got around to try the 410.57-r1 driver and had the exact same problem. So the -r1 revision didn't change anything for me. It locks up to the point where even Magic SysReq is unable to do anything except changing the keyboard mode. I tried booting without starting X, which worked fine but it locks up hard, when trying to remount filesystems as read-only, when shutting down. I don't have time to investigate this issue further, so I've downgraded to 396.54 for now.
_________________
A certified Gentoo lover!
Got 4 Gentoo boxes (2 desktops, 1 laptop and a server/MythTV HTPC).

Bought a computer bundled with windows? Don't want to use windows? Demand a refund!
Back to top
View user's profile Send private message
Aradayn
n00b
n00b


Joined: 16 Jul 2017
Posts: 19

PostPosted: Fri Oct 12, 2018 3:21 am    Post subject: Reply with quote

I can't seem to post my full dmesg log. I copied everything mentioning NVIDIA:

Code:

[    1.986561] usb 10-1: new SuperSpeed Gen 1 USB device number 2 using xhci_hcd
[    1.996156] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:03.1/0000:0d:00.1/sound/card1/input9
[    1.996180] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:03.1/0000:0d:00.1/sound/card1/input10
[    1.996199] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:03.1/0000:0d:00.1/sound/card1/input11
[    1.996217] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:03.1/0000:0d:00.1/sound/card1/input12
[    1.997367] ata5: SATA link down (SStatus 0 SControl 300)

...

[    3.551550] udevd[695]: starting eudev-3.2.6
[    3.588019] usb 1-2.3: new full-speed USB device number 7 using xhci_hcd
[    3.627589] igb 0000:04:00.0 enp4s0: renamed from eth0
[    3.629461] nvidia_drm: loading out-of-tree module taints kernel.
[    3.629465] nvidia_drm: module license 'MIT' taints kernel.
[    3.629466] Disabling lock debugging due to kernel taint
[    3.686945] nvidia-nvlink: Nvlink Core is being initialized, major device number 252
[    3.687155] nvidia 0000:0d:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem
[    3.687249] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  396.54  Tue Aug 14 19:02:34 PDT 2018 (using threaded interrupts)
[    3.691509] random: crng init done
[    3.691510] random: 7 urandom warning(s) missed due to ratelimiting
[    3.749403] EXT4-fs (sda3): re-mounted. Opts: discard
[    3.800642] usb 1-2.3: not running at top speed; connect to a high speed hub
[    3.838647] usb 1-2.3: New USB device found, idVendor=0bda, idProduct=5400, bcdDevice= 0.06
[    3.838649] usb 1-2.3: New USB device strings: Mfr=17, Product=18, SerialNumber=19
[    3.838649] usb 1-2.3: Product: BillBoard Device
[    3.838650] usb 1-2.3: Manufacturer: Realtek
[    3.838651] usb 1-2.3: SerialNumber: 123456789ABCDEFGH
[    3.948047] usb 1-2.4: new high-speed USB device number 8 using xhci_hcd
[    4.084698] usb 1-2.4: New USB device found, idVendor=0bda, idProduct=5412, bcdDevice= 1.19
[    4.084700] usb 1-2.4: New USB device strings: Mfr=1, Product=2, SerialNumber=0
[    4.084701] usb 1-2.4: Product: 4-Port USB 2.0 Hub
[    4.084702] usb 1-2.4: Manufacturer: Generic
[    4.089715] hub 1-2.4:1.0: USB hub found
[    4.093697] hub 1-2.4:1.0: 2 ports detected
[    4.177457] resource sanity check: requesting [mem 0x000c0000-0x000fffff], which spans more than PCI Bus 0000:00 [mem 0x000c0000-0x000dffff window]
[    4.177458] caller 0xffffffffc07a4a93 mapping multiple BARs
[    5.089690] Adding 33553404k swap on /dev/sda2.  Priority:-2 extents:1 across:33553404k SSDsc
[    5.513643] IPv6: ADDRCONF(NETDEV_UP): enp4s0: link is not ready
[    5.736219] xhci_hcd 0000:0a:00.0: remove, state 1
[    5.736225] usb usb6: USB disconnect, device number 1
Back to top
View user's profile Send private message
Cthulhu666
Apprentice
Apprentice


Joined: 02 Apr 2004
Posts: 195

PostPosted: Fri Oct 12, 2018 4:20 am    Post subject: Reply with quote

The snippet is too limited to be really useful.

You can upload the file to pastebin and link to it.
_________________
A certified Gentoo lover!
Got 4 Gentoo boxes (2 desktops, 1 laptop and a server/MythTV HTPC).

Bought a computer bundled with windows? Don't want to use windows? Demand a refund!
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Goto page 1, 2  Next
Page 1 of 2

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum