Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
[SOLVED] Nvidia graphics card freeze
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Desktop Environments
View previous topic :: View next topic  
Author Message
cfgauss
Guru
Guru


Joined: 18 May 2005
Posts: 531
Location: USA

PostPosted: Wed Oct 19, 2016 11:55 pm    Post subject: [SOLVED] Nvidia graphics card freeze Reply with quote

For the past week my ancient Nvidia GeForce 8600 GT graphics card using nvidia-drivers-340.98 has frozen once per boot. This last freeze was the most serious and didn't recover: I had to hard reboot. Here are the NVRM messages in /var/log/messages:
Code:
Oct 19 18:30:15 localhost kernel: NVRM: GPU at PCI:0000:01:00: GPU-9c0eedb0-fcaa-d6b7-691a-b3598b871b17
Oct 19 18:30:15 localhost kernel: NVRM: Xid (PCI:0000:01:00): 69, Class Error: ChId 000c, Class 0000502d, Offset 00000250, Data ffffffff, ErrorCode 0000000c
Oct 19 18:30:15 localhost kernel: NVRM: Xid (PCI:0000:01:00): 69, Class Error: ChId 000c, Class 0000502d, Offset 00000250, Data ffffffff, ErrorCode 0000000c
Oct 19 18:30:16 localhost kernel: NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: ChID 000c, Class 00008297, Offset 00000e08, Data 04380000
Oct 19 18:30:16 localhost kernel: NVRM: Xid (PCI:0000:01:00): 69, Class Error: ChId 000e, Class 0000502d, Offset 00000250, Data ffffffff, ErrorCode 0000000c
Oct 19 18:30:17 localhost kernel: NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: ChID 000c, Class 00008297, Offset 00000e08, Data 04380000
Oct 19 18:30:17 localhost kernel: NVRM: Xid (PCI:0000:01:00): 69, Class Error: ChId 000c, Class 0000502d, Offset 00000250, Data ffffffff, ErrorCode 0000000c
Oct 19 18:30:18 localhost kernel: NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: ChID 000c, Class 00008297, Offset 00000e08, Data 04380000
Oct 19 18:30:44 localhost kernel: NVRM: Xid (PCI:0000:01:00): 69, Class Error: ChId 000e, Class 0000502d, Offset 00000250, Data ffffffff, ErrorCode 0000000c
Oct 19 18:30:45 localhost kernel: NVRM: Xid (PCI:0000:01:00): 8, Channel 00000007
Oct 19 18:30:47 localhost kernel: NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Oct 19 18:30:49 localhost kernel: NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Oct 19 18:30:49 localhost kernel: NVRM: GPU at 0000:01:00.0 has fallen off the bus.

Any debugging hints will be gratefully received. In particular, I'd like to know if I need a new graphics card.

[SOLVED] I had an extra, identical video card but this one produced the same freezes and NVRM messages as the old one. In fact, I'm running VMware and my vmware.log complained of a bug in the 340.98 drivers and strongly suggested that I upgrade to newer drivers! So, I conclude that the NVRM messages listed above are due to buggy drivers rather than buggy hardware (as suggested by Hu, below). I bought a newer card which uses the latest drivers, 375.26, and have had no problems nor NVRM messages for three months. [/SOLVED]


Last edited by cfgauss on Tue Jan 10, 2017 2:49 am; edited 2 times in total
Back to top
View user's profile Send private message
Hu
Moderator
Moderator


Joined: 06 Mar 2007
Posts: 13509

PostPosted: Thu Oct 20, 2016 1:17 am    Post subject: Reply with quote

As I read the eix output, that is an odd choice of version for x11-drivers/nvidia-drivers. It is marked as testing on both x86 and amd64, but there are newer stable and testing versions available. I suggest upgrading to the latest stable or testing version (as appropriate for your system) to see if the problem has been fixed. Before replacing the card, I would also try using the open source drivers.
Back to top
View user's profile Send private message
cfgauss
Guru
Guru


Joined: 18 May 2005
Posts: 531
Location: USA

PostPosted: Thu Oct 20, 2016 3:14 pm    Post subject: Reply with quote

Hu wrote:
As I read the eix output, that is an odd choice of version for x11-drivers/nvidia-drivers. It is marked as testing on both x86 and amd64, but there are newer stable and testing versions available. I suggest upgrading to the latest stable or testing version (as appropriate for your system) to see if the problem has been fixed

For my ancient GeForce 8600 GT card, 340.98 is the latest driver.
.
Hu wrote:
Before replacing the card, I would also try using the open source drivers.

This is a good idea. Thanks.
Back to top
View user's profile Send private message
Section_8
Guru
Guru


Joined: 22 May 2004
Posts: 566
Location: Arlington, TX, US

PostPosted: Thu Oct 20, 2016 5:52 pm    Post subject: Reply with quote

I've had several video cards die in the past because the fan stopped working an the card would overheat and lock up at random times. Have you checked that out?
Back to top
View user's profile Send private message
Hu
Moderator
Moderator


Joined: 06 Mar 2007
Posts: 13509

PostPosted: Fri Oct 21, 2016 1:27 am    Post subject: Reply with quote

Based on years of seeing weird nVidia driver bugs, I would suspect the proprietary nVidia driver of having a bug before I would suspect that the card has died. If newer versions have lost the ability to drive your card, that is unfortunate, but also very typical for nVidia as a company. They have a nasty habit of retiring driver support long before the cards die of old age, which is one of several reasons that I usually push people not to use the nVidia drivers.

As Section_8 suggests, you should definitely do some basic hardware checks.
Back to top
View user's profile Send private message
Roman_Gruber
Advocate
Advocate


Joined: 03 Oct 2006
Posts: 3806
Location: Austro Bavaria

PostPosted: Fri Oct 21, 2016 8:15 am    Post subject: Reply with quote

The support has ended ~4 years ago or so, the reason why I sold my notebook with 9800m gts card.

Afaik patches + some kernel version + binary old driver worked.

My only advise, stay on the oldest available kernel in portage + the patches from this forum whne its needed to get the nvidia-drivers running.

2nd get a second hand gpu and replace it. in my area sometimes guys sell old gpus very cheaply or give them away also

or use the open source drivers...
Back to top
View user's profile Send private message
Aiken
Apprentice
Apprentice


Joined: 22 Jan 2003
Posts: 210
Location: Toowoomba/Australia

PostPosted: Fri Oct 21, 2016 9:02 am    Post subject: Reply with quote

Code:

NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context


I was getting a lot of this message with a pair of machines that each have 9600GT in them. Both had failing fans and were overheating. If it is something as simple as that another fan to blow air on the video cards heat sink may do the trick.
_________________
Beware the grue.
Back to top
View user's profile Send private message
snkmoorthy
Guru
Guru


Joined: 19 Nov 2002
Posts: 376

PostPosted: Fri Oct 21, 2016 9:22 am    Post subject: Reply with quote

Yesterday I re-flowed a 9800M GTX, because it was causing memory corruption, randomly when in 2D and hard locking when in 3D(glxgears).

similar to this - http://www.deskdecode.com/wp-content/uploads/2014/11/graphic-card-gpu-problem.jpg
Back to top
View user's profile Send private message
i4dnf
Apprentice
Apprentice


Joined: 18 Sep 2005
Posts: 265
Location: Bucharest, Romania

PostPosted: Sat Oct 22, 2016 8:43 am    Post subject: Reply with quote

Hu wrote:
They have a nasty habit of retiring driver support long before the cards die of old age, which is one of several reasons that I usually push people not to use the nVidia drivers.


With all due respect, if you care to check here (devtalk.nvidia.com) you'll see that both 304.xx and 340.xx legacy series of drivers have received updates as recent as 26.09.2016 which would hardly qualify as "retired driver support". That makes 10yo cards actively supported, cards that aren't even supported by other sw nowadays and you have to fallback on software rendering (which can hardly be said about other manufacturers, which dropped support even for 2yo cards at some point).

They are not supported by the "mainline" drivers because they lack the features the new drivers rely/improve on.
_________________
"The only difference between me and a madman is that I am not MAD" (SALVATOR DALI)
Back to top
View user's profile Send private message
Hu
Moderator
Moderator


Joined: 06 Mar 2007
Posts: 13509

PostPosted: Sat Oct 22, 2016 5:55 pm    Post subject: Reply with quote

That site presently returns 504 Gateway Time-out after a very long delay. Your claim would be nice if true, but is at odds with every other report I have ever read about the nVidia proprietary drivers and their support for legacy cards.
Back to top
View user's profile Send private message
jburns
Veteran
Veteran


Joined: 18 Jan 2007
Posts: 1031
Location: Massachusetts USA

PostPosted: Sun Oct 23, 2016 1:07 am    Post subject: Reply with quote

The correct URL for current driver releases is https://devtalk.nvidia.com/default/topic/533434/linux/current-graphics-driver-releases/

Edit:
Which is the same as in i4dnf post that did not work when I clicked on it but now works.
Back to top
View user's profile Send private message
asturm
Developer
Developer


Joined: 05 Apr 2007
Posts: 6831
Location: Austria

PostPosted: Sun May 14, 2017 8:39 am    Post subject: Reply with quote

i4dnf wrote:
That makes 10yo cards actively supported, cards that aren't even supported by other sw nowadays and you have to fallback on software rendering (which can hardly be said about other manufacturers, which dropped support even for 2yo cards at some point).

Two other well-known GPU chip manufacturers officially support open source drivers with support going back at least 10 years. You don't even have to worry about the latest kernel or xorg-server version, because it will just work.
_________________
backend.cpp:92:2: warning: #warning TODO - this error message is about as useful as a cooling unit in the arctic
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Desktop Environments All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum