Gentoo Forums
Gentoo Forums
Quick Search: in
Strange kernel behaviour since last update [partly solved]
View unanswered posts
View posts from last 24 hours

rackathon
 
Reply to topic    Gentoo Forums Forum Index Gentoo on AMD64
View previous topic :: View next topic  
Author Message
born
n00b
n00b


Joined: 24 May 2004
Posts: 53
Location: Germany

PostPosted: Mon Oct 24, 2005 5:02 pm    Post subject: Strange kernel behaviour since last update [partly solved] Reply with quote

Hello all,

sorry for my bad English, I'm not a native speaker and it's really late. :oops:

Yesterday I upgraded from the 2.6.12-gentoo-r10 to the 2.6.13-gentoo-r3 kernel and after a reboot I got the following kernel error message (I can't copy & paste, sorry):
Code:

Unable to handle kernel NULL pointer dereference at <many0>20 RIP:
[...] do_dbs_timer
[...]
Oops: 0000 [1] SMP
CPU 0
Modules linked in:
Pid: 4, comm: events/0 Not tainted 2.6.12-gentoo-r10
[...] registers [...]
Call Trace: do_dbs_timer, worker_thread, default_wake_function, kthread, child_rip...
RIP do_dbs_timer

The first thing I thought is that the new kernel does not seem to work and downgraded to my old kernel. I did not recompile, but the error was present also in the old kernel version, I thought WTF? There were no changes in between, except of
Code:
emerge -av nvidia-glx nvidia-kernel
.

On fgo I found some information, most of it regarding problems with nvidia cards and the kernel:

http://forums.gentoo.org/viewtopic-t-390499.html
http://forums.gentoo.org/viewtopic-t-386755.html
http://forums.gentoo.org/viewtopic-p-2814297.html#2814297
http://forums.gentoo.org/viewtopic-t-392537-highlight-unable+handle+kernel+null+pointer+dereference.html

Almost the same failure is described by dom_cyrus in the second post...
Quote:
Do you still have the problems? Ah yeah I made some tests. My problem seems to do nothing with heavy lead, because it can freeze when I just have open firefox. And I run some apps for many hoers (both cpus 100%) and nothing happened, but then after I closed all apps and waited maybe 30 min it freezes again...

When it freezes it locks the keyboard (no numlock switching anymore) but the mouspointer still moves also the sound is still playing and no kernel oop is coming. Can please someone say me how I can log such things?


If I am in the X environment I still can open programs, click everywhere with the mouse, but the keyboard hangs. Seems to be the same behaviour.

So I disabled the nvidia driver, tried nv instead, I compiled the nvidia drivers with ~amd64, I disabled the splash on boot, I tried different combinations with both kernels and nothing worked. I think this is a completely different problem.

Can anybody help me? It would be very nice.


Last edited by born on Tue Oct 25, 2005 3:14 am; edited 1 time in total
Back to top
View user's profile Send private message
dom_cyrus
Tux's lil' helper
Tux's lil' helper


Joined: 06 Jun 2005
Posts: 102

PostPosted: Mon Oct 24, 2005 5:57 pm    Post subject: Reply with quote

Hi,
my problem disappeared until I have installed newest vanilla sources kernel-2.6.14-rc4. But I' am still not sure if the problem really disappeard, because I did'nt use my gentoo box very often. But one thing is for sure, that the nvidia x86_64 driver has some problem with all kernels > 2.6.11, so there are two options:
    - Installs latest vanilla
    - Install nvidia-zander-patch by hand
Note that latest nvidia-zander-patch is not included in the ebuilds from gentoo, but someone told me, that it should be fixed with the newest gentoo-sources. I hope this helps you a little.
Back to top
View user's profile Send private message
born
n00b
n00b


Joined: 24 May 2004
Posts: 53
Location: Germany

PostPosted: Tue Oct 25, 2005 2:07 am    Post subject: Reply with quote

Yeah, but like I write, I do not think the problem is exactly the same like yours, because I still get the error message after disabling the nvidia driver, I do not load it and I have nv in the xorg.conf and still this messages, so it simply could not be the nvidia driver, because it is disabled. Only the symptoms are the same, but also the error message is another, because you got the error in X, mine is in events/0, every time PID 4.

Of course I can try to install the nvidia drivers with the latest patch, but I do not think it would help, since there is also the error without the drivers.

Update: I will try to compile a kernel with default options and see if the problem still remains, if not I have to check my kernel configuration. The only strange thing is that the old kernel worked perfectly and now it hangs. I have run a memtest and it passed several times, so it couldn't be a memory failure.
Back to top
View user's profile Send private message
born
n00b
n00b


Joined: 24 May 2004
Posts: 53
Location: Germany

PostPosted: Tue Oct 25, 2005 3:13 am    Post subject: Reply with quote

O.K., I think I found out what the problem was. do_dbs_timer has something to do with the CPU frequency scaling, so I disabled it in the kernel (so PowerNow! is also disabled :() and everything seems to be fine for now. I did not test it really long, but I use my Gentoo box quite often, so I think I can reopen this thread if there will be problems.

Is it a bug? Should I post it to bgo? Or is it only a misconfiguration of my kernel? Is it realistic (I mean possible is everything) that some PowerNow! functions in my CPU are defect?

I really do not know why this happened. Strange, strange, really strange.

Greets! And thanks for everything :)
Back to top
View user's profile Send private message
dom_cyrus
Tux's lil' helper
Tux's lil' helper


Joined: 06 Jun 2005
Posts: 102

PostPosted: Tue Oct 25, 2005 4:00 am    Post subject: Reply with quote

Maybe its thisone --> http://bugzilla.kernel.org/show_bug.cgi?id=4851
Da du ja auch deutsch verstehst --> http://www.heise.de/newsticker/foren/go.shtml?read=1&msg_id=9058111&forum_id=86419
Back to top
View user's profile Send private message
born
n00b
n00b


Joined: 24 May 2004
Posts: 53
Location: Germany

PostPosted: Tue Oct 25, 2005 4:07 am    Post subject: Reply with quote

It isn't the randomize_va bug and it can't be the SMP bug, because I have only one core in my CPU. But thanks for the links. My Gentoo box didn't crash randomly but always after about 5 minutes of runtime. I think the kernel wanted to slow down the CPU after this time and this produced the bug. I even don't know if it's a real bug or if it is my failure.
Back to top
View user's profile Send private message
urcindalo
Guru
Guru


Joined: 08 Feb 2005
Posts: 397
Location: Almeria, Spain

PostPosted: Wed Oct 26, 2005 11:47 am    Post subject: Reply with quote

I use the nvidia driver on AMD64 and the same kernel version and everything works like a charm. Here you are some of my info:
Code:
*  media-video/nvidia-glx
      Latest version available: 1.0.7676-r1
      Latest version installed: 1.0.7676-r1
      Size of downloaded files: 14,116 kB
      Homepage:    http://www.nvidia.com/
      Description: NVIDIA X11 driver and GLX libraries
      License:     NVIDIA

*  media-video/nvidia-kernel
      Latest version available: 1.0.7676
      Latest version installed: 1.0.7676
      Size of downloaded files: 14,116 kB
      Homepage:    http://www.nvidia.com/
      Description: Linux kernel module for the NVIDIA X11 driver
      License:     NVIDIA

*  media-video/nvidia-settings
      Latest version available: 1.0.20050729
      Latest version installed: 1.0.20050729
      Size of downloaded files: 1,032 kB
      Homepage:    http://www.nvidia.com/
      Description: NVIDIA Linux X11 Settings Utility
      License:     GPL-2

and
Code:
# emerge info
Portage 2.0.51.22-r3 (default-linux/amd64/2005.1, gcc-3.4.4, glibc-2.3.5-r2, 2.6.13-gentoo-r3 x86_64)
=================================================================
System uname: 2.6.13-gentoo-r3 x86_64 AMD Athlon(tm) 64 Processor 3000+
Gentoo Base System version 1.6.13
dev-lang/python:     2.4.2
sys-apps/sandbox:    1.2.12
sys-devel/autoconf:  2.13, 2.59-r6
sys-devel/automake:  1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1
sys-devel/binutils:  2.15.92.0.2-r10
sys-devel/libtool:   1.5.20
virtual/os-headers:  2.6.11-r2
ACCEPT_KEYWORDS="amd64"
AUTOCLEAN="yes"
CBUILD="x86_64-pc-linux-gnu"
CFLAGS="-march=k8 -O2"
CHOST="x86_64-pc-linux-gnu"
CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3/share/config /usr/share/config /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/env.d"
CXXFLAGS="-march=k8 -O2"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoconfig distlocks sandbox sfperms strict"
GENTOO_MIRRORS="http://distfiles.gentoo.org http://distro.ibiblio.org/pub/Linux/distributions/gentoo"
LINGUAS="en es"
PKGDIR="/home/ramiro/usr/local/portage/pkgdir-backup/"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY="/usr/local/portage"
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="amd64 X Xaw3d a52 aac aalib accessibility acl acpi adns aim alsa apache2 apm arts audiofile avi bash-completion bcmath berkdb bidi bitmap-fonts bonobo browserplugin bzlib caps cdparanoia cdr crypt ctype cups curl curlwrappers db2 dba dbase dbm dbx dga dio directfb dv dvb dvd dvdr dvdread eds emboss emul-linux-x86 encode esd ethereal evo exif expat fam fastcgi fbcon ffmpeg fftw flac flash flatfile foomaticdb fortran freetds ftp gd gdbm geoip gif ginac glut gmp gnome gnustep gnutls gphoto2 gpm gstreamer gtk gtk2 gtkhtml guile hal hyperwave-api iconv icq ieee1394 imagemagick imap imlib inifile innodb interbase iodbc ipv6 jabber jack java javascript jikes jpeg kde ladcca lcms ldap lesstif libcaca libg++ libgda libwww lzw lzw-tiff m17n-lib mad maildir mailwrapper matroska mbox mcal mcve memlimit mhash mikmod milter mime ming mmap mng motif mozilla mp3 mpeg mpi msession msn msql mysql mysqli nas ncurses neXt netboot netcdf nis nls nptl oci8 ofx ogg openal opengl oracle oracle7 oscar oss ovrimos pam pcntl pcre pdflib perl php pie plotutils png portaudio posix postgres ppds prelude profile python qdbm qt quicktime readline recode ruby samba sapdb sasl scanner sdl sharedext simplexml skey slang sndfile snmp soap sockets socks5 source sox speex spell spl ssl svg sysfs sysvipc szip tcltk tcpd tetex theora tidy tiff tokenizer truetype truetype-fonts type1-fonts udev unicode usb userlocales utf v4l vcd vhosts vorbis wddx wmf wxwindows xface xine xml xml2 xmlrpc xmms xpm xprint xscreensaver xsl xv xvid yahoo yaz zeo zlib linguas_en linguas_es userland_GNU kernel_linux elibc_glibc"
Unset:  ASFLAGS, CTARGET, LANG, LC_ALL, LDFLAGS, MAKEOPTS


Hope that helps oyu somehow.
Back to top
View user's profile Send private message
born
n00b
n00b


Joined: 24 May 2004
Posts: 53
Location: Germany

PostPosted: Wed Oct 26, 2005 12:00 pm    Post subject: Reply with quote

Do you have PowerNow! enabled in your kernel?
Back to top
View user's profile Send private message
urcindalo
Guru
Guru


Joined: 08 Feb 2005
Posts: 397
Location: Almeria, Spain

PostPosted: Thu Oct 27, 2005 2:57 am    Post subject: Reply with quote

This is my kernel config in that regard:
Code:
#
# Power management options
#
CONFIG_PM=y
# CONFIG_PM_DEBUG is not set
CONFIG_SOFTWARE_SUSPEND=y
CONFIG_PM_STD_PARTITION=""

#
# ACPI (Advanced Configuration and Power Interface) Support
#
CONFIG_ACPI=y
CONFIG_ACPI_BOOT=y
CONFIG_ACPI_INTERPRETER=y
CONFIG_ACPI_SLEEP=y
CONFIG_ACPI_SLEEP_PROC_FS=y
# CONFIG_ACPI_SLEEP_PROC_SLEEP is not set
# CONFIG_ACPI_AC is not set
# CONFIG_ACPI_BATTERY is not set
CONFIG_ACPI_BUTTON=y
CONFIG_ACPI_VIDEO=m
# CONFIG_ACPI_HOTKEY is not set
CONFIG_ACPI_FAN=y
CONFIG_ACPI_PROCESSOR=y
CONFIG_ACPI_THERMAL=y
# CONFIG_ACPI_ASUS is not set
CONFIG_ACPI_IBM=m
# CONFIG_ACPI_TOSHIBA is not set
CONFIG_ACPI_BLACKLIST_YEAR=0
CONFIG_ACPI_DEBUG=y
CONFIG_ACPI_BUS=y
CONFIG_ACPI_EC=y
CONFIG_ACPI_POWER=y
CONFIG_ACPI_PCI=y
CONFIG_ACPI_SYSTEM=y
CONFIG_ACPI_CONTAINER=m
Back to top
View user's profile Send private message
born
n00b
n00b


Joined: 24 May 2004
Posts: 53
Location: Germany

PostPosted: Thu Oct 27, 2005 3:55 am    Post subject: Reply with quote

O.K., thanks!

I will try a new run with your ACPI kernel configuration and report my results. Possibly I had a wrong kernel configuration.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Gentoo on AMD64 All times are GMT - 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum