

2.6.31 is the kernel used for openSUSE 11.2 so it will be supported for a while longer, but you are right that .32 will be the longest supported as it is used for SLE11SP1, next week I will rebase the 2.6.32 patches and upload a ebuild.meyerm wrote:Hi,
first let me say a big thank you for providing newer xen-kernels for Gentoo. I'll wait for a 2.6.32-version (since it is said this will be supported longer and stabilized by Suse and Redhat) before trying to go productive but am toying around with your current kernels at the moment.
I've not had any crashes with -r10, but yes -r11 should be very good now, I am running it on several production systems and I've had no problems at all.meyerm wrote:Just to add for those who had problems with 2.6.31-xen-r10: I experienced the crashes under load, too. Now with 2.6.31-xen-r11 the problems seem to be gone. I just started a "cat /dev/urandom > /dev/null" on each CPU and will let it run at least over the night to be really sure. For now it already runs for ca. 2 hours stable.
Great!andylyon wrote:2.6.31 is the kernel used for openSUSE 11.2 so it will be supported for a while longer, but you are right that .32 will be the longest supported as it is used for SLE11SP1, next week I will rebase the 2.6.32 patches and upload a ebuild.
I just reached the 24h runtime. I guess, I'll now switch the default entry in grub from 2.6.29 to the 2.6.31-r11.meyerm wrote:Just to add for those who had problems with 2.6.31-xen-r10: I experienced the crashes under load, too. Now with 2.6.31-xen-r11 the problems seem to be gone. I just started a "cat /dev/urandom > /dev/null" on each CPU and will let it run at least over the night to be really sure. For now it already runs for ca. 2 hours stable.
Code: Select all
Mar 14 18:29:34 endor kernel: [1649126.870036] BUG: soft lockup - CPU#1 stuck for 224s! [swapper:0]
Mar 14 18:29:34 endor kernel: [1649126.870036] Modules linked in: ipv6 ipt_LOG xt_limit xt_state iptable_filter ipt_REDIRECT xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 iptable_mangle ip_tables x_tables scsi_wait_scan
Mar 14 18:29:34 endor kernel: [1649126.870036] CPU 1:
Mar 14 18:29:34 endor kernel: [1649126.870036] Modules linked in: ipv6 ipt_LOG xt_limit xt_state iptable_filter ipt_REDIRECT xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 iptable_mangle ip_tables x_tables scsi_wait_scan
Mar 14 18:29:34 endor kernel: [1649126.870036] Pid: 0, comm: swapper Not tainted 2.6.31-xen-r11 #1 Sun Fire V40z
Mar 14 18:29:34 endor kernel: [1649126.870036] RIP: e030:[<ffffffff802063aa>] [<ffffffff802063aa>] 0xffffffff802063aa
Mar 14 18:29:34 endor kernel: [1649126.870036] RSP: e02b:ffff8803e8483eb8 EFLAGS: 00000246
Mar 14 18:29:34 endor kernel: [1649126.870036] RAX: 0000000000000000 RBX: 0000000000000001 RCX: ffffffff802063aa
Mar 14 18:29:34 endor kernel: [1649126.870036] RDX: ffff8803e8483ed0 RSI: 0000000000000000 RDI: 0000000000000001
Mar 14 18:29:34 endor kernel: [1649126.870036] RBP: 0000000000000000 R08: 0000000000000000 R09: ffff8803e8498000
Mar 14 18:29:34 endor kernel: [1649126.870036] R10: 86634d5592bb9e06 R11: 0000000000000246 R12: 0000000000000000
Mar 14 18:29:34 endor kernel: [1649126.870036] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
Mar 14 18:29:34 endor kernel: [1649126.870036] FS: 00007f9bd973e6f0(0000) GS:ffffc90000010000(0000) knlGS:0000000000000000
Mar 14 18:29:34 endor kernel: [1649126.870036] CS: e033 DS: 002b ES: 002b CR0: 000000008005003b
Mar 14 18:29:34 endor kernel: [1649126.870036] CR2: 00000000006d4ae4 CR3: 00000003e764d000 CR4: 0000000000000660
Mar 14 18:29:34 endor kernel: [1649126.870036] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 14 18:29:34 endor kernel: [1649126.870036] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Mar 14 18:29:34 endor kernel: [1649126.870036] Call Trace:
Mar 14 18:29:34 endor kernel: [1649126.870036] [<ffffffff8020caec>] ? xen_safe_halt+0xb2/0xdd
Mar 14 18:29:34 endor kernel: [1649126.870036] [<ffffffff80210e3e>] ? xen_idle+0x6e/0xe1
Mar 14 18:29:34 endor kernel: [1649126.870036] [<ffffffff80208c20>] ? cpu_idle+0xa1/0xddCode: Select all
[ 27.186394] EXT3 FS on md1, internal journal
[ 28.156511] Adding 2097144k swap on /dev/mapper/vg_swap-dom0. Priority:-1 extents:1 across:2097144k
[ 36.602542] ip_tables: (C) 2000-2006 Netfilter Core Team
[ 36.670856] nf_conntrack version 0.5.0 (16384 buckets, 65536 max)
[ 36.671096] CONFIG_NF_CT_ACCT is deprecated and will be removed soon. Please use
[ 36.671098] nf_conntrack.acct=1 kernel parameter, acct=1 nf_conntrack module option or
[ 36.671100] sysctl net.netfilter.nf_conntrack_acct=1 to enable it.
[ 37.859260] tg3 0000:02:02.0: firmware: using built-in firmware tigon/tg3_tso.bin
[ 39.728844] tg3: eth0: Link is up at 100 Mbps, full duplex.
[ 39.728851] tg3: eth0: Flow control is off for TX and off for RX.
[ 42.599496] tg3 0000:02:03.0: firmware: using built-in firmware tigon/tg3_tso.bin
[ 44.490092] tg3: eth1: Link is up at 100 Mbps, full duplex.
[ 44.490099] tg3: eth1: Flow control is off for TX and off for RX.
[ 142.789967] BUG: soft lockup - CPU#1 stuck for 114s! [swapper:0]
[ 142.789967] Modules linked in: iptable_mangle ipt_REDIRECT iptable_nat nf_nat xt_tcpudp ipt_LOG xt_limit nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables scsi_wait_scan
[ 142.789967] CPU 1:
[ 142.789967] Modules linked in: iptable_mangle ipt_REDIRECT iptable_nat nf_nat xt_tcpudp ipt_LOG xt_limit nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables scsi_wait_scan
[ 142.789967] Pid: 0, comm: swapper Not tainted 2.6.32-xen-r1 #1 Sun Fire V40z
[ 142.789967] RIP: e030:[<ffffffff802063aa>] [<ffffffff802063aa>] 0xffffffff802063aa
[ 142.789967] RSP: e02b:ffff8803e8483ee8 EFLAGS: 00000246
[ 142.789967] RAX: 0000000000000000 RBX: 0000000000000001 RCX: ffffffff802063aa
[ 142.789967] RDX: ffff8803e8483f00 RSI: 0000000000000000 RDI: 0000000000000001
[ 142.789967] RBP: ffffffff80757df0 R08: 0000000000000000 R09: ffff88000101bc30
[ 142.789967] R10: 0000000000009c00 R11: 0000000000000246 R12: 0000000000000000
[ 142.789967] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 142.789967] FS: 00007f481b29b6f0(0000) GS:ffff880001026000(0000) knlGS:0000000000000000
[ 142.789967] CS: e033 DS: 002b ES: 002b CR0: 000000008005003b
[ 142.789967] CR2: 00007f481ad77d2e CR3: 00000003e6a95000 CR4: 0000000000000660
[ 142.789967] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 142.789967] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 142.789967] Call Trace:
[ 142.789967] [<ffffffff8020c475>] ? xen_safe_halt+0xdf/0x10f
[ 142.789967] [<ffffffff8020f1e3>] ? xen_idle+0x25/0x5a
[ 142.789967] [<ffffffff80208835>] ? cpu_idle+0x98/0xcb
[ 142.789967] [<ffffffff803ad85a>] ? vgacon_cursor+0x0/0x17d
[ 142.791231] BUG: soft lockup - CPU#3 stuck for 114s! [swapper:0]
[ 142.791231] Modules linked in: iptable_mangle ipt_REDIRECT iptable_nat nf_nat xt_tcpudp ipt_LOG xt_limit nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables scsi_wait_scan
[ 142.791231] CPU 3:
[ 142.791231] Modules linked in: iptable_mangle ipt_REDIRECT iptable_nat nf_nat xt_tcpudp ipt_LOG xt_limit nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables scsi_wait_scan
[ 142.791231] Pid: 0, comm: swapper Not tainted 2.6.32-xen-r1 #1 Sun Fire V40z
[ 142.791231] RIP: e030:[<ffffffff802063aa>] [<ffffffff802063aa>] 0xffffffff802063aa
[ 142.791231] RSP: e02b:ffff8803e8487ee8 EFLAGS: 00000246
[ 142.791231] RAX: 0000000000000000 RBX: 0000000000000003 RCX: ffffffff802063aa
[ 142.791231] RDX: ffff8803e8487f00 RSI: 0000000000000000 RDI: 0000000000000001
[ 142.791231] RBP: ffffffff80757df0 R08: ffff8803e8487ed0 R09: 0000000000000040
[ 142.791231] R10: 000000013fffa42f R11: 0000000000000246 R12: 0000000000000000
[ 142.791231] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 142.791231] FS: 00007fc917ce66f0(0000) GS:ffff88000104a000(0000) knlGS:0000000000000000
[ 142.791231] CS: e033 DS: 002b ES: 002b CR0: 000000008005003b
[ 142.791231] CR2: 00007fc9177c2d2e CR3: 00000003e60ef000 CR4: 0000000000000660
[ 142.791231] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 142.791231] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 142.791231] Call Trace:
[ 142.791231] [<ffffffff8020c475>] ? xen_safe_halt+0xdf/0x10f
[ 142.791231] [<ffffffff8020f1e3>] ? xen_idle+0x25/0x5a
[ 142.791231] [<ffffffff80208835>] ? cpu_idle+0x98/0xcbCode: Select all
# xm dmesg
__ __ _ _ ___ ___
\ \/ /___ _ __ | || | / _ \ / _ \
\ // _ \ '_ \ | || |_| | | | | | |
/ \ __/ | | | |__ _| |_| | |_| |
/_/\_\___|_| |_| |_|(_)___(_)___/
(XEN) Xen version 4.0.0 (@local) (gcc version 4.3.4 (Gentoo 4.3.4 p1.1, pie-10.1.5) ) Sun May 9 10:29:43 MSD 2010
...
(XEN) Xen kernel: 64-bit, lsb, compat32
(XEN) Dom0 kernel: 64-bit, lsb, paddr 0x1000000 -> 0x1506000
...Code: Select all
kernel /boot/xen.gz console=vga
module /boot/vmlinuz-xen root=/dev/ram0 real_root=/dev/md1 console=tty0 console=ttyS0,38400I got the same problemTuinslak wrote:The only compression I seem to have is gzip (so yes, it's vmlinuz file instead of bzImage file). I cannot pick bz2 or something. I can try to disable it, but that won't make a lot of difference I guess, as bz2 works fine for my normal kernel.
For some reason I can now boot using the Xen.gz kernel, but it hangs during boot
You'll have to excuse my blurry images, I've used my cell phone to take them. Basicly, it hangs at async/1
The last message: "async/1 used greatest stack depth"
http://www.flickr.com/photos/tuinslak/3720354353/
http://www.flickr.com/photos/tuinslak/3720353783/
I've left the server on for over an hour, and it hasn't moved a bit.
The kernel without xen.gz still gives me error 13.
kernel sizes:
one being bz2, other gzip. Seem like normal sizes to me.Code: Select all
four boot # du -sh kern* 2.7M kernel-2.6.29 3.2M kernel-2.6.30
