Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Qube2 Freezing
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Gentoo on Alternative Architectures
View previous topic :: View next topic  
Author Message
yarrumretep
n00b
n00b


Joined: 06 May 2006
Posts: 8

PostPosted: Wed May 24, 2006 1:31 am    Post subject: Qube2 Freezing Reply with quote

Greetings!

I have successfully installed Gentoo on my mips based Qube2. Everything works pretty well - I have it controlling several X10 devices via a CM11a via the SerialPort. However, occasionally (well, lately, daily) the box has been freezing - no network, no switching my X10 devices. Reboot is required. I can't find any logs which indicate any problems. Can someone give me some tips on debugging this kind of situation? The serial port is taken over by the X10 stuff - so I can't connect a terminal to see if that would work - but I assume the box is locked if the scripts doing the X10 switching aren't running (confirmed by their logs) and the network interface isn't responding to pings or anything.

Any tips, thoughts, help?

Cheers,

pete
Back to top
View user's profile Send private message
yarrumretep
n00b
n00b


Joined: 06 May 2006
Posts: 8

PostPosted: Wed May 24, 2006 1:36 am    Post subject: Reply with quote

uname -a
Linux cobalt 2.6.15.7-mipsgit-20060109 #4 Sun May 7 01:52:32 EDT 2006 mips Nevada V10.0 FPU V10.0 Cobalt Qube2 GNU/Linux
c
Back to top
View user's profile Send private message
Redhatter
Retired Dev
Retired Dev


Joined: 20 Sep 2003
Posts: 548
Location: Brisbane, QLD, Australia

PostPosted: Wed May 24, 2006 2:17 am    Post subject: Reply with quote

This is unusual... Have you tried updating the kernel? There's a 2.6.16.4 image in my devspace (see the website link below) you can try, I know it's stable -- not sure if it'll have the drivers needed for your X10 equipment though. It's also a 64-bit kernel.

A possibility, perhaps a PCI serial card could be used for your X10 equipment? That would allow for capture of oopses via the serial console. I haven't tested this however, I'm not sure what PCI RS232 controllers would work on the Qube2.

For what it's worth though:
Code:
stuartl@wander ~ $ ssh qube uname -a \; uptime
Linux qube 2.6.16.4-mipsgit-20060320-qube #1 Thu Apr 20 14:40:23 EST 2006 mips64 Nevada V10.0  FPU V10.0 Cobalt Qube2 GNU/Linux
 12:15:17 up 33 days, 21:24,  0 users,  load average: 0.16, 0.03, 0.01


^^ I haven't had any big stabilty problems. That's the box that builds the cobalt stage tarballs.
_________________
Stuart Longland (a.k.a Redhatter, VK4MSL)
I haven't lost my mind - it's backed up on a tape somewhere...

Gentoo/MIPS Cobalt developer, Mozilla herd member.
Back to top
View user's profile Send private message
yarrumretep
n00b
n00b


Joined: 06 May 2006
Posts: 8

PostPosted: Wed May 24, 2006 11:22 am    Post subject: Reply with quote

Thanks, Redhatter, unfortunately, I've already got a wifi pci card in the pci slot. The X10 stuff doesn't use any drivers - just talks through ttyS0. All else I have running is

lighttpd
php (fastcgi)
sshd
dhcpd
heyu (X10 stuff)
syslogd
atd
ntpd
udevd

The wifi card is using the madwifi driver.

There is 256 MB RAM in the box (could it be a bad stick of ram?) / 512MB swap - should be enuf RAM. Normal operation indicates only 60MB RAM used and no swap - so it doesn't seem like an out of memory problem (I've seen that lock up linux as it tries to kill off processes to make room).

And a brand new 100GB disk.

Is there any way to tell the kernel to log to the disk if it has problems? Point the console to a disk file?

Thanks!

-pete
Back to top
View user's profile Send private message
Sanguinary
n00b
n00b


Joined: 17 Oct 2004
Posts: 16

PostPosted: Wed May 24, 2006 9:44 pm    Post subject: Reply with quote

yarrumretep wrote:
Thanks, Redhatter, unfortunately, I've already got a wifi pci card in the pci slot. The X10 stuff doesn't use any drivers - just talks through ttyS0.

<snipped>

There is 256 MB RAM in the box (could it be a bad stick of ram?) / 512MB swap - should be enuf RAM. Normal operation indicates only 60MB RAM used and no swap - so it doesn't seem like an out of memory problem (I've seen that lock up linux as it tries to kill off processes to make room).

And a brand new 100GB disk.


My initial thought is you may be drawing too much power. In my experience with the Qube 2, the power supply starts to get iffy when you add a PCI card. I don't know how much a wifi card draws, but the combination of that with the maxed ram and the hard drive may be just at the edge of what the Qube's power supply can handle, causing it to lockup randomly.

I know my Qube, which has a PCI modem in it, has been happier (i.e., no random lockups) since I replaced the hard drive with a compact flash adapter, which draws much less power.
Back to top
View user's profile Send private message
Redhatter
Retired Dev
Retired Dev


Joined: 20 Sep 2003
Posts: 548
Location: Brisbane, QLD, Australia

PostPosted: Wed May 24, 2006 11:32 pm    Post subject: Reply with quote

yarrumretep: Well, 256MB RAM should definately be enough... I run 128MB in mine... and with a PCI SCSI card, it runs quite well.

syslogd can do remote logging... perhaps it's worth trying that? That, and perhaps netconsole? I'm no authority on setting this up though... Peter Horton would be your best point of reference, he'd know how to go about setting up netconsole. Also have a look at the netconsole docs.

If you do figure out how to get it going, please, let me know... I'm looking at setting something up for the Cobalt netboot images. :-)

Sanguinary: Heh, mine's on its second PSU, having blown up the original. 12v PSUs are pretty easy to come by... if you're stuck, grab an AT PSU, hack one of the molex connectors off and use the Yellow and Black wires to power the Qube2. This is what I did before getting a replacement PSU. :-)
_________________
Stuart Longland (a.k.a Redhatter, VK4MSL)
I haven't lost my mind - it's backed up on a tape somewhere...

Gentoo/MIPS Cobalt developer, Mozilla herd member.
Back to top
View user's profile Send private message
yarrumretep
n00b
n00b


Joined: 06 May 2006
Posts: 8

PostPosted: Thu May 25, 2006 11:12 am    Post subject: Reply with quote

Thanks for the thoughts. Assuming the problem is power related - can I buy/make a beefier power supply? Will that help matters? I assume if the device is drawing too much power, the voltage drops below tolerances for the cpu and it stops? If that's the case (seems reasonably likely as the hard-drive is full-sized and much bigger than the one that was in there and wi-fi probably takes some juice), I probably wouldn't see any kernel messages anyway - the thing would just stop (which seems like what is happening).

It's now happening a couple of times a day, so this is untenable as it stands.

Cheers,

pete
Back to top
View user's profile Send private message
Redhatter
Retired Dev
Retired Dev


Joined: 20 Sep 2003
Posts: 548
Location: Brisbane, QLD, Australia

PostPosted: Fri May 26, 2006 1:20 am    Post subject: Reply with quote

I found the power supply would start cutting out unexpectedly, causing the whole machine to reboot. This was the bare machine with no peripherals. Basically, the power supply was being overloaded, and so the protection circuit cut in, killed the power.

If the machine looses power momentarily, this could cause data in CPU registers to become scrambled, and thus causes a hard lock.
_________________
Stuart Longland (a.k.a Redhatter, VK4MSL)
I haven't lost my mind - it's backed up on a tape somewhere...

Gentoo/MIPS Cobalt developer, Mozilla herd member.
Back to top
View user's profile Send private message
yarrumretep
n00b
n00b


Joined: 06 May 2006
Posts: 8

PostPosted: Fri May 26, 2006 1:46 pm    Post subject: Reply with quote

I've spliced in a beefier 12v power supply from an old laptop that is now defunct. I'm hoping that does the trick. I'll report back with my findings. If that doesn't work, I'll try getting the extra logging set up.

Rock on,

pete
Back to top
View user's profile Send private message
yarrumretep
n00b
n00b


Joined: 06 May 2006
Posts: 8

PostPosted: Sat May 27, 2006 9:01 pm    Post subject: Reply with quote

Update:

Quote:
pete@cobalt ~/aquarium $ uptime
16:59:31 up 1 day, 9:45, 2 users, load average: 0.00, 0.00, 0.00

So far, so good. Looks like it might have been the power supply.

I'll keep you posted!

Cheers,

pete
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Gentoo on Alternative Architectures All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum