View previous topic :: View next topic |
Author |
Message |
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Fri Sep 21, 2012 8:59 pm Post subject: |
|
|
http://swift.siphos.be/linux_sea/ch07.html#idp35190256 is a very well written discussion of the process we have been following (In a very well written discussion of linux generally). What we have been doing is leading in the nasty direction of some (intermittent? random?) hardware failure. The logic being that the drivers are correct for the hdd controller bus, correct for the hdd motherboard adapter, correct for the file system, and grub was passing the correct root=partition. (Almost) leaving only hardware. gpt/uefi was not part of the reference discussion. While I can vouch for drivers sd, sg, sr, ahci, ata_pIIx, ext4; I have little experience with the efi/gpt drivers -- EFI_PARTITION, EFI, EFI_STUB, FB_EFI, EFI_VARS. These may easily have quirks that I know nothing about.
Intermittent random is justified by multiple cases of panic block-device(0,0), some panic block-device(0,68), one no panic but an oops; however reconfigures were also happening.
Trying again the no panic but an oops kernel may help support or debunk intermittent random.
I'm off to try a gpt install.
BTW any time you boot the cd might be interesting..
if you have a sysresccd: "boot from first hard drive option" might be interesting not sure if it will handle gpt/grub2 but .. _________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Fri Sep 21, 2012 9:25 pm Post subject: |
|
|
Just got back, I'm compiling the "bad config" now. This would be the kernel where I anticipate not hitting a VFS kernel panic and getting a dmesg log out of. Maybe it'll boot if the file system's repaired, who's knows? (Magic Eight Ball says "Don't count on it")
I'll run e2fsck after I can umount this drive. It would be interesting if the filesystem has broken again..
I made clean, so this'll take a second. Seeing that I have time, I'll start burning sysresccd (I don't have a laptop down here, but I do have an external burner).
-EDIT:
e2fsck appears to be clean after reboot:
Code: | # e2fsck /dev/sde4
e2fsck 1.41.14 (22-Dec-2010)
/dev/sde4: clean, 290139/13590528 files, 1927609/54361745 blocks
|
|
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Fri Sep 21, 2012 10:07 pm Post subject: |
|
|
VFS panic at 0,0
Ugh, I just don't understand. I'm pretty positive this is the exact config I used when I got my dmesg this morning. I can accept that the SATA drivers aren't working well here because, well, that's what this means. Then, on the other hand,THE Linux kernel can handle this hardware. This computer has ran both Ubuntu and Gentoo in the past -- clearly there exists a set of configuration options for at least two versions of the Linux kernel that reliably and consistently boots on this hardware.
I also tried on the sysresccd Boot From First Hard Disk, same result. Maybe I'll poke through the UEFI settings some more; maybe there's some bizarre "feature" or something.. |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Fri Sep 21, 2012 10:28 pm Post subject: |
|
|
e2fsck /dev/sde4?
the random intermittent nature now heading toward repeatable suggests hardware
suggest boot sysresccd to GUI and run smartmontools against sde drive (or smartctl in cli).
http://wiki.gentoo.org/wiki/Smartmontools
memtest if any other os (except windows ) has been flaky.
different kernel? _________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Fri Sep 21, 2012 10:43 pm Post subject: |
|
|
I just got it.
Vaguely remembered poking around in GRUB settings before I got my dmesg boot. Decided to do so again before starting hw tests. Noticed GRUB had configured root=/dev/sde4, which doesn't make sense bc the UUID root is in GRUB default. Deleted the UUID line and now I'm sitting at login. Will reset to reproduce, then compile seed config to see if this continues to check out. Must have been a typo or syntax error, will figure out when I'm back on liveCD. |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Fri Sep 21, 2012 10:47 pm Post subject: |
|
|
ok. don't lose what you have.
my grub.cfg also defaults to root=/dev/sda2 not to a root=UUID=28c39827-2409-48a3-a17a-81b577db9681
mine may work because I run everything on a single partition
Quote: | menuentry 'Gentoo GNU/Linux' --class gentoo --class gnu-linux --class gnu --class os $menuentry_id_option 'gnulinux-simple-28c39827-2409-48a3-a17a-81b577db9681' {
load_video
insmod gzio
insmod part_msdos
insmod reiserfs
set root='hd0,msdos2'
if [ x$feature_platform_search_hint = xy ]; then
search --no-floppy --fs-uuid --set=root --hint-bios=hd0,msdos2 --hint-efi=hd0,msdos2 --hint-baremetal=ahci0,msdos2 --hint='hd0,msdos2' 28c39827-2409-48a3-a17a-81b577db9681
else
search --no-floppy --fs-uuid --set=root 28c39827-2409-48a3-a17a-81b577db9681
fi
echo 'Loading Linux 3.5.3-gentoo ...'
linux /boot/vmlinuz-3.5.3-gentoo root=/dev/sda2 ro
} |
_________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Fri Sep 21, 2012 11:05 pm Post subject: |
|
|
Alrighty, I'm off my iphone and on my liveCD
It wasn't a typo in the actual UUID address at least, I have the UUID from blkid and the UUID from grub.cfg lined up and they match perfectly.
This is the actual line in grub.cfg:
Code: | linux /vmlinuz-3.4.9-gentoo root=/dev/sde4 ro rootfstype=ext4 root=UUID=3c161f01-0c84-4472-a564-7b7b89f2cfca |
This is from blkid:
Code: | /dev/sde4: UUID="3c161f01-0c84-4472-a564-7b7b89f2cfca" TYPE="ext4" |
I think I'm going to backup the entire /usr/src/linux into a golden backup folder, that I will love and cherish indefinitely. Then I'm going to build the kernel seed kernel, because, well, it appears to be perfect as far as I can tell. And then, God willing, I'm going to reboot, make sure grub.cfg is okay, start compiling X after the system boots flawlessly (bruising my knuckles knocking on wood), figure out how to mark this as solved or state those words in another post if that's the task of moderators and find some sort-of celebratory action to perform. |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Fri Sep 21, 2012 11:11 pm Post subject: |
|
|
4 more crossed fingers here. golden folder on a usb in your pocket?
my grub.cfg also defaults to root=/dev/sda2 not to a root=UUID=28c39827-2409-48a3-a17a-81b577db9681 as we are lead to expect by the grub2 advertisements
mine may work because I run everything on a single partition with only one disk making it hard for the kernel to get confused about which disk is which.
your gnulinux-simple-UUID is different from your --set=root UUID's
Quote: | menuentry 'Gentoo GNU/Linux' --class gentoo --class gnu-linux --class gnu --class os $menuentry_id_option 'gnulinux-simple-28c39827-2409-48a3-a17a-81b577db9681' {
load_video
insmod gzio
insmod part_msdos
insmod reiserfs
set root='hd0,msdos2'
if [ x$feature_platform_search_hint = xy ]; then
search --no-floppy --fs-uuid --set=root --hint-bios=hd0,msdos2 --hint-efi=hd0,msdos2 --hint-baremetal=ahci0,msdos2 --hint='hd0,msdos2' 28c39827-2409-48a3-a17a-81b577db9681
else
search --no-floppy --fs-uuid --set=root 28c39827-2409-48a3-a17a-81b577db9681
fi
echo 'Loading Linux 3.5.3-gentoo ...'
linux /boot/vmlinuz-3.5.3-gentoo root=/dev/sda2 ro
} |
_________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Sat Sep 22, 2012 12:26 am Post subject: |
|
|
Hey, sorry for the wait. Had to celebrate a bit
I backed it up on lvm#1, which isn't exactly external media, but then on the other hand if lvm#1 goes down then I wouldn't want to be that desktop computer or the nearest sledgehammer, so it won't really matter anyways.
I took the seed config and spent some time adding some modules, checking over things, googling things I want and making sure the required kernel config for them matched up, etc. Don't think I changed anything that was actually compiled into the kernel at all.
Now when the computer boots it shows (none)@unknown_domain instead of root and after you log in everything's a read only filesystem. BUT...this is coming back to me. This is exactly what happened when I installed Gentoo the first time, and I figured it out. It had something to do with the ext4 filesystem breaking if I remember correctly. Last time I booted I e2fsck'd and it was indeed broken again. Upon repair I rebooted but the same thing happened. I just rebooted into a liveCD again and:
Code: | ubuntu@ubuntu:~$ sudo e2fsck /dev/sde4
e2fsck 1.41.14 (22-Dec-2010)
/dev/sde4 contains a file system with errors, check forced.
Pass 1: Checking inodes, blocks, and sizes
Pass 2: Checking directory structure
Pass 3: Checking directory connectivity
Pass 4: Checking reference counts
Pass 5: Checking group summary information
/dev/sde4: 338714/13590528 files (0.1% non-contiguous), 2109799/54361745 blocks
|
My dmesg's here if you want to take a look at it: http://pastebin.com/f8mgdsqq
That said, I'm pretty sure I can fix this. I remember it not being a config issue but just something that had to do with the filesystem. I remember not finding any examples of this elsewhere, but I think if I read up on repairing the filesystem that might be how you solve it.
Oh, right, and /etc/conf.d/hostname is set, that's not it. |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Sat Sep 22, 2012 12:31 am Post subject: |
|
|
Code: | mount -o remount,rw |
_________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Sat Sep 22, 2012 12:36 am Post subject: |
|
|
From within the running system? |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Sat Sep 22, 2012 12:39 am Post subject: |
|
|
as root
may need Code: | mount -o remount,rw / | actually _________________ Defund the FCC. |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Sat Sep 22, 2012 12:48 am Post subject: |
|
|
Code: | rc-update show | wgetpaste | from the booted system if possible; chroot if not _________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Sat Sep 22, 2012 12:51 am Post subject: |
|
|
Okay, that command succeeds at making / RW. Domain name doesn't update, bash prompt reads (none) ~ # and this doesn't change after exiting and logging in again. Testing reboot now.
Reboot logged into unknown domain, RO again. Noticed kernel saw filesystem with errors and forced fsck while booting this time. |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Sat Sep 22, 2012 12:54 am Post subject: |
|
|
crossed in the mail
you may be missing boot and sysinit services _________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Sat Sep 22, 2012 12:58 am Post subject: |
|
|
After remounting and dhcpcd could do it: http://bpaste.net/show/46830/
Also noticed kernel boot sequence got a little more red at the end than I remember the first few times. |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Sat Sep 22, 2012 1:10 am Post subject: |
|
|
bootmisc | boot
devfs | sysinit
dmesg | sysinit
fsck | boot
hostname | boot should get the hostname for the prompt, nano /etc/issue, "This is \n.\O (\s \m \r) \t"
hwclock | boot
keymaps | boot
killprocs | shutdown
local | default
localmount | boot
modules | boot
mount-ro | shutdown
mtab | boot
net.lo | boot
netmount | default
procfs | boot
root | boot should remount rw for you with udev in sysinit
savecache | shutdown
sshd | default
swap | boot
sysctl | boot
syslog-ng | default
termencoding | boot
udev | sysinit
udev-postmount | default
urandom | boot
vixie-cron | default _________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Sat Sep 22, 2012 1:19 am Post subject: |
|
|
But they're not working? Haha, everything I read on /etc/issue is about boot logos, which is pretty cool, but not what I need. Mine looks like that: \n. \O...etc. How can we figure out why these services aren't working?
I dunno if this is any help but I have rc logging on, here's rc.log: http://bpaste.net/show/46832/ |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Sat Sep 22, 2012 1:52 am Post subject: |
|
|
fsck failure is blocking services if it makes sense Code: | emerge smartmontools | if not fix date and/or time zone and
dying drive or time problems are common causes is your bios clock on local or gmt time? _________________ Defund the FCC. |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Sat Sep 22, 2012 1:58 am Post subject: |
|
|
GMTA, just finished setting up NTP.
Yup, yup! That fixed it |
|
Back to top |
|
|
Jickler n00b
Joined: 09 Sep 2011 Posts: 72
|
Posted: Sat Sep 22, 2012 2:21 am Post subject: |
|
|
Writing from Lynx --
Yup, this system feels good I don't anticipate anymore super weird problems, everything seems to be running well. I'm sure the problems I run into henceforth will be the everyday noob ones that I can work through with the excellent documentation project. Just wanted to say thanks again, I'm sure this has been expressed many times over on this board but everyone who contributes to forum support is a freaking hero in my book <3 If one day I know enough about Gentoo to provide even a fraction of the excellent support that I've recieved in this thread, on these forums and on irc then doing so will make this hobby of mine sooo worth it |
|
Back to top |
|
|
DONAHUE Watchman
Joined: 09 Dec 2006 Posts: 7651 Location: Goose Creek SC
|
Posted: Sat Sep 22, 2012 2:37 am Post subject: |
|
|
buon viaggio
nvidia-drivers will require a minimal xorg.conf
nvidia-xconfig will provide it
nvidia-drivers will need an emerge for each kernel change
copy your .config to the new /usr/src/linux and use make oldconfig to adapt it _________________ Defund the FCC. |
|
Back to top |
|
|
|