Forums

Skip to content

Advanced search
  • Quick links
    • Unanswered topics
    • Active topics
    • Search
  • FAQ
  • Login
  • Register
  • Board index Assistance Kernel & Hardware
  • Search

Is MCE log still in use?

Kernel not recognizing your hardware? Problems with power management or PCMCIA? What hardware is compatible with Gentoo? See here. (Only for kernels supported by Gentoo.)
Post Reply
Advanced search
5 posts • Page 1 of 1
Author
Message
midnite
Guru
Guru
User avatar
Posts: 474
Joined: Sun Apr 09, 2006 1:37 pm
Location: Hong Kong

Is MCE log still in use?

  • Quote

Post by midnite » Wed Jun 29, 2022 10:21 am

In Handbook, it says:

https://wiki.gentoo.org/wiki/Handbook:A ... figuration
Next select the exact processor type. It is also recommended to enable MCE features (if available) so that users are able to be notified of any hardware problems. On some architectures (such as x86_64), these errors are not printed to dmesg, but to /dev/mcelog. This requires the app-admin/mcelog package.
However, in the app-admin/mcelog package, it says:
Starting with version 2.6.4, the Linux kernel for x86-64 no longer decodes and logs recoverable Machine Check Exception events to the kernel log on its own.
We are using 5.x.x kernels in 2022. Does it mean that we will never get error messages on MCE?

- - - -

On the other hand, in the kernel, the description of X86_MCELOG_LEGACY says:
CONFIG_X86_MCELOG_LEGACY:

│ Enable support for /dev/mcelog which is needed by the old mcelog
│ userspace logging daemon. Consider switching to the new generation
│ rasdaemon solution.
If I did not enable X86_MCELOG_LEGACY, I do not have the device file /dev/mcelog. So I think X86_MCELOG_LEGACY is needed for the kernel to write something into /dev/mcelog, then parsed by app-admin/mcelog.

- - - -

For your information, I am able to test mcelog by following the instruction in http://mcelog.org/README.html . The mce-inject is available in https://github.com/andikleen/mce-inject . Be caution that this test will halt your system, halt for a few seconds then auto reboot. You will loss all unsaved files.

- - - -

To summarise,
  1. Will we still get MCE logs in case of hardware problems?
  2. If no, MCE logs are no longer in use (hinted by app-admin/mcelog and kernel's X86_MCELOG_LEGACY). How to configure "rasdaemon"? (Handbook may need updating if so.)
- midnite.
Top
slaterson
Guru
Guru
Posts: 313
Joined: Wed Feb 26, 2003 3:37 pm

  • Quote

Post by slaterson » Fri Nov 04, 2022 3:59 pm

i have the same question.... i get mce messages to my console and in syslog, however mcelog fails to start with

Code: Select all

mcelog: Cannot open `/dev/mcelog': No such file or directory
can't figure out how to get the device to be created. MCE, MCELOG, and MCE_INTEL are all enabled in my kernel.
Top
Hu
Administrator
Administrator
Posts: 24385
Joined: Tue Mar 06, 2007 5:38 am

  • Quote

Post by Hu » Fri Nov 04, 2022 4:47 pm

git grep mcelog -- '*Kconfig' leads to arch/x86/Kconfig, which has:

Code: Select all

config X86_MCELOG_LEGACY
	bool "Support for deprecated /dev/mcelog character device"
	depends on X86_MCE
	help
	  Enable support for /dev/mcelog which is needed by the old mcelog
	  userspace logging daemon. Consider switching to the new generation
	  rasdaemon solution.
You could enable this deprecated feature, or switch to the recommended new approach.
Top
toralf
Developer
Developer
User avatar
Posts: 3944
Joined: Sun Feb 01, 2004 2:58 pm
Location: Hamburg
Contact:
Contact toralf
Website

  • Quote

Post by toralf » Fri Nov 04, 2022 11:15 pm

I tried rasdaemon a yr ago - did not got it up and running at an i5.
And it crashes constantly here at an amd 5950.
Top
mike155
Advocate
Advocate
Posts: 4438
Joined: Fri Sep 17, 2010 11:33 pm
Location: Frankfurt, Germany

  • Quote

Post by mike155 » Sat Nov 05, 2022 9:21 pm

I'm afraid I haven't understood what mcelog does and how it works.

I remember that I once had a server with a faulty RAM. There was an error message in /var/log/mcelog. That made me think that the mcelog daemon is something like syslog for hardware errors. I thought that the kernel generates an error message and the mcelog daemon writes it to /var/log/mcelog.

But I guess that's wrong. Right now I think that the mcelog daemon (or rasdaemon) detects an error and writes a message to a log file. Furthermore, it informs the kernel about the issue.

Does anyone know a good introduction on mcelog and rasdaemon? I would like to know how this really works and on what kind of machines it's recommended to install mcelog or rasdaemon.
Top
Post Reply

5 posts • Page 1 of 1

Return to “Kernel & Hardware”

Jump to
  • Assistance
  • ↳   News & Announcements
  • ↳   Frequently Asked Questions
  • ↳   Installing Gentoo
  • ↳   Multimedia
  • ↳   Desktop Environments
  • ↳   Networking & Security
  • ↳   Kernel & Hardware
  • ↳   Portage & Programming
  • ↳   Gamers & Players
  • ↳   Other Things Gentoo
  • ↳   Unsupported Software
  • Discussion & Documentation
  • ↳   Documentation, Tips & Tricks
  • ↳   Gentoo Chat
  • ↳   Gentoo Forums Feedback
  • ↳   Duplicate Threads
  • International Gentoo Users
  • ↳   中文 (Chinese)
  • ↳   Dutch
  • ↳   Finnish
  • ↳   French
  • ↳   Deutsches Forum (German)
  • ↳   Diskussionsforum
  • ↳   Deutsche Dokumentation
  • ↳   Greek
  • ↳   Forum italiano (Italian)
  • ↳   Forum di discussione italiano
  • ↳   Risorse italiane (documentazione e tools)
  • ↳   Polskie forum (Polish)
  • ↳   Instalacja i sprzęt
  • ↳   Polish OTW
  • ↳   Portuguese
  • ↳   Documentação, Ferramentas e Dicas
  • ↳   Russian
  • ↳   Scandinavian
  • ↳   Spanish
  • ↳   Other Languages
  • Architectures & Platforms
  • ↳   Gentoo on ARM
  • ↳   Gentoo on PPC
  • ↳   Gentoo on Sparc
  • ↳   Gentoo on Alternative Architectures
  • ↳   Gentoo on AMD64
  • ↳   Gentoo for Mac OS X (Portage for Mac OS X)
  • Board index
  • All times are UTC
  • Delete cookies

© 2001–2026 Gentoo Foundation, Inc.

Powered by phpBB® Forum Software © phpBB Limited

Privacy Policy

 

 

magic