Memory exhausted: Desktop stalls instead of oom killer

Apheus · Guru Joined: 12 Jul 2008 Posts: 422

Hello,

I noticed an annoying problem on at least two gentoo machines, bare metal and virtualbox guest: If memory is exhausted by a user process, the complete desktop becomes unresponsive except for mouse cursor movement. One could suspect swap thrashing, but this happens also without any swap memory. Any action takes minutes to provoke a response, if at all. Sometimes, only Alt+PrintScr+REISUB is possible. In case of the virtual machine, I can see hard disk activity on the host in task manager at this times. Even when the guest does not have any swap, and I have no idea where the guest system reads from or writes to at that moment. I did never have a data loss or anything interesting on the hard drives after a shut off from that state - everything is clear after REISUB, or recovers with "recovering journal" after a hard shut off.

Even ssh login as root from another machine takes ages to show the login prompt - if at all.

Happened yesterday in the virtual machine while emerging palemoon with MAKEOPTS -j 6 and $PORTAGE_TMPDIR in tmpfs - 10 GiB is not enough apparently. I was able - after long waits - to switch to tty1, login and kill the emerge. At that moment, the oom killer killed firefox. Which was neither necessary nor useful at that point in time.

Another test case is the Google Chrome resource exhaustion discovered in the wake of the iMac/iOS CSS bug: https://s3.eu-central-1.amazonaws.com/sabri/chrome-reaper.html (CAUTION with that link of you use Chrome/Chromium et al).

I have a memory limit for the desktop session via cgroups. When I set this up years ago, it was intended to protect root processes (like ssh) from that behavior of desktop processes. A cgroup named "gui" is configured in /etc/cgroup/cgconfig.conf and gets assigned via two lines in /etc/X11/startDM.sh. But the problem persists even if I switch that assigment off. I can only use memory.limit_in_bytes to control the amount of space the desktop processes get before the problem starts.

Desktop is KDE. I would expect problems if kwin_x11 cannot get memory, but still the oom killer should kick in. And it should kick in pretty fast.

Any ideas?

Edit: Typo in path to cgconfig.conf
_________________
My phrenologist says I'm stupid.

eccerr0r · Posted: Wed Sep 19, 2018 2:30 pm Post subject:

Yes it's still "swapping" to do the best it can to prevent an OOM situation, even if you don't have swap.

You should add real swap or add more memory. Not using your GUI when running memory intensive stuff will help the kernel reach true OOM faster.

Ultimately you're still asking the kernel to put 11 liters of water into a 10 liter jug and all it's trying to do is prevent OOM from killing any programs until the very end when RAM is full of data it cannot figure out how to get elsewhere.
_________________
Intel Core i7 2700K/Radeon R7 250/24GB DDR3/256GB SSD
What am I supposed watching?

Apheus · Guru Joined: 12 Jul 2008 Posts: 422

Thanks.

khayyam

Apheus ... from the various details I'm inclined to think that scheduling is the underlying issue, I suspect you're using CFS (Completely Fair Scheduler), and CFQ (IOSCHED_CFQ), for I/O scheduling (probably also CFS_BANDWIDTH, CGROUP_SCHED, etc for cgroups). If that is the case then you might find better scheduling with MuQSS (which is the redesigned/updated/refactored BFS), and BFQ (Budget Fair Queueing I/O scheduler). Anyhow ...

NeddySeagoon · Posted: Mon Sep 24, 2018 11:02 am Post subject:

Apheus,

The kernel has several mechanisms for swapping.
The swap partition only allows the kernel to swap dynamically allocated RAM, so by not having any swap space you rob the kernel of that option.
tmpfs can also be swapped to swap.

As the kernel memory maps code to RAM and doesn't load anything until there is a page fault, when it tries to use it, (the page isn't loaded) it swaps by dropping some of the loaded code.
It will reload it later, if its needed. This dropping code and reloading is swapping.

Dirty buffers can be written to disk to free RAM.
Caches can be cleared and reloaded.

All in all, not having swap is a bad idea. All this other swapping goes on, you just don't see it. A few MB of swap permanently in use is harmless. Much more, and you need more physical RAM.
Swap space not being used is an indicator that all is well.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

Apheus · Guru Joined: 12 Jul 2008 Posts: 422

Thanks.

Apheus · Guru Joined: 12 Jul 2008 Posts: 422

I tried =sys-kernel/ck-sources-4.14.69 with BFQ and MuQSS in the virtual machine. Since google chrome fixed the reaper bug, my test case is the new firefox version from reaperbugs.com. Firefox 60.2.1 with empty test profile.

I removed the limit for the "gui" cgroup in cgconfig.conf. 10GiB physical memory, 250 MiB swap partition.

At first, the behavior seemed better: Kinfocenter with the memory charts remained responsive. At one point I could see half the swap space used. I could Alt-Tab between windows, with ~1 minute delay. KDE marked the firefox window as "Not responding". But after some minutes, everything came to a halt and I had to Alt+PrntScrn+REISUB.
_________________
My phrenologist says I'm stupid.

khayyam · Posted: Mon Sep 24, 2018 1:03 pm Post subject:

Apheus ...

hmmm ... ok, can you now test with the following:

Apheus · Guru Joined: 12 Jul 2008 Posts: 422

khayyam · Posted: Mon Sep 24, 2018 4:44 pm Post subject:

Apheus ...

I'm reminded why I'm still on 3.12.x, I got sick of the near constant instability when tracking "stable". But, yes, looks like a deeper problem there, try and reproduce with 4.14, or 4.9 (not sure what kernel you have currently). Sorry I couldn't be of more help.

best ... khay

devsk · Posted: Wed Sep 26, 2018 12:16 am Post subject:

Apheus · Guru Joined: 12 Jul 2008 Posts: 422

devsk · Posted: Wed Sep 26, 2018 4:17 pm Post subject:

lower swappiness => hunt for free pages at the last minute
higher swappiness => hunt for free pages at the earliest memory pressure

Higher swappiness is amortizing the cost of free page hunt over a longer period of time and on under configured systems, is the right choice. Lower swappiness is better for well (over) configured system because you will never encounter even the tiniest of hiccups because of page hunts. "Free page" hunts are world stopping events because they can happen inline in the same process or by kswapd in the kernel. Depending on conditions, they can hard block for a while.