Zucca wrote:
EDIT02: After enabling gfx942 and recompiling with --newuse, nothing significant happened. Emerging took about 30 seconds.
Do we have the same versions we're compiling?
Are we talking about the same package? I'm talking about hipBLASLt. The process that takes all the memory and threads is Tensile<something starting with capital letter here> while building some kind of library related to gfx942. If you have a different GPU higher target may be enabled and I guess it'll take even more memory to build the libraries.
Also since then I think more versions have entered the tree. At that time it was hipBLASLt-6.1.1-r1
EDIT: more targets are enabled for 6.3.2
For 6.1.1-r1 they are AMDGPU_TARGETS="gfx90a gfx940 gfx941 gfx942"
and for 6.3.2: AMDGPU_TARGETS="gfx90a gfx908 gfx940 gfx941 gfx942 gfx1100 gfx1101"
Also the llvm slot has changed to 19. I might give it another try just to verify if my findings are still relevant.
But 30 seconds, we're talking about something else. I remember the was a package that took like a minute on the background of all other packages taking hours.
EDIT2: interesting. Last time it took hours before getting to hipBLASLt and now it took only minutes. I'm not sure what's going on and I definitely don't have the logs. I also think I depcleaned after that experiment. Version 6.3.2 indeed enables more targets:
Code: Select all
-- AMDGPU_TARGETS: gfx1100;gfx1101;gfx908;gfx90a;gfx940;gfx941;gfx942;
I'll see it I can stop it before taking over my swap partition and enable only the targets that compiled successfully without too much RAM usage.
EDIT3: nah, it immediately starts gradually increasing swap usage. I guess I just didn't notice it last time. It definitely requires more RAM and will be faster with more RAM as swap, albeit on an SSD, is still much slower than RAM.
On [1/35] the swap usage jumped to 3 GB and on [2/35] to 32GB. I can only imagine where it can go.
EDIT4: I'm terminating this too. Further experiments are meaningless. It still consumes insane amounts of memory to build that libraries. And I can only guess it'll get even higher for the higher targets. I'm not attempting that anymore.
There are packages built for other distros, I think a -bin ebuild sounds reasonable, but I have no experience with that. If OP wants, they may file a bug and if they are lucky, somebody may write such an ebuild for them.
Best Regards,
Georgi