Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
[SOLVED] Problem with AMDGPU: blank screen at boot
View unanswered posts
View posts from last 24 hours

Goto page 1, 2  Next  
Reply to topic    Gentoo Forums Forum Index Desktop Environments
View previous topic :: View next topic  
Author Message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Fri Jan 05, 2018 3:28 pm    Post subject: [SOLVED] Problem with AMDGPU: blank screen at boot Reply with quote

Hello again! (old user coming back).

I am writting because I am having problems with a new AMD GPU on my main desktop (It has been working fine with a Nvidia card, but I decided to replace it in order to be able to use newer kernels and free drivers).

The card I did choose is a ATI Radeon RX570 (but lspci reports it as an RX 470/480). I am using as CPU a Ryzen7, so I don't have any integrated GPU to produce video output. I followed all the AMDGPU instructions and made sure everything was set in the kernel (4.14.11-r2), bit after what would be loading KMS I only get a black screen. The computer otherwise works and I am able to access it via ssh. Typing my username/password and startx won't start my monitor and X. I don't normally use any initrd image, but after googling I decided to try it out and that won't do anything. I see that someone had the same problem years ago and it was a firmware problem. However, after installing linux-firmware and setting it into the config file I didn't make any change.

The firmware section I did set in my .config is as it says: (the wiki says that the RX570 is POLARIS10)

Code:
CONFIG_FW_LOADER=y
CONFIG_FIRMWARE_IN_KERNEL=y
CONFIG_EXTRA_FIRMWARE="amdgpu/polaris10_ce.bin amdgpu/polaris10_k_smc.bin amdgpu/polaris10_mc.bin amdgpu/polaris10_me.bin amdgpu/polaris10_mec2.bin amdgpu/polaris10_mec.bin amdgpu/polaris10_pfp.bin amdgpu/polaris10_rlc.bin amdgpu/polaris10_sdma1.bin amdgpu/polaris10_sdma.bin amdgpu/polaris10_smc.bin amdgpu/polaris10_smc_sk.bin amdgpu/polaris10_uvd.bin amdgpu/polaris10_vce.bin"
CONFIG_EXTRA_FIRMWARE_DIR="/lib/firmware"


the GPU section is set as follows:

Code:
#
# Graphics support
#
CONFIG_AGP=y
CONFIG_AGP_AMD64=y
# CONFIG_AGP_INTEL is not set
# CONFIG_AGP_SIS is not set
# CONFIG_AGP_VIA is not set
CONFIG_VGA_ARB=y
CONFIG_VGA_ARB_MAX_GPUS=16
# CONFIG_VGA_SWITCHEROO is not set
CONFIG_DRM=y
# CONFIG_DRM_DP_AUX_CHARDEV is not set
# CONFIG_DRM_DEBUG_MM is not set
# CONFIG_DRM_DEBUG_MM_SELFTEST is not set
CONFIG_DRM_KMS_HELPER=y
CONFIG_DRM_KMS_FB_HELPER=y
CONFIG_DRM_FBDEV_EMULATION=y
CONFIG_DRM_FBDEV_OVERALLOC=100
# CONFIG_DRM_LOAD_EDID_FIRMWARE is not set
CONFIG_DRM_TTM=y

#
# I2C encoder or helper chips
#
# CONFIG_DRM_I2C_CH7006 is not set
# CONFIG_DRM_I2C_SIL164 is not set
# CONFIG_DRM_I2C_NXP_TDA998X is not set
# CONFIG_DRM_RADEON is not set
CONFIG_DRM_AMDGPU=y
CONFIG_DRM_AMDGPU_SI=y
CONFIG_DRM_AMDGPU_CIK=y
CONFIG_DRM_AMDGPU_USERPTR=y
CONFIG_DRM_AMDGPU_GART_DEBUGFS=y

#
# Display Panels
#
CONFIG_DRM_BRIDGE=y
CONFIG_DRM_PANEL_BRIDGE=y

#
# Display Interface Bridges
#
# CONFIG_DRM_ANALOGIX_ANX78XX is not set
# CONFIG_DRM_HISI_HIBMC is not set
# CONFIG_DRM_TINYDRM is not set
# CONFIG_DRM_LEGACY is not set
# CONFIG_DRM_LIB_RANDOM is not set

#
# Frame buffer Devices
#
CONFIG_FB=y
# CONFIG_FIRMWARE_EDID is not set
CONFIG_FB_CMDLINE=y
CONFIG_FB_NOTIFY=y
# CONFIG_FB_DDC is not set
# CONFIG_FB_BOOT_VESA_SUPPORT is not set
CONFIG_FB_CFB_FILLRECT=y
CONFIG_FB_CFB_COPYAREA=y
CONFIG_FB_CFB_IMAGEBLIT=y
# CONFIG_FB_CFB_REV_PIXELS_IN_BYTE is not set
CONFIG_FB_SYS_FILLRECT=y
CONFIG_FB_SYS_COPYAREA=y
CONFIG_FB_SYS_IMAGEBLIT=y
# CONFIG_FB_PROVIDE_GET_FB_UNMAPPED_AREA is not set
# CONFIG_FB_FOREIGN_ENDIAN is not set
CONFIG_FB_SYS_FOPS=y
CONFIG_FB_DEFERRED_IO=y
# CONFIG_FB_SVGALIB is not set
# CONFIG_FB_MACMODES is not set
# CONFIG_FB_BACKLIGHT is not set
CONFIG_FB_MODE_HELPERS=y
CONFIG_FB_TILEBLITTING=y

Am I missing something? Thank you in advance!

[Moderator edit: added [code] tags to preserve output layout. -Hu]


Last edited by vladimir1986 on Thu Jan 11, 2018 2:33 pm; edited 1 time in total
Back to top
View user's profile Send private message
Chrishas
n00b
n00b


Joined: 19 Jul 2015
Posts: 12

PostPosted: Fri Jan 05, 2018 6:57 pm    Post subject: Reply with quote

Hi vladimir1986,

Can you provide your dmesg output?
Back to top
View user's profile Send private message
DancesWithWords
Guru
Guru


Joined: 29 Jun 2002
Posts: 341
Location: ottawa, canada

PostPosted: Fri Jan 05, 2018 7:23 pm    Post subject: Reply with quote

I would forget about installing the firmware into the kernel and do the following.

Code:
# Graphics support
#
CONFIG_AGP=y
CONFIG_AGP_AMD64=y
CONFIG_AGP_INTEL=m
CONFIG_AGP_SIS=m
CONFIG_AGP_VIA=m
CONFIG_INTEL_GTT=m
CONFIG_VGA_ARB=y
CONFIG_VGA_ARB_MAX_GPUS=16
# CONFIG_VGA_SWITCHEROO is not set
CONFIG_DRM=m
# CONFIG_DRM_DP_AUX_CHARDEV is not set
CONFIG_DRM_KMS_HELPER=m
CONFIG_DRM_KMS_FB_HELPER=y
CONFIG_DRM_FBDEV_EMULATION=y
# CONFIG_DRM_LOAD_EDID_FIRMWARE is not set
CONFIG_DRM_TTM=m

#
# I2C encoder or helper chips
#
# CONFIG_DRM_I2C_CH7006 is not set
# CONFIG_DRM_I2C_SIL164 is not set
# CONFIG_DRM_I2C_NXP_TDA998X is not set
# CONFIG_DRM_RADEON is not set
CONFIG_DRM_AMDGPU=m
CONFIG_DRM_AMDGPU_SI=y
CONFIG_DRM_AMDGPU_CIK=y
CONFIG_DRM_AMDGPU_USERPTR=y



That worked for me. Give it a try.

===
DWW

[Moderator edit: added [code] tags to preserve output layout. -Hu]
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Fri Jan 05, 2018 8:27 pm    Post subject: Reply with quote

Thank you!

about the DMESG, I get this:

[ 0.610272] [drm] amdgpu kernel modesetting enabled.
[ 0.611494] amdgpu 0000:24:00.0: Invalid PCI ROM header signature: expecting 0xaa55, got 0xffff
[ 0.612155] amdgpu 0000:24:00.0: SME is active, device will require DMA bounce buffers
[ 0.612320] amdgpu 0000:24:00.0: SME is active, device will require DMA bounce buffers
[ 0.612491] amdgpu 0000:24:00.0: VRAM: 4096M 0x000000F400000000 - 0x000000F4FFFFFFFF (4096M used)
[ 0.612657] amdgpu 0000:24:00.0: GTT: 256M 0x0000000000000000 - 0x000000000FFFFFFF
[ 0.613510] [drm] amdgpu: 4096M of VRAM memory ready
[ 0.613614] [drm] amdgpu: 4096M of GTT memory ready.
[ 0.614370] amdgpu 0000:24:00.0: amdgpu: using MSI.
[ 0.614463] [drm] amdgpu: irq initialized.
[ 0.795847] amdgpu: [powerplay] amdgpu: powerplay sw initialized
[ 0.796067] [drm] AMDGPU Display Connectors
[ 0.799687] amdgpu 0000:24:00.0: fence driver on ring 0 use gpu addr 0x0000000000400040, cpu addr 0xffff975348709040
[ 0.799851] amdgpu 0000:24:00.0: fence driver on ring 1 use gpu addr 0x00000000004000c0, cpu addr 0xffff9753487090c0
[ 0.800019] amdgpu 0000:24:00.0: fence driver on ring 2 use gpu addr 0x0000000000400140, cpu addr 0xffff975348709140
[ 0.800183] amdgpu 0000:24:00.0: fence driver on ring 3 use gpu addr 0x00000000004001c0, cpu addr 0xffff9753487091c0
[ 0.800340] amdgpu 0000:24:00.0: fence driver on ring 4 use gpu addr 0x0000000000400240, cpu addr 0xffff975348709240
[ 0.800499] amdgpu 0000:24:00.0: fence driver on ring 5 use gpu addr 0x00000000004002c0, cpu addr 0xffff9753487092c0
[ 0.800658] amdgpu 0000:24:00.0: fence driver on ring 6 use gpu addr 0x0000000000400340, cpu addr 0xffff975348709340
[ 0.800814] amdgpu 0000:24:00.0: fence driver on ring 7 use gpu addr 0x00000000004003c0, cpu addr 0xffff9753487093c0
[ 0.800970] amdgpu 0000:24:00.0: fence driver on ring 8 use gpu addr 0x0000000000400440, cpu addr 0xffff975348709440
[ 0.801144] amdgpu 0000:24:00.0: fence driver on ring 9 use gpu addr 0x00000000004004e0, cpu addr 0xffff9753487094e0
[ 0.801691] amdgpu 0000:24:00.0: fence driver on ring 10 use gpu addr 0x0000000000400560, cpu addr 0xffff975348709560
[ 0.801863] amdgpu 0000:24:00.0: fence driver on ring 11 use gpu addr 0x00000000004005e0, cpu addr 0xffff9753487095e0
[ 0.802382] amdgpu 0000:24:00.0: fence driver on ring 12 use gpu addr 0x000000f4002ad420, cpu addr 0xffff975348e5a420
[ 0.802672] amdgpu 0000:24:00.0: fence driver on ring 13 use gpu addr 0x00000000004006e0, cpu addr 0xffff9753487096e0
[ 0.802829] amdgpu 0000:24:00.0: fence driver on ring 14 use gpu addr 0x0000000000400760, cpu addr 0xffff975348709760
[ 1.603540] [drm:gfx_v8_0_ring_test_ring] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)
[ 1.603692] [drm:amdgpu_device_init] *ERROR* hw_init of IP block <gfx_v8_0> failed -22
[ 1.603838] amdgpu 0000:24:00.0: amdgpu_init failed

[ 2.290390] [drm] amdgpu: ttm finalized
[ 2.290488] amdgpu 0000:24:00.0: Fatal error during GPU init
[ 2.290590] [drm] amdgpu: finishing device.
[ 2.291153] amdgpu: probe of 0000:24:00.0 failed with error -22

I recompiled as recommended and had the same result
Back to top
View user's profile Send private message
DancesWithWords
Guru
Guru


Joined: 29 Jun 2002
Posts: 341
Location: ottawa, canada

PostPosted: Fri Jan 05, 2018 8:32 pm    Post subject: Reply with quote

vladimir1986 wrote:
Thank you!

about the DMESG, I get this:

[ 0.610272] [drm] amdgpu kernel modesetting enabled.
[ 0.611494] amdgpu 0000:24:00.0: Invalid PCI ROM header signature: expecting 0xaa55, got 0xffff
[ 0.612155] amdgpu 0000:24:00.0: SME is active, device will require DMA bounce buffers
[ 0.612320] amdgpu 0000:24:00.0: SME is active, device will require DMA bounce buffers
[ 0.612491] amdgpu 0000:24:00.0: VRAM: 4096M 0x000000F400000000 - 0x000000F4FFFFFFFF (4096M used)
[ 0.612657] amdgpu 0000:24:00.0: GTT: 256M 0x0000000000000000 - 0x000000000FFFFFFF
[ 0.613510] [drm] amdgpu: 4096M of VRAM memory ready
[ 0.613614] [drm] amdgpu: 4096M of GTT memory ready.
[ 0.614370] amdgpu 0000:24:00.0: amdgpu: using MSI.
[ 0.614463] [drm] amdgpu: irq initialized.
[ 0.795847] amdgpu: [powerplay] amdgpu: powerplay sw initialized
[ 0.796067] [drm] AMDGPU Display Connectors
[ 0.799687] amdgpu 0000:24:00.0: fence driver on ring 0 use gpu addr 0x0000000000400040, cpu addr 0xffff975348709040
[ 0.799851] amdgpu 0000:24:00.0: fence driver on ring 1 use gpu addr 0x00000000004000c0, cpu addr 0xffff9753487090c0
[ 0.800019] amdgpu 0000:24:00.0: fence driver on ring 2 use gpu addr 0x0000000000400140, cpu addr 0xffff975348709140
[ 0.800183] amdgpu 0000:24:00.0: fence driver on ring 3 use gpu addr 0x00000000004001c0, cpu addr 0xffff9753487091c0
[ 0.800340] amdgpu 0000:24:00.0: fence driver on ring 4 use gpu addr 0x0000000000400240, cpu addr 0xffff975348709240
[ 0.800499] amdgpu 0000:24:00.0: fence driver on ring 5 use gpu addr 0x00000000004002c0, cpu addr 0xffff9753487092c0
[ 0.800658] amdgpu 0000:24:00.0: fence driver on ring 6 use gpu addr 0x0000000000400340, cpu addr 0xffff975348709340
[ 0.800814] amdgpu 0000:24:00.0: fence driver on ring 7 use gpu addr 0x00000000004003c0, cpu addr 0xffff9753487093c0
[ 0.800970] amdgpu 0000:24:00.0: fence driver on ring 8 use gpu addr 0x0000000000400440, cpu addr 0xffff975348709440
[ 0.801144] amdgpu 0000:24:00.0: fence driver on ring 9 use gpu addr 0x00000000004004e0, cpu addr 0xffff9753487094e0
[ 0.801691] amdgpu 0000:24:00.0: fence driver on ring 10 use gpu addr 0x0000000000400560, cpu addr 0xffff975348709560
[ 0.801863] amdgpu 0000:24:00.0: fence driver on ring 11 use gpu addr 0x00000000004005e0, cpu addr 0xffff9753487095e0
[ 0.802382] amdgpu 0000:24:00.0: fence driver on ring 12 use gpu addr 0x000000f4002ad420, cpu addr 0xffff975348e5a420
[ 0.802672] amdgpu 0000:24:00.0: fence driver on ring 13 use gpu addr 0x00000000004006e0, cpu addr 0xffff9753487096e0
[ 0.802829] amdgpu 0000:24:00.0: fence driver on ring 14 use gpu addr 0x0000000000400760, cpu addr 0xffff975348709760
[ 1.603540] [drm:gfx_v8_0_ring_test_ring] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)
[ 1.603692] [drm:amdgpu_device_init] *ERROR* hw_init of IP block <gfx_v8_0> failed -22
[ 1.603838] amdgpu 0000:24:00.0: amdgpu_init failed

[ 2.290390] [drm] amdgpu: ttm finalized
[ 2.290488] amdgpu 0000:24:00.0: Fatal error during GPU init
[ 2.290590] [drm] amdgpu: finishing device.
[ 2.291153] amdgpu: probe of 0000:24:00.0 failed with error -22

I recompiled as recommended and had the same result


I've no idea what that is about. Hopefully someone with more experience with amdgpu will chime in to help.

===
DWW


EDIT:

Did you see this? https://forums.gentoo.org/viewtopic-p-8031464.html
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Sat Jan 06, 2018 8:30 am    Post subject: Reply with quote

Mhhh I already tried to do the same and avoid loading the firmware, but I still simply get a black screen. I am going to try today a couple of new things and see if that helps: I noticed there is a 4.14.12 kernel, I might try to remove the card and check the connectors (the case is very tight in space and maybe the card isn't properly connected). Hope any of that will help
Back to top
View user's profile Send private message
Marlo
Veteran
Veteran


Joined: 26 Jul 2003
Posts: 1303

PostPosted: Sat Jan 06, 2018 12:58 pm    Post subject: Reply with quote

Try this and pay attention to the USE flags

Code:
tux ~ # egrep amdgpu  /etc/portage/make.conf
       VIDEO_CARDS="amdgpu radeonsi"

Code:

tux ~ # emerge -1  x11-libs/libdrm  media-libs/mesa x11-apps/mesa-progs x11-base/xorg-server x11-base/xorg-drivers $(qlist -IC x11-drivers/) -pv

These are the packages that would be merged, in order:

Calculating dependencies... done!
[ebuild   R    ] x11-libs/libdrm-2.4.88::gentoo  USE="libkms -static-libs -valgrind" ABI_X86="(64) -32 (-x32)" VIDEO_CARDS="amdgpu radeon (-exynos) (-freedreno) -intel -nouveau (-omap) (-tegra) (-vc4) (-vivante) -vmware" 0 KiB

[ebuild   R   ~] media-libs/mesa-17.3.1::gentoo  USE="classic dri3 egl gallium gbm gles1 gles2 llvm nptl opencl openmax vaapi vdpau vulkan wayland xa xvmc -bindist -d3d9 -debug -osmesa -pax_kernel -pic (-selinux) -unwind -valgrind" ABI_X86="(64) -32 (-x32)" VIDEO_CARDS="radeonsi (-freedreno) -i915 -i965 (-imx) -intel -nouveau -r100 -r200 -r300 -r600 -radeon (-vc4) -virgl (-vivante) -vmware" 0 KiB

[ebuild   R    ] x11-apps/mesa-progs-8.3.0::gentoo  USE="egl gles2" 0 KiB

[ebuild   R    ] x11-base/xorg-server-1.19.5:0/1.19.5::gentoo  USE="glamor kdrive suid udev wayland xorg xvfb -debug -dmx -doc -ipv6 -libressl -minimal (-selinux) -static-libs (-systemd) -tslib -unwind -xcsecurity -xephyr -xnest" 0 KiB

[ebuild   R    ] x11-drivers/xf86-input-libinput-0.25.1::gentoo  0 KiB
[ebuild   R    ] x11-drivers/xf86-video-amdgpu-1.3.0::gentoo  0 KiB
[ebuild   R    ] x11-drivers/xf86-video-ati-7.9.0::gentoo  USE="glamor udev" 0 KiB

[ebuild   R    ] x11-base/xorg-drivers-1.19::gentoo  INPUT_DEVICES="libinput (-acecad) (-aiptek) -elographics -evdev (-fpit) (-hyperpen) -joystick -keyboard -mouse (-mutouch) -penmount -synaptics -tslib -vmmouse -void -wacom" VIDEO_CARDS="amdgpu radeonsi (-apm) (-ark) -ast (-chips) (-cirrus) -dummy -fbdev (-freedreno) (-geode) -glint (-i128) (-i740) -i915 -i965 -intel (-mach64) -mga (-neomagic) (-newport) -nouveau -nv -nvidia (-omap) -qxl -r128 -radeon (-rendition) (-s3) (-s3virge) (-savage) -siliconmotion (-sis) (-sisusb) (-sunbw2) (-suncg14) (-suncg3) (-suncg6) (-sunffb) (-sunleo) (-suntcx) (-tdfx) (-tegra) (-tga) (-trident) (-tseng) (-vc4) -vesa -via -virtualbox -vmware (-voodoo)" 0 KiB

Total: 8 packages (8 reinstalls), Size of downloads: 0 KiB


And

Code:
tux ~ # cat /etc/X11/xorg.conf.d/10-device.conf
Section "Device"
   Identifier  "AMD Radeon RX460 4G (Island POLARIS11)" ### Set your Identifer
   Driver      "amdgpu"
   BusID       "PCI:1:0:0"
   Option      "DRI"         "3"
   Option      "AccelMethod" "glamor"
   Option       "TearFree"    "on"
EndSection


If that does not work, sign up here again. Then let's take another look at the kernel.
Good luck!
Ma

EDIT:
llvm is also one of them:
Code:
tux ~ # emerge -1 llvm -pv

These are the packages that would be merged, in order:

Calculating dependencies... done!
[ebuild   R    ] sys-devel/llvm-4.0.1-r1:4::gentoo  USE="gold libffi ncurses -debug -doc -libedit {-test}" ABI_X86="(64) -32 (-x32)" LLVM_TARGETS="AMDGPU BPF NVPTX (X86) -AArch64 -ARM -Hexagon -Lanai -MSP430 -Mips -PowerPC -RISCV -Sparc -SystemZ -XCore" 0 KiB

Code:

tux ~ # euse -i llvm
global use flags (searching: llvm)
************************************************************
no matching entries found

local use flags (searching: llvm)
************************************************************
...
[-      ] llvm
    media-libs/mesa: Enable LLVM backend for Gallium3D.
        [+ B] 17.1.10 [gentoo]
        [+ B] 17.2.6 [gentoo]
        [+ B] 17.2.7 [gentoo]
        [+ B] 17.3.1 [gentoo]
        [+ B] 9999 [gentoo]


_________________
Thank you for your attention, interest and support.


Last edited by Marlo on Sat Jan 06, 2018 1:19 pm; edited 1 time in total
Back to top
View user's profile Send private message
Ant P.
Watchman
Watchman


Joined: 18 Apr 2009
Posts: 5074

PostPosted: Sat Jan 06, 2018 1:00 pm    Post subject: Reply with quote

@ above: This is failing in the kernel before X11 even runs. Try and pay attention to what you're replying to next time.
Back to top
View user's profile Send private message
Marlo
Veteran
Veteran


Joined: 26 Jul 2003
Posts: 1303

PostPosted: Sat Jan 06, 2018 1:59 pm    Post subject: Reply with quote

If it's just a kernel problem, as Ant P. says, then that's the relevant part of the .config:
Code:
CONFIG_MTRR = y
# CONFIG_TRIM_UNUSED_KSYMS is not set
CONFIG_MTRR_SANITIZER = y
CONFIG_MTRR_SANITIZER_ENABLE_DEFAULT = 1
CONFIG_MTRR_SANITIZER_SPARE_REG_NR_DEFAULT = 2
CONFIG_BINFMT_ELF=y
CONFIG_COMPAT_BINFMT_ELF=y
CONFIG_BINFMT_SCRIPT=y
CONFIG_BINFMT_MISC=y
CONFIG_FW_LOADER = y
CONFIG_FIRMWARE_IN_KERNEL = y
CONFIG_EXTRA_FIRMWARE = "amdgpu/polaris10_ce.bin amdgpu/polaris10_mc.bin amdgpu/polaris10_me.bin amdgpu/polaris10_mec2.bin amdgpu/polaris10_mec.bin amdgpu/polaris10_pfp.bin amdgpu/polaris10_rlc.bin amdgpu/polaris10_sdma1.bin amdgpu/polaris10_sdma.bin amdgpu/polaris10_smc.bin amdgpu/polaris10_smc_sk.bin amdgpu/polaris10_uvd.bin amdgpu/polaris10_vce.bin"
CONFIG_EXTRA_FIRMWARE_DIR = "/lib/firmware"
CONFIG_AGP = y
CONFIG_VGA_SWITCHEROO = y
CONFIG_DRM = y
CONFIG_DRM_FBDEV_EMULATION = y
CONFIG_DRM_AMDGPU = m
CONFIG_DRM_AMDGPU_USERPTR = y
CONFIG_FB = y
CONFIG_FB_MODE_HELPERS = y
CONFIG_FB_VESA = y
# CONFIG_DRM_RADEON is not set
# CONFIG_FB_RADEON is not set
DEBUG_KERNEL = n

_________________
Thank you for your attention, interest and support.
Back to top
View user's profile Send private message
Chrishas
n00b
n00b


Joined: 19 Jul 2015
Posts: 12

PostPosted: Sat Jan 06, 2018 3:49 pm    Post subject: Reply with quote

I had a similar error with my Vega 64 and the latest 4.15-rc kernel which went away when I disabled AMD Secure Memory Encryption (SME) support under Processor Type and Features, although could be completely unrelated to your issue.
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Sat Jan 06, 2018 4:09 pm    Post subject: Reply with quote

Hey, thank you for the replies!

I tried the new .config, in case I mispelled something in the firmware, and it still doesn't work... (Yeah, I did set the video card as in the make.conf, but I need to have any video output before attacking X).

I did get some new error messages on Dmesg tho, or maybe I didn't see it before. Is that failing of amdgpu.

Code:
[    5.370013] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)
[    5.370031] [drm:amdgpu_device_init [amdgpu]] *ERROR* hw_init of IP block <gfx_v8_0> failed -22
[    5.370032] amdgpu 0000:24:00.0: amdgpu_init failed
[    6.097172] [TTM] Finalizing pool allocator
[    6.097175] [TTM] Finalizing DMA pool allocator
[    6.097194] [TTM] Zone  kernel: Used memory at exit: 12 kiB
[    6.097196] [TTM] Zone   dma32: Used memory at exit: 12 kiB
[    6.097197] [drm] amdgpu: ttm finalized
[    6.097201] amdgpu 0000:24:00.0: Fatal error during GPU init
[    6.097202] [drm] amdgpu: finishing device.


[Moderator edit: added [code] tags to preserve output layout. -Hu]
Back to top
View user's profile Send private message
Chrishas
n00b
n00b


Joined: 19 Jul 2015
Posts: 12

PostPosted: Sat Jan 06, 2018 4:43 pm    Post subject: Reply with quote

Can you provide your kernel command line?[/code]
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Sat Jan 06, 2018 6:52 pm    Post subject: Reply with quote

Chrishas wrote:
Can you provide your kernel command line?[/code]


To be honest I still have the default from the grub-mkconfig command, so nothing extra loaded

linux /vmlinuz-4.14.12-gentoo root=/dev/sda2 ro
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Wed Jan 10, 2018 2:49 pm    Post subject: Reply with quote

I must add that I tried with AMDGPU both as a module and into the Kernel, with no changes. It seems this is quite a difficult problem....

I will hold into the hope that it is some sort of driver problem and knowing that the 4.15 kernel will include some improvements for AMD gpu, I'll just wait till they release it. Otherwise I'll try the closed drivers though I'll rather avoid them.


The other solutions go through using my previous Nvidia card, and that is the very last thing I want.

I will keep this updated, in case I can mark it as solved.
Back to top
View user's profile Send private message
chithanh
Developer
Developer


Joined: 05 Aug 2006
Posts: 2152
Location: Berlin, Germany

PostPosted: Wed Jan 10, 2018 4:25 pm    Post subject: Reply with quote

The problem is that the amdgpu driver cannot initialize the hardware correctly. As you have already ruled out firmware problems, this is most likely a kernel bug. Some workarounds like messing with IOMMU configuration etc. might be possible, but in general you have to report to the developers and then wait for a bugfix.

In the meantime you can use unaccelerated graphics if you enable
Code:
CONFIG_X86_SYSFB=y
CONFIG_FB_SIMPLE=y
CONFIG_FRAMEBUFFER_CONSOLE=y

and prevent amdgpu kernel module from loading.
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 40979
Location: 56N 3W

PostPosted: Wed Jan 10, 2018 4:46 pm    Post subject: Reply with quote

vladimir1986,

Its worth trying the new amdgpu driver in the 4.15.0 kernel.
That's still a -rc version,
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Thu Jan 11, 2018 9:47 am    Post subject: Reply with quote

NeddySeagoon wrote:
vladimir1986,

Its worth trying the new amdgpu driver in the 4.15.0 kernel.
That's still a -rc version,


Hey! I remember you answered my first Gentoo question many years ago in this same forum! :D

Anyway, I tried the 4.15-r7, and... It didn't work.

However, The error messages are different and point to the same problem in dmesg. It seems like it can't load some firmware, specifically polaris10_mc.bin (maybe it is a bug?) It doesn't complain about the kernel module anymore. I tried to load amdgpu and the firmware embedded into the kernel, as modules. To specify the firmware and to leave it blank to no avail.

-edit- I went downstairs and the monitor was displaying text, not just blank, so it definitely does something! there isn't any video driver loaded, but at least I got plain VGA. This is an improvement.

There are the DRM messagaes:

$ dmesg | grep -i drm
[ 0.608846] [drm] amdgpu kernel modesetting enabled.
[ 0.609088] [drm] initializing kernel modesetting (POLARIS10 0x1002:0x67DF 0x1DA2:0xE366 0xEF).
[ 0.609261] [drm] register mmio base: 0xFE900000
[ 0.609364] [drm] register mmio size: 262144
[ 0.609473] [drm] probing gen 2 caps for device 1022:1453 = 733903/e
[ 0.609579] [drm] probing mlw for device 1022:1453 = 733903
[ 0.609690] [drm] UVD is enabled in VM mode
[ 0.609792] [drm] UVD ENC is enabled in VM mode
[ 0.609895] [drm] VCE enabled in VM mode
[ 0.610467] [drm] GPU post is not needed
[ 0.610661] [drm] vm size is 64 GB, block size is 13-bit, fragment size is 9-bit
[ 0.611452] [drm:gmc_v8_0_sw_init] *ERROR* Failed to load mc firmware!
[ 0.611561] [drm:amdgpu_device_init] *ERROR* sw_init of IP block <gmc_v8_0> failed -2
[ 0.611935] [drm] amdgpu: finishing device.


and the AMDGPU messages:

$ dmesg | grep -i amdgpu
[ 0.608846] [drm] amdgpu kernel modesetting enabled.
[ 0.610179] amdgpu 0000:24:00.0: Invalid PCI ROM header signature: expecting 0xaa55, got 0xffff
[ 0.610826] amdgpu 0000:24:00.0: SME is active, device will require DMA bounce buffers
[ 0.610991] amdgpu 0000:24:00.0: SME is active, device will require DMA bounce buffers
[ 0.611174] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_mc.bin failed with error -2
[ 0.611342] mc: Failed to load firmware "amdgpu/polaris10_mc.bin"
[ 0.611561] [drm:amdgpu_device_init] *ERROR* sw_init of IP block <gmc_v8_0> failed -2
[ 0.611726] amdgpu 0000:24:00.0: amdgpu_init failed
[ 0.611830] amdgpu 0000:24:00.0: Fatal error during GPU init
[ 0.611935] [drm] amdgpu: finishing device.
[ 0.612301] amdgpu: probe of 0000:24:00.0 failed with error -2
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 40979
Location: 56N 3W

PostPosted: Thu Jan 11, 2018 12:05 pm    Post subject: Reply with quote

vladimir1986,

The 4.15 kernel requires more firmware. I found that out the hard way. :)
Its in linux-firmware if you get a new enough one.

Did you find the extra kernel option?
The old driver is still there too. I think the new one builds on top of it.

Check your firnware in the kernel. From linux firmware, I have
Code:
ls -l /lib/firmware/amdgpu/ | grep polaris10
-rw-r--r-- 1 root root   8852 Jan  6 21:22 polaris10_ce_2.bin
-rw-r--r-- 1 root root   8832 Jan  6 21:22 polaris10_ce.bin
-rw-r--r-- 1 root root 130228 Jan  6 21:22 polaris10_k_smc.bin
-rw-r--r-- 1 root root  32708 Jan  6 21:22 polaris10_mc.bin
-rw-r--r-- 1 root root  17044 Jan  6 21:22 polaris10_me_2.bin
-rw-r--r-- 1 root root  17024 Jan  6 21:22 polaris10_me.bin
-rw-r--r-- 1 root root 262824 Jan  6 21:22 polaris10_mec2_2.bin
-rw-r--r-- 1 root root 262824 Jan  6 21:22 polaris10_mec_2.bin
-rw-r--r-- 1 root root 262784 Jan  6 21:22 polaris10_mec2.bin
-rw-r--r-- 1 root root 262784 Jan  6 21:22 polaris10_mec.bin
-rw-r--r-- 1 root root  17044 Jan  6 21:22 polaris10_pfp_2.bin
-rw-r--r-- 1 root root  17024 Jan  6 21:22 polaris10_pfp.bin
-rw-r--r-- 1 root root  23984 Jan  6 21:22 polaris10_rlc.bin
-rw-r--r-- 1 root root  12692 Jan  6 21:22 polaris10_sdma1.bin
-rw-r--r-- 1 root root  12692 Jan  6 21:22 polaris10_sdma.bin
-rw-r--r-- 1 root root 130196 Jan  6 21:22 polaris10_smc.bin
-rw-r--r-- 1 root root 130196 Jan  6 21:22 polaris10_smc_sk.bin
-rw-r--r-- 1 root root 369696 Jan  6 21:22 polaris10_uvd.bin
-rw-r--r-- 1 root root 166816 Jan  6 21:22 polaris10_vce.bin
You will need every file.
Its a very long list to enter in the kernel config box, then you will have your CPU microcode too.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Thu Jan 11, 2018 1:02 pm    Post subject: Reply with quote

I have all those... I did install the latest linux-firmware from the repository. At the time I am writting this it is 20180103-r1
Which extra kernel option? 8O

I am going to recompile after putting all that into the kernel conf file again, let's see if that works or gives something new to work with
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Thu Jan 11, 2018 1:41 pm    Post subject: Reply with quote

ok, I missed that post about disabling SME support. Saddly that didn't do anything.

However, adding the firmware location into the kernel prompted a different error: I wish I knew what "error -2" means. Now it fails in a couple of firmware modules, so maybe it is a firmware problem after all...

I do have the possibility of having unacelerated graphics as suggested, or downgrading to a Nvidia card if I really needed to, but having my laptop in working order I'll like to track down this problem and learn how to fix it.

$ dmesg | grep drm
[ 0.801110] [drm] amdgpu kernel modesetting enabled.
[ 0.801328] [drm] initializing kernel modesetting (POLARIS10 0x1002:0x67DF 0x1DA2:0xE366 0xEF).
[ 0.801498] [drm] register mmio base: 0xFE900000
[ 0.801599] [drm] register mmio size: 262144
[ 0.801706] [drm] probing gen 2 caps for device 1022:1453 = 733903/e
[ 0.801812] [drm] probing mlw for device 1022:1453 = 733903
[ 0.801921] [drm] UVD is enabled in VM mode
[ 0.802036] [drm] UVD ENC is enabled in VM mode
[ 0.802139] [drm] VCE enabled in VM mode
[ 0.802696] [drm] GPU post is not needed
[ 0.802890] [drm] vm size is 64 GB, block size is 13-bit, fragment size is 9-bit
[ 0.803407] [drm] Detected VRAM RAM=4096M, BAR=256M
[ 0.803509] [drm] RAM width 256bits GDDR5
[ 0.805584] [drm] amdgpu: 4096M of VRAM memory ready
[ 0.805687] [drm] amdgpu: 4096M of GTT memory ready.
[ 0.805797] [drm] GART: num cpu pages 65536, num gpu pages 65536
[ 0.806044] [drm] PCIE GART of 256M enabled (table at 0x000000F400040000).
[ 0.806245] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
[ 0.806328] [drm] Driver supports precise vblank timestamp query.
[ 0.806424] [drm] amdgpu: irq initialized.
[ 0.806700] [drm] AMDGPU Display Connectors
[ 0.806780] [drm] Connector 0:
[ 0.806855] [drm] DP-1
[ 0.806929] [drm] HPD6
[ 0.807011] [drm] DDC: 0x4868 0x4868 0x4869 0x4869 0x486a 0x486a 0x486b 0x486b
[ 0.807127] [drm] Encoders:
[ 0.807202] [drm] DFP1: INTERNAL_UNIPHY2
[ 0.807280] [drm] Connector 1:
[ 0.807355] [drm] DP-2
[ 0.807429] [drm] HPD4
[ 0.807503] [drm] DDC: 0x4870 0x4870 0x4871 0x4871 0x4872 0x4872 0x4873 0x4873
[ 0.807619] [drm] Encoders:
[ 0.807694] [drm] DFP2: INTERNAL_UNIPHY2
[ 0.807772] [drm] Connector 2:
[ 0.807847] [drm] HDMI-A-1
[ 0.807921] [drm] HPD1
[ 0.807995] [drm] DDC: 0x486c 0x486c 0x486d 0x486d 0x486e 0x486e 0x486f 0x486f
[ 0.808126] [drm] Encoders:
[ 0.808210] [drm] DFP3: INTERNAL_UNIPHY1
[ 0.808288] [drm] Connector 3:
[ 0.808363] [drm] HDMI-A-2
[ 0.808437] [drm] HPD5
[ 0.808512] [drm] DDC: 0x4874 0x4874 0x4875 0x4875 0x4876 0x4876 0x4877 0x4877
[ 0.808628] [drm] Encoders:
[ 0.808702] [drm] DFP4: INTERNAL_UNIPHY1
[ 0.808780] [drm] Connector 4:
[ 0.808855] [drm] DVI-D-1
[ 0.808929] [drm] HPD3
[ 0.809014] [drm] DDC: 0x487c 0x487c 0x487d 0x487d 0x487e 0x487e 0x487f 0x487f
[ 0.809132] [drm] Encoders:
[ 0.809208] [drm] DFP5: INTERNAL_UNIPHY
[ 0.812286] [drm] Found UVD firmware Version: 1.79 Family ID: 16
[ 0.813098] [drm] Found VCE firmware Version: 52.4 Binary ID: 3
[ 0.813800] [drm:amdgpu_cgs_get_firmware_info] *ERROR* Failed to request firmware
[ 0.895268] [drm:amdgpu_device_init] *ERROR* hw_init of IP block <amdgpu_powerplay> failed -22
[ 0.961735] WARNING: CPU: 2 PID: 1 at drivers/gpu/drm/drm_mm.c:897 drm_mm_takedown+0x1b/0x20
[ 0.962143] RIP: 0010:drm_mm_takedown+0x1b/0x20
[ 0.962297] drm_dev_register+0x15e/0x1f0
[ 0.965906] WARNING: CPU: 2 PID: 1 at drivers/gpu/drm/drm_mm.c:897 drm_mm_takedown+0x1b/0x20
[ 0.966350] RIP: 0010:drm_mm_takedown+0x1b/0x20
[ 0.966738] drm_dev_register+0x15e/0x1f0
[ 0.971383] [drm] amdgpu: ttm finalized
[ 0.971543] [drm] amdgpu: finishing device.


$ dmesg | grep amdgpu
[ 0.801110] [drm] amdgpu kernel modesetting enabled.
[ 0.802412] amdgpu 0000:24:00.0: Invalid PCI ROM header signature: expecting 0xaa55, got 0xffff
[ 0.803079] amdgpu 0000:24:00.0: VRAM: 4096M 0x000000F400000000 - 0x000000F4FFFFFFFF (4096M used)
[ 0.803243] amdgpu 0000:24:00.0: GTT: 256M 0x0000000000000000 - 0x000000000FFFFFFF
[ 0.805584] [drm] amdgpu: 4096M of VRAM memory ready
[ 0.805687] [drm] amdgpu: 4096M of GTT memory ready.
[ 0.806165] amdgpu 0000:24:00.0: amdgpu: using MSI.
[ 0.806424] [drm] amdgpu: irq initialized.
[ 0.806514] amdgpu: [powerplay] amdgpu: powerplay sw initialized
[ 0.809296] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_pfp_2.bin failed with error -2
[ 0.809421] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_me_2.bin failed with error -2
[ 0.809546] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_ce_2.bin failed with error -2
[ 0.809673] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_mec_2.bin failed with error -2
[ 0.809798] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_mec2_2.bin failed with error -2
[ 0.810141] amdgpu 0000:24:00.0: fence driver on ring 0 use gpu addr 0x0000000000400040, cpu addr 0x0000000009df9e11
[ 0.810300] amdgpu 0000:24:00.0: fence driver on ring 1 use gpu addr 0x00000000004000c0, cpu addr 0x00000000594f905f
[ 0.810452] amdgpu 0000:24:00.0: fence driver on ring 2 use gpu addr 0x0000000000400140, cpu addr 0x000000000bceb655
[ 0.810604] amdgpu 0000:24:00.0: fence driver on ring 3 use gpu addr 0x00000000004001c0, cpu addr 0x000000002aab8f24
[ 0.810754] amdgpu 0000:24:00.0: fence driver on ring 4 use gpu addr 0x0000000000400240, cpu addr 0x00000000c2a29eba
[ 0.810905] amdgpu 0000:24:00.0: fence driver on ring 5 use gpu addr 0x00000000004002c0, cpu addr 0x00000000bc2d43a7
[ 0.811075] amdgpu 0000:24:00.0: fence driver on ring 6 use gpu addr 0x0000000000400340, cpu addr 0x000000001b605875
[ 0.811227] amdgpu 0000:24:00.0: fence driver on ring 7 use gpu addr 0x00000000004003c0, cpu addr 0x00000000370abf85
[ 0.811378] amdgpu 0000:24:00.0: fence driver on ring 8 use gpu addr 0x0000000000400440, cpu addr 0x00000000e2350542
[ 0.811516] amdgpu 0000:24:00.0: fence driver on ring 9 use gpu addr 0x00000000004004e0, cpu addr 0x00000000cde88522
[ 0.811976] amdgpu 0000:24:00.0: fence driver on ring 10 use gpu addr 0x0000000000400560, cpu addr 0x00000000d46a316e
[ 0.812155] amdgpu 0000:24:00.0: fence driver on ring 11 use gpu addr 0x00000000004005e0, cpu addr 0x000000002cd3ccde
[ 0.812651] amdgpu 0000:24:00.0: fence driver on ring 12 use gpu addr 0x000000f4001e6420, cpu addr 0x000000004041f004
[ 0.812799] amdgpu 0000:24:00.0: fence driver on ring 13 use gpu addr 0x00000000004006e0, cpu addr 0x000000005cd50b01
[ 0.812945] amdgpu 0000:24:00.0: fence driver on ring 14 use gpu addr 0x0000000000400760, cpu addr 0x00000000740451a5
[ 0.813248] amdgpu 0000:24:00.0: fence driver on ring 15 use gpu addr 0x00000000004007e0, cpu addr 0x00000000b145e47e
[ 0.813395] amdgpu 0000:24:00.0: fence driver on ring 16 use gpu addr 0x0000000000400860, cpu addr 0x000000004238527b
[ 0.813676] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_k_smc.bin failed with error -2
[ 0.813800] [drm:amdgpu_cgs_get_firmware_info] *ERROR* Failed to request firmware
[ 0.851225] amdgpu: [powerplay] SMU Firmware start failed!
[ 0.895015] amdgpu: [powerplay] SMU Firmware start failed!
[ 0.895099] amdgpu: [powerplay] Failed to load SMU ucode.
[ 0.895180] amdgpu: [powerplay] smc start failed
[ 0.895268] [drm:amdgpu_device_init] *ERROR* hw_init of IP block <amdgpu_powerplay> failed -22
[ 0.895386] amdgpu 0000:24:00.0: amdgpu_init failed
[ 0.962297] amdgpu_vram_mgr_fini+0x28/0x50
[ 0.962297] amdgpu_ttm_fini+0xdb/0x210
[ 0.962297] amdgpu_bo_fini+0x9/0x30
[ 0.962297] amdgpu_fini+0x1ee/0x300
[ 0.962297] amdgpu_device_init+0xd88/0x1530
[ 0.962297] amdgpu_driver_load_kms+0x51/0x210
[ 0.962297] amdgpu_pci_probe+0x11d/0x150
[ 0.966738] amdgpu_gtt_mgr_fini+0x1c/0x50
[ 0.966738] amdgpu_ttm_fini+0xe8/0x210
[ 0.966738] amdgpu_bo_fini+0x9/0x30
[ 0.966738] amdgpu_fini+0x1ee/0x300
[ 0.966738] amdgpu_device_init+0xd88/0x1530
[ 0.966738] amdgpu_driver_load_kms+0x51/0x210
[ 0.966738] amdgpu_pci_probe+0x11d/0x150
[ 0.971383] [drm] amdgpu: ttm finalized
[ 0.971461] amdgpu 0000:24:00.0: Fatal error during GPU init
[ 0.971543] [drm] amdgpu: finishing device.
[ 0.971849] amdgpu: probe of 0000:24:00.0 failed with error -22

Thank you very much for all the help so far
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Thu Jan 11, 2018 2:39 pm    Post subject: Reply with quote

It is working at last!

In the end, what I did is to compile amdgpu as a module, not into the kernel and leaving all the firmware into the config file:

CONFIG_EXTRA_FIRMWARE="amdgpu/polaris10_ce.bin amdgpu/polaris10_mc.bin amdgpu/polaris10_me.bin amdgpu/polaris10_mec2.bin amdgpu/polaris10_mec.bin amdgpu/polaris10_pfp.bin amdgpu/polaris10_rlc.bin amdgpu/polaris10_sdma1.bin amdgpu/polaris10_sdma.bin amdgpu/polaris10_smc.bin amdgpu/polaris10_smc_sk.bin amdgpu/polaris10_uvd.bin amdgpu/polaris10_vce.bin"
CONFIG_EXTRA_FIRMWARE_DIR="/lib/firmware"


When the stable 4.15 comes out I'll try to remove that line just in case, as I am not sure you'll need to manually specify each firmware blob, but now I need a bit of a break :lol:

As a sad note, Doom4 won't work anymore with a AMD card, but Doom 1 & 2 does, plus now I have full framebuffer support, so it makes it up for that!

Thank you all for the help!
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 40979
Location: 56N 3W

PostPosted: Thu Jan 11, 2018 3:39 pm    Post subject: Reply with quote

vladimir1986,

The rules for kernel modules that need to load firmware are the same for all modules.

The firmware must be available when the module is initialised.
That means ..
if the module is <*> the firmware must be listed in CONFIG_EXTRA_FIRMWARE
if the module is <M> the firmware must be in /lib/firmware
The other two combinations don't work.

When built in modules are initialised, root is not yet mounted, so /lib/firmware cannot be read.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
firephoto
Veteran
Veteran


Joined: 29 Oct 2003
Posts: 1587
Location: +48° 5' 23.40", -119° 48' 30.00"

PostPosted: Fri Jan 12, 2018 4:38 pm    Post subject: Reply with quote

vladimir1986 wrote:

CONFIG_EXTRA_FIRMWARE="amdgpu/polaris10_ce.bin amdgpu/polaris10_mc.bin amdgpu/polaris10_me.bin amdgpu/polaris10_mec2.bin amdgpu/polaris10_mec.bin amdgpu/polaris10_pfp.bin amdgpu/polaris10_rlc.bin amdgpu/polaris10_sdma1.bin amdgpu/polaris10_sdma.bin amdgpu/polaris10_smc.bin amdgpu/polaris10_smc_sk.bin amdgpu/polaris10_uvd.bin amdgpu/polaris10_vce.bin"
CONFIG_EXTRA_FIRMWARE_DIR="/lib/firmware"


Code:

CONFIG_EXTRA_FIRMWARE="amdgpu/polaris10_ce.bin amdgpu/polaris10_mc.bin amdgpu/polaris10_me.bin amdgpu/polaris10_mec2.bin amdgpu/polaris10_mec.bin amdgpu/polaris10_pfp.bin amdgpu/polaris10_rlc.bin amdgpu/polaris10_sdma1.bin amdgpu/polaris10_sdma.bin amdgpu/polaris10_smc.bin amdgpu/polaris10_smc_sk.bin amdgpu/polaris10_uvd.bin amdgpu/polaris10_vce.bin amdgpu/polaris10_k_smc.bin"


You're missing amdgpu/polaris10_k_smc.bin in your config extra firmware. I had the same problem initially with my rx580. I see earlier you show as having that in your config but I wonder if you missed it and it was a paste from the wiki or somewhere else and not your actual config. Your last error in the log before you fixed it by switching to a module shows it.

Quote:

[ 0.813676] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_k_smc.bin failed with error -2


So you could add the one more entry for the firmware in your kernel and go back to built in if you want to.
_________________
#gentoo-kde on freenode
Back to top
View user's profile Send private message
vladimir1986
n00b
n00b


Joined: 09 Dec 2010
Posts: 37

PostPosted: Sat Jan 13, 2018 10:04 am    Post subject: Reply with quote

firephoto wrote:
vladimir1986 wrote:

CONFIG_EXTRA_FIRMWARE="amdgpu/polaris10_ce.bin amdgpu/polaris10_mc.bin amdgpu/polaris10_me.bin amdgpu/polaris10_mec2.bin amdgpu/polaris10_mec.bin amdgpu/polaris10_pfp.bin amdgpu/polaris10_rlc.bin amdgpu/polaris10_sdma1.bin amdgpu/polaris10_sdma.bin amdgpu/polaris10_smc.bin amdgpu/polaris10_smc_sk.bin amdgpu/polaris10_uvd.bin amdgpu/polaris10_vce.bin"
CONFIG_EXTRA_FIRMWARE_DIR="/lib/firmware"


Code:

CONFIG_EXTRA_FIRMWARE="amdgpu/polaris10_ce.bin amdgpu/polaris10_mc.bin amdgpu/polaris10_me.bin amdgpu/polaris10_mec2.bin amdgpu/polaris10_mec.bin amdgpu/polaris10_pfp.bin amdgpu/polaris10_rlc.bin amdgpu/polaris10_sdma1.bin amdgpu/polaris10_sdma.bin amdgpu/polaris10_smc.bin amdgpu/polaris10_smc_sk.bin amdgpu/polaris10_uvd.bin amdgpu/polaris10_vce.bin amdgpu/polaris10_k_smc.bin"


You're missing amdgpu/polaris10_k_smc.bin in your config extra firmware. I had the same problem initially with my rx580. I see earlier you show as having that in your config but I wonder if you missed it and it was a paste from the wiki or somewhere else and not your actual config. Your last error in the log before you fixed it by switching to a module shows it.

Quote:

[ 0.813676] amdgpu 0000:24:00.0: Direct firmware load for amdgpu/polaris10_k_smc.bin failed with error -2


So you could add the one more entry for the firmware in your kernel and go back to built in if you want to.


After what NeedySeagoon wrote I wondered if I had mispelled something, so after you pointed out I was missing one it was confirmed... I am pretty much sure that was the problem and I might try out of curiousity. However If I can choose I always like to build everything as a module, so as a module will end.
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 40979
Location: 56N 3W

PostPosted: Sat Jan 13, 2018 12:02 pm    Post subject: Reply with quote

vladimir1986,

I admit I cheated. I put some rubbish in CONFIG_EXTRA_FIRMWARE= using menuconfig, just as a placeholder.
Then I used $EDITOR on make.conf to change the rubbish to all the files given by
Code:
$ cd /lib/firmware
roy@NeddySeagoon_Static /lib/firmware $ ls amdgpu/polaris10*
amdgpu/polaris10_ce_2.bin    amdgpu/polaris10_pfp_2.bin
amdgpu/polaris10_ce.bin      amdgpu/polaris10_pfp.bin
amdgpu/polaris10_k_smc.bin   amdgpu/polaris10_rlc.bin
amdgpu/polaris10_mc.bin      amdgpu/polaris10_sdma1.bin
amdgpu/polaris10_me_2.bin    amdgpu/polaris10_sdma.bin
amdgpu/polaris10_me.bin      amdgpu/polaris10_smc.bin
amdgpu/polaris10_mec2_2.bin  amdgpu/polaris10_smc_sk.bin
amdgpu/polaris10_mec_2.bin   amdgpu/polaris10_uvd.bin
amdgpu/polaris10_mec2.bin    amdgpu/polaris10_vce.bin
amdgpu/polaris10_mec.bin

This makes sure that I list all the files I actually have and copy/paste takes care of the typos too.

I know I always say never, ever use $EDITOR on make.conf. This is probably the exception that proves the rule.
Don't forget to add your CPU microcode too. Microcode updates for moust CPUs no longer work as modules.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Desktop Environments All times are GMT
Goto page 1, 2  Next
Page 1 of 2

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum