uname -a
Linux pc 5.4.48-gentoo #1 SMP Mon Jun 29 21:49:54 WEST 2020 x86_64 Intel(R) Core(TM) i7-6850K CPU @ 3.60GHz GenuineIntel GNU/Linux
Kernel parameters:
GRUB_CMDLINE_LINUX_DEFAULT="iommu=on intel_iommu=on swiotlb=1024 nvidia-drm.modeset=1 kvm-intel.nested=1 kvm-intel.enable_shadow_vmcs=1 kvm-intel.enable_apicv=1 kvm-intel.ept=1 quiet"
I have the following error in PCIe bus since I installed kernel 5.4;
[ 118.509751] pcieport 0000:00:02.0: AER: Multiple Corrected error received: 0000:00:02.0
[ 118.509756] pcieport 0000:00:02.0: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
[ 118.509760] pcieport 0000:00:02.0: AER: device [8086:6f04] error status/mask=00000040/00002000
[ 118.509762] pcieport 0000:00:02.0: AER: [ 6] BadTLP
[ 224.241123] pcieport 0000:00:1c.3: Enabling MPC IRBNCE
[ 224.241125] pcieport 0000:00:1c.3: Intel PCH root port ACS workaround enabled
lspci
00:00.0 Host bridge: Intel Corporation Xeon E7 v4/Xeon E5 v4/Xeon E3 v4/Xeon D DMI2 (rev 01)
00:01.0 PCI bridge: Intel Corporation Xeon E7 v4/Xeon E5 v4/Xeon E3 v4/Xeon D PCI Express Root Port 1 (rev 01)
00:01.1 PCI bridge: Intel Corporation Xeon E7 v4/Xeon E5 v4/Xeon E3 v4/Xeon D PCI Express Root Port 1 (rev 01)
00:02.0 PCI bridge: Intel Corporation Xeon E7 v4/Xeon E5 v4/Xeon E3 v4/Xeon D PCI Express Root Port 2 (rev 01)
00:03.0 PCI bridge: Intel Corporation Xeon E7 v4/Xeon E5 v4/Xeon E3 v4/Xeon D PCI Express Root Port 3 (rev 01)
00:03.2 PCI bridge: Intel Corporation Xeon E7 v4/Xeon E5 v4/Xeon E3 v4/Xeon D PCI Express Root Port 3 (rev 01)
00:1c.0 PCI bridge: Intel Corporation C610/X99 series chipset PCI Express Root Port #1 (rev d5)
00:1c.3 PCI bridge: Intel Corporation C610/X99 series chipset PCI Express Root Port #4 (rev d5)
00:1c.4 PCI bridge: Intel Corporation C610/X99 series chipset PCI Express Root Port #5 (rev d5)
Also read in redhat bugzilla that setting pcie_aspm=off could get rid of this kind of errors:
https://bugzilla.redhat.com/show_bug.cgi?id=1616364
What would you suggests me to do?
Thanks
----------------
Seems that kernel 5.4 is more verbose than 4, since I never saw this error before.
I didn't notice any performance issue around IO.
My concern was if this issue could be hardware or software related. As I could see from redhat bugzilla, someone could disable this errors configuring pcie_aspm=off, so could be related to power management.
----------------
Could be related to my graphic card gtx 1060:
https://www.nvidia.com/en-us/geforce/fo ... us-errors/
Here they suggests pci=nommconf to stabilize the cpu.
----------------
After adding pcie_aspm=off to grub in kernel boot parameters, system boot little faster and don't see until now the error happening again.
[Moderator edit: Moved poster's 3 self-responses into main post, then deleted those self-responses to return the thread to the unanswered list.
OP: Generally, if post length permits, you should edit extra information into your first post instead of replying to it. Some users run a canned search for threads with zero replies as a way of finding posts to answer. When you replied to yourself, you removed your thread from that list, even though you were not replying to announce that the thread was solved. Thus, they would no longer see your thread on the list of unanswered threads.
-Hu]
