jagdpanther wrote:I currently don't think my new system issues are a hardware fault. If anyone disagrees with my reasons, stated below, please let me know. I have almost a week to return the parts if needed from the vendor.
Sorry, i do
Any software error will endup log somewhere, only hardware error could disallow a software logging an error (that's a too high error that prevent the system itself to even logging the error) ; that's why mce were made: because hardware can log itself the error internally and report it to a software.
It's a tiny raw logic so easy to get: if someone cut electricity, your hardware could have a "if i get short of electricty, i'll write and keep that i was short of electricity to later report that" ; but how could a sofware could ask anything to a cpu that is out of electricity to work (the work to do should be "write something in a log")?
your unability to have software logging any error is not helping toward thinking about a software only issue. Still because nothing is impossible, it might be a software issue, but i tend to stay within "acceptable" logic, and extremly hard to catch error by software seems far less possible than a brand new broken hardware (that's quiet common, hardware tend to be more and more reliable with time and tend to break earlier if any defect was made in the process), we have in french word for that "rodage" which is translate (bad?) into "running-in period"
2. Turning off the CSM co...
your UEFI not UEFI is also a broken test.
You seems to accept "oh if it work in UEFI than it's because it's a new machine that love UEFI".
Nah, if really a m/b could work only in UEFI, than the m/b would only have UEFI ; if the m/b allow non UEFI it's because this is suppose to work.
So your UEFI test may be of help to track the issue if you want, but still, you cannot assume UEFI should be stable and non UEFI broken and that is perfectly ok.
I do have an UEFI m/b myself, i cannot even tell you if UEFI booting is working, because i never use it, but i assume it is, anyway, my system have no problem running in non UEFI (and this shouldn't be a surprise, just normality)
4. On a seperate SATA drive (not the NVMe M2 that I have Gentoo on), today, I installed Win10 in UEFI mode, the Nvidia driver, Steam and played a little "Rise of the Tomb Raider" and saw no issues.
Many, but many (early) ryzen users could tell you about this, amd has even recommand to get a "fixed!" cpu only for linux users, while their Windows users (not seeing the problem) were gently leave with a broken cpu (the second market for ryzen will be a nightmare!)
So, as ryzen prove, not working in Windows and linux prove it's not working, but working in Windows doesn't prove it is working

Why, just because ryzen users in linux were using gcc (which is not kidding with cpu) and running Windows is a peace of cake for such cpu.
I'm not really impress by your Tombe Raider test too, that should push your videocard to its limit maybe, but certainly not a beast with 28 cores.
Your ubuntu case (#3) seems more interresting, however ubuntu is a binary distro, choices made in their kernel could be the difference between a gcc working fully and a gcc slowdown by a feature (like kernel scheduler or whatever) that may not expose the problem (while still the problem exists, just not exhibit in that environnment). I even think some ryzen users weren't affect even running gcc test in some linux distro (but i'm not certain).
While i agree a kernel may expose strange issue, when you start testing different kernels (not made by you so) and keep getting the issue, i'm incline to think the issue may not be the kernel.
This is what i would do if i was you (i'm not saying that's the best to do, but what I would do)
1/ considering the vendor policy: if they accept return without proof or anything (that's important, don't return something the vendor won't see himself, a "my cpu doesn't work" will get your ass kick if "your cpu work with us" (because they use Windows certainly).
2/ if vendor need proof, then argue about a prize for the service to not giving the proof, but having the vendor himself find the cause. So instead of proving yourself the material is faulty, tell the vendor the material is 100% sure faulty and you need him to point where the fault is, and you will pay for the service.
Vendors always like to have their time paid, and when they find the cause, they will be the first to offer an exchange of the faulty material to make their customer happy.
3/ if vendor trust you and doesn't ask proof, send back what is not QVL approve from your list, and ask an exchange or upgrade to QVL materials your m/b have in its list ; like i said earlier, non QVL is not a proof the material may not work with it, but QVL is a proof it should, removing any doubt about incompatibilty.
4/ because 8 DIMM is only support by 6 cores cpu on that m/b (the m/b support 4 DIMM for 4xcore cpu, and only 6+ core cpu allow 8 DIMM), this kind of oddity always make me feel better to avoid "corner case": this to say, if i were you, i would reach my 64Gb of ram with 4xDIMM instead of the 8x8 combo you have goes for, and i will do my best to pickup ones from the QVL of m/b, so looking hard to get QVL 4x16 DIMM).