64 Bit Raspberry Pi 4B Benchmarks

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

64 Bit Raspberry Pi 4B Benchmarks

Previously, I have run my 32 bit and 64 bit benchmarks on the appropriate range of Raspberry Pi computers, up to model 3B+. Details of the benchmarks, results and download links are available from ResearchGate in a

https://www.researchgate.net/publication/327467963_Raspberry_Pi_3B_32_bit_and_64_bit_Benchmarks_and_Stress_Tests

I have also run the 32 bit versions on the Raspberry Pi 4, with results in

https://www.researchgate.net/publication/333973011_Raspberry_Pi_4B_32_Bit_Benchmarks">Raspberry-Pi-4-Benchmarks.pdf

This report contains brief reminders of the benchmarks, with 64 bit results on the Raspberry Pi 4 using Gentoo Operating System. Existing benchmarks were used to provide comparisons with the old 3B+ model and the Pi 4B system using 32 bit Raspbian. The first part is for my original single core programs.

Whetstone Benchmark

This has a number of simple programming loops, with the overall MWIPS rating dependent on floating point calculations, lately those identified as COS and EXP. The last three can be over optimised (N/A), but the time does not affect the overall rating much.

For this simple code, at 64 bits, average Pi 4 performance gain, over the Pi 3B+, was 2.12 times, but only around 1.3 times for straightforward floating point calculations. Then, as should be expected, the Pi 4B 32 bit speed was not much slower.

Sakaki · Guru Joined: 21 May 2014 Posts: 409

roylongbottom,

very interesting analysis as always, thanks for all your continued hard work on this!

Will you be posting these results to your Raspberry Pi Benchmarking thread on the RPi forums in due course?
_________________
Regards,

sakaki

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

NeddySeagoon · Posted: Wed Aug 21, 2019 4:13 pm Post subject:

roylongbottom,

Did you use the same binaries on the Pi3 and Pi4 or rebuild to code to take advantage of the out of order execution available on the Pi4?
Here, I'm being lazy and using Pi3 64 bit code everywhere.

Thank you for your Pi benchmark work.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

NeddySeagoon · Posted: Wed Aug 21, 2019 7:28 pm Post subject:

roylongbottom,

On the Pi3, for 64 bit code, I use

Sakaki · Guru Joined: 21 May 2014 Posts: 409

Interesting read about this topic here:

https://community.arm.com/developer/tools-software/tools/b/tools-software-ides-blog/posts/compiler-flags-across-architectures-march-mtune-and-mcpu
_________________
Regards,

sakaki

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

sakaki

I became distracted from reporting some benchmark results after building ATLAS Linear Algebra Subprograms overnight (13 hours), in order to run the High Performance Linpack Benchmark. All went well until the final stage compiling the HPL program, where mpicc could not be found. It was there on the 3B Gentoo, where I successfully installed and ran HPL on a Pi 3B+.

Is mpicc available for downloading for Pi 4 Gentoo?
_________________
Regards

Roy

Sakaki · Guru Joined: 21 May 2014 Posts: 409

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

Sakaki

Thanks

Nearly there, mpicc is used but now error is mpif77: Command not found
_________________
Regards

Roy

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

Memory Benchmarks

This batch of programs measure speed dependent on data from caches and RAM.

MemSpeed Benchmark

MemSpeed benchmark measures data reading speeds in MegaBytes per second, carrying out calculations on arrays of cache and RAM data, normally sized 2 x 4 KB to 2 x 4 MB. Calculations are as shown in the result headings. For the first two double precision tests, speed MFLOPS can be calculated by dividing MB/second by 8 and 16. For single precision divide by 4 and 8.

Results are provided below for the Gentoo 64 bit version on the Pi 3B+ and Pi 4B, and the Raspbian 32 bit variety on the Pi 4B, then a sample of relative performance, covering data from L1 cache, L2 cache and RAM.

Gains, greater than the 7% CPU MHz difference, were recorded all round by the Pi 4B over the Pi 3B+. The most impressive were on using L2 cache based data and the more intensive floating point calculations.

On the Pi 4B, speeds of 64 bit and 32 bit compilations were similar using RAM based data and executing some integer tests, but significantly faster from cache based floating point calculations.

Sakaki · Guru Joined: 21 May 2014 Posts: 409

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

Sakaki

My recompile worked, so I now have a working Gentoo Pi 4 HPL Benchmark, but the speed is disappointing, same as the 32 bit version with a maximum of just over 10 GFLOPS (with 4 GB RAM). It might need some compiling parameters changing for HPL (or ATLAS) and wonder if I could find anyone to advise how and where.

At least it is three times faster than using the Gentoo Pi 3B+ version.
_________________
Regards

Roy

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

sakaki

For an up to date comparison, I have been running that HPL benchmark and my other MP tests on my Pi 3B+, using the new Gentoo. All ran without any problems, but there were two things to report.

The first was that TV display started at 1024 x 786. Settings did not provide an option anywhere near 1920 x 1080.

The second point was that WiFi connected, without any intervention, using the originally entered password. Back on the Pi 4, it still did not connect.
_________________
Regards

Roy

Sakaki · Guru Joined: 21 May 2014 Posts: 409

NeddySeagoon · Posted: Fri Aug 23, 2019 1:02 pm Post subject:

roylongbottom,

See Pi 4 Wifi.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

NeddySeagoon · Posted: Fri Aug 23, 2019 1:09 pm Post subject:

Team,

Sakaki · Guru Joined: 21 May 2014 Posts: 409

Sakaki · Guru Joined: 21 May 2014 Posts: 409

NeddySeagoon · Posted: Fri Aug 23, 2019 2:01 pm Post subject:

Sakaki,

That's my thinking too but I've not done it yet.

My Acer R13 Chromebook in a big.LITTLE device but for now, it just runs my Pi3 Gentoo.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

Optimisers or Misers

I have been trying the suggested compiling parameters on various benchmarks, via Gentoo on a Pi 4B, but have not found one where they made a great deal of difference - unlike hardware architecture. No doubt there are some.

Below are result for the Livermore loops, comprising 24 program kernels, the most critical at Lawrence Livermore Laboratory for selecting a new supercomputer. The tables show the compile parameters used. The first table indicating the measured MFLOPS for each kernel, and the second one relative ratios compared with

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

roylongbottom · n00b Joined: 13 Feb 2017 Posts: 64 Location: Essex, UK

Sakaki

My WiFi is now working using v1.5.1 bugfix release, on my two Pi 4s and a Pi 3B+. I have also found that two monitors and a TV display at the correct resolution.
_________________
Regards

Roy

Sakaki · Guru Joined: 21 May 2014 Posts: 409