-
-
Save fwsGonzo/a594727a9429cb29f2012652ad43fb37 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
-= Binary Translated =- | |
$ ./rvlinux ../binaries/STREAM/stream-tuned-rv64gvb | |
------------------------------------------------------------- | |
STREAM version $Revision: 5.10 $ | |
------------------------------------------------------------- | |
This system uses 4 bytes per array element. | |
------------------------------------------------------------- | |
Array size = 20000000 (elements), Offset = 0 (elements) | |
Memory per array = 76.3 MiB (= 0.1 GiB). | |
Total memory required = 228.9 MiB (= 0.2 GiB). | |
Each kernel will be executed 10 times. | |
The *best* time for each kernel (excluding the first iteration) | |
will be used to compute the reported bandwidth. | |
------------------------------------------------------------- | |
Your clock granularity/precision appears to be 1 microseconds. | |
Each test below will take on the order of 14130 microseconds. | |
(= 14130 clock ticks) | |
Increase the size of the arrays if this shows that | |
you are not getting at least 20 clock ticks per test. | |
------------------------------------------------------------- | |
WARNING -- The above is only a rough guideline. | |
For best results, please be sure you know the | |
precision of your system timer. | |
------------------------------------------------------------- | |
Function Best Rate MB/s Avg time Min time Max time | |
Copy: 35281.1 0.004607 0.004535 0.004767 | |
Scale: 33712.6 0.004784 0.004746 0.004818 | |
Add: 32693.1 0.007415 0.007341 0.007539 | |
Triad: 31716.7 0.007608 0.007567 0.007661 | |
------------------------------------------------------------- | |
Solution Validates: avg error less than 1.000000e-06 on all three arrays | |
------------------------------------------------------------- | |
>>> Program exited, exit code = 0 (0x0) | |
Instructions executed: 1305098370 Runtime: 388.267ms Insn/s: 3361mi/s | |
Pages in use: 20 (80 kB virtual memory, total 364 kB) | |
-= Interpreted =- | |
$ ./rvlinux ../binaries/STREAM/build/stream | |
------------------------------------------------------------- | |
STREAM version $Revision: 5.10 $ | |
------------------------------------------------------------- | |
This system uses 4 bytes per array element. | |
------------------------------------------------------------- | |
Array size = 20000000 (elements), Offset = 0 (elements) | |
Memory per array = 76.3 MiB (= 0.1 GiB). | |
Total memory required = 228.9 MiB (= 0.2 GiB). | |
Each kernel will be executed 10 times. | |
The *best* time for each kernel (excluding the first iteration) | |
will be used to compute the reported bandwidth. | |
------------------------------------------------------------- | |
Your clock granularity/precision appears to be 1 microseconds. | |
Each test below will take on the order of 96551 microseconds. | |
(= 96551 clock ticks) | |
Increase the size of the arrays if this shows that | |
you are not getting at least 20 clock ticks per test. | |
------------------------------------------------------------- | |
WARNING -- The above is only a rough guideline. | |
For best results, please be sure you know the | |
precision of your system timer. | |
------------------------------------------------------------- | |
Function Best Rate MB/s Avg time Min time Max time | |
Copy: 18262.5 0.010872 0.008761 0.012888 | |
Scale: 9456.8 0.017738 0.016919 0.022430 | |
Add: 10730.6 0.022911 0.022366 0.024496 | |
Triad: 7793.5 0.032249 0.030795 0.036608 | |
------------------------------------------------------------- | |
Solution Validates: avg error less than 1.000000e-06 on all three arrays | |
------------------------------------------------------------- | |
>>> Program exited, exit code = 0 (0x0) | |
Instructions executed: 1177805251 Runtime: 1489.893ms Insn/s: 791mi/s | |
Pages in use: 233 (932 kB virtual memory, total 1978 kB) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
STREAM with libtcc as JIT-compiler: