Test results for aarch64, pi5, crypto_hashblocks/sha512
[Page version: 20250609 19:52:35]
Measurements for aarch64, pi5, crypto_hashblocks
Test results for aarch64, pi5, crypto_hashblocks
Test results for crypto_hashblocks/sha512
Computer: pi5
Microarchitecture: aarch64; Cortex-A76 (414fd0b1)
Architecture: aarch64
CPU ID: 414fd0b1
SUPERCOP version: 20250415
Operation: crypto_hashblocks
Primitive: sha512
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
9771 | 2612 0 0 | 11528 768 736 | wflip | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
9820 | 3200 0 0 | 12984 768 736 | wflip | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
9832 | 2820 0 0 | 14369 824 728 | wflip | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
10067 | 2716 0 0 | 10656 752 720 | wflip | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
10508 | 11472 0 0 | 21280 768 736 | inplace | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
10611 | 11492 0 0 | 21264 768 736 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
10815 | 11536 0 0 | 20464 768 736 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
10822 | 11500 0 0 | 20448 768 736 | inplace | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
10979 | 4248 0 0 | 13063 760 728 | wflip | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
11147 | 3828 0 0 | 15409 824 728 | compact | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
11315 | 4012 0 0 | 13840 768 736 | compact | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
11326 | 3476 0 0 | 12424 768 736 | compact | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
11627 | 10856 0 0 | 18800 752 720 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
11645 | 10852 0 0 | 18808 752 720 | inplace | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
11799 | 3444 0 0 | 11408 752 720 | compact | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
12134 | 11124 0 0 | 19975 760 728 | inplace | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
12140 | 11144 0 0 | 19999 760 728 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
12339 | 11884 0 0 | 23449 824 728 | inplace | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
12432 | 11896 0 0 | 23441 824 728 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
12512 | 3624 0 0 | 12487 760 728 | compact | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
13200 | 1664 0 0 | 13209 824 728 | compact4 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
14390 | 1228 0 0 | 9176 752 720 | compact4 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
14636 | 1436 0 0 | 10368 768 736 | compact4 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
14711 | 2036 0 0 | 11856 768 736 | compact4 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
15180 | 1400 0 0 | 10231 760 728 | compact4 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
19903 | 2156 0 0 | 11976 768 736 | compact2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
19952 | 1616 0 0 | 10560 768 736 | compact2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
21206 | 1708 0 0 | 13265 824 728 | compact3 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
21307 | 1444 0 0 | 10392 768 736 | compact3 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
21394 | 2972 0 0 | 14521 824 728 | compact2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
21539 | 2064 0 0 | 11872 768 736 | compact3 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
21559 | 1564 0 0 | 9552 752 720 | compact2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
22245 | 1364 0 0 | 9320 752 720 | compact3 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
22433 | 1692 0 0 | 10567 760 728 | compact2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
23670 | 1572 0 0 | 10415 760 728 | compact3 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250522 | 20250415 |
Compiler output
blocks.c: blocks.c:15:2: warning: "FEAT_SHA512 is not enabled in the compiler. To enable FEAT_SHA512, use for instance \"-mcpu=neoverse-v2+crypto+sha3\" on Graviton 4 or NVidia Grace." [-W#warnings]
blocks.c: #warning "FEAT_SHA512 is not enabled in the compiler. To enable FEAT_SHA512, use for instance \"-mcpu=neoverse-v2+crypto+sha3\" on Graviton 4 or NVidia Grace."
blocks.c: ^
blocks.c: blocks.c:181:5: warning: implicit declaration of function 'vsha512hq_u64' is invalid in C99 [-Wimplicit-function-declaration]
blocks.c: F4_v2( w0, GETK( 0), w2, GETK( 2), ab, cd, ef, gh);
blocks.c: ^
blocks.c: blocks.c:52:19: note: expanded from macro 'F4_v2'
blocks.c: half_round_v2 = vsha512hq_u64(k0plusw0_swapped_plusgh_v2, vextq_u64(ef, gh, 1), vextq_u64(cd, ef, 1)); \
blocks.c: ^
blocks.c: blocks.c:181:5: error: assigning to 'uint64x2_t' (vector of 2 'uint64_t' values) from incompatible type 'int'
blocks.c: F4_v2( w0, GETK( 0), w2, GETK( 2), ab, cd, ef, gh);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:52:17: note: expanded from macro 'F4_v2'
blocks.c: half_round_v2 = vsha512hq_u64(k0plusw0_swapped_plusgh_v2, vextq_u64(ef, gh, 1), vextq_u64(cd, ef, 1)); \
blocks.c: ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:181:5: warning: implicit declaration of function 'vsha512h2q_u64' is invalid in C99 [-Wimplicit-function-declaration]
blocks.c: blocks.c:53:8: note: expanded from macro 'F4_v2'
blocks.c: gh = vsha512h2q_u64(half_round_v2, cd, ab); \
blocks.c: ^
blocks.c: blocks.c:181:5: error: assigning to 'uint64x2_t' (vector of 2 'uint64_t' values) from incompatible type 'int'
blocks.c: F4_v2( w0, GETK( 0), w2, GETK( 2), ab, cd, ef, gh);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:53:6: note: expanded from macro 'F4_v2'
blocks.c: gh = vsha512h2q_u64(half_round_v2, cd, ab); \
blocks.c: ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
dolbeau/arm-sha512 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
Compiler output
blocks.c: blocks.c:15:2: warning: #warning "FEAT_SHA512 is not enabled in the compiler. To enable FEAT_SHA512, use for instance \"-mcpu=neoverse-v2+crypto+sha3\" on Graviton 4 or NVidia Grace." [-Wcpp]
blocks.c: 15 | #warning "FEAT_SHA512 is not enabled in the compiler. To enable FEAT_SHA512, use for instance \"-mcpu=neoverse-v2+crypto+sha3\" on Graviton 4 or NVidia Grace."
blocks.c: | ^~~~~~~
blocks.c: In file included from blocks.c:11:
blocks.c: /usr/lib/gcc/aarch64-linux-gnu/12/include/arm_neon.h: In function 'crypto_hashblocks_sha512_dolbeau_arm_sha512_constbranchindex':
blocks.c: /usr/lib/gcc/aarch64-linux-gnu/12/include/arm_neon.h:29671:1: error: inlining failed in call to 'always_inline' 'vsha512h2q_u64': target specific option mismatch
blocks.c: 29671 | vsha512h2q_u64 (uint64x2_t __a, uint64x2_t __b, uint64x2_t __c)
blocks.c: | ^~~~~~~~~~~~~~
blocks.c: blocks.c:59:8: note: called from here
blocks.c: 59 | ef = vsha512h2q_u64(half_round_v2, ab, gh); \
blocks.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:212:5: note: in expansion of macro 'F4_v2'
blocks.c: 212 | F4_v2(w12, GETK(76), w14, GETK(78), ef, gh, ab, cd);
blocks.c: | ^~~~~
blocks.c: /usr/lib/gcc/aarch64-linux-gnu/12/include/arm_neon.h:29664:1: error: inlining failed in call to 'always_inline' 'vsha512hq_u64': target specific option mismatch
blocks.c: 29664 | vsha512hq_u64 (uint64x2_t __a, uint64x2_t __b, uint64x2_t __c)
blocks.c: | ^~~~~~~~~~~~~
blocks.c: blocks.c:58:19: note: called from here
blocks.c: 58 | half_round_v2 = vsha512hq_u64(k2plusw2_swapped_plusef_v2, vextq_u64(cd, ef, 1), vextq_u64(ab, cd, 1)); \
blocks.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:212:5: note: in expansion of macro 'F4_v2'
blocks.c: 212 | F4_v2(w12, GETK(76), w14, GETK(78), ef, gh, ab, cd);
blocks.c: | ^~~~~
blocks.c: /usr/lib/gcc/aarch64-linux-gnu/12/include/arm_neon.h:29671:1: error: inlining failed in call to 'always_inline' 'vsha512h2q_u64': target specific option mismatch
blocks.c: 29671 | vsha512h2q_u64 (uint64x2_t __a, uint64x2_t __b, uint64x2_t __c)
blocks.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
dolbeau/arm-sha512 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
dolbeau/arm-sha512 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
dolbeau/arm-sha512 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
dolbeau/arm-sha512 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
Passed TIMECOP
TIMECOP iterations: 10
Number of similar (implementation,compiler) pairs: 35, namely:
Implementation | Compiler |
compact | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
compact | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
compact2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact3 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
compact3 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact3 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact3 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact3 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact4 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
compact4 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact4 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact4 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact4 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
inplace | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
inplace | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
inplace | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
inplace | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
inplace | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
wflip | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
wflip | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
wflip | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
wflip | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
wflip | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |