Test results for aarch64, pi5, crypto_hashblocks/sha512

[Page version: 20250609 19:52:35]

Measurements for aarch64, pi5, crypto_hashblocks Test results for aarch64, pi5, crypto_hashblocks Test results for crypto_hashblocks/sha512
Computer: pi5
Microarchitecture: aarch64; Cortex-A76 (414fd0b1)
Architecture: aarch64
CPU ID: 414fd0b1
SUPERCOP version: 20250415
Operation: crypto_hashblocks
Primitive: sha512
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
97712612 0 011528 768 736wflipgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
98203200 0 012984 768 736wflipgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
98322820 0 014369 824 728wflipclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
100672716 0 010656 752 720wflipgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1050811472 0 021280 768 736inplacegcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1061111492 0 021264 768 736refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1081511536 0 020464 768 736refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1082211500 0 020448 768 736inplacegcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
109794248 0 013063 760 728wflipgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
111473828 0 015409 824 728compactclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
113154012 0 013840 768 736compactgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
113263476 0 012424 768 736compactgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1162710856 0 018800 752 720refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1164510852 0 018808 752 720inplacegcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
117993444 0 011408 752 720compactgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1213411124 0 019975 760 728inplacegcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1214011144 0 019999 760 728refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1233911884 0 023449 824 728inplaceclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
1243211896 0 023441 824 728refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
125123624 0 012487 760 728compactgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
132001664 0 013209 824 728compact4clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
143901228 0 09176 752 720compact4gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
146361436 0 010368 768 736compact4gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
147112036 0 011856 768 736compact4gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
151801400 0 010231 760 728compact4gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
199032156 0 011976 768 736compact2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
199521616 0 010560 768 736compact2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
212061708 0 013265 824 728compact3clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
213071444 0 010392 768 736compact3gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
213942972 0 014521 824 728compact2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
215392064 0 011872 768 736compact3gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
215591564 0 09552 752 720compact2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
222451364 0 09320 752 720compact3gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
224331692 0 010567 760 728compact2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415
236701572 0 010415 760 728compact3gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2025052220250415

Compiler output


blocks.c: blocks.c:15:2: warning: "FEAT_SHA512 is not enabled in the compiler. To enable FEAT_SHA512, use for instance \"-mcpu=neoverse-v2+crypto+sha3\" on Graviton 4 or NVidia Grace." [-W#warnings]
blocks.c: #warning "FEAT_SHA512 is not enabled in the compiler. To enable FEAT_SHA512, use for instance \"-mcpu=neoverse-v2+crypto+sha3\" on Graviton 4 or NVidia Grace."
blocks.c:  ^
blocks.c: blocks.c:181:5: warning: implicit declaration of function 'vsha512hq_u64' is invalid in C99 [-Wimplicit-function-declaration]
blocks.c:     F4_v2( w0, GETK( 0),  w2, GETK( 2), ab, cd, ef, gh);
blocks.c:     ^
blocks.c: blocks.c:52:19: note: expanded from macro 'F4_v2'
blocks.c:   half_round_v2 = vsha512hq_u64(k0plusw0_swapped_plusgh_v2, vextq_u64(ef, gh, 1), vextq_u64(cd, ef, 1)); \
blocks.c:                   ^
blocks.c: blocks.c:181:5: error: assigning to 'uint64x2_t' (vector of 2 'uint64_t' values) from incompatible type 'int'
blocks.c:     F4_v2( w0, GETK( 0),  w2, GETK( 2), ab, cd, ef, gh);
blocks.c:     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:52:17: note: expanded from macro 'F4_v2'
blocks.c:   half_round_v2 = vsha512hq_u64(k0plusw0_swapped_plusgh_v2, vextq_u64(ef, gh, 1), vextq_u64(cd, ef, 1)); \
blocks.c:                 ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:181:5: warning: implicit declaration of function 'vsha512h2q_u64' is invalid in C99 [-Wimplicit-function-declaration]
blocks.c: blocks.c:53:8: note: expanded from macro 'F4_v2'
blocks.c:   gh = vsha512h2q_u64(half_round_v2, cd, ab);                           \
blocks.c:        ^
blocks.c: blocks.c:181:5: error: assigning to 'uint64x2_t' (vector of 2 'uint64_t' values) from incompatible type 'int'
blocks.c:     F4_v2( w0, GETK( 0),  w2, GETK( 2), ab, cd, ef, gh);
blocks.c:     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:53:6: note: expanded from macro 'F4_v2'
blocks.c:   gh = vsha512h2q_u64(half_round_v2, cd, ab);                           \
blocks.c:      ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
dolbeau/arm-sha512clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)

Compiler output


blocks.c: blocks.c:15:2: warning: #warning "FEAT_SHA512 is not enabled in the compiler. To enable FEAT_SHA512, use for instance \"-mcpu=neoverse-v2+crypto+sha3\" on Graviton 4 or NVidia Grace." [-Wcpp]
blocks.c:    15 | #warning "FEAT_SHA512 is not enabled in the compiler. To enable FEAT_SHA512, use for instance \"-mcpu=neoverse-v2+crypto+sha3\" on Graviton 4 or NVidia Grace."
blocks.c:       |  ^~~~~~~
blocks.c: In file included from blocks.c:11:
blocks.c: /usr/lib/gcc/aarch64-linux-gnu/12/include/arm_neon.h: In function 'crypto_hashblocks_sha512_dolbeau_arm_sha512_constbranchindex':
blocks.c: /usr/lib/gcc/aarch64-linux-gnu/12/include/arm_neon.h:29671:1: error: inlining failed in call to 'always_inline' 'vsha512h2q_u64': target specific option mismatch
blocks.c: 29671 | vsha512h2q_u64 (uint64x2_t __a, uint64x2_t __b, uint64x2_t __c)
blocks.c:       | ^~~~~~~~~~~~~~
blocks.c: blocks.c:59:8: note: called from here
blocks.c:    59 |   ef = vsha512h2q_u64(half_round_v2, ab, gh);                           \
blocks.c:       |        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:212:5: note: in expansion of macro 'F4_v2'
blocks.c:   212 |     F4_v2(w12, GETK(76), w14, GETK(78), ef, gh, ab, cd);
blocks.c:       |     ^~~~~
blocks.c: /usr/lib/gcc/aarch64-linux-gnu/12/include/arm_neon.h:29664:1: error: inlining failed in call to 'always_inline' 'vsha512hq_u64': target specific option mismatch
blocks.c: 29664 | vsha512hq_u64 (uint64x2_t __a, uint64x2_t __b, uint64x2_t __c)
blocks.c:       | ^~~~~~~~~~~~~
blocks.c: blocks.c:58:19: note: called from here
blocks.c:    58 |   half_round_v2 = vsha512hq_u64(k2plusw2_swapped_plusef_v2, vextq_u64(cd, ef, 1), vextq_u64(ab, cd, 1)); \
blocks.c:       |                   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:212:5: note: in expansion of macro 'F4_v2'
blocks.c:   212 |     F4_v2(w12, GETK(76), w14, GETK(78), ef, gh, ab, cd);
blocks.c:       |     ^~~~~
blocks.c: /usr/lib/gcc/aarch64-linux-gnu/12/include/arm_neon.h:29671:1: error: inlining failed in call to 'always_inline' 'vsha512h2q_u64': target specific option mismatch
blocks.c: 29671 | vsha512h2q_u64 (uint64x2_t __a, uint64x2_t __b, uint64x2_t __c)
blocks.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
dolbeau/arm-sha512gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
dolbeau/arm-sha512gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
dolbeau/arm-sha512gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
dolbeau/arm-sha512gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)

Passed TIMECOP


TIMECOP iterations: 10

Number of similar (implementation,compiler) pairs: 35, namely:
ImplementationCompiler
compactclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
compactgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compactgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compactgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compactgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
compact2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact3clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
compact3gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact3gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact3gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact3gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact4clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
compact4gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact4gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact4gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
compact4gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
inplaceclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
inplacegcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
inplacegcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
inplacegcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
inplacegcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
wflipclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
wflipgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
wflipgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
wflipgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
wflipgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)