Test results for amd64, hertz, crypto_hash/simd512
[Page version: 20250926 22:06:17]
Measurements for amd64, hertz, crypto_hash
Test results for amd64, hertz, crypto_hash
Test results for crypto_hash/simd512
Computer: hertz
Microarchitecture: amd64; Zen 4 (a60f12)
Architecture: amd64
CPU ID: AuthenticAMD-00a60f12-178bfbff
SUPERCOP version: 20250922
Operation: crypto_hash
Primitive: simd512
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
9721 | 15953 416 0 | 28189 1268 1032 | T:vect128 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
9751 | 20521 416 0 | 34709 1268 1032 | T:vect128 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
9759 | 14439 416 0 | 25144 1244 936 | T:vect128 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
24708 | 45299 0 0 | 61751 828 968 | T:sphlib | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
25360 | 36483 0 0 | 50999 828 968 | T:sphlib | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
29568 | 60546 416 0 | 74757 1268 1032 | T:opt | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
32497 | 27482 0 0 | 39193 820 968 | T:sphlib | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
38226 | 57348 0 0 | 71230 804 1032 | T:sphlib | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
39139 | 52946 0 0 | 64806 804 1032 | T:sphlib | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
44205 | 39291 388 0 | 54546 1248 968 | T:opt | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
45631 | 32972 0 0 | 46886 804 1032 | T:sphlib-small | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
46368 | 28786 0 0 | 40678 804 1032 | T:sphlib-small | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
46825 | 38279 388 0 | 51514 1248 968 | T:opt | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
47321 | 18970 388 0 | 30804 1240 968 | T:opt | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
48410 | 47975 0 0 | 64223 828 968 | T:sphlib-small | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
48899 | 37127 0 0 | 51423 828 968 | T:sphlib-small | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
56172 | 28267 0 0 | 40033 820 968 | T:sphlib-small | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
63066 | 20002 416 0 | 32293 1268 1032 | T:opt | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
67330 | 52631 0 0 | 62985 780 936 | T:sphlib | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
76950 | 26382 0 0 | 36761 780 936 | T:sphlib-small | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
90887 | 13422 416 0 | 24136 1244 936 | T:opt | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
5156658 | 5306 388 0 | 17252 1240 968 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
5849578 | 47269 388 0 | 64314 1248 968 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
5849909 | 33205 388 0 | 48250 1248 968 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
6816374 | 13061 416 0 | 27253 1268 1032 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
6875690 | 6566 416 0 | 18789 1268 1032 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
8174133 | 4891 416 0 | 15576 1244 936 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250925 | 20250922 |
Compiler output
optimized.c: optimized.c:437:9: warning: unused variable 'j' [-Wunused-variable]
optimized.c: 437 | int i,j;
optimized.c: | ^
optimized.c: 1 warning generated.
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:opt | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:opt | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:opt | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
Compiler output
optimized.c: optimized.c: In function 'SIMD_Compress':
optimized.c: optimized.c:437:9: warning: unused variable 'j' [-Wunused-variable]
optimized.c: 437 | int i,j;
optimized.c: | ^
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:opt | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:opt | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:opt | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
Compiler output
reference.c: reference.c:69:82: warning: expression result unused [-Wunused-value]
reference.c: 69 | state->A[j] = state->D[j] + w[j] + F(state->A[j], state->B[j], state->C[j]), s;
reference.c: | ^
reference.c: 1 warning generated.
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
Compiler output
vector.c: vector.c:73:9: warning: 'X' macro redefined [-Wmacro-redefined]
vector.c: 73 | #define X(i) X##i
vector.c: | ^
vector.c: vector.c:68:9: note: previous definition is here
vector.c: 68 | #define X(i) A[i]
vector.c: | ^
vector.c: vector.c:129:3: error: use of unknown builtin '__builtin_ia32_pcmpgtw128' [-Wimplicit-function-declaration]
vector.c: 129 | DO_REDUCE_FULL_S(0);
vector.c: | ^
vector.c: vector.c:56:12: note: expanded from macro 'DO_REDUCE_FULL_S'
vector.c: 56 | X(i) = EXTRA_REDUCE_S(X(i)); \
vector.c: | ^
vector.c: vector.c:42:32: note: expanded from macro 'EXTRA_REDUCE_S'
vector.c: 42 | v16_sub(x, v16_and(V257.v16, v16_cmp(x, V128.v16)))
vector.c: | ^
vector.c: ./vector.h:92:22: note: expanded from macro 'v16_cmp'
vector.c: 92 | #define v16_cmp __builtin_ia32_pcmpgtw128
vector.c: | ^
vector.c: vector.c:129:3: error: cannot convert between scalar type 'int' and vector type 'v16' (aka 'v8hi') as implicit conversion would cause truncation
vector.c: vector.c:56:12: note: expanded from macro 'DO_REDUCE_FULL_S'
vector.c: 56 | X(i) = EXTRA_REDUCE_S(X(i)); \
vector.c: | ^
vector.c: vector.c:42:14: note: expanded from macro 'EXTRA_REDUCE_S'
vector.c: 42 | v16_sub(x, v16_and(V257.v16, v16_cmp(x, V128.v16)))
vector.c: | ^
vector.c: ...
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:vect128 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:vect128 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:vect128 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
Compiler output
vector.c: vector.c: In function 'fft64':
vector.c: vector.c:73:9: warning: 'X' redefined
vector.c: 73 | #define X(i) X##i
vector.c: | ^
vector.c: vector.c:68:9: note: this is the location of the previous definition
vector.c: 68 | #define X(i) A[i]
vector.c: | ^
vector.c: vector.c: In function 'fft128_msg_final':
vector.c: vector.c:326:7: warning: unused variable 'i' [-Wunused-variable]
vector.c: 326 | int i;
vector.c: | ^
vector.c: vector.c: In function 'rounds512':
vector.c: vector.c:796:9: warning: 'STEP_1' redefined
vector.c: 796 | #define STEP_1(a,b,c,d,w,fun,r,s,z) \
vector.c: | ^~~~~~
vector.c: vector.c:542:9: note: this is the location of the previous definition
vector.c: 542 | #define STEP_1(a,b,c,d,w,fun,r,s,z) \
vector.c: | ^~~~~~
vector.c: vector.c:805:9: warning: 'STEP_2' redefined
vector.c: 805 | #define STEP_2(a,b,c,d,w,fun,r,s) \
vector.c: | ^~~~~~
vector.c: vector.c:566:9: note: this is the location of the previous definition
vector.c: 566 | #define STEP_2(a,b,c,d,w,fun,r,s) \
vector.c: | ^~~~~~
vector.c: vector.c:808:9: warning: 'STEP' redefined
vector.c: ...
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:vect128 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:vect128 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:vect128 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
Namespace violations
nist.o Final T
nist.o Hash T
nist.o IV_224 D
nist.o IV_256 D
nist.o IV_384 D
nist.o IV_512 D
nist.o IncreaseCounter T
nist.o Init T
nist.o InitIV T
nist.o Update T
optimized.o FFT_128_full T
optimized.o FFT_128_halfzero T
optimized.o FFT_256_halfzero T
optimized.o FFT_64 T
optimized.o RequiredAlignment T
optimized.o Round4 T
optimized.o Round8 T
optimized.o SIMD_Compress T
optimized.o SupportedLength T
optimized.o VERSION T
optimized.o fft128_natural T
optimized.o fft256_natural T
optimized.o p8_xor R
optimized.o revbin T
Number of similar (implementation,compiler) pairs: 6, namely:
Implementation | Compiler |
T:opt | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:opt | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:opt | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:opt | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:opt | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:opt | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
Namespace violations
nist.o Final T
nist.o Hash T
nist.o IV_224 D
nist.o IV_256 D
nist.o IV_384 D
nist.o IV_512 D
nist.o IncreaseCounter T
nist.o Init T
nist.o InitIV T
nist.o Update T
reference.o IF T
reference.o MAJ T
reference.o P R
reference.o RequiredAlignment T
reference.o Round T
reference.o SIMD_Compress T
reference.o Step T
reference.o SupportedLength T
reference.o VERSION T
reference.o message_expansion T
reference.o p4 R
reference.o p8 R
Number of similar (implementation,compiler) pairs: 6, namely:
Implementation | Compiler |
T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
Namespace violations
simd.o sph_simd224 T
simd.o sph_simd224_addbits_and_close T
simd.o sph_simd224_close T
simd.o sph_simd224_init T
simd.o sph_simd256 T
simd.o sph_simd256_addbits_and_close T
simd.o sph_simd256_close T
simd.o sph_simd256_init T
simd.o sph_simd384 T
simd.o sph_simd384_addbits_and_close T
simd.o sph_simd384_close T
simd.o sph_simd384_init T
simd.o sph_simd512 T
simd.o sph_simd512_addbits_and_close T
simd.o sph_simd512_close T
simd.o sph_simd512_init T
Number of similar (implementation,compiler) pairs: 12, namely:
Implementation | Compiler |
T:sphlib | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:sphlib | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:sphlib | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:sphlib | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:sphlib | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:sphlib | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:sphlib-small | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:sphlib-small | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:sphlib-small | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_20.1.8_(0ubuntu4)) |
T:sphlib-small | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:sphlib-small | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:sphlib-small | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
Namespace violations
nist.o Final T
nist.o Hash T
nist.o IV_224 D
nist.o IV_256 D
nist.o IV_384 D
nist.o IV_512 D
nist.o IncreaseCounter T
nist.o Init T
nist.o InitIV T
nist.o Update T
vector.o RequiredAlignment T
vector.o SIMD_Compress T
vector.o SupportedLength T
vector.o VERSION T
vector.o fft128_msg T
vector.o fft128_msg_final T
vector.o fft128_natural T
vector.o fft256_msg T
vector.o rounds T
vector.o rounds512 T
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:vect128 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:vect128 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |
T:vect128 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (15.2.0) |