Test results for amd64, hydra5, crypto_sign/haetae5

[Page version: 20240911 13:24:11]

Measurements for amd64, hydra5, crypto_sign Test results for amd64, hydra5, crypto_sign Test results for crypto_sign/haetae5
Computer: hydra5
Microarchitecture: amd64; K10 32nm (300f10)
Architecture: amd64
CPU ID: AuthenticAMD-00300f10-178bfbff
SUPERCOP version: 20240909
Operation: crypto_sign
Primitive: haetae5
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
9038505196443 0 0217240 868 1728refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2024081120240808
9038816196443 0 0217240 868 1728refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2024081120240808
921653063437 0 083336 868 1728refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2024081120240808
966040140408 0 058920 868 1728refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2024081120240808
981660141438 848 061077 1668 1792refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2024081120240808
1006204334968 0 052674 860 1728refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2024081120240808
1022909437559 848 056261 1668 1792refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2024081120240808
1058837677426 0 098341 812 1792refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2024081120240808
1121716235462 1104 052909 1916 1760refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2024081120240808

Compiler output


aes256ctr.c: aes256ctr.c:91:3: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes256ctr.c:   BLOCK1(0x01);
aes256ctr.c:   ^
aes256ctr.c: aes256ctr.c:72:11: note: expanded from macro 'BLOCK1'
aes256ctr.c:   temp1 = _mm_aeskeygenassist_si128(temp2, IMM);                        \
aes256ctr.c:           ^
aes256ctr.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes256ctr.c:   ((__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R)))
aes256ctr.c:             ^
aes256ctr.c: aes256ctr.c:92:3: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes256ctr.c:   BLOCK2(0x01);
aes256ctr.c:   ^
aes256ctr.c: aes256ctr.c:82:11: note: expanded from macro 'BLOCK2'
aes256ctr.c:   temp1 = _mm_aeskeygenassist_si128(temp0, IMM);                        \
aes256ctr.c:           ^
aes256ctr.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes256ctr.c:   ((__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R)))
aes256ctr.c:             ^
aes256ctr.c: aes256ctr.c:94:3: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes256ctr.c:   BLOCK1(0x02);
aes256ctr.c:   ^
aes256ctr.c: aes256ctr.c:72:11: note: expanded from macro 'BLOCK1'
aes256ctr.c:   temp1 = _mm_aeskeygenassist_si128(temp2, IMM);                        \
aes256ctr.c:           ^
aes256ctr.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes256ctr.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


aes256ctr.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:37,
aes256ctr.c:                  from aes256ctr.c:8:
aes256ctr.c: aes256ctr.c: In function 'aesni_encrypt4':
aes256ctr.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
aes256ctr.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes256ctr.c:       | ^~~~~~~~~~~~~~~~
aes256ctr.c: aes256ctr.c:23:8: note: called from here
aes256ctr.c:    23 |   f3 = _mm_shuffle_epi8(_mm_add_epi64(f,_mm_set_epi64x(3,0)),idx);
aes256ctr.c:       |        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes256ctr.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:37,
aes256ctr.c:                  from aes256ctr.c:8:
aes256ctr.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
aes256ctr.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes256ctr.c:       | ^~~~~~~~~~~~~~~~
aes256ctr.c: aes256ctr.c:22:8: note: called from here
aes256ctr.c:    22 |   f2 = _mm_shuffle_epi8(_mm_add_epi64(f,_mm_set_epi64x(2,0)),idx);
aes256ctr.c:       |        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes256ctr.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:37,
aes256ctr.c:                  from aes256ctr.c:8:
aes256ctr.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
aes256ctr.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes256ctr.c:       | ^~~~~~~~~~~~~~~~
aes256ctr.c: aes256ctr.c:21:8: note: called from here
aes256ctr.c:    21 |   f1 = _mm_shuffle_epi8(_mm_add_epi64(f,_mm_set_epi64x(1,0)),idx);
aes256ctr.c:       |        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes256ctr.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


sampler.c: sampler.c:229:43: warning: variable 'cnt' set but not used [-Wunused-but-set-variable]
sampler.c:     size_t bytecnt = buflen, coefcnt = 0, cnt = 0;
sampler.c:                                           ^
sampler.c: 1 warning generated.

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Namespace violations


decompose.o cryptolab_haetae5_decompose_hint T
decompose.o cryptolab_haetae5_decompose_vk T
decompose.o cryptolab_haetae5_decompose_z1 T
encoding.o cryptolab_haetae5_decode_h T
encoding.o cryptolab_haetae5_decode_hb_z1 T
encoding.o cryptolab_haetae5_encode_h T
encoding.o cryptolab_haetae5_encode_hb_z1 T
fft.o brv8 R
fft.o complex_fp_sqabs T
fft.o fft T
fft.o fft_init_and_bitrev T
fips202.o haetae_fips202_KeccakF_RoundConstants R
fips202.o haetae_fips202_sha3_256 T
fips202.o haetae_fips202_sha3_512 T
fips202.o haetae_fips202_shake128 T
fips202.o haetae_fips202_shake128_absorb T
fips202.o haetae_fips202_shake128_absorb_once T
fips202.o haetae_fips202_shake128_finalize T
fips202.o haetae_fips202_shake128_init T
fips202.o haetae_fips202_shake128_squeeze T
fips202.o haetae_fips202_shake128_squeezeblocks T
fips202.o haetae_fips202_shake256 T
fips202.o haetae_fips202_shake256_absorb T
fips202.o haetae_fips202_shake256_absorb_once T
fips202.o haetae_fips202_shake256_finalize T
fips202.o haetae_fips202_shake256_init T
fips202.o haetae_fips202_shake256_squeeze T
fips202.o haetae_fips202_shake256_squeezeblocks T
fixpoint.o cryptolab_haetae5_fixpoint_add T
fixpoint.o cryptolab_haetae5_fixpoint_mul_rnd13 T
fixpoint.o cryptolab_haetae5_fixpoint_newton_invsqrt T
fixpoint.o cryptolab_haetae5_fixpoint_square T
fixpoint.o start_cube R
fixpoint.o start_times_threehalves R
ntt.o cryptolab_haetae5_invntt_tomont T
ntt.o cryptolab_haetae5_ntt T
packing.o cryptolab_haetae5_pack_pk T
packing.o cryptolab_haetae5_pack_sig T
packing.o cryptolab_haetae5_pack_sk T
packing.o cryptolab_haetae5_unpack_pk T
packing.o cryptolab_haetae5_unpack_sig T
packing.o cryptolab_haetae5_unpack_sk T
poly.o cryptolab_haetae5_poly2eta_pack T
poly.o cryptolab_haetae5_poly2eta_unpack T
poly.o cryptolab_haetae5_poly_add T
poly.o cryptolab_haetae5_poly_challenge T
poly.o cryptolab_haetae5_poly_compose T
poly.o cryptolab_haetae5_poly_decomposed_pack T
poly.o cryptolab_haetae5_poly_decomposed_unpack T
poly.o cryptolab_haetae5_poly_freeze T
poly.o cryptolab_haetae5_poly_freeze2q T
poly.o cryptolab_haetae5_poly_fromcrt T
poly.o cryptolab_haetae5_poly_fromcrt0 T
poly.o cryptolab_haetae5_poly_highbits T
poly.o cryptolab_haetae5_poly_invntt_tomont T
poly.o cryptolab_haetae5_poly_lowbits T
poly.o cryptolab_haetae5_poly_lsb T
poly.o cryptolab_haetae5_poly_ntt T
poly.o cryptolab_haetae5_poly_pack_highbits T
poly.o cryptolab_haetae5_poly_pack_lsb T
poly.o cryptolab_haetae5_poly_pointwise_montgomery T
poly.o cryptolab_haetae5_poly_reduce2q T
poly.o cryptolab_haetae5_poly_sub T
poly.o cryptolab_haetae5_poly_uniform T
poly.o cryptolab_haetae5_poly_uniform_eta T
poly.o cryptolab_haetae5_polyeta_pack T
poly.o cryptolab_haetae5_polyeta_unpack T
poly.o cryptolab_haetae5_polyq_pack T
poly.o cryptolab_haetae5_polyq_unpack T
poly.o hammingWeight_8 T
polyfix.o cryptolab_haetae5_polyfix_add T
polyfix.o cryptolab_haetae5_polyfix_round T
polyfix.o cryptolab_haetae5_polyfixfixveck_sub T
polyfix.o cryptolab_haetae5_polyfixfixvecl_sub T
polyfix.o cryptolab_haetae5_polyfixveck_add T
polyfix.o cryptolab_haetae5_polyfixveck_double T
polyfix.o cryptolab_haetae5_polyfixveck_round T
polyfix.o cryptolab_haetae5_polyfixvecl_add T
polyfix.o cryptolab_haetae5_polyfixvecl_double T
polyfix.o cryptolab_haetae5_polyfixvecl_round T
polyfix.o cryptolab_haetae5_polyfixveclk_sample_hyperball T
polyfix.o cryptolab_haetae5_polyfixveclk_sqnorm2 T
polyfix.o fix_round T
polyfix.o polyfixfix_sub T
polymat.o cryptolab_haetae5_polymatkl_double T
polymat.o cryptolab_haetae5_polymatkl_expand T
polymat.o cryptolab_haetae5_polymatkl_pointwise_montgomery T
polymat.o cryptolab_haetae5_polymatkm_expand T
polymat.o cryptolab_haetae5_polymatkm_pointwise_montgomery T
polyvec.o cryptolab_haetae5_polyveck_add T
polyvec.o cryptolab_haetae5_polyveck_caddDQ2ALPHA T
polyvec.o cryptolab_haetae5_polyveck_caddq T
polyvec.o cryptolab_haetae5_polyveck_cneg T
polyvec.o cryptolab_haetae5_polyveck_csubDQ2ALPHA T
polyvec.o cryptolab_haetae5_polyveck_decompose_vk T
polyvec.o cryptolab_haetae5_polyveck_div2 T
polyvec.o cryptolab_haetae5_polyveck_double T
polyvec.o cryptolab_haetae5_polyveck_double_negate T
polyvec.o cryptolab_haetae5_polyveck_expand T
polyvec.o cryptolab_haetae5_polyveck_freeze T
polyvec.o cryptolab_haetae5_polyveck_freeze2q T
polyvec.o cryptolab_haetae5_polyveck_frommont T
polyvec.o cryptolab_haetae5_polyveck_highbits_hint T
polyvec.o cryptolab_haetae5_polyveck_invntt_tomont T
polyvec.o cryptolab_haetae5_polyveck_mul_alpha T
polyvec.o cryptolab_haetae5_polyveck_ntt T
polyvec.o cryptolab_haetae5_polyveck_pack_highbits T
polyvec.o cryptolab_haetae5_polyveck_poly_fromcrt T
polyvec.o cryptolab_haetae5_polyveck_poly_pointwise_montgomery T
polyvec.o cryptolab_haetae5_polyveck_reduce2q T
polyvec.o cryptolab_haetae5_polyveck_sqnorm2 T
polyvec.o cryptolab_haetae5_polyveck_sub T
polyvec.o cryptolab_haetae5_polyvecl_cneg T
polyvec.o cryptolab_haetae5_polyvecl_highbits T
polyvec.o cryptolab_haetae5_polyvecl_lowbits T
polyvec.o cryptolab_haetae5_polyvecl_ntt T
polyvec.o cryptolab_haetae5_polyvecl_pointwise_acc_montgomery T
polyvec.o cryptolab_haetae5_polyvecl_sqnorm2 T
polyvec.o cryptolab_haetae5_polyvecm_ntt T
polyvec.o cryptolab_haetae5_polyvecm_pointwise_acc_montgomery T
polyvec.o cryptolab_haetae5_polyvecmk_sqsing_value T
polyvec.o cryptolab_haetae5_polyvecmk_uniform_eta T
reduce.o cryptolab_haetae5_caddq T
reduce.o cryptolab_haetae5_freeze T
reduce.o cryptolab_haetae5_freeze2q T
reduce.o cryptolab_haetae5_montgomery_reduce T
reduce.o cryptolab_haetae5_reduce32_2q T
sampler.o cryptolab_haetae5_rej_eta T
sampler.o cryptolab_haetae5_rej_uniform T
sampler.o cryptolab_haetae5_sample_gauss_N T
sampler.o sample_gauss T
sign.o cryptolab_haetae5_signature T
sign.o cryptolab_haetae5_verify T
symmetric-shake.o cryptolab_haetae5_haetae_shake128_stream_init T
symmetric-shake.o cryptolab_haetae5_haetae_shake256_absorb_twice T
symmetric-shake.o cryptolab_haetae5_haetae_shake256_stream_init T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Passed TIMECOP


TIMECOP iterations: 10

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)