Test results for amd64, saber214, crypto_core/weightsntrup1277
[Page version: 20241023 11:56:08]
Measurements for amd64, saber214, crypto_core
Test results for amd64, saber214, crypto_core
Test results for crypto_core/weightsntrup1277
Computer: saber214
Microarchitecture: amd64; Bulldozer (600f20)
Architecture: amd64
CPU ID: AuthenticAMD-00600f20-1789c3f5
SUPERCOP version: 20241022
Operation: crypto_core
Primitive: weightsntrup1277
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
3847 | 108 0 0 | 12388 816 760 | ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
3848 | 108 0 0 | 14780 816 776 | ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
3859 | 192 0 0 | 11789 768 832 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
3878 | 190 0 0 | 11934 776 832 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
3878 | 190 0 0 | 13398 776 832 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
3972 | 109 0 0 | 13516 816 760 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
3981 | 108 0 0 | 11132 816 760 | ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
5083 | 94 0 0 | 10742 808 760 | ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
5087 | 174 0 0 | 10601 752 800 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241013 | 20241011 |
Compiler output
weight.c: weight.c:26:11: error: always_inline function '_mm256_add_epi8' requires target feature 'avx2', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx2'
weight.c: sum = _mm256_add_epi8(sum,bits);
weight.c: ^
weight.c: weight.c:31:11: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx2'
weight.c: sumhi = _mm256_srli_epi16(sum,8);
weight.c: ^
weight.c: weight.c:33:9: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx2'
weight.c: sum = _mm256_add_epi16(sum,sumhi);
weight.c: ^
weight.c: weight.c:36:9: error: always_inline function '_mm256_hadd_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx2'
weight.c: sum = _mm256_hadd_epi16(sum,sum);
weight.c: ^
weight.c: weight.c:38:9: error: always_inline function '_mm256_hadd_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx2'
weight.c: sum = _mm256_hadd_epi16(sum,sum);
weight.c: ^
weight.c: weight.c:40:9: error: always_inline function '_mm256_hadd_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx2'
weight.c: sum = _mm256_hadd_epi16(sum,sum);
weight.c: ^
weight.c: 6 errors generated.
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
Compiler output
weight.c: weight.c:20:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx'
weight.c: sum = _mm256_loadu_si256((__m256i *) (in+p-32));
weight.c: ^
weight.c: weight.c:20:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
weight.c: weight.c:21:10: error: always_inline function '_mm256_set_epi8' requires target feature 'avx', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx'
weight.c: sum &= endingmask;
weight.c: ^
weight.c: ./params.h:2:20: note: expanded from macro 'endingmask'
weight.c: #define endingmask _mm256_set_epi8(1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,0,0)
weight.c: ^
weight.c: weight.c:21:10: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
weight.c: ./params.h:2:20: note: expanded from macro 'endingmask'
weight.c: #define endingmask _mm256_set_epi8(1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,0,0)
weight.c: ^
weight.c: weight.c:24:20: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx'
weight.c: __m256i bits = _mm256_loadu_si256((__m256i *) in);
weight.c: ^
weight.c: weight.c:24:20: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
weight.c: weight.c:25:13: error: always_inline function '_mm256_set1_epi8' requires target feature 'avx', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx'
weight.c: bits &= _mm256_set1_epi8(1);
weight.c: ^
weight.c: weight.c:25:13: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
weight.c: weight.c:26:11: error: always_inline function '_mm256_add_epi8' requires target feature 'avx2', but would be inlined into function 'crypto_core_weightsntrup1277_avx_constbranchindex' that is compiled without support for 'avx2'
weight.c: sum = _mm256_add_epi8(sum,bits);
weight.c: ^
weight.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
Compiler output
weight.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
weight.c: from weight.c:1:
weight.c: weight.c: In function 'crypto_core_weightsntrup1277_avx_constbranchindex':
weight.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:105:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi8': target specific option mismatch
weight.c: 105 | _mm256_add_epi8 (__m256i __A, __m256i __B)
weight.c: | ^~~~~~~~~~~~~~~
weight.c: weight.c:26:11: note: called from here
weight.c: 26 | sum = _mm256_add_epi8(sum,bits);
weight.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~
weight.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
weight.c: from weight.c:1:
weight.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:105:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi8': target specific option mismatch
weight.c: 105 | _mm256_add_epi8 (__m256i __A, __m256i __B)
weight.c: | ^~~~~~~~~~~~~~~
weight.c: weight.c:26:11: note: called from here
weight.c: 26 | sum = _mm256_add_epi8(sum,bits);
weight.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~
weight.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
weight.c: from weight.c:1:
weight.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:287:1: error: inlining failed in call to 'always_inline' '_mm256_hadd_epi16': target specific option mismatch
weight.c: 287 | _mm256_hadd_epi16 (__m256i __X, __m256i __Y)
weight.c: | ^~~~~~~~~~~~~~~~~
weight.c: weight.c:40:9: note: called from here
weight.c: 40 | sum = _mm256_hadd_epi16(sum,sum);
weight.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~
weight.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x109FBD
at 0x...: salsa20.part.0 (try-anything.c:102)
by 0x...: salsa20 (try-anything.c:85)
by 0x...: canary (try-anything.c:148)
by 0x...: output_prepare (try-anything.c:178)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x109C0A
at 0x...: core (try-anything.c:53)
by 0x...: salsa20.part.0 (try-anything.c:101)
by 0x...: salsa20 (try-anything.c:85)
by 0x...: canary (try-anything.c:148)
by 0x...: output_prepare (try-anything.c:178)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x109804
at 0x...: memcpy (string_fortified.h:29)
by 0x...: test (try.c:149)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
Passed TIMECOP
TIMECOP iterations: 1
Number of similar (implementation,compiler) pairs: 6, namely:
Implementation | Compiler |
ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |