Test results for amd64, phoenix, crypto_core/mult3sntrup761
[Page version: 20250923 21:27:52]
Measurements for amd64, phoenix, crypto_core
Test results for amd64, phoenix, crypto_core
Test results for crypto_core/mult3sntrup761
Computer: phoenix
Microarchitecture: amd64; Zen 4 (a70f41)
Architecture: amd64
CPU ID: AuthenticAMD-00a70f41-178bfbff 
SUPERCOP version: 20250922
Operation: crypto_core
Primitive: mult3sntrup761
| Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version | 
|---|
| 4196 | 36741 0 0 | 52356 820 1032 | avx2unsigned | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 4379 | 34549 0 0 | 50100 820 1032 | avx2unsigned | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 4589 | 23338 0 0 | 35478 812 1032 | avx2unsigned | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 4710 | 43399 0 0 | 57180 780 1064 | avx2unsigned | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 5985 | 23784 0 0 | 34749 820 936 | avx2unsigned | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 6468 | 15869 0 0 | 27964 780 1064 | avx2unsigned | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 6670 | 16757 0 0 | 28699 772 1000 | avx2unsigned | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 6804 | 12093 0 0 | 22639 756 968 | avx2unsigned | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 7587 | 15315 0 0 | 31324 820 1032 | avx800 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 7602 | 20855 0 0 | 34204 820 1032 | avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 7683 | 16217 0 0 | 32284 820 1032 | avx800 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 7722 | 18616 0 0 | 28806 812 1032 | avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 7729 | 21773 0 0 | 35180 820 1032 | avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 7757 | 13757 0 0 | 26262 812 1032 | avx800 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 8323 | 19914 0 0 | 28789 820 936 | avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 8389 | 13958 0 0 | 25469 820 936 | avx800 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 8693 | 13446 0 0 | 25339 772 1000 | avx800 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 8791 | 17084 0 0 | 30940 780 1064 | round2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9169 | 16569 0 0 | 28435 772 1000 | avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 9309 | 16165 0 0 | 30828 820 1032 | round2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9339 | 17067 0 0 | 31788 820 1032 | round2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9480 | 13292 0 0 | 25420 780 1064 | round2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9509 | 11966 0 0 | 23878 812 1032 | round2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9573 | 10789 0 0 | 22747 772 1000 | round2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 10108 | 12936 0 0 | 23581 820 936 | round2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 10198 | 11003 0 0 | 21535 756 968 | round2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 12516 | 15269 0 0 | 29068 780 1064 | avx800 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 12882 | 18158 0 0 | 31972 780 1064 | avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 13211 | 13029 0 0 | 23535 756 968 | avx800 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 13248 | 13445 0 0 | 25500 780 1064 | avx800 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 13526 | 15454 0 0 | 25959 756 968 | avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 13597 | 16334 0 0 | 28372 780 1064 | avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 22088 | 13822 0 0 | 30044 820 1032 | round1 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 22119 | 13356 0 0 | 29524 820 1032 | round1 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 22457 | 13120 0 0 | 26948 780 1064 | round1 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 22978 | 8537 0 0 | 21206 812 1032 | round1 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 23919 | 9288 0 0 | 21300 780 1064 | round1 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 27102 | 7857 0 0 | 18391 756 968 | round1 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 27197 | 9144 0 0 | 20669 820 936 | round1 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 27725 | 8835 0 0 | 20804 780 1000 | round1 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 60273 | 4173 0 0 | 20284 820 1032 | compact | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 63453 | 4173 0 0 | 20332 820 1032 | compact | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 79295 | 2068 0 0 | 15868 780 1064 | compact | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 93225 | 2734 0 0 | 18876 820 1032 | ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 93583 | 2734 0 0 | 18828 820 1032 | ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 95979 | 4658 0 0 | 18420 780 1064 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 106756 | 1467 0 0 | 19437 820 936 | compact | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 165527 | 5336 0 0 | 19092 780 1064 | 32 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 168904 | 5281 0 0 | 21452 820 1032 | 32 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 169058 | 3522 0 0 | 19644 820 1032 | 32 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 169819 | 2404 0 0 | 14452 780 1064 | 32 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 183274 | 1173 0 0 | 13003 772 1000 | 32 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 192283 | 1220 0 0 | 13838 812 1032 | compact | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 198790 | 2613 0 0 | 20589 820 936 | 32 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 201298 | 820 0 0 | 11303 756 968 | 32 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 202492 | 2375 0 0 | 14966 812 1032 | 32 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 204394 | 862 0 0 | 12333 820 936 | 32 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 248953 | 2686 0 0 | 20661 820 936 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 286986 | 2509 0 0 | 15142 812 1032 | ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 1031397 | 1043 0 0 | 13020 780 1064 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 1071077 | 433 0 0 | 12436 780 1064 | compact | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 1087170 | 412 0 0 | 11877 820 936 | compact | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 1419770 | 569 0 0 | 12411 772 1000 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 1434865 | 332 0 0 | 10815 756 968 | compact | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 1460314 | 563 0 0 | 12029 820 936 | ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 1825626 | 494 0 0 | 10975 756 968 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
| 2903870 | 428 0 0 | 12267 772 1000 | compact | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250814 | 20250415 | 
Compiler output
mult768.c: mult768.c:210:7: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx_constbranchindex' that is compiled without support for 'avx'
mult768.c:   210 |   x = const_x16(0);
mult768.c:       |       ^
mult768.c: mult768.c:12:19: note: expanded from macro 'const_x16'
mult768.c:    12 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:210:7: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:12:19: note: expanded from macro 'const_x16'
mult768.c:    12 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:211:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx_constbranchindex' that is compiled without support for 'avx'
mult768.c:   211 |   for (i = p&~15;i < 768;i += 16) store_x16(&f[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:211:35: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:212:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx_constbranchindex' that is compiled without support for 'avx'
mult768.c:   212 |   for (i = p&~15;i < 768;i += 16) store_x16(&g[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
Compiler output
mult3_32x32.c: mult3_32x32.c:220:6: warning: unused function 'cvt_2vecs_to_vecx2_4' [-Wunused-function]
mult3_32x32.c:   220 | void cvt_2vecs_to_vecx2_4( __m256i * r , const __m256i * v0 , const __m256i * v1 )
mult3_32x32.c:       |      ^~~~~~~~~~~~~~~~~~~~
mult3_32x32.c: 1 warning generated.
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| avx2unsigned | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx2unsigned | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx2unsigned | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx2unsigned | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
Compiler output
mult.c: mult.c:48:59: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx2unsigned_constbranchindex' that is compiled without support for 'avx'
mult.c:    48 |     _mm256_store_si256( (__m256i*)(a+i) , cvt_to_unsigned(_mm256_load_si256((__m256i*)(a+i))) );
mult.c:       |                                                           ^
mult.c: mult.c:48:59: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult.c: mult.c:48:43: warning: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI [-Wpsabi]
mult.c:    48 |     _mm256_store_si256( (__m256i*)(a+i) , cvt_to_unsigned(_mm256_load_si256((__m256i*)(a+i))) );
mult.c:       |                                           ^
mult.c: mult.c:48:5: error: always_inline function '_mm256_store_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx2unsigned_constbranchindex' that is compiled without support for 'avx'
mult.c:    48 |     _mm256_store_si256( (__m256i*)(a+i) , cvt_to_unsigned(_mm256_load_si256((__m256i*)(a+i))) );
mult.c:       |     ^
mult.c: mult.c:48:5: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult.c: mult.c:51:59: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx2unsigned_constbranchindex' that is compiled without support for 'avx'
mult.c:    51 |     _mm256_store_si256( (__m256i*)(b+i) , cvt_to_unsigned(_mm256_load_si256((__m256i*)(b+i))) );
mult.c:       |                                                           ^
mult.c: mult.c:51:59: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult.c: mult.c:51:43: warning: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI [-Wpsabi]
mult.c:    51 |     _mm256_store_si256( (__m256i*)(b+i) , cvt_to_unsigned(_mm256_load_si256((__m256i*)(b+i))) );
mult.c:       |                                           ^
mult.c: mult.c:51:5: error: always_inline function '_mm256_store_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx2unsigned_constbranchindex' that is compiled without support for 'avx'
mult.c:    51 |     _mm256_store_si256( (__m256i*)(b+i) , cvt_to_unsigned(_mm256_load_si256((__m256i*)(b+i))) );
mult.c:       |     ^
mult.c: mult.c:51:5: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult.c: mult.c:64:54: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx2unsigned_constbranchindex' that is compiled without support for 'avx'
mult.c:    64 |     _mm256_store_si256( (__m256i*)(c+i) , cvt_to_int(_mm256_load_si256((__m256i*)(c+i))) );
mult.c:       |                                                      ^
mult.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx2unsigned | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
Compiler output
mult768.c: mult768.c:210:7: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx800_constbranchindex' that is compiled without support for 'avx'
mult768.c:   210 |   x = const_x16(0);
mult768.c:       |       ^
mult768.c: mult768.c:12:19: note: expanded from macro 'const_x16'
mult768.c:    12 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:210:7: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:12:19: note: expanded from macro 'const_x16'
mult768.c:    12 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:211:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx800_constbranchindex' that is compiled without support for 'avx'
mult768.c:   211 |   for (i = p&~15;i < 768;i += 16) store_x16(&f[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:211:35: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:212:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_avx800_constbranchindex' that is compiled without support for 'avx'
mult768.c:   212 |   for (i = p&~15;i < 768;i += 16) store_x16(&g[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx800 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
Compiler output
mult.c: mult.c:147:22: error: invalid output size for constraint '=&x'
mult.c:   147 |   MULSTEP_fromzero(0,h0,h1,h2,h3,h4)
mult.c:       |                      ^
mult.c: mult.c:149:26: error: invalid output size for constraint '+x'
mult.c:   149 |     MULSTEP_noload(j + 1,h1,h2,h3,h4,h0)
mult.c:       |                          ^
mult.c: mult.c:150:26: error: invalid output size for constraint '+x'
mult.c:   150 |     MULSTEP_noload(j + 2,h2,h3,h4,h0,h1)
mult.c:       |                          ^
mult.c: mult.c:151:26: error: invalid output size for constraint '+x'
mult.c:   151 |     MULSTEP_noload(j + 3,h3,h4,h0,h1,h2)
mult.c:       |                          ^
mult.c: mult.c:152:26: error: invalid output size for constraint '+x'
mult.c:   152 |     MULSTEP_noload(j + 4,h4,h0,h1,h2,h3)
mult.c:       |                          ^
mult.c: mult.c:153:26: error: invalid output size for constraint '+x'
mult.c:   153 |     MULSTEP_noload(j + 5,h0,h1,h2,h3,h4)
mult.c:       |                          ^
mult.c: mult.c:155:24: error: invalid output size for constraint '+x'
mult.c:   155 |   MULSTEP_noload(j + 1,h1,h2,h3,h4,h0)
mult.c:       |                        ^
mult.c: mult.c:156:24: error: invalid output size for constraint '+x'
mult.c:   156 |   MULSTEP_noload(j + 2,h2,h3,h4,h0,h1)
mult.c:       |                        ^
mult.c: mult.c:157:24: error: invalid output size for constraint '+x'
mult.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| round1 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
Compiler output
mult768.c: mult768.c:210:7: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_round2_constbranchindex' that is compiled without support for 'avx'
mult768.c:   210 |   x = const_x16(0);
mult768.c:       |       ^
mult768.c: mult768.c:11:19: note: expanded from macro 'const_x16'
mult768.c:    11 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:210:7: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:11:19: note: expanded from macro 'const_x16'
mult768.c:    11 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:211:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_round2_constbranchindex' that is compiled without support for 'avx'
mult768.c:   211 |   for (i = p&~15;i < 768;i += 16) store_x16(&f[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:10:24: note: expanded from macro 'store_x16'
mult768.c:    10 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:211:35: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:10:24: note: expanded from macro 'store_x16'
mult768.c:    10 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:212:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup761_round2_constbranchindex' that is compiled without support for 'avx'
mult768.c:   212 |   for (i = p&~15;i < 768;i += 16) store_x16(&g[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:10:24: note: expanded from macro 'store_x16'
mult768.c:    10 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| round2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A46A
   at 0x...: core (try-anything.c:61)
   by 0x...: salsa20 (try-anything.c:89)
   by 0x...: canary (try-anything.c:148)
   by 0x...: output_prepare (try-anything.c:178)
   by 0x...: test (try.c:99)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 8, namely:
| Implementation | Compiler | 
| 32 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx2unsigned | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx800 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| compact | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round1 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10AF44
   at 0x...: salsa20 (try-anything.c:90)
   by 0x...: canary (try-anything.c:148)
   by 0x...: output_prepare (try-anything.c:178)
   by 0x...: test (try.c:99)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 8, namely:
| Implementation | Compiler | 
| 32 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx2unsigned | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx800 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| compact | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round1 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x109EB3
   at 0x...: core (try-anything.c:64)
   by 0x...: salsa20 (try-anything.c:89)
   by 0x...: canary (try-anything.c:148)
   by 0x...: output_prepare (try-anything.c:178)
   by 0x...: test (try.c:99)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 8, namely:
| Implementation | Compiler | 
| 32 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx2unsigned | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx800 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| compact | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round1 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x109D3A
   at 0x...: core (try-anything.c:64)
   by 0x...: salsa20 (try-anything.c:89)
   by 0x...: canary (try-anything.c:148)
   by 0x...: output_prepare (try-anything.c:178)
   by 0x...: test (try.c:99)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 8, namely:
| Implementation | Compiler | 
| 32 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx2unsigned | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx800 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| compact | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round1 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x109EF1
   at 0x...: salsa20.part.0 (try-anything.c:88)
   by 0x...: salsa20 (try-anything.c:85)
   by 0x...: canary (try-anything.c:148)
   by 0x...: output_prepare (try-anything.c:178)
   by 0x...: test (try.c:99)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 8, namely:
| Implementation | Compiler | 
| 32 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx2unsigned | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx800 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| compact | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round1 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x109EEA
   at 0x...: st32 (try-anything.c:47)
   by 0x...: core (try-anything.c:78)
   by 0x...: salsa20.part.0 (try-anything.c:89)
   by 0x...: salsa20 (try-anything.c:85)
   by 0x...: canary (try-anything.c:148)
   by 0x...: output_prepare (try-anything.c:178)
   by 0x...: test (try.c:99)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 8, namely:
| Implementation | Compiler | 
| 32 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx2unsigned | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx800 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| compact | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round1 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x109B3C
   at 0x...: core (try-anything.c:64)
   by 0x...: salsa20.part.0 (try-anything.c:89)
   by 0x...: salsa20 (try-anything.c:85)
   by 0x...: canary (try-anything.c:148)
   by 0x...: output_prepare (try-anything.c:178)
   by 0x...: test (try.c:99)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 8, namely:
| Implementation | Compiler | 
| 32 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx2unsigned | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx800 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| compact | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round1 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A59C
   at 0x...: good (mult768.c:52)
   by 0x...: mult768 (mult768.c:149)
   by 0x...: crypto_core_mult3sntrup761_avx_constbranchindex (mult768.c:225)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A624
   at 0x...: cvt_to_unsigned (const_inline.h:38)
   by 0x...: crypto_core_mult3sntrup761_avx2unsigned_constbranchindex (mult.c:48)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx2unsigned | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A59C
   at 0x...: good (mult768.c:52)
   by 0x...: mult768 (mult768.c:149)
   by 0x...: crypto_core_mult3sntrup761_avx800_constbranchindex (mult768.c:225)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx800 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10B376
   at 0x...: _mm256_mulhrs_epi16 (avx2intrin.h:533)
   by 0x...: squeezeadd16 (mult.c:114)
   by 0x...: mult768_mix2_m256i (mult.c:468)
   by 0x...: crypto_core_mult3sntrup761_round1_constbranchindex (mult.c:752)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| round1 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A59C
   at 0x...: good (mult768.c:51)
   by 0x...: mult768 (mult768.c:147)
   by 0x...: crypto_core_mult3sntrup761_round2_constbranchindex (mult768.c:225)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| round2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
Passed TIMECOP
TIMECOP iterations: 1
Number of similar (implementation,compiler) pairs: 6, namely:
| Implementation | Compiler | 
| 32 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| 32 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| compact | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| compact | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |