Test results for amd64, alder, crypto_core/mult3sntrup653
[Page version: 20250924 17:11:48]
Measurements for amd64, alder, crypto_core
Test results for amd64, alder, crypto_core
Test results for crypto_core/mult3sntrup653
Computer: alder
Microarchitecture: amd64; Golden Cove (90675-00)
Architecture: amd64
CPU ID: GenuineIntel-00090675-00-bfebfbff 
SUPERCOP version: 20250922
Operation: crypto_core
Primitive: mult3sntrup653
| Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version | 
|---|
| 7655 | 15818 0 0 | 32037 820 1032 | avx800 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 7671 | 18208 0 0 | 31877 820 1032 | avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 7674 | 18382 0 0 | 32189 820 1032 | avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 7710 | 15628 0 0 | 31677 820 1032 | avx800 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 7985 | 16047 0 0 | 30534 820 1032 | round2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 7987 | 16269 0 0 | 30894 820 1032 | round2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 8212 | 13215 0 0 | 25542 812 1032 | avx800 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 8293 | 13920 0 0 | 25405 820 936 | avx800 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 8365 | 15134 0 0 | 25414 812 1032 | avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 8388 | 16642 0 0 | 25733 820 936 | avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 8543 | 12515 0 0 | 22886 820 936 | round2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 8709 | 11120 0 0 | 22672 812 1032 | round2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9155 | 13738 0 0 | 25643 772 1000 | avx800 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 9164 | 16476 0 0 | 30309 780 1000 | round2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9425 | 10969 0 0 | 22676 772 1000 | round2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9496 | 12941 0 0 | 24749 780 1000 | round2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9711 | 10815 0 0 | 21048 756 968 | round2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 | 
| 9920 | 15049 0 0 | 26931 772 1000 | avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 12596 | 15221 0 0 | 29260 780 1000 | avx800 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 12636 | 13104 0 0 | 23567 756 968 | avx800 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 13167 | 16878 0 0 | 30900 780 1000 | avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 13181 | 13996 0 0 | 24447 756 968 | avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 13197 | 13638 0 0 | 25636 780 1000 | avx800 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 13784 | 15295 0 0 | 27276 780 1000 | avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 44248 | 2118 0 0 | 18301 820 1032 | compact | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 44977 | 1589 0 0 | 15612 780 1000 | compact | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 49528 | 5138 0 0 | 21245 820 1032 | compact | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 76246 | 2367 0 0 | 18533 820 1032 | ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 76434 | 3057 0 0 | 19173 820 1032 | ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 87188 | 1834 0 0 | 19765 820 936 | compact | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 106401 | 3968 0 0 | 17940 780 1000 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 184197 | 2991 0 0 | 20917 820 936 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 542855 | 582 0 0 | 12492 780 1000 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 571716 | 563 0 0 | 11997 820 936 | ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 596633 | 372 0 0 | 12774 812 1032 | compact | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 629878 | 412 0 0 | 11845 820 936 | compact | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 635992 | 499 0 0 | 12910 812 1032 | ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 878926 | 443 0 0 | 12380 780 1000 | compact | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 915473 | 570 0 0 | 12427 772 1000 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 924145 | 332 0 0 | 10767 756 968 | compact | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 1365447 | 494 0 0 | 10927 756 968 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
| 1746080 | 432 0 0 | 12283 772 1000 | compact | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 | 
Compiler output
mult768.c: mult768.c:210:7: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_avx_constbranchindex' that is compiled without support for 'avx'
mult768.c:   210 |   x = const_x16(0);
mult768.c:       |       ^
mult768.c: mult768.c:12:19: note: expanded from macro 'const_x16'
mult768.c:    12 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:210:7: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:12:19: note: expanded from macro 'const_x16'
mult768.c:    12 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:211:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_avx_constbranchindex' that is compiled without support for 'avx'
mult768.c:   211 |   for (i = p&~15;i < 768;i += 16) store_x16(&f[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:211:35: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:212:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_avx_constbranchindex' that is compiled without support for 'avx'
mult768.c:   212 |   for (i = p&~15;i < 768;i += 16) store_x16(&g[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
Compiler output
mult768.c: mult768.c:210:7: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_avx800_constbranchindex' that is compiled without support for 'avx'
mult768.c:   210 |   x = const_x16(0);
mult768.c:       |       ^
mult768.c: mult768.c:12:19: note: expanded from macro 'const_x16'
mult768.c:    12 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:210:7: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:12:19: note: expanded from macro 'const_x16'
mult768.c:    12 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:211:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_avx800_constbranchindex' that is compiled without support for 'avx'
mult768.c:   211 |   for (i = p&~15;i < 768;i += 16) store_x16(&f[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:211:35: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:212:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_avx800_constbranchindex' that is compiled without support for 'avx'
mult768.c:   212 |   for (i = p&~15;i < 768;i += 16) store_x16(&g[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:11:24: note: expanded from macro 'store_x16'
mult768.c:    11 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx800 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
Compiler output
mult768.c: mult768.c:210:7: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_round2_constbranchindex' that is compiled without support for 'avx'
mult768.c:   210 |   x = const_x16(0);
mult768.c:       |       ^
mult768.c: mult768.c:11:19: note: expanded from macro 'const_x16'
mult768.c:    11 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:210:7: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:11:19: note: expanded from macro 'const_x16'
mult768.c:    11 | #define const_x16 _mm256_set1_epi16
mult768.c:       |                   ^
mult768.c: mult768.c:211:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_round2_constbranchindex' that is compiled without support for 'avx'
mult768.c:   211 |   for (i = p&~15;i < 768;i += 16) store_x16(&f[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:10:24: note: expanded from macro 'store_x16'
mult768.c:    10 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:211:35: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mult768.c: mult768.c:10:24: note: expanded from macro 'store_x16'
mult768.c:    10 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c:       |                        ^
mult768.c: mult768.c:212:35: error: always_inline function '_mm256_storeu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_mult3sntrup653_round2_constbranchindex' that is compiled without support for 'avx'
mult768.c:   212 |   for (i = p&~15;i < 768;i += 16) store_x16(&g[i],x);
mult768.c:       |                                   ^
mult768.c: mult768.c:10:24: note: expanded from macro 'store_x16'
mult768.c:    10 | #define store_x16(p,v) _mm256_storeu_si256((int16x16 *) (p),(v))
mult768.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| round2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A7EF
   at 0x...: main (try-anything.c:369)
Illegal instruction
Number of similar (implementation,compiler) pairs: 3, namely:
| Implementation | Compiler | 
| avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx800 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A7FF
   at 0x...: main (try-anything.c:369)
Illegal instruction
Number of similar (implementation,compiler) pairs: 3, namely:
| Implementation | Compiler | 
| avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx800 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A139
   at 0x...: main (try-anything.c:369)
Illegal instruction
Number of similar (implementation,compiler) pairs: 5, namely:
| Implementation | Compiler | 
| avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| avx800 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| compact | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| round2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10B522
   at 0x...: F3_freeze (mult.c:15)
   by 0x...: crypto_core_mult3sntrup653_compact_constbranchindex (mult.c:42)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| compact | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10C112
   at 0x...: F3_freeze (mult.c:15)
   by 0x...: crypto_core_mult3sntrup653_compact_constbranchindex (mult.c:42)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| compact | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10B619
   at 0x...: F3_freeze (mult.c:13)
   by 0x...: crypto_core_mult3sntrup653_ref_constbranchindex (mult.c:48)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10B629
   at 0x...: F3_freeze (mult.c:13)
   by 0x...: crypto_core_mult3sntrup653_ref_constbranchindex (mult.c:48)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
Passed TIMECOP
TIMECOP iterations: 1
Number of similar (implementation,compiler) pairs: 27, namely:
| Implementation | Compiler | 
| avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx800 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| avx800 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx800 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| avx800 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| avx800 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| compact | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| compact | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| compact | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| compact | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| compact | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| compact | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) | 
| round2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) | 
| round2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| round2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |