Test results for amd64, r24000, crypto_core/invsntrup761
[Page version: 20250926 17:39:37]
Measurements for amd64, r24000, crypto_core
Test results for amd64, r24000, crypto_core
Test results for crypto_core/invsntrup761
Computer: r24000
Microarchitecture: amd64; Coffee Lake (906ea)
Architecture: amd64
CPU ID: GenuineIntel-000906ea-bfebfbff 
SUPERCOP version: 20250922
Operation: crypto_core
Primitive: invsntrup761
| Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version | 
|---|
| 565111 | 249805 0 0 | 259094 828 1032 | jumpdivsteps | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250924 | 20250922 | 
| 565398 | 236457 0 0 | 245622 828 1032 | jumpdivsteps | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250924 | 20250922 | 
| 578833 | 211515 0 0 | 224048 820 1032 | jumpdivsteps | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250924 | 20250922 | 
| 597340 | 240830 0 0 | 246183 828 936 | jumpdivsteps | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250924 | 20250922 | 
| 602830 | 245232 0 0 | 259676 812 1000 | jumpdivsteps | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250924 | 20250922 | 
| 621858 | 260998 0 0 | 273388 812 1000 | jumpdivsteps | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250924 | 20250922 | 
| 687081 | 3403 0 0 | 18766 828 1032 | avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 687879 | 4403 0 0 | 19846 828 1032 | avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 713451 | 263108 0 0 | 275460 812 1000 | jumpdivsteps | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250924 | 20250922 | 
| 725435 | 1703 0 0 | 13215 828 936 | avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 728476 | 4164 0 0 | 18525 804 1000 | avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 747627 | 1811 0 0 | 13804 796 1000 | avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 756730 | 1770 0 0 | 14037 804 1000 | avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 758564 | 1580 0 0 | 14000 820 1032 | avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 784094 | 204810 0 0 | 216131 796 968 | jumpdivsteps | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250924 | 20250922 | 
| 810091 | 1484 0 0 | 12516 788 968 | avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 1140345 | 7526 0 0 | 21917 804 1000 | portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 2026630 | 5777 0 0 | 21158 828 1032 | portable | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 2027400 | 6769 0 0 | 22230 828 1032 | portable | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 7730661 | 5055 0 0 | 20553 836 1032 | ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 7735193 | 4015 0 0 | 19433 836 1032 | ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 7753467 | 5731 0 0 | 20007 828 936 | portable | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 8001768 | 4042 0 0 | 18471 812 1000 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 15112689 | 1811 0 0 | 13295 828 936 | portable | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 15357196 | 1742 0 0 | 14168 820 1032 | portable | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 15660836 | 1948 0 0 | 14253 804 1000 | portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 17022466 | 1780 0 0 | 13796 796 1000 | portable | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 19782008 | 3801 0 0 | 18153 836 936 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 29147035 | 1100 0 0 | 13431 812 1000 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 31613051 | 1179 0 0 | 12737 836 936 | ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 33505137 | 1049 0 0 | 13562 828 1032 | ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 36666900 | 954 0 0 | 12030 796 968 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 37746424 | 1158 0 0 | 13132 796 1000 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
| 38271614 | 1513 0 0 | 12548 788 968 | portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250429 | 20250415 | 
Compiler output
recip.c: recip.c:73:19: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c:    73 |   __m256i f0vec = _mm256_set1_epi16(f0);
recip.c:       |                   ^
recip.c: recip.c:73:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:74:19: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c:    74 |   __m256i g0vec = _mm256_set1_epi16(g0);
recip.c:       |                   ^
recip.c: recip.c:74:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:75:48: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c:    75 |   __m256i f0vecqinv = _mm256_mullo_epi16(f0vec,qinvvec);
recip.c:       |                                                ^
recip.c: recip.c:59:17: note: expanded from macro 'qinvvec'
recip.c:    59 | #define qinvvec _mm256_set1_epi16(qinv)
recip.c:       |                 ^
recip.c: recip.c:75:48: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:59:17: note: expanded from macro 'qinvvec'
recip.c:    59 | #define qinvvec _mm256_set1_epi16(qinv)
recip.c:       |                 ^
recip.c: recip.c:75:23: error: always_inline function '_mm256_mullo_epi16' requires target feature 'avx2', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx2'
recip.c:    75 |   __m256i f0vecqinv = _mm256_mullo_epi16(f0vec,qinvvec);
recip.c:       |                       ^
recip.c: recip.c:75:23: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:76:48: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c:    76 |   __m256i g0vecqinv = _mm256_mullo_epi16(g0vec,qinvvec);
recip.c:       |                                                ^
recip.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
Compiler output
avx-768.c: avx-768.c:716:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:847:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   847 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:973:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   973 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:1176:23: warning: unused variable 'f4' [-Wunused-variable]
avx-768.c:  1176 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                       ^~
avx-768.c: avx-768.c:1176:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c:  1176 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                          ^~
avx-768.c: avx-768.c:1237:17: warning: unused variable 'f2' [-Wunused-variable]
avx-768.c:  1237 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                 ^~
avx-768.c: avx-768.c:1237:20: warning: unused variable 'f3' [-Wunused-variable]
avx-768.c:  1237 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                    ^~
avx-768.c: avx-768.c:1237:23: warning: unused variable 'f4' [-Wunused-variable]
avx-768.c:  1237 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                       ^~
avx-768.c: avx-768.c:1237:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c: ...
avx.c: avx.c:23:21: warning: unused function 'mullo' [-Wunused-function]
avx.c:    23 | static inline int16 mullo(int16 x,int16 y)
avx.c:       |                     ^~~~~
avx.c: avx.c:28:21: warning: unused function 'mulhi' [-Wunused-function]
avx.c:    28 | static inline int16 mulhi(int16 x,int16 y)
avx.c:       |                     ^~~~~
avx.c: avx.c:33:21: warning: unused function 'mulhrs' [-Wunused-function]
avx.c:    33 | static inline int16 mulhrs(int16 x,int16 y)
avx.c:       |                     ^~~~~~
avx.c: 3 warnings generated.
avx3-512.c: avx3-512.c:1055:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c:  1055 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                   ^~
avx3-512.c: avx3-512.c:1055:38: warning: unused variable 'h1' [-Wunused-variable]
avx3-512.c:  1055 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                      ^~
avx3-512.c: avx3-512.c:1055:41: warning: unused variable 'h2' [-Wunused-variable]
avx3-512.c:  1055 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                         ^~
avx3-512.c: avx3-512.c:1055:44: warning: unused variable 'h3' [-Wunused-variable]
avx3-512.c:  1055 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                            ^~
avx3-512.c: avx3-512.c:1056:10: warning: unused variable 'origf' [-Wunused-variable]
avx3-512.c:  1056 |   int16 *origf = f;
avx3-512.c:       |          ^~~~~
avx3-512.c: avx3-512.c:1019:13: warning: unused function 'ntt512_7681' [-Wunused-function]
avx3-512.c:  1019 | static void ntt512_7681(int16 *f,int reps)
avx3-512.c:       |             ^~~~~~~~~~~
avx3-512.c: avx3-512.c:1030:13: warning: unused function 'ntt512_10753' [-Wunused-function]
avx3-512.c:  1030 | static void ntt512_10753(int16 *f,int reps)
avx3-512.c:       |             ^~~~~~~~~~~~
avx3-512.c: avx3-512.c:1041:13: warning: unused function 'ntt512_12289' [-Wunused-function]
avx3-512.c:  1041 | static void ntt512_12289(int16 *f,int reps)
avx3-512.c:       |             ^~~~~~~~~~~~
avx3-512.c: 8 warnings generated.
jump32divsteps.c: jump32divsteps.c:113:31: warning: unused variable 'gx' [-Wunused-variable]
jump32divsteps.c:   113 |   __m256i minusdelta_v, temp, gx, ux, vx, qx, rx;
jump32divsteps.c:       |                               ^~
jump32divsteps.c: 1 warning generated.
polymul_ntt.c: polymul_ntt.c:166:13: warning: unused function 'polymulntt512_256x256_test' [-Wunused-function]
polymul_ntt.c:   166 | static void polymulntt512_256x256_test(__m256i *h, __m256i *f, __m256i *g)
polymul_ntt.c:       |             ^~~~~~~~~~~~~~~~~~~~~~~~~~
polymul_ntt.c: 1 warning generated.
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| jumpdivsteps | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| jumpdivsteps | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| jumpdivsteps | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| jumpdivsteps | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
Compiler output
avx-768.c: avx-768.c:544:36: error: invalid output size for constraint '+x'
avx-768.c:   544 |   __asm__("vpsubw %1,%0,%0" : "+x"(a),"+x"(b));
avx-768.c:       |                                    ^
avx-768.c: avx-768.c:550:36: error: invalid output size for constraint '+x'
avx-768.c:   550 |   __asm__("vpaddw %1,%0,%0" : "+x"(a),"+x"(b));
avx-768.c:       |                                    ^
avx-768.c: avx-768.c:716:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:847:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   847 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:973:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   973 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:1176:23: warning: unused variable 'f4' [-Wunused-variable]
avx-768.c:  1176 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                       ^~
avx-768.c: avx-768.c:1176:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c:  1176 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                          ^~
avx-768.c: avx-768.c:1237:17: warning: unused variable 'f2' [-Wunused-variable]
avx-768.c:  1237 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                 ^~
avx-768.c: avx-768.c:1237:20: warning: unused variable 'f3' [-Wunused-variable]
avx-768.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| jumpdivsteps | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
Compiler output
avx-768.c: avx-768.c: In function 'invntt64':
avx-768.c: avx-768.c:716:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c: In function 'ntt256':
avx-768.c: avx-768.c:847:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   847 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c: In function 'invntt256':
avx-768.c: avx-768.c:973:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   973 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c: In function 'ntt768_512':
avx-768.c: avx-768.c:1176:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c:  1176 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                          ^~
avx-768.c: avx-768.c:1176:23: warning: unused variable 'f4' [-Wunused-variable]
avx-768.c:  1176 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                       ^~
avx-768.c: avx-768.c: In function 'ntt768_256':
avx-768.c: avx-768.c:1237:32: warning: unused variable 'g1' [-Wunused-variable]
avx-768.c:  1237 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c:       |                                ^~
avx-768.c: avx-768.c:1237:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c:  1237 |   __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: ...
avx.c: avx.c:437:21: warning: argument 1 of type 'int16x16[16][8]' {aka '__m256i[16][8]'} with mismatched bound [-Warray-parameter=]
avx.c:   437 | void fft64(int16x16 fpad[16][8],const int16x16 f[64])
avx.c:       |            ~~~~~~~~~^~~~~~~~~~~
avx.c: In file included from avx.c:5:
avx.c: avx.h:53:19: note: previously declared as 'int16x16 (*)[8]' {aka '__m256i (*)[8]'}
avx.c:    53 | extern void fft64(int16x16 (*)[8],const int16x16 *);
avx.c:       |                   ^~~~~~~~~~~~~~~
avx.c: avx.c:437:48: warning: argument 2 of type 'const int16x16[64]' {aka 'const __m256i[64]'} with mismatched bound [-Warray-parameter=]
avx.c:   437 | void fft64(int16x16 fpad[16][8],const int16x16 f[64])
avx.c:       |                                 ~~~~~~~~~~~~~~~^~~~~
avx.c: avx.h:53:35: note: previously declared as 'const int16x16 *' {aka 'const __m256i *'}
avx.c:    53 | extern void fft64(int16x16 (*)[8],const int16x16 *);
avx.c:       |                                   ^~~~~~~~~~~~~~~~
avx.c: avx.c:687:31: warning: argument 1 of type 'int16x16[64]' {aka '__m256i[64]'} with mismatched bound [-Warray-parameter=]
avx.c:   687 | void unfft64_scale16(int16x16 f[64],int16x16 fpad[16][8])
avx.c:       |                      ~~~~~~~~~^~~~~
avx.c: avx.h:54:29: note: previously declared as 'int16x16 *' {aka '__m256i *'}
avx.c:    54 | extern void unfft64_scale16(int16x16 *,int16x16 (*)[8]);
avx.c:       |                             ^~~~~~~~~~
avx.c: avx.c:687:46: warning: argument 2 of type 'int16x16[16][8]' {aka '__m256i[16][8]'} with mismatched bound [-Warray-parameter=]
avx.c:   687 | void unfft64_scale16(int16x16 f[64],int16x16 fpad[16][8])
avx.c:       |                                     ~~~~~~~~~^~~~~~~~~~~
avx.c: avx.h:54:40: note: previously declared as 'int16x16 (*)[8]' {aka '__m256i (*)[8]'}
avx.c:    54 | extern void unfft64_scale16(int16x16 *,int16x16 (*)[8]);
avx.c:       |                                        ^~~~~~~~~~~~~~~
avx.c: ...
avx3-512.c: avx3-512.c: In function 'invntt512':
avx3-512.c: avx3-512.c:1056:10: warning: unused variable 'origf' [-Wunused-variable]
avx3-512.c:  1056 |   int16 *origf = f;
avx3-512.c:       |          ^~~~~
avx3-512.c: avx3-512.c:1055:44: warning: unused variable 'h3' [-Wunused-variable]
avx3-512.c:  1055 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                            ^~
avx3-512.c: avx3-512.c:1055:41: warning: unused variable 'h2' [-Wunused-variable]
avx3-512.c:  1055 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                         ^~
avx3-512.c: avx3-512.c:1055:38: warning: unused variable 'h1' [-Wunused-variable]
avx3-512.c:  1055 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                      ^~
avx3-512.c: avx3-512.c:1055:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c:  1055 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                   ^~
avx3-512.c: avx3-512.c: At top level:
avx3-512.c: avx3-512.c:1041:13: warning: 'ntt512_12289' defined but not used [-Wunused-function]
avx3-512.c:  1041 | static void ntt512_12289(int16 *f,int reps)
avx3-512.c:       |             ^~~~~~~~~~~~
avx3-512.c: avx3-512.c:1030:13: warning: 'ntt512_10753' defined but not used [-Wunused-function]
avx3-512.c:  1030 | static void ntt512_10753(int16 *f,int reps)
avx3-512.c:       |             ^~~~~~~~~~~~
avx3-512.c: avx3-512.c:1019:13: warning: 'ntt512_7681' defined but not used [-Wunused-function]
avx3-512.c:  1019 | static void ntt512_7681(int16 *f,int reps)
avx3-512.c: ...
jump32divsteps.c: jump32divsteps.c: In function 'crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump32xdivsteps':
jump32divsteps.c: jump32divsteps.c:113:31: warning: unused variable 'gx' [-Wunused-variable]
jump32divsteps.c:   113 |   __m256i minusdelta_v, temp, gx, ux, vx, qx, rx;
jump32divsteps.c:       |                               ^~
polymul_ntt.c: polymul_ntt.c:166:13: warning: 'polymulntt512_256x256_test' defined but not used [-Wunused-function]
polymul_ntt.c:   166 | static void polymulntt512_256x256_test(__m256i *h, __m256i *f, __m256i *g)
polymul_ntt.c:       |             ^~~~~~~~~~~~~~~~~~~~~~~~~~
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| jumpdivsteps | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| jumpdivsteps | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| jumpdivsteps | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| jumpdivsteps | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
TIMECOP error (can be valgrind bug)
error 1
VEX temporary storage exhausted.
Pool = TEMP,  start 0x597cd5e8 curr 0x59c92118 end 0x59c92127 (size 5000000)
vex: the `impossible' happened:
VEX temporary storage exhausted.
Increase N_{TEMPORARY,PERMANENT}_BYTES and recompile.
vex storage: T total 478083664 bytes allocated
vex storage: P total 512 bytes allocated
valgrind: the 'impossible' happened:
LibVEX called failure_exit().
host stacktrace:
   at 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
sched status:
running_tid=1
Thread 1: status = VgTs_Runnable (lwpid 3960405)
   at 0x...: _mm256_mullo_epi16 (avx2intrin.h:555)
   by 0x...: barrett_fake (gf4591_avx2.h:58)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_gf_polymul_128x128_avx2_divR (polymul_NxN.c:431)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_gf_polymul_128x128_avx2_uvqr_fg (polymul_NxN.c:537)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump256divsteps (jumpNdivsteps.c:271)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_rq_recip3 (rq_recip3_new.c:200)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
client stack range: [0x1FFEFEE000 0x1FFF000FFF] client SP: 0x1FFEFEEC00
valgrind stack range: [0x10090AE000 0x10091ADFFF] top usage: 18232 of 1048576
Note: see also the FAQ in the source distribution.
It contains workarounds to several common problems.
In particular, if Valgrind aborted or crashed after
identifying problems in your program, there's a good chance
that fixing those problems will prevent Valgrind aborting or
crashing, especially if it happened in m_mallocfree.c.
If that doesn't help, please report this bug to: www.valgrind.org
In the bug report, send all the above text, the valgrind
version, and what OS and version you are using.  Thanks.
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| jumpdivsteps | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
Passed TIMECOP
TIMECOP iterations: 1
Number of similar (implementation,compiler) pairs: 33, namely:
| Implementation | Compiler | 
| avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| jumpdivsteps | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| jumpdivsteps | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| jumpdivsteps | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| jumpdivsteps | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| jumpdivsteps | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| jumpdivsteps | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| jumpdivsteps | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| portable | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| portable | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| portable | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| portable | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| portable | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| portable | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) | 
| ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) | 
| ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |