Test results for amd64, wolfdale, crypto_kem/ntrulpr4591761
[Page version: 20250925 09:46:33]
Measurements for amd64, wolfdale, crypto_kem
Test results for amd64, wolfdale, crypto_kem
Test results for crypto_kem/ntrulpr4591761
Computer: wolfdale
Microarchitecture: amd64; Core 2 45nm (1067a)
Architecture: amd64
CPU ID: GenuineIntel-0001067a-bfebfbff 
SUPERCOP version: 20250922
Operation: crypto_kem
Primitive: ntrulpr4591761
| Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version | 
|---|
| 56184510 | 4906 0 0 | 24278 876 1720 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
| 56192478 | 7723 0 0 | 29104 884 1720 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
| 56197678 | 12326 0 0 | 34576 884 1720 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
| 56207216 | 12499 0 0 | 34560 884 1720 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
| 57553695 | 3913 0 0 | 23969 868 1752 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
| 57731975 | 3357 0 0 | 22345 860 1720 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
| 57735726 | 3931 0 0 | 24006 876 1720 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
| 59347748 | 13019 0 0 | 34289 868 1752 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
| 59380269 | 3937 0 0 | 24625 868 1752 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250428 | 20250415 | 
Compiler output
mult.c: mult.c:147:22: error: invalid output size for constraint '=&x'
mult.c:   MULSTEP_fromzero(0,h0,h1,h2,h3,h4)
mult.c:                      ^
mult.c: mult.c:149:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 1,h1,h2,h3,h4,h0)
mult.c:                          ^
mult.c: mult.c:150:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 2,h2,h3,h4,h0,h1)
mult.c:                          ^
mult.c: mult.c:151:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 3,h3,h4,h0,h1,h2)
mult.c:                          ^
mult.c: mult.c:152:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 4,h4,h0,h1,h2,h3)
mult.c:                          ^
mult.c: mult.c:153:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 5,h0,h1,h2,h3,h4)
mult.c:                          ^
mult.c: mult.c:155:24: error: invalid output size for constraint '+x'
mult.c:   MULSTEP_noload(j + 1,h1,h2,h3,h4,h0)
mult.c:                        ^
mult.c: mult.c:156:24: error: invalid output size for constraint '+x'
mult.c:   MULSTEP_noload(j + 2,h2,h3,h4,h0,h1)
mult.c:                        ^
mult.c: mult.c:157:24: error: invalid output size for constraint '+x'
mult.c: ...
Number of similar (implementation,compiler) pairs: 5, namely:
| Implementation | Compiler | 
| T:avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
Compiler output
mult.c: mult.c: In function 'add':
mult.c: mult.c:82:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
mult.c:    82 | {
mult.c:       | ^
mult.c: mult.c: In function 'squeezeadd16':
mult.c: mult.c:110:23: note: the ABI for passing parameters with 32-byte alignment has changed in GCC 4.6
mult.c:   110 | static inline __m256i squeezeadd16(__m256i x,__m256i y)
mult.c:       |                       ^~~~~~~~~~~~
mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:111,
mult.c:                  from mult.c:2:
mult.c: mult.c: In function 'fastadd':
mult.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/fmaintrin.h:63:1: error: inlining failed in call to 'always_inline' '_mm256_fmadd_ps': target specific option mismatch
mult.c:    63 | _mm256_fmadd_ps (__m256 __A, __m256 __B, __m256 __C)
mult.c:       | ^~~~~~~~~~~~~~~
mult.c: mult.c:88:10: note: called from here
mult.c:    88 |   return _mm256_fmadd_ps(y,v1_float,x);
mult.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:51,
mult.c:                  from mult.c:2:
mult.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avxintrin.h:1311:1: error: inlining failed in call to 'always_inline' '_mm256_set1_ps': target specific option mismatch
mult.c:  1311 | _mm256_set1_ps (float __A)
mult.c:       | ^~~~~~~~~~~~~~
mult.c: mult.c:88:10: note: called from here
mult.c:    88 |   return _mm256_fmadd_ps(y,v1_float,x);
mult.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| T:avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
Namespace violations
hide.o hide T
randomweightw.o small_random_weightw T
randomweightw.o small_seeded_weightw T
rq_fromseed.o rq_fromseed T
rq_mult.o rq_mult T
rq_round3.o rq_round3 T
rq_rounded.o rq_decoderounded T
rq_rounded.o rq_encoderounded T
small.o small_decode T
small.o small_encode T
Number of similar (implementation,compiler) pairs: 9, namely:
| Implementation | Compiler | 
| T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |