Test results for amd64, wooden, crypto_kem/ntrulpr4591761
[Page version: 20250929 19:04:32]
Measurements for amd64, wooden, crypto_kem
Test results for amd64, wooden, crypto_kem
Test results for crypto_kem/ntrulpr4591761
Computer: wooden
Microarchitecture: amd64; Goldmont (506c9)
Architecture: amd64
CPU ID: GenuineIntel-000506c9-1fc9cbf5 
SUPERCOP version: 20250922
Operation: crypto_kem
Primitive: ntrulpr4591761
| Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version | 
|---|
| 55685638 | 14690 0 0 | 38509 784 1600 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
| 55741338 | 7194 0 0 | 29628 800 1568 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
| 55742708 | 11955 0 0 | 36700 800 1568 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
| 55744912 | 4220 0 0 | 27157 784 1600 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
| 55745240 | 12499 0 0 | 36020 800 1568 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
| 55781980 | 3992 0 0 | 26301 784 1600 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
| 55798036 | 3372 0 0 | 23837 776 1568 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
| 56712322 | 3955 0 0 | 25466 792 1568 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
| 57247848 | 4890 0 0 | 25658 792 1568 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250520 | 20250415 | 
Compiler output
mult.c: mult.c:147:22: error: invalid output size for constraint '=&x'
mult.c:   MULSTEP_fromzero(0,h0,h1,h2,h3,h4)
mult.c:                      ^
mult.c: mult.c:149:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 1,h1,h2,h3,h4,h0)
mult.c:                          ^
mult.c: mult.c:150:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 2,h2,h3,h4,h0,h1)
mult.c:                          ^
mult.c: mult.c:151:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 3,h3,h4,h0,h1,h2)
mult.c:                          ^
mult.c: mult.c:152:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 4,h4,h0,h1,h2,h3)
mult.c:                          ^
mult.c: mult.c:153:26: error: invalid output size for constraint '+x'
mult.c:     MULSTEP_noload(j + 5,h0,h1,h2,h3,h4)
mult.c:                          ^
mult.c: mult.c:155:24: error: invalid output size for constraint '+x'
mult.c:   MULSTEP_noload(j + 1,h1,h2,h3,h4,h0)
mult.c:                        ^
mult.c: mult.c:156:24: error: invalid output size for constraint '+x'
mult.c:   MULSTEP_noload(j + 2,h2,h3,h4,h0,h1)
mult.c:                        ^
mult.c: mult.c:157:24: error: invalid output size for constraint '+x'
mult.c: ...
Number of similar (implementation,compiler) pairs: 5, namely:
| Implementation | Compiler | 
| T:avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
Compiler output
mult.c: mult.c: In function 'add':
mult.c: mult.c:82:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
mult.c:    82 | {
mult.c:       | ^
mult.c: mult.c: In function 'squeezeadd16':
mult.c: mult.c:110:23: note: the ABI for passing parameters with 32-byte alignment has changed in GCC 4.6
mult.c:   110 | static inline __m256i squeezeadd16(__m256i x,__m256i y)
mult.c:       |                       ^~~~~~~~~~~~
mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:111,
mult.c:                  from mult.c:2:
mult.c: mult.c: In function 'fastadd':
mult.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/fmaintrin.h:63:1: error: inlining failed in call to 'always_inline' '_mm256_fmadd_ps': target specific option mismatch
mult.c:    63 | _mm256_fmadd_ps (__m256 __A, __m256 __B, __m256 __C)
mult.c:       | ^~~~~~~~~~~~~~~
mult.c: mult.c:88:10: note: called from here
mult.c:    88 |   return _mm256_fmadd_ps(y,v1_float,x);
mult.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:51,
mult.c:                  from mult.c:2:
mult.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avxintrin.h:1311:1: error: inlining failed in call to 'always_inline' '_mm256_set1_ps': target specific option mismatch
mult.c:  1311 | _mm256_set1_ps (float __A)
mult.c:       | ^~~~~~~~~~~~~~
mult.c: mult.c:88:10: note: called from here
mult.c:    88 |   return _mm256_fmadd_ps(y,v1_float,x);
mult.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| T:avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
Namespace violations
hide.o hide T
randomweightw.o small_random_weightw T
randomweightw.o small_seeded_weightw T
rq_fromseed.o rq_fromseed T
rq_mult.o rq_mult T
rq_round3.o rq_round3 T
rq_rounded.o rq_decoderounded T
rq_rounded.o rq_encoderounded T
small.o small_decode T
small.o small_encode T
Number of similar (implementation,compiler) pairs: 9, namely:
| Implementation | Compiler | 
| T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |