Test results for amd64, hunsnivy, crypto_encode/953x3

[Page version: 20260701 00:14:32]

Measurements for amd64, hunsnivy, crypto_encode Test results for amd64, hunsnivy, crypto_encode Test results for crypto_encode/953x3

Computer: hunsnivy
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20260627
Operation: crypto_encode
Primitive: 953x3

Time	Object size	Test size	Implementation	Compiler	Benchmark date	SUPERCOP version
331	4019 0 0	25960 1272 1440	`ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
333	743 0 0	21760 1272 1440	`ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
361	1205 0 0	22439 1200 1536	`ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
992	167 0 0	21976 1272 1440	`ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
1031	123 0 0	19560 1272 1440	`ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
1104	142 0 0	19887 1200 1536	`ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
1111	137 0 0	19278 1192 1536	`ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
1134	115 0 0	18194 1176 1504	`ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
1481	106 0 0	18846 1264 1440	`ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217

Compiler output

encode.c: encode.c:34:18: error: always_inline function '_mm256_packus_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a0 = _mm256_packus_epi16(f0&lobytes,f1&lobytes);
encode.c:                  ^
encode.c: encode.c:37:38: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a1 = _mm256_packus_epi16(_mm256_srli_epi16(f0,8),_mm256_srli_epi16(f1,8));
encode.c:                                      ^
encode.c: encode.c:37:62: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a1 = _mm256_packus_epi16(_mm256_srli_epi16(f0,8),_mm256_srli_epi16(f1,8));
encode.c:                                                              ^
encode.c: encode.c:37:18: error: always_inline function '_mm256_packus_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a1 = _mm256_packus_epi16(_mm256_srli_epi16(f0,8),_mm256_srli_epi16(f1,8));
encode.c:                  ^
encode.c: encode.c:39:18: error: always_inline function '_mm256_packus_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a2 = _mm256_packus_epi16(f2&lobytes,f3&lobytes);
encode.c:                  ^
encode.c: encode.c:40:38: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a3 = _mm256_packus_epi16(_mm256_srli_epi16(f2,8),_mm256_srli_epi16(f3,8));
encode.c:                                      ^
encode.c: encode.c:40:62: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a3 = _mm256_packus_epi16(_mm256_srli_epi16(f2,8),_mm256_srli_epi16(f3,8));
encode.c:                                                              ^
encode.c: encode.c:40:18: error: always_inline function '_mm256_packus_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a3 = _mm256_packus_epi16(_mm256_srli_epi16(f2,8),_mm256_srli_epi16(f3,8));
encode.c:                  ^
encode.c: encode.c:42:29: error: always_inline function '_mm256_slli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`avx`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`
`avx`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`
`avx`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`
`avx`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`

Compiler output

encode.c: encode.c:27:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx'
encode.c:     __m256i f0 = _mm256_loadu_si256((const __m256i *) (f+0));
encode.c:                  ^
encode.c: encode.c:27:18: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
encode.c: encode.c:28:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx'
encode.c:     __m256i f1 = _mm256_loadu_si256((const __m256i *) (f+32));
encode.c:                  ^
encode.c: encode.c:28:18: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
encode.c: encode.c:29:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx'
encode.c:     __m256i f2 = _mm256_loadu_si256((const __m256i *) (f+64));
encode.c:                  ^
encode.c: encode.c:29:18: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
encode.c: encode.c:30:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx'
encode.c:     __m256i f3 = _mm256_loadu_si256((const __m256i *) (f+96));
encode.c:                  ^
encode.c: encode.c:30:18: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
encode.c: encode.c:34:18: error: always_inline function '_mm256_packus_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a0 = _mm256_packus_epi16(f0&lobytes,f1&lobytes);
encode.c:                  ^
encode.c: encode.c:34:18: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
encode.c: encode.c:37:38: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c:     __m256i a1 = _mm256_packus_epi16(_mm256_srli_epi16(f0,8),_mm256_srli_epi16(f1,8));
encode.c:                                      ^
encode.c: encode.c:37:38: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
encode.c: encode.c:37:62: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_953x3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:

Implementation	Compiler
`avx`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`

Compiler output

encode.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/12/include/immintrin.h:47,
encode.c:                  from encode.c:2:
encode.c: /usr/lib/gcc/x86_64-linux-gnu/12/include/avx2intrin.h: In function 'crypto_encode_953x3_avx_constbranchindex':
encode.c: /usr/lib/gcc/x86_64-linux-gnu/12/include/avx2intrin.h:105:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi8': target specific option mismatch
encode.c:   105 | _mm256_add_epi8 (__m256i __A, __m256i __B)
encode.c:       | ^~~~~~~~~~~~~~~
encode.c: encode.c:55:10: note: called from here
encode.c:    55 |     b0 = _mm256_add_epi8(b0,_mm256_set1_epi8(85));
encode.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encode.c: /usr/lib/gcc/x86_64-linux-gnu/12/include/avx2intrin.h:1044:1: error: inlining failed in call to 'always_inline' '_mm256_permutevar8x32_epi32': target specific option mismatch
encode.c:  1044 | _mm256_permutevar8x32_epi32 (__m256i __X, __m256i __Y)
encode.c:       | ^~~~~~~~~~~~~~~~~~~~~~~~~~~
encode.c: encode.c:53:10: note: called from here
encode.c:    53 |     b0 = _mm256_permutevar8x32_epi32(b0,_mm256_set_epi32(7,3,6,2,5,1,4,0));
encode.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encode.c: /usr/lib/gcc/x86_64-linux-gnu/12/include/avx2intrin.h:105:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi8': target specific option mismatch
encode.c:   105 | _mm256_add_epi8 (__m256i __A, __m256i __B)
encode.c:       | ^~~~~~~~~~~~~~~
encode.c: encode.c:51:10: note: called from here
encode.c:    51 |     b0 = _mm256_add_epi8(b0,_mm256_slli_epi16(b2&_mm256_set1_epi8(15),4));
encode.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encode.c: /usr/lib/gcc/x86_64-linux-gnu/12/include/avx2intrin.h:668:1: error: inlining failed in call to 'always_inline' '_mm256_slli_epi16': target specific option mismatch
encode.c:   668 | _mm256_slli_epi16 (__m256i __A, int __B)
encode.c:       | ^~~~~~~~~~~~~~~~~
encode.c: encode.c:51:10: note: called from here
encode.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`avx`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`avx`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`avx`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`avx`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`

Passed TIMECOP

TIMECOP iterations: 10

Number of similar (implementation,compiler) pairs: 9, namely:

Implementation	Compiler
`ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`
`ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`
`ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`
`ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`
`ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`
`ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`