Test results for amd64, dali, crypto_sort/int64

[Page version: 20260408 05:54:07]

Measurements for amd64, dali, crypto_sort Test results for amd64, dali, crypto_sort Test results for crypto_sort/int64
Computer: dali
Microarchitecture: amd64; Zen (820f01)
Architecture: amd64
CPU ID: AuthenticAMD-00820f01-178bfbff
SUPERCOP version: 20260330
Operation: crypto_sort
Primitive: int64
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
2690733983 0 056488 1272 14722026avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
2694431775 0 053680 1272 14722026avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
2702730987 0 050782 1264 15042026avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
2741135388 0 057624 1272 1472sse42clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
2765935388 0 058224 1272 1472sse42clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
2779530815 0 050176 1272 14722026avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
2803834616 0 054742 1264 1504sse42clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
2885234484 0 054128 1272 1472sse42clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
3143635573 0 055911 1200 1536sse42gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
3162836570 0 058399 1200 1536sse42gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
3220334573 0 054343 1200 1536sse42gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
3319838145 0 056827 1184 1504sse42gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
3346837181 0 059047 1200 15362026avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
3369034485 0 053187 1184 15042026avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
3376233126 0 052919 1200 15362026avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
3437033877 0 054247 1200 15362026avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
5522211252 0 032903 1200 1536x863gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
5602111592 0 034544 1272 1472x863clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
560839992 0 032344 1272 1472x863clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
5655711689 0 033888 1272 1472x863clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
5674410804 0 030959 1200 1536x863gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
569728644 0 028862 1264 1504x863clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
5700810271 0 029847 1200 1536x863gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
570789498 0 029256 1272 1472x863clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
62784918 0 023120 1272 1472portable4clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
62954837 0 023192 1272 1472portable4clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
62969917 0 023872 1272 1472portable4clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
64251853 0 020632 1272 1472portable4clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
64940678 0 020902 1264 1504portable4clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
653693906 0 022379 1184 1504x863gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
70493857 0 022511 1200 1536portable4gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
70816857 0 021015 1200 1536portable4gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
911551329 0 024293 1296 1568T:stdsortg++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026033120260330
913621397 0 025405 1296 1568T:stdsortg++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026033120260330
95606936 0 021787 1280 1536T:stdsortg++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026033120260330
958411312 0 024105 1352 1504T:stdsortclang++ -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026033120260330
964091709 0 026832 1352 1472T:stdsortclang++ -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026033120260330
971831167 0 023395 1296 1568T:stdsortg++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026033120260330
972221472 0 025892 1352 1472T:stdsortclang++ -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026033120260330
981021555 0 023823 1360 1472T:stdsortclang++ -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026033120260330
98700981 0 020559 1200 1536portable4gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
145851664 0 019147 1184 1504portable4gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217

Compiler output


sort.c: sort.c:123:8: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'avx'
sort.c:   x0 = int64x4_load(x+0); // 0 4 8 12
sort.c:        ^
sort.c: sort.c:17:25: note: expanded from macro 'int64x4_load'
sort.c: #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:                         ^
sort.c: sort.c:123:8: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
sort.c: sort.c:17:25: note: expanded from macro 'int64x4_load'
sort.c: #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:                         ^
sort.c: sort.c:124:8: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'avx'
sort.c:   x1 = int64x4_load(x+4); // 1 5 9 13
sort.c:        ^
sort.c: sort.c:17:25: note: expanded from macro 'int64x4_load'
sort.c: #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:                         ^
sort.c: sort.c:124:8: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
sort.c: sort.c:17:25: note: expanded from macro 'int64x4_load'
sort.c: #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:                         ^
sort.c: sort.c:125:11: error: always_inline function '_mm256_set1_epi64x' requires target feature 'avx', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'avx'
sort.c:   infty = int64x4_broadcast(int64_largest);
sort.c:           ^
sort.c: sort.c:41:27: note: expanded from macro 'int64x4_broadcast'
sort.c: #define int64x4_broadcast _mm256_set1_epi64x
sort.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
2026avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)

Compiler output


sort.c: sort.c:130:93: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'ssse3'
sort.c:   x4 = int8x16_iftopthenelse(int64x2_smaller_mask(int64x2_set(8,9),int64x2_broadcast(pos4)),int8x16_varextract(int64x2_load(x+pos4-2),int8x16_sub(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos4&1)))),infty); // 4 12
sort.c:                                                                                             ^
sort.c: sort.c:39:28: note: expanded from macro 'int8x16_varextract'
sort.c: #define int8x16_varextract _mm_shuffle_epi8
sort.c:                            ^
sort.c: sort.c:130:30: error: always_inline function '_mm_cmpgt_epi64' requires target feature 'sse4.2', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'sse4.2'
sort.c:   x4 = int8x16_iftopthenelse(int64x2_smaller_mask(int64x2_set(8,9),int64x2_broadcast(pos4)),int8x16_varextract(int64x2_load(x+pos4-2),int8x16_sub(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos4&1)))),infty); // 4 12
sort.c:                              ^
sort.c: sort.c:20:35: note: expanded from macro 'int64x2_smaller_mask'
sort.c: #define int64x2_smaller_mask(a,b) _mm_cmpgt_epi64(b,a)
sort.c:                                   ^
sort.c: sort.c:130:8: error: always_inline function '_mm_blendv_epi8' requires target feature 'sse4.1', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'sse4.1'
sort.c:   x4 = int8x16_iftopthenelse(int64x2_smaller_mask(int64x2_set(8,9),int64x2_broadcast(pos4)),int8x16_varextract(int64x2_load(x+pos4-2),int8x16_sub(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos4&1)))),infty); // 4 12
sort.c:        ^
sort.c: sort.c:23:38: note: expanded from macro 'int8x16_iftopthenelse'
sort.c: #define int8x16_iftopthenelse(c,t,e) _mm_blendv_epi8(e,t,c)
sort.c:                                      ^
sort.c: sort.c:132:95: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'ssse3'
sort.c:   x5 = int8x16_iftopthenelse(int64x2_smaller_mask(int64x2_set(10,11),int64x2_broadcast(pos5)),int8x16_varextract(int64x2_load(x+pos5-2),int8x16_sub(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos5&1)))),infty); // 5 13
sort.c:                                                                                               ^
sort.c: sort.c:39:28: note: expanded from macro 'int8x16_varextract'
sort.c: #define int8x16_varextract _mm_shuffle_epi8
sort.c:                            ^
sort.c: sort.c:132:30: error: always_inline function '_mm_cmpgt_epi64' requires target feature 'sse4.2', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'sse4.2'
sort.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
sse42clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)

Compiler output


try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/dali/amd64/include/stq.h:32:13: warning: unused function 'stq_sort' [-Wunused-function]
try.cpp: static void stq_sort(stq_doublearray S,long long n)
try.cpp:             ^
try.cpp: 1 warning generated.
try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/dali/amd64/include/stq.h:32:13: warning: unused function 'stq_sort' [-Wunused-function]
try.cpp: static void stq_sort(stq_doublearray S,long long n)
try.cpp:             ^
try.cpp: 1 warning generated.
measure.cpp: In file included from measure-anything.cpp:19:
measure.cpp: .../supercop-data/dali/amd64/include/stq.h:32:13: warning: unused function 'stq_sort' [-Wunused-function]
measure.cpp: static void stq_sort(stq_doublearray S,long long n)
measure.cpp:             ^
measure.cpp: 1 warning generated.

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:stdsortclang++ -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
T:stdsortclang++ -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
T:stdsortclang++ -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
T:stdsortclang++ -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)

Compiler output


try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/dali/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
try.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
try.cpp:       |             ^~~~~~~~
try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/dali/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
try.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
try.cpp:       |             ^~~~~~~~
measure.cpp: In file included from measure-anything.cpp:19:
measure.cpp: .../supercop-data/dali/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
measure.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
measure.cpp:       |             ^~~~~~~~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:stdsortg++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
T:stdsortg++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
T:stdsortg++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
T:stdsortg++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)

Passed TIMECOP


TIMECOP iterations: 10

Number of similar (implementation,compiler) pairs: 34, namely:
ImplementationCompiler
2026avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
2026avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
2026avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
2026avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
2026avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
2026avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
2026avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
2026avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
portable4clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
portable4clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
portable4clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
portable4clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
portable4clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
portable4gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
portable4gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
portable4gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
portable4gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
sse42clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
sse42clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
sse42clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
sse42clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
sse42gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
sse42gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
sse42gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
sse42gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
x863clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
x863clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
x863clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
x863clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
x863clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)
x863gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
x863gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
x863gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)
x863gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)