Test results for aarch64, pi3aplus, crypto_stream/chacha8
[Page version: 20250510 09:23:11]
Measurements for aarch64, pi3aplus, crypto_stream
Test results for aarch64, pi3aplus, crypto_stream
Test results for crypto_stream/chacha8
Computer: pi3aplus
Microarchitecture: aarch64; Cortex-A53 (410fd034)
Architecture: aarch64
CPU ID: 410fd034 
SUPERCOP version: 20250415
Operation: crypto_stream
Primitive: chacha8
| Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version | 
|---|
| 3995 | 4796 0 4 | 16756 768 800 | dolbeau/arm-neon | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 4002 | 3820 0 4 | 14828 768 800 | dolbeau/arm-neon | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 4482 | 3500 0 4 | 13452 752 784 | dolbeau/arm-neon | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 6000 | 4220 0 4 | 15195 760 792 | dolbeau/arm-neon | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 7745 | 2948 0 4 | 13956 768 800 | e/merged | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 7748 | 3596 0 4 | 15556 768 800 | e/merged | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 8044 | 3276 0 4 | 15252 768 800 | dolbeau/mipsel-msa | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 8048 | 3308 0 4 | 15268 768 800 | e/ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 8569 | 4752 0 4 | 16716 768 800 | e/regs | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 12130 | 2624 0 4 | 12580 752 784 | e/merged | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 16606 | 2228 0 4 | 13252 768 800 | dolbeau/mipsel-msa | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 16632 | 2260 0 4 | 13284 768 800 | e/ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 17048 | 2788 0 4 | 13812 768 800 | e/regs | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 17528 | 3724 0 4 | 14691 760 792 | e/merged | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 20525 | 2076 0 4 | 12044 752 784 | dolbeau/mipsel-msa | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 20528 | 2076 0 4 | 12028 752 784 | e/ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 23550 | 3112 0 4 | 14083 760 792 | e/regs | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 24846 | 2436 0 4 | 12380 752 784 | e/regs | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 30444 | 2648 0 4 | 13635 760 792 | dolbeau/mipsel-msa | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
| 30444 | 2648 0 4 | 13619 760 792 | e/ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250409 | 20250307 | 
Compiler output
chacha.S: chacha.S: Assembler messages:
chacha.S: chacha.S:19: Error: operand 1 must be an integer register -- `mov %rsp,%r11'
chacha.S: chacha.S:20: Error: operand 1 must be an integer or stack pointer register -- `and $31,%r11'
chacha.S: chacha.S:21: Error: operand 1 must be an integer or stack pointer register -- `add $384,%r11'
chacha.S: chacha.S:22: Error: operand 1 must be an integer or stack pointer register -- `sub %r11,%rsp'
chacha.S: chacha.S:23: Error: operand 1 must be an integer register -- `mov %rdi,%r8'
chacha.S: chacha.S:24: Error: operand 1 must be an integer register -- `mov %rsi,%rsi'
chacha.S: chacha.S:25: Error: operand 1 must be an integer register -- `mov %rsi,%rdi'
chacha.S: chacha.S:26: Error: operand 1 must be an integer register -- `mov %rdx,%rdx'
chacha.S: chacha.S:27: Error: operand 1 must be an integer or stack pointer register -- `cmp $0,%rdx'
chacha.S: chacha.S:29: Error: unknown mnemonic `jbe' -- `jbe ._done'
chacha.S: chacha.S:31: Error: operand 1 must be an integer register -- `mov $0,%rax'
chacha.S: chacha.S:33: Error: operand 1 must be an integer register -- `mov %rdx,%rcx'
chacha.S: chacha.S:35: Error: unknown mnemonic `rep' -- `rep stosb'
chacha.S: chacha.S:37: Error: operand 1 must be an integer or stack pointer register -- `sub %rdx,%rdi'
chacha.S: chacha.S:39: Error: unknown mnemonic `jmp' -- `jmp ._start'
chacha.S: chacha.S:47: Error: operand 1 must be an integer register -- `mov %rsp,%r11'
chacha.S: chacha.S:48: Error: operand 1 must be an integer or stack pointer register -- `and $31,%r11'
chacha.S: chacha.S:49: Error: operand 1 must be an integer or stack pointer register -- `add $384,%r11'
chacha.S: chacha.S:50: Error: operand 1 must be an integer or stack pointer register -- `sub %r11,%rsp'
chacha.S: chacha.S:52: Error: operand 1 must be an integer register -- `mov %rdi,%r8'
chacha.S: chacha.S:54: Error: operand 1 must be an integer register -- `mov %rsi,%rsi'
chacha.S: chacha.S:56: Error: operand 1 must be an integer register -- `mov %rdx,%rdi'
chacha.S: chacha.S:58: Error: operand 1 must be an integer register -- `mov %rcx,%rdx'
chacha.S: chacha.S:60: Error: operand 1 must be an integer or stack pointer register -- `cmp $0,%rdx'
chacha.S: ...
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| amd64-ssse3 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| amd64-ssse3 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| amd64-ssse3 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| amd64-ssse3 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Compiler output
stream.cpp: stream.cpp:1:10: fatal error: cryptopp/chacha.h: No such file or directory
stream.cpp:     1 | #include <cryptopp/chacha.h>
stream.cpp:       |          ^~~~~~~~~~~~~~~~~~~
stream.cpp: compilation terminated.
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| T:cryptopp | g++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| T:cryptopp | g++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| T:cryptopp | g++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| T:cryptopp | g++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Compiler output
chacha.c: In file included from chacha.c:94:
chacha.c: u4.h: In function 'crypto_stream_chacha8_dolbeau_arm_neon_constbranchindex_ECRYPT_encrypt_bytes':
chacha.c: u4.h:77:14: warning: unused variable 't_15' [-Wunused-variable]
chacha.c:    77 |   uint32x4_t t_15;
chacha.c:       |              ^~~~
chacha.c: u4.h:76:14: warning: unused variable 't_14' [-Wunused-variable]
chacha.c:    76 |   uint32x4_t t_14;
chacha.c:       |              ^~~~
chacha.c: u4.h:75:14: warning: unused variable 't_13' [-Wunused-variable]
chacha.c:    75 |   uint32x4_t t_13;
chacha.c:       |              ^~~~
chacha.c: u4.h:74:14: warning: unused variable 't_12' [-Wunused-variable]
chacha.c:    74 |   uint32x4_t t_12;
chacha.c:       |              ^~~~
chacha.c: u4.h:69:14: warning: unused variable 't_7' [-Wunused-variable]
chacha.c:    69 |   uint32x4_t t_7;
chacha.c:       |              ^~~
chacha.c: u4.h:68:14: warning: unused variable 't_6' [-Wunused-variable]
chacha.c:    68 |   uint32x4_t t_6;
chacha.c:       |              ^~~
chacha.c: u4.h:67:14: warning: unused variable 't_5' [-Wunused-variable]
chacha.c:    67 |   uint32x4_t t_5;
chacha.c:       |              ^~~
chacha.c: u4.h:66:14: warning: unused variable 't_4' [-Wunused-variable]
chacha.c:    66 |   uint32x4_t t_4;
chacha.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| dolbeau/arm-neon | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/arm-neon | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/arm-neon | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/arm-neon | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Compiler output
chacha.c: In file included from chacha.c:1:
chacha.c: e/chacha.c: In function 'crypto_stream_chacha8_e_merged_constbranchindex_ECRYPT_encrypt_bytes':
chacha.c: e/chacha.c:179:46: warning: 'ctarget' may be used uninitialized [-Wmaybe-uninitialized]
chacha.c:   179 |         for (i = 0;i < bytes;++i) ctarget[i] = c[i];
chacha.c:       |                                   ~~~~~~~~~~~^~~~~~
chacha.c: e/chacha.c:64:7: note: 'ctarget' was declared here
chacha.c:    64 |   u8 *ctarget;
chacha.c:       |       ^~~~~~~
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| e/merged | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/merged | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/merged | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/merged | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Compiler output
stream.c: stream.c:11:10: fatal error: immintrin.h: No such file or directory
stream.c:    11 | #include <immintrin.h>
stream.c:       |          ^~~~~~~~~~~~~
stream.c: compilation terminated.
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| goll_gueron | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| goll_gueron | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| goll_gueron | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| goll_gueron | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Compiler output
stream.c: stream.c:8:10: fatal error: immintrin.h: No such file or directory
stream.c:     8 | #include <immintrin.h>
stream.c:       |          ^~~~~~~~~~~~~
stream.c: compilation terminated.
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| krovetz/avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| krovetz/avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| krovetz/avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| krovetz/avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Compiler output
stream.c: stream.c:80:2: error: #error -- Implementation supports only machines with neon, altivec or SSE2
stream.c:    80 | #error -- Implementation supports only machines with neon, altivec or SSE2
stream.c:       |  ^~~~~
stream.c: stream.c: In function 'crypto_stream_chacha8_krovetz_vec128_constbranchindex_xor':
stream.c: stream.c:151:14: warning: implicit declaration of function 'NONCE' [-Wimplicit-function-declaration]
stream.c:   151 |     vec s3 = NONCE(np);
stream.c:       |              ^~~~~
stream.c: stream.c:151:14: error: incompatible types when initializing type 'vec' {aka '__vector(4) unsigned int'} using type 'int'
stream.c: stream.c:91:19: error: 'VBPI' undeclared (first use in this function); did you mean 'BPI'?
stream.c:    91 | #define BPI      (VBPI + GPR_TOO)  /* Blocks computed per loop iteration   */
stream.c:       |                   ^~~~
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c:   152 |     for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c:       |                                    ^~~
stream.c: stream.c:91:19: note: each undeclared identifier is reported only once for each function it appears in
stream.c:    91 | #define BPI      (VBPI + GPR_TOO)  /* Blocks computed per loop iteration   */
stream.c:       |                   ^~~~
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c:   152 |     for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c:       |                                    ^~~
stream.c: stream.c:91:26: error: 'GPR_TOO' undeclared (first use in this function)
stream.c:    91 | #define BPI      (VBPI + GPR_TOO)  /* Blocks computed per loop iteration   */
stream.c:       |                          ^~~~~~~
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c:   152 |     for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| krovetz/vec128 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| krovetz/vec128 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| krovetz/vec128 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| krovetz/vec128 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Passed TIMECOP
TIMECOP iterations: 10
Number of similar (implementation,compiler) pairs: 20, namely:
| Implementation | Compiler | 
| dolbeau/arm-neon | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/arm-neon | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/arm-neon | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/arm-neon | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/mipsel-msa | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/mipsel-msa | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/mipsel-msa | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| dolbeau/mipsel-msa | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/merged | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/merged | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/merged | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/merged | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/regs | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/regs | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/regs | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| e/regs | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |